论文发表

论文发表

2017 

  1. Yanzhuo Ding,Yang Liu,Huanbo Luan, and Maosong Sun.2017.Visualizing and Understanding Neural Machine Translation. In Proceedings of ACL 2017, Vancouver, Canada, July.ACL 2017 Outstanding Paper
  2. Jiacheng Zhang, Yang Liu, Huanbo Luan, Jingfang Xu, and Maosong Sun. 2017. Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization. In Proceedings of ACL 2017, Vancouver, Canada, July.
  3. Meng Zhang, Yang Liu, Huanbo Luan, and Maosong Sun. 2017. Adversarial Training for Unsupervised Bilingual Lexicon Induction.In Proceedings of ACL 2017, Vancouver, Canada, July.
  4. Yun Chen, Yang Liu, Yong Cheng, and Victor O.K. Li. 2017. A Teacher-Student Framework for Zero-Resource Neural Machine Translation.In Proceedings of ACL 2017, Vancouver, Canada, July.
  5. Meng Zhang, Yang Liu, Huanbo Luan, and Maosong Sun. 2017. Earth Mover's Distance Minimization for Unsupervised Bilingual Lexicon Induction. In Proceedings of EMNLP 2017,Copenhagen, Denmark, September.
  6. Hao Zheng, Yong Cheng, and Yang Liu. 2017. Maximum Expected Likelihood Estimation for Zero-resource Neural Machine Translation.In Proceedings of IJCAI 2017, Melbourne, Australia,August.
  7. Yong Cheng, Qian Yang, Yang Liu, Maosong Sun, and Wei Xu. 2017. Joint Training for Pivot-based Neural Machine Translation. In Proceedings of IJCAI 2017, Melbourne, Australia, August.
  8. Meng Zhang, Haoruo Peng, Yang Liu, Huanbo Luan, and Maosong Sun. 2017. Bilingual Lexicon Induction from Non-Parallel Data with Minimal Supervision.In Proceedings of AAAI 2017, San Francisco, USA, February.
  9. Yong Cheng, Yang Liu, and Wei Xu. 2017. Maximum Reconstruction Estimation for Generative Latent-Variable Models.In Proceedings of AAAI 2017, San Francisco, USA, February.
  10. Jinsong Su, Zhixing Tan, Deyi Xiong, Rongrong Ji, Xiaodong Shi, and Yang Liu. 2017. Lattice-Based Recurrent Neural Network Encoders for Neural Machine Translation.In Proceedings of AAAI 2017, San Francisco, USA, February.
  11. Zhaopeng Tu, Yang Liu, Lifeng Shang, Xiaohua Liu, and Hang Li. 2017. Neural Machine Translation with Reconstruction.In Proceedings of AAAI 2017, San Francisco, USA, February.
  12. Zhaopeng Tu, Yang Liu, Zhengdong Lu, Xiaohua Liu, and Hang Li. 2017. Context Gates for Neural Machine Translation.Transactions of the Association for Computational Linguistics,5:87–99.
  13. Shiqi Shen, Yang Liu, and Maosong Sun. 2017. Optimizing Non-Decomposable Evaluation Metrics for Neural Machine Translation. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY. 32(4): 796–804.
  14. Liner Yang, Maosong Sun, Jiacheng Zhang, Zhenghao Liu, Huanbo Luan, and Yang Liu. 2017.Neural Parse Combination. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY. 32(4): 749–757.
  15. Zhuoran Liu, and Yang Liu. 2017. Exploiting Unlabeled Data for Neural Grammatical Error Detection.Journal of Computer Science and Technology. 32(4):758-767.
  16. Jiacheng Zhang, Yanzhuo Ding, Shiqi Shen, Yong Cheng, Maosong Sun, Huanbo Luan, and Yang Liu. 2017. THUMT: An Open Source Toolkit for Neural Machine  Translation.
  17. Ayana, Shiqi Shen, Yankai Lin, Cunchao Tu, Yu Zhao, Zhiyuan Liu, and Maosong Sun. Recent Advances on Neural Headline Generation. Journal of Computer Science and Technology,32(4): 768–784.
  18. Yixin Cao, Juanzi Li, Jiaxin Shi, Zhiyuan Liu. On Modeling Sense Relatedness in Multi-prototype Word Embedding.In Proceedings of IJCNLP 2017, Taipei,Taiwan,November. 
  19. Wenyuan Zeng, Yankai Lin, Zhiyuan Liu, and Maosong Sun. Incorporating Relation Paths in Neural Relation Extraction.In Proceedings of EMNLP 2017, Copenhagen,Denmark, September.
  20. Cunchao Tu, Zhengyan Zhang, Zhiyuan Liu, and Maosong Sun. TransNet: Translation-Based Network Representation Learning for Social Relation Extraction.In Proceedings of IJCAI 2017, Melbourne, Australia, August.
  21. Cheng Yang, Maosong Sun, Zhiyuan Liu, and Cunchao Tu. Fast Network Embedding Enhancement via High Order Proximity Approximation.In Proceedings of IJCAI 2017, Melbourne, Australia, August.
  22. Hao Zhu, Ruobing Xie, Zhiyuan Liu, and Maosong Sun. Iterative Entity Alignment via Joint Knowledge Embeddings.In Proceedings of IJCAI 2017, Melbourne, Australia, August.
  23. Ruobing Xie, Zhiyuan Liu, Huanbo Luan, and Maosong Sun. Image-embodied Knowledge Representation Learning.In Proceedings of IJCAI 2017, Melbourne, Australia, August.
  24. Ruobing Xie, Xingchi Yuan, Zhiyuan Liu, and Maosong Sun. Lexical Sememe Prediction via Word Embeddings and Matrix Factorization. In Proceedings of IJCAI 2017, Melbourne, Australia, August.
  25. Zhuyun Dai, Chenyan Xiong, Jamie Callan, and Zhiyuan Liu. End-to-End Neural Ad-hoc Ranking with Kernel Pooling.In Proceedings of SIGIR 2017, Tokyo,Japan,August.
  26. Cunchao Tu, Han Liu, Zhiyuan Liu, and Maosong Sun. CANE: Context-Aware Network Embedding for Relation Modeling. In Proceedings of ACL 2017,Vancouver, Canada, July.
  27. Yankai Lin, Zhiyuan Liu, and Maosong Sun. Neural Relation Extraction with Multi-lingual Attention. In Proceedings of ACL 2017, Vancouver, Canada, July.
  28. Yilin Niu, Ruobing Xie, Zhiyuan Liu, and Maosong Sun. Improved Word Representation Learning with Sememes. In Proceedings of ACL 2017, Vancouver, Canada, July.
  29. Cunchao Tu, Zhiyuan Liu, Huanbo Luan, and Maosong Sun. PRISM: Profession Identification in Social Media.ACM Transactions on Intelligent Systems and Technology, Vol. 9, No. 4, Article 39.
  30. Cheng Yang, Maosong Sun, Wayne Xin Zhao, Zhiyuan Liu, and Edward Chang. A Neural Network Approach to Joint Modeling Social Networks and Mobile Trajectories. ACM Transactions on Information Systems, Vol. 35, No. 4, Article 8.
  31. Liner Yang, Xinxiong Chen, Zhiyuan Liu, and Maosong Sun. Improving Word Representations with Document Labels.IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING,25(4): 863-870.
  32. 哈里旦木·阿布都克里木,程勇,刘洋,孙茂松. 2017. 基于双向门限递归单元神经网络的维吾尔语形态切分. 清华大学学报(自然科学版), 57(1):1-6.
  33. 哈里旦木·阿布都克里木, 刘洋, 孙茂松. 2017. 神经机器翻译系统在维吾尔语-汉语翻译中的性能对比. 清华大学学报(自然科学版),57(8):878-883.
  34. 刘洋. 2017. 神经机器翻译前沿进展. 计算机研究与发展,54(6):1144-1149.
  35. 涂存超, 杨成, 刘知远, 孙茂松. 2017.网络表示学习综述.中国科学:信息科学, 47(8):980-996.

2016

  1. Shiqi Shen, Yong Cheng, Zhongjun He, Wei He, Hua Wu, Maosong Sun, and Yang Liu. 2016. Minimum Risk Training for Neural Machine Translation. In Proceedings of ACL 2016, Berlin, Germany, August. 
  2. Yong Cheng, Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun, and Yang Liu. 2016. Semi-Supervised Learning for Neural Machine Translation. In Proceedings of ACL 2016, Berlin, Germany, August.
  3. Chunyang Liu, Yang Liu, Huanbo Luan, Maosong Sun, and Heng Yu. 2016. Agreement-based Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora. In Proceedings of ACL 2016, Berlin, Germany, August.
  4. Yankai Lin, Shiqi Shen, Zhiyuan Liu, Huanbo Luan, Maosong Sun. 2016. Neural Relation Extraction with Selective Attention over Instances. In Proceedings of ACL 2016, Berlin, Germany, August.
  5. Yong Cheng, Shiqi Shen, Zhongjun He, Wei He, Hua Wu, Maosong Sun, and Yang Liu. 2016. Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation. In Proceedings of IJCAI 2016, New York, USA, July.
  6. Cunchao Tu, Weicheng Zhang, Zhiyuan Liu, Maosong Sun, Huanbo Luan. 2016. Max-Margin DeepWalk: Discriminative Learning of Network Representation. In Proceedings of IJCAI 2016, New York, USA, July.
  7. Yankai Lin, Zhiyuan Liu, Maosong Sun. 2016. Knowledge Representation Learning with Entities, Attributes and Relations. In Proceedings of IJCAI 2016, New York, USA, July.
  8. Ruobing Xie, Zhiyuan Liu, Maosong Sun. 2016. Representation Learning of Knowledge Graphs with Hierarchical Types. In Proceedings of IJCAI 2016, New York, USA, July.
  9. Meng Zhang, Yang Liu, Huanbo Luan, Maosong Sun, Tatsuya Izuha, and Jie Hao. 2016. Building Earth Mover's Distance on Bilingual Word Embeddings for Machine Translation. In Proceedings of AAAI 2016, Phoenix, USA, February.
  10. Ruobing Xie, Zhiyuan Liu, Jia Jia, Huanbo Luan, Maosong Sun. 2016. Representation Learning of Knowledge Graphs with Entity Descriptions. In Proceedings of AAAI 2016, Phoenix, USA, February.
  11. Meng Zhang, Yang Liu, Huanbo Luan, and Maosong Sun. 2016. Listwise Ranking Functions for Statistical Machine Translation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(8): 1464-1472.
  12. 刘知远, 孙茂松, 林衍凯, 谢若冰. 2016. 知识表示学习研究进展. 计算机研究与发展, 53(2): 247-261.

2015

  1. Chunyang Liu, Yang Liu, Huanbo Luan, Maosong Sun, and Heng Yu. 2015. Generalized Agreement for Bidirectional Word Alignment. In Proceedings of EMNLP 2015, Lisbon, Portugal, September.[link]
  2. Shiqi Shen, Yang Liu, Huanbo Luan, and Maosong Sun. 2015. Consistency-Aware Search for Word Alignment. In Proceedings of EMNLP 2015, Lisbon, Portugal, September.[link]
  3. Jinsong Su, Deyi Xiong, Biao Zhang, Yang Liu, and Junfeng Yao. 2015. Bilingual Correspondence Recursive Autoencoder for Statistical Machine Translation. In Proceedings of EMNLP 2015, Lisbon, Portugal, September.[link]
  4. Yankai Lin, Zhiyuan Liu, Huanbo Luan, Maosong Sun, Siwei Rao, Song Liu. Modeling Relation Paths for Representation Learning of Knowledge Bases. In Proceedings of EMNLP 2015, Lisbon, Portugal, September.[link]
  5. Hongyin Luo, Zhiyuan Liu, Huanbo Luan, Maosong Sun. Online Learning of Interpretable Word Embeddings. In Proceedings of EMNLP 2015, Lisbon, Portugal, September.[link]
  6. Meiping Dong, Yang Liu, Huanbo Luan, Maosong Sun, Tatsuya Izuha, and Dakun Zhang. 2015. Iterative Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora. In Proceedings of IJCAI 2015, Buenos Aires, Argentina, July.[link]
  7. Yu Zhao, Zhiyuan Liu, Maosong Sun. Representation Learning for Measuring Entity Relatedness with Rich Information. In Proceedings of IJCAI 2015, Buenos Aires, Argentina, July.[link]
  8. Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, Edward Chang. Network Representation Learning with Rich Text Information. In Proceedings of IJCAI 2015, Buenos Aires, Argentina, July.[link]
  9. Xinxiong Chen*, Lei Xu*, Zhiyuan Liu, Maosong Sun, Huanbo Luan. Joint Learning of Character and Word Embeddings. In Proceedings of IJCAI 2015, Buenos Aires, Argentina, July. (* indicates equal contribution)[link]
  10. Yang Liu and Maosong Sun. 2015. Contrastive Unsupervised Word Alignment with Non-Local Features. In Proceedings of AAAI 2015, Austin, Texas, January. [link]
  11. Yu Zhao, Zhiyuan Liu, Maosong Sun. Phrase Type Sensitive Tensor Indexing Model for Semantic Composition. In Proceedings of AAAI 2015, Austin, Texas, January. [link]
  12. Yang Liu, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun. Topical Word Embeddings. In Proceedings of AAAI 2015, Austin, Texas, January. [link]
  13. Yankai Lin, Zhiyuan Liu, Maosong Sun, Yang Liu, Xuan Zhu. Learning Entity and Relation Embeddings for Knowledge Graph Completion. In Proceedings of AAAI 2015, Austin, Texas, January. [link]
  14. Jinsong Su, Deyi Xiong, Yang Liu, Xianpei Han, Hongyu Lin, and Junfeng Yao. 2015. A Context-Aware Topic Model for Statistical Machine Translation. In Proceedings of ACL 2015, Beijing, China, July.[link]
  15. Tianze Shi, Zhiyuan Liu, Yang Liu, and Maosong Sun. 2015. Learning Cross-lingual Word Embeddings via Matrix Co-factorization. In Proceedings of ACL 2015 (short paper), Beijing, China, July.[link]
  16. Yang Liu and Min Zhang. 2015. Statistical Machine Translation. Routledge Encyclopedia of Translation Technology (Edited by Sin-Wai Chan), Chapter 11.[link]
  17. Yan Wang, Zhiyuan Liu, Maosong Sun. Incorporating Linguistic Knowledge for Learning Distributed Word Representations. PLOS ONE.[link]
  18. 刘知远, 张乐, 涂存超, 孙茂松. 中文社交媒体谣言统计语义分析. 中国科学 信息科学, 45(12): 1536-1546, 2015.[link
  19. Cunchao Tu, Zhiyuan Liu, Huanbo Luan, Maosong Sun. PRISM: Profession Identification in Social Media with Personal Information and Community Structure. National Conference of Social Media Processing (SMP 2015), 2015.[link]
  20. 刘扬, 刘知远, 孙茂松. 面向语义变迁分析的多向量表示词义模型. 全国社会媒体处理大会 (SMP 2015), 2015.
  21. Cunchao Tu, Zhiyuan Liu, Maosong Sun. Tag Correspondence Model for User Tag Suggestion. Journal of Computer Science and Technology (JCST), 2015.[link]
  22. Halidanmu Abudukelimu, Yang Liu, Xinxiong Chen, Maosong Sun, and Abudoukelimu Abulizi. Learning Distributed Representations of Uyghur Word and Morphemes. In Proceedings of CCL/NLP-NABD 2015, Guangzhou, China, November.[link]
  23. Liner Yang, Maosong Sun. Improved Learning of Chinese Word Embeddings with Semantic Knowledge. In Proceedings of CCL/NLP-NABD 2015, Guangzhou, China, November.[link]

2014

  1. 孙茂松,刘挺,姬东鸿,穗志方,赵军,张钹,吾守尔·斯拉木,俞士汶,朱军,李建民,刘洋,王厚峰,吐尔根·依布拉音,刘群,刘知远. 语言计算的重要国际前沿. 中文信息学报, 第28卷, 第1期, 1-8, 2014.
  2. Maosong Sun, Yang Liu, and Jun Zhao. 2014. Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data.Springer Lecture Notes in Computer, ScienceVolume 8801. ISBN: 978-3-319-12276-2.[link]
  3. Sebastian Beschke, Yang Liu, and Wolfgang Menzel. 2014. Large-Scale CCG Induction from the Groningen Meaning Bank.In Proceedings of ACL 2014 Workshop on Semantic Parsing, Baltimore, USA, June. [pdf]
  4. Peng Li, Yang Liu, Maosong Sun, Tatsuya Izuha, and Dakun Zhang. 2014. A Neural Reordering Model for Phrase-based Translation.(Oral)In Proceedings of COLING 2014,Dublin, Ireland, August.[pdf][ppt]
  5. Meiping Dong, Yong Cheng, Yang Liu, Jia Xu, Maosong Sun, Tatsuya Izuha, and Jie Hao. 2014. Query Lattice for Translation Retrieval.In Proceedings of COLING 2014,Dublin, Ireland, August.[pdf]
  6. Xinxiong Chen, Zhiyuan Liu, Maosong Sun. A Unified Model for Word Sense Representation and Disambiguation.Proc. of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP 2014),Doha, Qatar, 2014, pp. 1025–1035.[pdf]
  7. Xinxiong Chen, Zhiyuan Liu, Maosong Sun. Estimating Translation Probabilities for Social Tag Suggestion.Expert System with Applications (2014),DOI: 10.1016/j.eswa.2014.10.002.[link]
  8. Li Li, Maosong Sun and Zhiyuan Liu. Discriminating Gender on Chinese Microblog:A Study of Online Behaviour, Writing Style and Preferred Vocabulary.The 2014 10th International Conference on Natural Computation (ICNC 2014),Xiamen, China, 2014.[pdf]
  9. Cunchao Tu, Zhiyuan Liu, and Maosong Sun. 2014. Inferring Correspondences from Multiple Sources for Microblog User Tags.The 3rd Conference on Social Media Processing(SMP 2014), Beijing, China, 2014. Springer Berlin Heidelberg, 2014: 1-12.[pdf]
  10. Chong Kuang, Zhiyuan Liu, Maosong Sun, Feng Yu, Pengfei Ma. Quantifying Chinese Happiness via Large-Scale Microblogging Data. The 11th Web Information System and Application Conference (WISA'14).
  11. 匡冲,刘知远,孙茂松. 微博转发者的个性化排序. 山东大学学报 (理学版), 第49卷, 第11期, 31-36, 2014. [pdf]
  12. 孙茂松,李莉,刘知远. 面向中英平行专利语料的双语术语自动抽取.清华大学学报(自然科学版), 2014年10期, 1339-1343, 2014.

2013

  1. Jiayu Tang, Zhiyuan Liu, Maosong Sun and Jiahua Liu. Portraying User Life Status from Microblogging Posts. Tsinghua Science and Technology, vol. 18, no. 2, pp. 182-195, 2013. [pdf]
  2. Can Wang, Yang Liu and Maosong Sun. Minimum Error Rate Training for Bilingual News Alignment. Proc. of the 14th Chinese Lexical Semantics Workshop (CLSW 2013), Zhengzhou, China, 2013. [pdf]
  3. Jiayu Tang, Zhiyuan Liu and Maosong Sun. Measuring and Visualizing the Interest Similarity between Microblog Users. Proc. of the 14th International Conference on Web-Age Information Management (WAIM 2013)Beidaihe, China, 2013, Lecture Notes in Computer Science, vol. 7923, pp. 478-489. [pdf]
  4. Peng Li, Yang Liu and Maosong Sun. An Extended GHKM Algorithm for Inducing Lambda-SCFG.(Oral)Proc. of the 27th AAAI Conference on Artificial Intelligence (AAAI 13), Bellevue, Washington, USA, 2013, pp. 605-611. [pdf][ppt]
  5. Yang Liu. A Shift-Reduce Parsing Algorithm for Phrase-based String-to-Dependency Translation. Proc. of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria, 2013, pp. 1-10. [pdf]
  6. Yu Zhao, and Maosong Sun. Exploiting Lexicalized Statistical Patterns in Chinese Linguistic Analysis. Proc. of the 1st Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data(NLP-NABD 2013),  Suzhou, China, 2013, Lecture Notes in Computer Science, vol.8202, pp. 238-246. [pdf]
  7. Peng Li, Yang Liu and Maosong Sun. Recursive Autoencoders for ITG-based Translation.(Poster) Proc. of  the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013), Seattle, Washington, USA, 2013, pp. 567-577.[pdf]
  8. Yan Zhang, Qixia Jiang and Maosong Sun, HMeanMax: Placing HMAX and HoG into a unified framework. Proc. of the 2013 International Joint Conference on Neural Networks (IJCNN 2013), Dalla, Dexas, USA, 2013.
  9. 唐家渝, 刘知远, 孙茂松. 文本可视化研究综述. 计算机辅助设计与图形学学报, 第25卷, 第03期, 273-285, 2013. [pdf]
  10. 刘奇, 刘洋, 孙茂松. URL模式与HTML结构相结合的平行网页获取方法. 中文信息学报, 第27卷, 第03期, 91-99, 2013. [pdf]
  11. 沈世奇, 刘洋, 孙茂松. 基于对偶分解的词语对齐搜索算法. 中文信息学报, 第27卷, 第04期, 9-15, 2013. [pdf]
  12. 孙茂松. 国内一流大学计算机教学改革三个值得注意的问题. 计算机教育, 2013年, 第20期, 65-67, 2013. [pdf]
  13. 张燕, 张扬, 孙茂松. 基于中文拼音输入法数据的汉语方言词汇自动识别. 中文信息学报, 第27卷, 第05期, 22-28, 2013. [pdf]
  14. 李莉, 刘知远, 孙茂松. 基于中英平行专利语料的短语复述自动抽取研究. 中文信息学报, 第27卷, 第06期, 151-157, 2013.
  15. 孙茂松. MOOC:太阳照常升起,境界已然不同. 中国教育网络, 第09期, 34-35, 2013.
  16. 唐家渝, 孙茂松. 新媒体中的词云:内容简明表达的一种可视化形式. 中国传媒科技, 第11期, 18-19, 2013.
  17. 李鹏, 刘洋, 薛平, 孙茂松. 双语文本的词语对齐方法及装置. 申请号: 201310003841.3. 申请日期: 2013.01.06. [专利申请受理通知书]
  18. 唐家渝, 孙茂松, 刘知远. 一种文本集合相似性的可视化方法和装置. 申请号: 201310022589.0. 申请日期: 2013.1.22. [专利申请受理通知书]
  19. 刘奇, 刘洋, 孙茂松. 平行网页获取方法及装置. 申请号: 201310174218.4. 申请日期: 2013.05.10. [专利申请受理通知书]
  20. 沈世奇, 刘洋, 孙茂松. 一种词语对齐方法及装置. 申请号: 201310389092.2. 申请日期:2013.08.30.  [专利申请受理通知书]
  21. 唐家渝, 刘知远, 孙茂松. 一种图文集合的可视化方法和装置. 申请号: 201310538293.4. 申请日期: 2013.11.04. [专利申请受理通知书]
  22. 董梅平, 刘洋, 孙茂松. 双语篇章对齐标记系统[简称:BDOCAS]. 登记号:2013SR109662. [软件登记证书]
  23. 刘家骅, 沈世奇, 刘洋, 孙茂松. 双语词语对齐标记系统[简称:BWAAS]. 登记号:2013SR109701. [软件登记证书
  24. 李鹏, 刘洋, 孙茂松. 双语句子对齐标注系统[简称:BASA]. 登记号:2013SR109732. [软件登记证书]

2012

  1. Qixia Jiang, Jun Zhu, Maosong Sun and Eric P. Xing. Monte Carlo Methods for Maximum Margin Supervised Topic Models, Proc. of the 26th Annual Conference on Neural Information Processing Systems (NIPS’12).[pdf]
  2. Zhiyuan Liu, Xinxiong Chen, Maosong Sun. Mining the Interests of Chinese Microbloggers via Keyword Extraction. Frontiers of Computer Science,vol. 6, no. 1, pp. 76-87.[pdf]
  3. Zhang Kaixu, Sun Maosong, Unified Framework of Performing Chinese Word Segmentation and Part-of-Speech Tagging, China Communications, 2012 9 (3): 1-9.[pdf]
  4. Yang Feng, Yang Liu, Qun Liu, and Trevor Cohn. 2012. Left-to-Right Tree-to-String Decoding with Prediction. Proc. of EMNLP 2012, pages 1191-1200, Jeju, Korea, July.[pdf]
  5. Yan Zhang,Qixia Jiangand Maosong Sun. Particle Mixed Membership Stochastic Block Model. Proc. of the 8th International Conference on Semantics, Knowledge & Grids (SKG’12).[pdf]
  6. Zhiyuan Liu, Chen Liang, and Maosong Sun. Topical Word Trigger Model for Keyphrase Extraction, Proc. of the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, 2012, pp. 1715-1730. [pdf]
  7. Zhiyuan Liu, Chen Liang, and Maosong Sun. Expert Finding for Microblog Misinformation Identification, Proc. of the 24th International Conference on Computational Linguistics (COLING 2012): Posters, Mumbai, India, 2012, pp. 703-712. [pdf]
  8. Han Li, Zhiyuan Liu, and Maosong Sun. Random Walks on Context-Aware Relation Graphs for Ranking Social Tags, Proc. of the 24th International Conference on Computational Linguistics (COLING 2012): Posters, Mumbai, India, 2012, pp. 653-662. [pdf]
  9. Zhiyuan Liu, Cunchao Tu and Maosong Sun. Tag Dispatch Model with Social Network Regularization for Microblog User Tag Suggestion, Proc. of the 24th International Conference on Computational Linguistics (COLING 2012): Posters, Mumbai, India, 2012, pp. 755-764. [pdf]
  10. Peng Li, Yang Liu, and Maosong Sun. A Beam Search Algorithm for ITG Word Alignment, Proc. of the 24th International Conference on Computational Linguistics (COLING 2012): Posters, Mumbai, India, 2012, pp. 673-682. [pdf]
  11. Chunyang Liu, Qi Liu, Yang Liu and Maosong Sun. THUTR: A Translation Retrieval System, Proc. of the 24th International Conference on Computational Linguistics (COLING 2012): Demonstration Papers, Mumbai, India, 2012, pp. 321-328. [pdf]
  12. Xinyan Xiao, Deyi Xiong, Yang Liu, Qun Liu, and Shouxun Lin. Unsupervised Discriminative Induction of Synchronous Grammar for Machine Translation, Proc. of the 24th International Conference on Computational Linguistics (COLING 2012), Mumbai, India, 2012, pp. 2883-2898. [pdf]
  13. Zhaopeng Tu, Yang Liu, Yifan He, Qun Liu, Shouxun Lin and Josef van Genabith. Combining Multiple Alignments to Improve Machine Translation, Proc. of the 24th International Conference on Computational Linguistics (COLING 2012): Posters, Mumbai, India, 2012, pp. 1249-1260. [pdf]
  14. 谢丽星, 周明, 孙茂松. 基于层次结构的多策略中文微博情感分析和特征抽取. 中文信息学报, 第26卷, 第01期, 73-83, 2012. [pdf]
  15. 刘奇, 刘洋, 柳春洋, 孙茂松. 译文检索方法与装置. 申请号: 201210438968.3. 申请日期: 2012.11.06. [专利申请受理通知书]
  16. 雷升涛, 孙茂松, 刘洋, 唐家渝. 维吾尔语搜索引擎系统[简称:USES]. 登记号:2012SR063705. [软件登记证书] [软件开发者证书]

2011

  1. 孙茂松. 基于互联网自然标注资源的自然语言处理. 中文信息学报, 第25卷, 第06期, 26-32, 2011. [pdf]
  2. Qixia Jiang and Maosong Sun. Fast Query Recommendation by Search. Proc. of the 25th AAAI Conf. on Artificial Intelligence (AAAI 2011), San Francisco, USA, 2011, pp. 1192-1197.[pdf]
  3. Qixia Jiang and Maosong Sun. Semi-Supervised SimHash for Efficient Document Similarity Search. Proc. of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2011), Portland, Oregon, USA, 2011, pp. 93–101.[pdf]
  4. Qixia Jiang, Yan Zhang, Liner Yang and Maosong Sun. Is Simhash Achilles? Proc. of the 7th Asian Information Retrieval Society Conf. (AIRS'11), Dubai, United Arab Emirates, 2011, Lecture Notes in Computer Science, vol. 7097, pp. 61-72.[pdf]
  5. Zhongguo Li. Parsing the Internal Structure of Words: A New Paradigm for Chinese Word Segmentation. Proc. of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2011), Portland, Oregon, USA, 2011, pp. 1405-1414.[pdf]
  6. Kaixu Zhang and Maosong Sun. A Comparison Study of Candidate Generation for Chinese Word Segmentation. Proc. of the 7th International Conf. on Natural Language Processing and Knowledge Engineering (NLPKE 2011), Tokushima, Japan, 2011.
  7. Kaixu Zhang, Ruining Wang, Ping Xue and Maosong Sun. Extract Chinese Unknown Words from a Large-scale Corpus Using Morphological and Distributional Evidences. Prof. of the 5th International Joint Conf. on Natural Language Processing (IJCNLP 2011), Chiang Mai, Thailand, 2011, pp. 837–845.[pdf]
  8. Yabin Zheng, Chen Li, Maosong Sun. CHIME: An Efficient Error-Tolerant Chinese Pinyin Input Method. Proc. of the 22nd International Joint Conf. on Artificial Intelligence (IJCAI 2011), Barcelona, Spain, 2011, pp. 2551-2556.[pdf]
  9. Zhiyuan Liu, Xinxiong Chen, Maosong Sun. A Simple Word Trigger Method for Social Tag Suggestion. Proc. of the Conf. on Empirical Methods in Natural Language Processing (EMNLP 2011), Edinburgh, Scotland, 2011, pp. 1577–1588.[pdf]
  10. Zhiyuan Liu, Xinxiong Chen, Yabin Zheng, Maosong Sun. Automatic Keyphrase Extraction by Bridging Vocabulary Gap. Proc. of the 15th Conf. on Computational Natural Language Learning (CoNLL 2011), Portland, Oregon, USA, 2011, pp. 135–144.[pdf]
  11. Zhiyuan Liu, Yabin Zheng, Lixing Xie, Maosong Sun, Liyun Ru. User Behaviors in Related Word Retrieval and New Word Detection: A Collaborative Perspective. ACM Transactions on Asian Language Information Processing (ACM TALIP, Special Issue on Chinese Language Processing), vol. 10, no. 4, pp. 20:1-20:26, 2011.[pdf]
  12. Zhiyuan Liu, Yuzhou Zhang, Edward Y. Chang, Maosong Sun. PLDA+: Parallel Latent Dirichlet Allocation with Data Placement and Pipeline Processing. ACM Transactions on Intelligent Systems and Technology (ACM TIST, Special Issue on Large Scale Machine Learning), vol. 2, no. 3, pp. 26:1-26:18, 2011.[pdf]
  13. Yabin Zheng, Lixing Xie, Zhiyuan Liu, Maosong Sun, Liyun Ru. Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method. Proc. of the 49th Annual Meeting of the Association for Computational Linguistics and Human Language Technologies (ACL-HLT 2011), Portland, Oregon, USA, 2011, pp. 485–490.[pdf]
  14. Lixing Xie, Yabin Zheng, Zhiyuan Liu, Maosong Sun, Canhui Wang. Extracting Chinese Abbreviation-definition Pairs from Anchor Texts. Proc. of the 10th International Conf. on Machine Learning and Cybernetics (ICMLC 2011), Guilin, Guangxi, China, 2011, pp. 1485 - 1491.[pdf]
  15. Zhiyuan Liu, Maosong Sun. Can Prior Knowledge Help Graph-based Methods for Keyword Extraction? Accepted by Frontiers of Electrical and Electronic Engineering in China.
  16. 李鹏, 孙茂松, 薛平. 双语文本的对齐方法及装置. 申请号: 200910093061.6. 申请日期: 2009.09.23. 公开日期: 2010.03.10. 公开号: CN101667177. 授权号:ZL200910093061.6. [专利证书]

2010

  1. Xiance Si, Zhiyuan Liu and Maosong Sun, Modeling Social Annotations via Latent Reason Identification. IEEE Intelligent Systems, vol. 25, no. 6, pp. 42-49, 2010.[pdf]
  2. Batuer Aisha. A Novel Method for Uyghur Tokenizatin and Morpheme Analysis, Accepted by the International Conference on Asian Language Processing 2010 (IALP 2010)
  3. Kaixu Zhang, Maosong Sun and Ping Xue. A Local Generative Model for Chinese Word Segmentation, Proc. 6th Asia Information Retrieval Society Conf. (AIRS 2010), Taipei, 2010, Lecture Notes in Computer Science, vol. 6458, pp. 420-431.[pdf]
  4. Zhiyuan Liu, Chuan Shi, Maosong Sun. FolkDiffusion: A Graph-based Tag Suggestion Method for Folksonomies. Proc. 6th Asia Information Retrieval Society Conf. (AIRS 2010), Taipei, 2010, Lecture Notes in Computer Science, vol. 6458, pp. 231-240.[pdf]
  5. Zhiyuan Liu, Maosong Sun. Domain-Specific Term Rankings Using Topic Models. Proc. 6th Asia Information Retrieval Society Conf. (AIRS 2010), Taipei, 2010, Lecture Notes in Computer Science, vol. 6458, pp. 454-465.[pdf]
  6. Batuer Aisha, Maosong Sun. Uyghur-Chinese Statistical Machine Translation by Incorporating Morphological Information, Journal of Computational Information Systems, vol. 6, no. 10, pp. 3137-3145, 2010.[pdf]
  7. Xiance Si, Edward Y. Chang, Zoltan Gyongyi and Maosong Sun, Confucius and its Intelligent Disciples: Integrating Social with Search. Proc. of the 36th International Conference on Very Large Data Bases (VLDB 2010), Singapore, 2010, pp 1505-1517.[pdf]
  8. Xiance Si, Zhiyuan Liu and Maosong Sun. Explore the Structure of Social Tags by Subsumption Relations. Proc. of the 23rd International Conference on Computational Linguistics (COLING 2010), Beijing, China, 2010, pp. 1011-1019.[pdf]
  9. Xiance Si and Maosong Sun. Tag Allocation Model: Modeling Noisy Social Annotations by Reason Finding, Proc. of the 2010 IEEE/ACM/WIC Conferences on Web Intelligence (WI-IAT 2010), Toronto, Canada, 2010.
  10. Xiance Si and Maosong Sun. Exploring the Concept Levels of Social Tags in Chinese Blogs, Proc. of the 11th Chinese Lexical Semantics Workshop (CLSW 2010), Suzhou, China, 2010
  11. Zhiyuan Liu, Wenyi Huang, Yabin Zheng, Maosong Sun. Automatic Keyphrase Extraction via Topic Decomposition. Proc. of Conf. on Empirical Methods in Natural Language Processing (EMNLP 2010), Massachusetts, USA, 2010, pp. 366-376.[pdf]
  12. Yabin Zheng, Zhiyuan Liu and Lixing Xie. Growing Related Words from Seed via User Behaviors: A Re-ranking Based Approach. Proc. of the ACL 2010 Student Research Workshop (ACL 2010), Uppsala, Sweden, 2010, pp. 49-54.[pdf]
  13. Yan Zhang, Maosong Sun and Yang Zhang. Chinese New Word Detection from Query Logs, Proc. of the Advanced Data Mining and Applications (ADMA2010), Chongqing, China, 2010.
  14. Peng Li, Maosong Sun, Ping Xue. Fast-Champollion: A Fast and Robust Sentence Alignment Algorithm. Proc. of the 23rd International Conference on Computational Linguistics (COLING 2010): Posters, Beijing, China, 2010, pp. 710-718.[pdf]
  15. 乔维, 孙茂松. 基于M3N的中文分词与命名实体识别一体化方法.清华大学学报(自然科学版), 2010年05期, 758-762, 2010.
  16. 张开旭, 孙茂松. 运用模板匹配的汉语生语料动宾关系提取.第十一届汉语词汇语义学研讨会(CLSW2010), 苏州, 中国, 397-403, 2010. [pdf]

2009

  1. Qixia Jiang, Yan Zhang and Maosong Sun. Community Detection on Weighted Networks: A Variational Bayesian Method, Proc. of the 1st Asian Conference on Machine Learning (ACML2009), Nanjing, China, 2009, Lecture Notes in Computer Science, vol. 5828, pp. 176-190. [pdf]
  2. Batuer Aisha, Maosong Sun, A Uyghur Morpheme Analysis Method based on Conditional Random Fields, International Journal of Asian Language Processing, vol. 19, no. 2, pp. 69-83, 2009. [pdf]
  3. Batuer Aisha, Maosong Sun. A Statistical Method for Uyghur Tokenization. Proc. of the 2009 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'09), Dalian, China, 2009, pp. 383-387. [pdf]
  4. Xiance Si, Zhiyuan Liu, Peng Li, Qixia Jiang, Maosong Sun. Content-based and Graph-based Tag Suggestion, Proc. of ECML/PKDD 2009 Discovery Challenge Workshop, Bled, Slovenia, 2009. pp. 243-260.
  5. Xiance Si, Maosong Sun. Disambiguating Blog Tags. Proc. of the 12th International Conference on Text, Speech and Dialogue (TSD 2009), Pilsen, Czech Republic, 2009, Lecture Notes in Computer Science, vol. 5729, pp 139-146.[pdf]
  6. Xiance Si, Maosong Sun. Tag-LDA for Scalable and Realtime Tag Recommendation, Journal of Computational Information Systems, vol. 6, no. 2, 2009. pp 1009-1016.
  7. Yabin Zheng, Zhiyuan Liu, Maosong Sun, Liyun Ru and Yang Zhang. Incorporating User Behaviors in New Word Detection. Proc. of the 21st International Joint Conference on Artificial Intelligence (IJCAI 2009), Pasadena, California, USA, 2009, pp. 2101-2106.[pdf]
  8. Yabin Zheng, Zhiyuan Liu, Shaohua Teng and Maosong Sun. Efficient Text Classification Using Term Projection. Proc. of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology (AIRS 2009), Sapporo, Japan, 2009, Lecture Notes in Computer Science, vol. 5839, pp. 230-241. [pdf]
  9. Zhongguo Li, Maosong Sun. Punctuation as Implicit Annotations for Chinese Word Segmentation. Computational Linguistics, vol. 35, no. 4, pp. 505-512, 2009.[pdf]
  10. Wei Qiao and Maosong Sun. An Efficient OOV Strategy for Chinese Word Segmentation: Harnessing Web Search and Conditional Random Field Model. Proc. of the 23rd Pacific Asia Conference on Language, Information and Computation (PACLIC 2009), Hong Kong, China, 2009, pp. 454-463.
  11. Zhiyuan Liu, Peng Li, Yabin Zheng, Maosong Sun. Clustering to Find Exemplar Terms for Keyphrase Extraction. Proc. of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2009), Singapore, 2009. pp. 257-266. [pdf]
  12. Zhiyuan Liu, Yabin Zheng, Maosong Sun. Quantifying Asymmetric Semantic Relations from Query Logs by Resource Allocation. Proc. of the 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2009), Bangkok, Thailand, 2009, Lecture Notes in Artifical Intelligence, vol. 5476, pp. 254-265. [pdf]
  13. Wei Qiao and Maosong Sun. Chinese Word Frequency Approximation Using Multiple-type Corpora. Accepted by the International Journal of Quantitative Linguistics.
  14. Yan Liu, Yan Zhang, Maosong Sun, Wenji Li, Full-reference Quality Diagnosis for Video Summary, Proc. of the 2008 IEEE International conference on Multimedia & Expo (ICME 2008), Hannover, Germany, 2008, pp. 1489-1492. [pdf]
  15. 谢丽星, 孙茂松, 佟子健, 王灿辉. 基于用户查询日志和锚文字的汉语缩略语识别. 全国第十届计算语言学学术会议 (CNCCL-2009), 烟台, 中国, 551-556, 2009. [pdf]
  16. 张开旭,夏云庆,宇航. 基于条件随机场的古文自动断句与标点方法. 清华大学学报, 第49卷, 第10期, 1733-1736, 2009.
  17. 司宪策, 孙茂松. 一个基于Web的汉语例句自动检索系统, 辞书编纂现代化研究,上海辞书出版社, 上海,2009.
  18. 郑亚斌, 刘知远, 孙茂松, 茹立云, 张扬. 获取新词的方法和装置. 申请号: 200910083143.2. 申请日期: 2009.05.04. 公开日期: 2009.09.23. 公开号: CN101539940.
  19. 谢丽星, 孙茂松, 佟子健, 王灿辉. 汉语缩略语处理方法及装置. 申请号: 200910088377.6. 申请日期: 2009.07.02. 公开日期: 2009.12.09. 公开号: CN101599075.
  20. 李鹏, 孙茂松, 薛平. 双语文本的对齐方法及装置. 申请号: 200910093061.6. 申请日期: 2009.09.23. 公开日期: 2010.03.10. 公开号: CN101667177.
  21. 司宪策, 郑亚斌, 李景阳, 孙茂松, 谢丽星. 中英文文本自动分类系统, 软件登记号2009SRBJ0174, 2009.
  22. 张开旭, 孙茂松. 中文填字游戏辅助设计软件, 软件登记号2009SRBJ1112, 2009.

2008

  1. Maosong Sun, Dongliang Xu, Benjamin K Tsou and Huaming Lu. Disyllabic Chinese Word Extraction Based on Character Thesaurus and Semantic Constraints in Word-Formation. Proc. of the 11th International Conference on Text Speech and Dialogue (TSD 2008), Brno, Czech Republic, 2008, Lecture Notes in Computer Science, vol. 5246, pp. 141-151.[pdf]
  2. Wei Li, Maosong Sun, Multi-modal Multi-label Semantic Indexing of Images using Unlabeled Data. Proc. of the 7th International Conference on Advanced Language Processing and Web Information Technology (ALPIT 2008), Los Alamitos, CA, USA, 2008. pp: 204-209.[pdf]
  3. Wei Qiao, Maosong Sun, Wolfgang Menzel. Statistical Properties of Overlapping Ambiguities in Chinese Word Segmentation and a Strategy for Their Disambiguation, Lecture Notes in Computer Science, vol. 5246, 2008. pp. 177-186. [pdf]
  4. Yabin Zheng, Shaohua Teng, Zhiyuan Liu, Maosong Sun. Text Classification Based on Transfer Learning and Self-Training. Proc. of the 4th International Conference on Natural Computation (ICNC'08) - Volume 03, Jinan, China, 2008, pp. 363-367. [pdf]
  5. Zhiyuan Liu, Maosong Sun. Asymmetrical Query Recommendation Method Based on Bipartite Network Resource Allocation. Proc. of the 17th International World Wide Web Conference (WWW 2008), Beijing, China, 2008. pp. 1049-1050. [pdf]
  6. Yang Liu, Yan Liu, Yan Zhang, The Hong Kong Polytechnic University at TRECVID 2007 BBC rushes summarization, Proc. of the International Workshop on TRECVID Video Summarization, Augsburg, Germany, 2007, pp 50-54. [pdf]
  7. Yabin Zheng, Shaohua Teng, Zhiyuan Liu and Maosong Sun. Text Classification Based on Transfer Learning and Self-Training. Proc. of the 2008 Fourth International Conference on Natural Computation (ICNC 2008), Jinan, Shandong, China, 2008, pp. 363-367.[pdf]
  8. 乔维, 孙茂松. 汉语交集型歧义切分字段关于专业领域的统计特性. 中文信息学报, 第22卷, 第4期, 10-18, 2008. [pdf]
  9. 张开旭, 孙茂松. 统计与规则结合的古文对联应对模型.中文信息学报, 第23卷, 第1期, 100-105, 2008.[pdf]
  10. 刘知远, 郑亚斌, 孙茂松. 汉语依存句法网络的复杂网络性质. 复杂系统与复杂性科学, 第5卷, 第2期, 37-45, 2008. [pdf]
  11. 郑亚斌, 曹嘉伟, 刘知远. 基于最大匹配和马尔科夫模型的对联系统. 第四届全国学生计算语言学研讨会, 山西省太原市, 中国, 2008, pp. 452-458. [pdf]

2007

  1. Wei Li, Maosong Sun, Christopher Habel. Multi-modal Multi-label Semantic Indexing of Images Based on Hybrid Ensemble Learning. Proc. of the 8th Pacific Rim Conference on Multimedia (PCM 2007), Hong Kong, China, 2007. Lecture Notes in Computer Science, vol. 4810, pp. 744-754. [pdf]
  2. Jingyang Li, Maosong Sun. Exploiting Category Information and Document Information to Improve Term Weighting for Text Categorization. Proc. of the 8th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2007), Mexico City, Mexico, 2007. Lecture Notes in Computer Science, vol. 4394, pp. 587-598. [pdf]
  3. Jun Li, Maosong Sun. Experimental Study on Sentiment Classification of Chinese Review using Machine Learning Techniques. Proc. of 2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE 2007), Beijing, China, 2007, pp. 393-400.[pdf]
  4. Jingyang Li, Maosong Sun. Scalable Term Selection for Text Categorization. Proc. of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), Prague, Czech Republic, 2007, pp. 774-782. [pdf]
  5. 刘知远,孙茂松. 汉语词同现网络的小世界效应和无标度特性. 中文信息学报, 第21卷, 第6期, 52-58, 2007. [pdf]
  6. 郑亚斌,刘知远,孙茂松. 中文歌词的统计特征及其检索应用. 中文信息学报, 第21卷, 第5期, 61-67, 2007. [pdf]
  7. 陈涛,孙茂松. 基于SOM的语义词典自动构建实验研究. 情报学报, 第26卷,第1期,77-83, 2007. [pdf]
  8. 刘知远,司宪策,郑亚斌,孙茂松. 中文博客标签的若干统计性质. 第七届中文处理国际会议, 湖北省武汉市, 中国, 2007, pp. 533-539.
  9. 刘知远,孙茂松. 基于WEB的计算机领域新术语的自动检测. 第九届全国计算语言学学术会议, 辽宁省大连市, 中国, 2007, pp. 515-521.

2006

  1. Wei Li, Maosong Sun. Automatic Image Annotation Based on WordNet and Hierarchical Ensembles. Proc. of the 7th Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2006), Mexico City, Mexico, 2006. Lecture Notes in Computer Science, vol. 3878, pp. 417-428. [pdf]
  2. Maosong Sun, Zhengcao Zhang, Benjamin K Tsou, Huaming Lu. Word Frequency Approximation for Chinese without Using Manually-annotated Corpus. Proc. of the 7th Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2006), Mexico City, Mexico, 2006. Lecture Notes in Computer Science, vol. 3878, pp. 105-116. [pdf]
  3. Jingyang Li, Maosong Sun, Xian Zhang. A Comparison and Semi-Quantitative Analysis of Words and Character-Bigrams as Features in Chinese Text Categorization. Proc. of the 2006 Joint Conference of the International Committee on Computational Linguistics and the Association for Computational Linguistics (COLING-ACL 2006), Sydney, Australia, 2006, pp. 545-552. [pdf]
  4. Wei Li, Maosong Sun. Semi-supervised Learning for Image Annotation Based on Conditional Random Fields. Proc. of the 5th International Conference on Image and Video Retrieval (CIVR 2006), Tempe, AZ, USA, 2006, Lecture Notes in Computer Science, vol. 4071, pp. 463-472. [pdf]
  5. Wei Li, Maosong Sun. Incorporating Prior Knowledge into Multi-label Boosting for Cross-Modal Image Annotation and Retrieval. Proc. of the Third Asia Information Retrieval Symposium (AIRS 2006), Singapore, 2006, Lecture Notes in Computer Science, vol. 4182, pp. 404-415. [pdf]
  6. Wei Qiao, Maosong Sun. Word Frequency Approximation for Chinese Using Raw, MM-segmented and Manually Segmented Corpora. Proc. of the 21st International Conference on the Computer Processing of Oriental Languages (ICCPOL 2006), Singapore, 2006, Lecture Notes in Artificial Intelligence, vol. 4285, pp. 256-267. [pdf]
  7. Maosong Sun, Dongliang Xu, Benjamin K. Tsou, Huaming Lu. An Integrated Approach to Chinese Word Segmentation and Part-of-Speech Tagging.Proc. of the 21st International Conference on the Computer Processing of Oriental Languages (ICCPOL 2006), Singapore, 2006, Lecture Notes in Artificial Intelligence, vol. 4285, pp. 299-309. [pdf]
  8. Emile Kroeger, Maosong Sun. A Chinese Character Lookup System Based on Morphological Similarity. Research and Application of Digitized Chinese Teaching and Learning, 2005, pp. 354-362.
  9. Xinghua Fan, Maosong Sun. Knowledge Representation and Reasoning Based on Entity and Relation Propagation Diagram/Tree. Intelligent Data Analysis - An International Journal, vol. 10, no. 1, pp. 81-102, 2006. [pdf]
  10. 樊兴华, 孙茂松. 一种高性能的两类中文文本分类方法. 计算机学报, 第29卷,第1期, 124-131, 2006. [pdf]

2005

  1. Maosong Sun, Shengfen Luo, Benjamin K Tsou. Word Extraction Based on Semantic Constraints in Chinese Word-formation. Proc. of the 6th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2005), Mexico City, Mexico, 2005, Lecture Notes in Computer Science, vol. 3406, pp. 202-213. [pdf]
  2. Hongtao Wang, Maosong Sun, Shaoming Liu. Merging Case Relations in VSM to Improve Information Retrieval Precision. Proc. of the 6th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2005), Mexico City, Mexico, 2005, Lecture Notes in Computer Science, vol. 3406, pp. 584-592. [pdf]
  3. Xinghua Fan, Maosong Sun, Key-sun Choi, Qin Zhang. Classifying Chinese Texts in Two Steps. Proc. of the 2nd International Joint Conference on Natural Language Processing (IJCNLP 2005), Jeju Island, Republic of Korea, 2005, Lecture Notes in Artificial Intelligence, vol. 3651, pp. 302-313. [pdf]
  4. Xinghua Fan, Maosong Sun. A Method of Recognizing Entity and Relation. Proc. of the 2nd International Joint Conference on Natural Language Processing (IJCNLP 2005), Jeju Island, Republic of Korea, 2005, Lecture Notes in Artificial Intelligence, vol. 3651, pp. 245-256. [pdf]
  5. Wei Li, Maosong Sun. Automatic Image Annotation Using Maximum Entropy Model. Proc. of the 2nd International Joint Conference on Natural Language Processing (IJCNLP 2005), Jeju Island, Republic of Korea, 2005, Lecture Notes in Artificial Intelligence, vol. 3651, pp. 34-45. [pdf]
  6. Fan Sun, Maosong Sun. A New Transductive Support Vector Machine Approach to Text Categorization. Proc. of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE 2005), Wuhan, China, 2005, pp. 631-635. [pdf]
  7. Emile Kroeger, Maosong Sun. Sentence Difficulty Evaluation for a Learner's Dictionary. Proc. of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE 2005), Wuhan, China, 2005, pp. 678-682. [pdf]
  8. Zhengcao Zhang, Maosong Sun, Shaoming Liu. Automatic Content Based Title Extraction for Chinese Documents using Support Vector Machine.Proc. of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE 2005), Wuhan, China, 2005, pp. 553-558. [pdf]
  9. Fan Sun, Maosong Sun. Transductive Support Vector Machines Using Simulated Annealing. Proc. of the 2005 International Conference of Computational Intelligence and Security (CIS 2005), Xian, China, 2005, pp. 536-543. [pdf]
  10. Tao Chen, Maosong Sun, Huaming Lu. Automated Construction of Chinese Thesaurus Based on Self-Organizing Map. Proc. of the 7th International Conference on Terminology and Knowledge Engineering (TKE2005), Copenhagen, Denmark, 2005.
  11. 亢世勇, 徐艳华, 孙茂松, 许小星. 基于语料库的现代汉语新词语构词法统计研究. Proc. of 2005 International Conference on Chinese Computing, Singapore, 2005. [pdf]
  12. 孙茂松. 语言计算:信息科学技术中长期发展的战略制高点. 语言文字应用, 第3期, 38-40, 2005. [pdf]
  13. 谢永芳, 孙茂松. 《现汉》ABCD与AB(CD)同时出条初步考察. 辞书研究, 第3期, 118-127, 2005.
  14. 孙道功, 亢世勇, 孙茂松. 基于标注语料库的现代汉语单句句型句模的对应关系. 自然语言理解与大规模内容计算, 清华大学出版社, 北京, 234-240, 2005.
  15. 李毅, 亢世勇, 孙茂松 ,孙道功. 基于奥运语料的语义成分标注规范. 自然语言理解与大规模内容计算, 清华大学出版社, 北京, 633-635, 2005.
  16. 宋兰, 孙茂松. 一种高性能的两类中文文本分类方法. 自然语言理解与大规模内容计算, 清华大学出版社, 北京, 514-520, 2005.

2004

  1. Dejun Xue, Maosong Sun. Eliminating High-degree Biased Character Bigrams for Dimensionality Reduction in Chinese Text Categorization. Proc. of the 26th European Conference on IR Research (ECIR 2004), Sunderland, UK, 2004, Lecture Notes in Computer Science, vol. 2997, pp. 197-208. [pdf]
  2. Dejun Xue, Maosong Sun. Raising High-Degree Overlapped Character Bigrams into Trigrams for Dimensionality Reduction in Chinese Text Categorization. Proc. of the 5th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2004), CAU, Seoul, Korea, 2004, Lecture Notes in Computer Science, vol. 2945, pp. 584-595. [pdf]
  3. Xinghua Fan, Maosong Sun. A Reasoning Algorithm of Applying Causality Diagram to Fault Diagnosis of Complex Hybrid Systems. Proc. of the 5th World Congress on Intelligent Control and Automation (WCICA 2004), Hangzhou, China, 2004, pp. 1741-1745. [pdf]
  4. Dejun Xue, Maosong Sun. Select Strong Information Features to Improve Text Categorization Effectiveness. Journal of Intelligent Systems, vol. 13, no. 4, 270-290, 2004.
  5. Maosong Sun, Dejun Xue. A Prototype System for Chinese Text Categorization. Proc. of the 5th China-Korea Joint Symposium on Oriental Language Processing and Pattern Recognition, Qingdao, China, 2004, pp. 85-90.
  6. Shiyong Kang, Maosong Sun. The Research on Chinese Semantic Word-formation based on a Semantically Annotated Lexicon. Proc. of the 5th Chinese Lexical Semantics Workshop (CLSW-5), Singapore, 2004, pp. 120-127.
  7. 孙茂松. 中文信息处理发展战略之我见. 21世纪的中国语言学(一), 商务印书馆, 北京, 244-249, 2004.
  8. 苏新春, 孙茂松. 常用双音释词词量及提取方法. 词汇学理论与应用(二), 商务印书馆, 北京, 65-79, 2004.
  9. 孙茂松, 肖明, 邹嘉彦. 基于无指导学习策略的无词表条件下汉语自动分词. 计算机学报, 第27卷,第6期, 736-742, 2004. [pdf]

2003

  1. Dejun Xue, Maosong Sun. Chinese Text Categorization Based on the Binary Weighting Model with Non-Binary Smoothing. Proc. of the 25th European Conference on IR Research (ECIR 2003), Pisa, Italy, 2003, Lecture Notes in Computer Science, vol. 2633, pp. 548-559. [pdf]
  2. Dejun Xue, Maosong Sun. A Study on Feature Weighting in Chinese Text Categorization. Proc. of the 4th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2003), Mexico City, Mexico, 2003, Lecture Notes in Computer Science, vol. 2588, pp. 592-601. [pdf]
  3. Maosong Sun. LFG for Chinese: Issues of Representation and Computation. Journal of Chinese Linguistics, vol. 19, pp. 129-151, 2003.
  4. Maosong Sun, Dongliang Xu, Benjamin K Tsou. Integrated Chinese Word Segmentation and Part-of-speech Tagging Based on the Divide-and-Conquer Strategy. Proc. of 2003 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE 2003), Beijing, China, pp. 610-615, 2003. [pdf]
  5. Shiyong Kang, Maosong Sun. Corpus-Based Study on Semantic Structure Patterns of Sentences in Contemporary Chinese. Proc. of 20th International Conference on Computer Processing of Oriental Languages (ICCPOL '03), Shenyang, China, 2003.
  6. Tao Chen, Maosong Sun. Automatic Clustering of Chinese Nouns Based on SOM. Proc. of 20th International Conference on Computer Processing of Oriental Languages (ICCPOL '03), Shenyang, China, 2003.
  7. Jun Zhao, Maosong Sun, Bo Xu. The Progress of Chinese Linguistic Data Consortium (Chinese LDC). Proc. of the 6th East Asia Forum on Terminology, 2003.
  8. Shengfen Luo, Maosong Sun. Two-Character Chinese Word Extraction Based on Hybrid of Internal and Contextual Measures. Proc. of Second SIGHAN Workshop on Chinese Language Processing, Sapporo, Japan, pp. 24-30, 2003. [pdf]
  9. Dejun Xue, Maosong Sun. Feature Selection Combining CHI and Restrained IG Weighting Measures in Chinese Text Categorization. Proc. of First Indian International Conference on Artificial Intelligence (IICAI-03), Hyderabad, India, 2003.
  10. 樊兴华, 张勤, 孙茂松, 黄席樾. 多值因果图的推理算法研究. 计算机学报, 第26卷, 第3期, 310-322, 2003. [pdf]
  11. 罗盛芬, 孙茂松. 基于字串内部结合紧密度的汉语自动抽词实验研究. 中文信息学报, 第17卷, 第3期, 9-14, 2003. [pdf]
  12. 孙茂松, 王洪君, 董秀芳. 《信息处理用现代汉语分词词表》规范. 语言计算与基于内容的文本处理, 清华大学出版社, 北京, 391-398, 2003.
  13. 孙茂松. 对统计语言模型的若干认识. 中文信息处理若干重要问题, 科学出版社, 北京, 3-13, 2003.
  14. 苏新春, 孙茂松. 常用双音释词词量及提取方法——对《现汉》双音同义释词的量化分析. 语言教学与研究, 第6期, 31, 2003.
  15. 孙茂松, 陈群秀主编. 语言计算与基于内容的文本处理. 清华大学出版社, 北京, 2003.
  16. 孙茂松, 姚天顺, 苑春法主编. Advances in Computation of Oriental Languages. 清华大学出版社, 北京, 2003.
  17. 徐波, 孙茂松, 靳光瑾主编. 中文信息处理若干重要问题. 科学出版社, 北京, 2003.

2002

  1. Maosong Sun, Shiyong Kang. A Well-formed Chinese Lexicon for Word Segmentation and POS Tagging. Proc. of International Conference on Multilingual Information Processing, 2002.
  2. Dejun Xue, Maosong Sun. Automated Text Categorization for Chinese Based on Multinominal Bayesian Model. Proc. of Digital Library: IT Opportunities and Challenges in the New Millennium, Beijing, China, 2002.
  3. Xiao Luo, Maosong Sun, Jiayan Zou. Covering Ambiguity Resolution in Chinese Word Segmentation Based on Contextual Information. Proc. of the 19th International Conference on Computational Linguistics (COLING 2002), Taipei, pp. 1-7, 2002. [pdf]
  4. Maosong Sun, Shengfen Luo, Shiyong Kang. Sense Tagging of Characters in Chinese Lexicon. Proc. of the 3rd Symposium on Chinese Lexical Semantics, 2002.
  5. Shiyong Kang, Maosong Sun. Comparative Study on Distribution of Semantic Categories of Characters and Words in Contemporary Chinese. Proc. of the 3rd Symposium on Chinese Lexical Semantics, pp. 1-7, 2002.
  6. 奚晨海,孙茂松. 基于神经元网络的汉语短语边界识别. 中文信息学报, 第16卷, 第2期, 20-26, 2002. [pdf]

2001

  1. Maosong Sun. Finding Chinese Personal Names in Unrestricted Texts. Proc. of 2001 Pacific Neighborhood Consortium Annual Conference and Joint Meetings, 2001.
  2. Maosong Sun. LFG for Chinese: Issues of Representation and Computation. Proc. of LFG'01, 2001.
  3. Xiaohua Liu, Maosong Sun. A Prototype of Chinese Search Engine Based on Word Segmentation Techniques. Proc. of 2001 IEEE International Conference on Systems, Man, and Cybernetics, Tucson, AZ , USA, 2001, vol. 4, pp. 2215-2218. [pdf]
  4. 孙茂松,邹嘉彦. 汉语自动分词研究评述. 当代语言学, 第3卷, 第1期, 22-32, 2001. [pdf]
  5. 肖云,孙茂松,邹嘉彦. 利用上下文信息解决汉语自动分词中的组合型歧义. 计算机工程与应用, 第37卷, 第9期, 87-89, 2001. [pdf]
  6. 陶跃华,孙茂松. 搜索引擎结果的评价技术. 情报科学, 第19卷, 第8期, 861-863, 2001.
  7. 陶跃华,孙茂松. 搜索引擎中相关性反馈技术. 情报理论与实践, 第24卷, 第4期, 295-297, 2001. [pdf]
  8. 陶跃华,孙茂松. 基于潜语义标引的自然语言检索. 现代图书情报技术, 第5期, 40-41, 2001.
  9. 陶跃华, 孙茂松, 王锡钢. 因特网搜索引擎评价系统. 计算机工程与科学, 第23卷, 第3期, 25-31, 2001. [pdf]
  10. 孙茂松, 王洪君, 李行健, 富丽, 黄昌宁, 陈松岑, 谢自立, 张卫国. 信息处理用现代汉语分词词表. 语言文字应用, 第4期, 84-89, 2001. [pdf]
  11. 孙茂松. 站在语言信息处理基础研究的前沿——"973"相关课题介绍. 术语标准化与信息技术, 第23卷, 第3期, 19-23, 2001. [pdf]
  12. 孙茂松. 中国数字图书馆建设中的若干关键问题. 辉煌二十周年——中国中文信息学会二十周年学术会议, 清华大学出版社, 北京, 298-302, 2001. [pdf]
  13. 亢世勇, 孙茂松, 刘海润. 《汉字义类信息库》的研究与实现. Proc. of International Conference on Chinese Computing 2001, Singapore, 2001.
  14. 刘晓华, 孙茂松. 分词策略对中文搜索引擎查全率和查准率的影响. Proc. of International Conference on Chinese Computing 2001, Singapore, 2001.

Log in