参考文献
[1]林政,吕雅娟,刘群,等.Web平行语料挖掘及其在机器翻译中的应用[J].中文信息学报,2010,24(5):85-91.
[2]Shi Lixin,Nie Jianyun.Filtering or Adapting:Two Strategies to Exploit Noisy Parallel Corpora for Cross-language Information Retrieval[C]//Proceedings of the 15th ACM International Conference on Information and Knowledge
Management.Arlington,USA:ICST Press,2006:216-222.
[3]Yee S C,Hwee T N.Scaling up Word Sense Dis-ambiguation via Parallel Texts[C]//Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence
Conference.Pittsburgh,USA:[s.n.],2005:125-134.
[4]Kit C,Yee J,Ng H.An Intelligent Web Agent to Mine Bilingual Parallel Pages via Automatic Discovery of URL Pairing Patterns[C]//Proceedings of Web Intelligence and Intelligent Agent Technology Work-shops.Silicon Valley,USA:IEEE
Press,2007:236-246.
[5]Ye Shani,Lv Yajuan,Huang Yun,et al.Automatic Parallel Sentences Extracting from Web[J].Journal of Chinese Information Processing,2008,22(5):67-73.
[6]任翔,刘滨.基于超链接分析的网页正文提取算法[J].泰山学院学报,2010,32(3):180-187.
[7]Zhang Ying,Wu Ke,Gao Jianfeng,et al.Automatic Acquisition of Chinese-English Parallel Corpus from the Web[C]//Proceedings of the 28th European Conference on Information Retrieval.Berlin,Germany:Springer-Verlag,2006:457-465.
[8]Shi Lei,Niu Cheng,Zhou Ming,et al.A Dom Tree Alignment Model for Mining Parallel Data from the Web[C]//Proceedings of the 21st International Con-ference on Computional Linguistics and 44th Annual Meeting of the
ACL.Stroudsburg,USA:Association for Computional Linguistics,2006:251-263.
[9]Chen Jisong,Chau Rowena,Yeh Chung-hsing.Discovering Parallel Text from the World Wide Web[C]//Proceedings of the 2nd Workshop on Australasian Information Security,Data Mining and Web Intel-ligence,and Software
Internationalisation.Darlinghurst,Australia:Australian Computer Society,Inc.,2004:326-335.
[10]刘奇,刘洋,孙茂松.URL模式与HTML结构相结合的平行网页获取方法[J].中文信息学报,2013,27(3):91-99.
[11]Chen Jiang,Nie Jianyun.Automatic Construction of Parallel English-Chinese Corpus for Cross-language Information Retrieval[C]//Proceedings of the 6th Conference on Applied Natural Language Processing.Stroudsburg,USA:Association
for Computational Linguistics,2000:621-635.
[12]Resnik P,Smith N A.The Web as a Parallel Corpus[J].Computational Linguistics,2003,29(3):349-380.
[13]Zhang Ying,Wu Ke,Gao Jianfeng,et al.Automatic Acquisition of Chinese-English Parallel Corpus from the Web[C]//Proceedings of the 28th European Conference on Information Retrieval ACL.London,UK:Springer,2006:126-135.
[14]Ma Xiaoyi,Liberman M Y.Bits:A Method for Bilingual Text Search over the Web[C]//Proceedings of Machine Translation Summit VII. Singapore:[s.n.],1999:111-124.
[15]Jiang Long,Shi Quanyang,Zhou Ming,et al.Mining Bilingual Data from the Web with Adaptively Learnt Patterns[C]//Proceedings of the 47th Annual Meeting of the Association for Computa-tional Linguistics and the 4th International
Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing.Stroudsburg,USA:Association for Com-putional Linguistics,2009:870-878
[16]Resnik P.Semantic Similarity in a Taxonomy:An Information-aased Measure and Its Application to Problems of Ambiguity in Natural Language[J].Journal of Artificial Intelligence Research,1999,11(1):95-130.
[17]Lin Dekang.An Information-theoretic Definition of Similarity[C]//Proceedings of the 15th International Conference on Machine Learning.San Francisco,USA:Morgan Kaufmann Publishers Inc.,1998:465-478.
编辑索书志 |