Textual Entailment Recognition Fused with Syntactic Structure Transformation and Lexical Semantic Features

doi:10.3969/j.issn.1000-3428.2015.09.037

Computer Engineering

Previous Articles Next Articles

Textual Entailment Recognition Fused with Syntactic Structure Transformation and Lexical Semantic Features

ZHANG Zhichang,YAO Dongren,LIU Xia,CHEN Songyi,LU Xiaoyong

(College of Computer Science and Engineering,Northwest Normal University,Lanzhou 730070,China)

Received:2014-11-19 Online:2015-09-15 Published:2015-09-15

融合句法结构变换与词汇语义特征的文本蕴涵识别

张志昌,姚东任,刘霞,陈松毅,鲁小勇

(西北师范大学计算机科学与工程学院,兰州 730070)

作者简介:张志昌(1976-)，男，副教授、博士，主研方向：自然语言处理，数据挖掘；姚东任、刘霞、陈松毅，硕士研究生；鲁小勇，工程师。
基金资助:
国家自然科学基金资助项目(61163039,61163036，61363058)；西北师范大学青年教师科研能力提升计划基金资助项目(NWNU-LKQN-10-2,NWNU-LKQN-12-23)。

Abstract

Abstract: The traditional textual entailment recognition methods only stay at vocabulary level,not involving the influence of the syntactic and semantic aspects,and reduce the F value of the identification results.In order to solve this problem,a Chinese text recognition method is proposed which is fused with the transformation of syntactic structure and traditional lexical semantic characteristics.This method makes the text preprocessing based on syntax analysis tree transformation,adds the text contains identification features of syntactic analysis into related statistics and lexical semantic characteristics,uses the statistical machine learning methods to make entailment relationship classification of text T and assumptions text H,and gets the final recognition result through the correction processing of semantic rules.Evaluation results with NTCIR RITE3 show that compared with III&CYUT,Yamraj,etc,the method can obtain higher F value.

Key words: Chinese textual entailment, syntactic structure transformation, lexical semantic feature, lexical statistical featur, statistical machine learning

摘要： 传统文本蕴涵识别方法仅停留在词汇级的识别,无法涉及句法、语义等方面,造成识别结果的F值较低。针对该问题,提出一种将句法结构的变换和传统词汇语义特征结合的中文文本蕴涵识别方法。对文本进行基于句法分析树变换的预处理,将句法分析中适用于文本蕴涵识别的特征加入到相关的统计和词汇语义特征中,使用统计机器学习的方法对由文本片段T和假设的文本片段H组成的文本对进行蕴涵关系分类,并经过语义规则的修正处理得到最终的识别结果。在NTCIR RITE3上的评测结果表明,与III&CYUT,Yamraj等相比,该方法能获得较高的F值。

关键词: 中文文本蕴涵, 句法结构变换, 词汇语义特征, 词汇统计特征, 统计机器学习

CLC Number:

TP399

ZHANG Zhichang,YAO Dongren,LIU Xia,CHEN Songyi,LU Xiaoyong. Textual Entailment Recognition Fused with Syntactic Structure Transformation and Lexical Semantic Features[J]. Computer Engineering.

张志昌,姚东任,刘霞,陈松毅,鲁小勇. 融合句法结构变换与词汇语义特征的文本蕴涵识别[J]. 计算机工程.

/ Recommend / Download Citations

URL:

https://www.ecice06.com/EN/Y2015/V41/I9/199

References

参考文献［1］Dagan I,Glickman O.Probabilistic Textual Entailment:Generic Applied Modeling of Language Variability［C］//Proceedings of PASCAL Workshop on Learning Methods for Text Understanding and Mining.Grenoble,France:Association for Computational Linguistics,2004. ［2］袁毓林,王明华.文本蕴涵的推理模型与识别模型［J］.中文信息学报,2010,24(2):3-13. ［3］Tatu M,Moldovan D.COGEX at RTE 3［C］//Pro-ceedings of ACL-PASCAL Workshop on Textual Entail-ment and Paraphrasing.Prague,Czech Republic:Association for Computational Linguistics,2007:22-27. ［4］Harmeling S.Inferring Textual Entailment with a Pro-babilistically Sound Calculus［J］.Natural Language Engineering,2009,15(4):459-477. ［5］Bar-Haim R,Berant J,Dagan I.A Compact Forest for Scal-able Inference over Entailment and Paraphrase Rules［C］//Proceedings of Conference on Empirical Methods in Natural Language Processing.Singapore:Association for Computational Linguistics,2009:1056-1065. ［6］Malakasiotis P,Androutsopoulos I.Learning Textual Entail-ment Using SVMs and String Similarity Measures［C］//Proceedings of ACL-PASCAL Workshop on Textual Entailment and Paraphrasing.Association for Computational Linguistics.Prague,Czech Republic:Association for Computational Linguistics,2007:42-47. ［7］Maytham A,Allan R.Natural Language Inference for Ara-bic Using Extended Tree Edit Distance with Subtrees［J］.Journal of Artificial Intelligence Research,2013,48(5):1-22. ［8］吴晓锋,宗成庆.基于语义角色标注的新闻领域复述句识别方法［J］.中文信息学报,2010,24(5):3-9. ［9］Wang Xiaolin,Zhao Hai,Lu Baoliang.BCMI-NLP Labeled-alignment-based Entailment System for NTCIR-10 RITE-2 Task［C］//Proceedings of the 10th NTCIR Con-ference.Tokyo,Japan:National Institute of Informatics,2013:18-21. ［10］Galitsky B.Machine Learning of Syntactic Parse Trees for Search and Classification of Text［J］.Engineering Applica-tions of Artificial Intelligence,2013,26(3):1072-1091. ［11］田久乐,赵蔚.基于同义词林的词语相似度计算方法［J］.吉林大学学报,2010,28(6):602-608. ［12］刘茂福,李妍,姬东鸿.基于事件语义特征的中文文本蕴涵识别［J］.中文信息学报,2013,27(5):129-136. ［13］Suguru M,Yusuke M,Tomohide S,et al.Overview of the NTCIR-11 Recognizing Inference in Text and Vali-dation(RITE-VAL) Task［C］//Proceedings of the 11th NTCIR Conference.Tokyo,Japan:National Institute of Informatics,2014:9-12. 编辑刘冰

Please choose a citation manager

Content to export

Textual Entailment Recognition Fused with Syntactic Structure Transformation and Lexical Semantic Features

融合句法结构变换与词汇语义特征的文本蕴涵识别

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 1

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Textual Entailment Recognition Fused with Syntactic Structure Transformation and Lexical Semantic Features

融合句法结构变换与词汇语义特征的文本蕴涵识别

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 1

Recommended Articles

Metrics

Comments