融合外部语义知识的中文文本蕴含识别

doi:10.19678/j.issn.1000-3428.0056841

计算机工程 ›› 2021, Vol. 47 ›› Issue (1): 44-49. doi: 10.19678/j.issn.1000-3428.0056841

融合外部语义知识的中文文本蕴含识别

李世宝, 李贺, 赵庆帅, 殷乐乐, 刘建航, 黄庭培

中国石油大学(华东) 海洋与空间信息学院, 山东青岛 266580

收稿日期:2019-12-09 修回日期:2020-01-17 发布日期:2020-02-11
作者简介:李世宝(1978-),男,副教授、硕士,主研方向为无线通信、自然语言处理;李贺、赵庆帅、殷乐乐,硕士研究生;刘建航、黄庭培,副教授、博士。
基金资助:
国家自然科学基金（61972417，61872385）；中央高校基本科研业务费专项资金（18CX02134A，19CX05003A-4，18CX02137A）。

Chinese Textual Entailment Recognition Fused with External Semantic Knowledge

LI Shibao, LI He, ZHAO Qingshuai, YIN Lele, LIU Jianhang, HUANG Tingpei

College of Oceanography and Space Informatics, China University of Petroleum(East China), Qingdao, Shandong 266580, China

Received:2019-12-09 Revised:2020-01-17 Published:2020-02-11

摘要/Abstract

摘要： 基于神经网络的文本蕴含识别模型通常仅从训练数据中学习推理知识，导致模型泛化能力较弱。提出一种融合外部语义知识的中文知识增强推理模型（CKEIM）。根据知网知识库的特点提取词级语义知识特征以构建注意力权重矩阵，同时从同义词词林知识库中选取词语相似度特征和上下位特征组成特征向量，并将注意力权重矩阵、特征向量与编码后的文本向量相结合融入神经网络的模型训练过程，实现中文文本蕴含的增强识别。实验结果表明，与增强序列推理模型相比，CKEIM在15%、50%和100%数据规模的CNLI训练集下识别准确率分别提升了3.7%、1.5%和0.9%，具有更好的中文文本蕴含识别性能和泛化能力。

关键词: 中文文本蕴含, 自然语言推理, 注意力机制, 双向长短期记忆网络, 知网, 词林

Abstract: The textual entailment recognition model based on neural network learns inference knowledge only from training data,which leads to the weak generalization ability of the model.This paper proposes a Chinese Knowledge Enhanced Inference Model(CKEIM) fused with external semantic knowledge.Based on the features of the HowNet knowledge base,the features of word-level semantic knowledge are extracted to construct an attention weight matrix.At the same time,the semantic similarity features of words and hyponymy features are selected from the CiLin knowledge base of synonyms to form the feature vector.Finally,the attention weight matrix,the feature vector and the encoded text vectors are integrated into the training of the neural network model to implement enhanced recognition of Chinese textual entailment.Experimental results show that compared with the Enhanced Sequential Inference Model(ESIM),CKEIM improves the recognition accuracy by 3.7%,1.5% and 0.9% respectively on CNLI training sets of 15%,50% and 100% data scales,which demonstrates that it has better Chinese textual entailment recognition performance and generalization ability.

Key words: Chinese textual entailment, natural language inference, attention mechanism, Bi-directional Long Short-Term Memory(BiLSTM) network, HowNet, CiLin

中图分类号:

TP391.1

李世宝, 李贺, 赵庆帅, 殷乐乐, 刘建航, 黄庭培. 融合外部语义知识的中文文本蕴含识别[J]. 计算机工程, 2021, 47(1): 44-49.

LI Shibao, LI He, ZHAO Qingshuai, YIN Lele, LIU Jianhang, HUANG Tingpei. Chinese Textual Entailment Recognition Fused with External Semantic Knowledge[J]. Computer Engineering, 2021, 47(1): 44-49.

https://www.ecice06.com/CN/Y2021/V47/I1/44

图/表 7

20210125163121

20210125163124

20210125163129

20210125163132

20210125163134

20210125163137

20210125163140

参考文献

[1] GUO Maosheng,ZHANG Yu,LIU Ting.Research advances and prospect of recognizing textual entailment and knowledge acquisition[J].Chinese Journal of Computers,2017,40(4):119-140.(in Chinese)郭茂盛,张宇,刘挺.文本蕴含关系识别与知识获取研究进展及展望[J].计算机学报,2017,40(4):119-140.
[2] MACCARTNEY B,MANNING C D.Natural language inference[M].Stanford,USA:Stanford University,2009.
[3] ZHANG Zhichang,YAO Dongren,LIU Xia,et al.Textual entailment recognition fused with syntactic structure transformation and lexical semantic features[J].Computer Engineering,2015,41(9):199-204.(in Chinese)张志昌,姚东任,刘霞,等.融合句法结构变换与词汇语义特征的文本蕴涵识别[J].计算机工程,2015,41(9):199-204.
[4] BOWMAN S R,ANGELI G,POTTS C,et al.A large annotated corpus for learning natural language inference[EB/OL].[2019-11-15].https://arxiv.org/abs/1508.05326.
[5] CHEN Qian,ZHU Xiaodan,LING Zhenhua,et al.Recurrent neural network-based sentence encoder with gated attention for natural language inference[EB/OL].[2019-11-15].https://arxiv.org/abs/1708.01353.
[6] TALMAN A,YLI-JYRÄ A,TIEDEMANN J.Natural language inference with hierarchical BiLSTM max pooling architecture[EB/OL].[2019-11-15].https://arxiv.org/abs/1808.08762v1.
[7] ROCKTÄSCHEL T,GREFENSTETTE E,HERMANN K,et al.Reasoning about entailment with neural attention[EB/OL].[2019-11-15].https://arxiv.org/abs/1509.06664.
[8] WANG Shuohang,JIANG Jing.Learning natural language inference with LSTM[EB/OL].[2019-11-15].https://arxiv.org/abs/1512.08849.
[9] WANG Z,HAMZA W,FLORIAN R.Bilateral multi-per-spective matching for natural language sentences[EB/OL].[2019-11-15].https://arxiv.org/abs/1702.038
[10] CHEN Qian,ZHU Xiaodan,LING Zhenhua,et al.Enhanced LSTM for natural language inference[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics.Philadelphia,USA:Association for Computational Linguistics,2017:1657-1668.
[11] GLOCKNER M,SHWARTZ V,GOLDBERG Y.Breaking NLI systems with sentences that require simple lexical inferences[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics.Philadelphia,USA:Association for Computational Liguistics,2018:650-655.
[12] CHEN Qian,ZHU Xiaodan,LING Zhenhua,et al.Neural natural language inference models enhanced with external knowledge[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics.Philadelphia,USA:Association for Computational Liguistics,2018:2406-2417.
[13] WANG X,KAPANIPATHI P,MUSA R,et al.Improving natural language inference using external knowledge in the science questions domain[C]//Proceedings of AAAI Conference on Artificial Intelligence.Palo Alto,USA:AAAI Press,2019:7208-7215.
[14] TAN Yongmei,LIU Shuwen,LÜ Xueqiang.CNN and BiLSTM based Chinese textual entailment recognition[J].Journal of Chinese Information Processing,2018,32(7):11-19.(in Chinese)谭咏梅,刘姝雯,吕学强.基于CNN与双向LSTM的中文文本蕴含识别方法[J].中文信息学报,2018,32(7):11-19.
[15] DONG Zhendong,DONG Qiang.HowNet-a hybrid language and knowledge resource[C]//Proceedings of International Conference on Natural Language Processing and Knowledge Engineering.Washington D.C.:IEEE Press,2003:820-824.
[16] MEI Jiaju.Synonym CiLin[M].Shanghai:Shanghai Lexicographic Publishing House,1983.(in Chinese)梅家驹.同义词词林[M].上海:上海辞书出版社,1983.
[17] PENG Qi,ZHU Xinhua,CHEN Yishan,et al.IC-based approach for calculating word semantic similarity in CiLin[J].Application Research of Computers,2018,35(2):400-404.(in Chinese)彭琦,朱新华,陈意山,等.基于信息内容的词林词语相似度计算[J].计算机应用研究,2018,35(2):400-404.
[18] NIU Yibin,XIE Ruobing,LIU Zhiyuan,et al.Improved word representation learning with sememes[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics.Philadelphia,USA:Association for Computational Linguistics,2017:2049-2058.
[19] LIU Jiangming,XU Jinan,ZHANG Yujie.An approach of hybrid hierarchical structure for word similarity computing by HowNet[C]//Proceedings of the 6th International Joint Conference on Natural Language Processing.Washington D.C.,USA:IEEE Press,2013:927-931.
[20] SONG Y,SHI S M,LI J,et al.Directional skip-gram:explicitly distinguishing left and right context for word embeddings[C]//Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Philadelphia,USA:Association for Computational Linguistics,2018:175-180.
[21] KINGMA D P,BA J.Adam:a method for stochastic optimization[EB/OL].[2019-11-15].https://arxiv.org/abs/1412.6980.
[22] CHE Wanxiang,LI Zhenghua,LIU Ting.LTP:a Chinese language technology platform[C]//Proceedings of the 23rd International Conference on Computational Linguistics:Demonstrations.Philadelphia,USA:Association for Computational Linguistics,2010:13-16.

选择文件类型/文献管理软件名称

选择包含的内容

融合外部语义知识的中文文本蕴含识别

Chinese Textual Entailment Recognition Fused with External Semantic Knowledge

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 7

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	张天鹏, 韩晶, 吕学强. 基于多任务学习的超分辨率辅助小目标检测[J]. 计算机工程, 2024, 50(9): 304-312.
[2]	郭敏, 张熙涵, 李阳. 融合注意力的教师互一致性半监督医学图像分割[J]. 计算机工程, 2024, 50(9): 313-323.
[3]	曾钰琦, 刘博, 钟柏昌, 钟瑾. 智慧教育下基于改进YOLOv8的学生课堂行为检测算法[J]. 计算机工程, 2024, 50(9): 344-355.
[4]	屈潇雅, 李兵, 温立强. 面向行政执法案件文本的事件抽取研究[J]. 计算机工程, 2024, 50(9): 63-71.
[5]	李俊俊, 董建刚, 李坤. 基于Kubernetes的集群节能策略研究[J]. 计算机工程, 2024, 50(9): 82-91.
[6]	党小超, 刘涧, 董晓辉, 祝忠彦, 李芬芳. 面向不平衡数据的机械设备故障命名实体识别[J]. 计算机工程, 2024, 50(9): 104-112.
[7]	林畅, 郭伟, 任哲聪, 金海波. 基于Transformer的目标跟踪与分割统一算法[J]. 计算机工程, 2024, 50(9): 130-141.
[8]	李泽霖, 吕兆峰, 陈富强, 李克. 基于多跳信息融合的实体对齐模型[J]. 计算机工程, 2024, 50(9): 142-152.
[9]	王汝英, 马嘉骏, 董建强, 刘万龙, 张海涛, 尹凯, 赵博超. 基于MTS-BiGRU-DMHSA的工业负荷预测方法[J]. 计算机工程, 2024, 50(9): 169-178.
[10]	朱凯, 李理, 张彤, 江晟, 别一鸣. 基于Transformer的多阶段运动模糊图像修复网络[J]. 计算机工程, 2024, 50(9): 276-285.
[11]	饶日昕, 王怡文, 曾砺志, 童心恬, 赵海涛. 面向废旧电缆检测的轻量化网络模型[J]. 计算机工程, 2024, 50(8): 22-30.
[12]	李华昱, 张智康, 闫阳, 岳阳. 基于知识图谱增强的领域多模态实体识别[J]. 计算机工程, 2024, 50(8): 31-39.
[13]	王蕾, 党时鹏, 潘丰. 基于卷积神经网络的隐匿性旁路预测模型[J]. 计算机工程, 2024, 50(8): 40-49.
[14]	陈瀚, 赵春蕾, 蒋昊达, 王春东. 基于融合模型与语义网络的App用户意图识别研究[J]. 计算机工程, 2024, 50(8): 50-63.
[15]	王夙喆, 张雪英, 陈晓玉, 李凤莲, 吴泽林. 基于有效注意力和GAN结合的脑卒中EEG增强算法[J]. 计算机工程, 2024, 50(8): 336-344.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

融合外部语义知识的中文文本蕴含识别

Chinese Textual Entailment Recognition Fused with External Semantic Knowledge

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 7

参考文献

相关文章 15

编辑推荐

Metrics

本文评价