基于改进图节点的图神经网络多跳阅读理解研究

doi:10.19678/j.issn.1000-3428.0059917

计算机工程 ›› 2022, Vol. 48 ›› Issue (1): 99-104. doi: 10.19678/j.issn.1000-3428.0059917

基于改进图节点的图神经网络多跳阅读理解研究

舒冲¹, 欧阳智², 杜逆索^1,2, 何庆², 魏琴²

1. 贵州大学计算机科学与技术学院, 贵阳 550025;
2. 贵州大学贵州省大数据产业发展应用研究院, 贵阳 550025

收稿日期:2020-11-05 修回日期:2021-01-04 发布日期:2021-01-08
作者简介:舒冲(1996-),男,硕士研究生,主研方向为自然语言处理;欧阳智、杜逆索、何庆、魏琴,副教授、博士。
基金资助:
国家重点研发计划（2018YFB1004300）；贵州省科学技术厅重大科技计划（黔科合重大专项字［2018］3002）；贵州大学培育项目（黔科合平台人才［2017］5788）。

Research on Multi-Hop Reading Comprehension Based on Graph Neural Network with Improved Graph Nodes

SHU Chong¹, OUYANG Zhi², DU Nisuo^1,2, HE Qing², WEI Qin²

1. College of Computer Science and Technology, Guizhou University, Guiyang 550025, China;
2. Guizhou Big Data Academy, Guizhou University, Guiyang 550025, China

Received:2020-11-05 Revised:2021-01-04 Published:2021-01-08

摘要/Abstract

摘要： 多跳阅读理解需要基于问题并在多个支撑文档中寻找相关信息进行跳跃式推理来回答问题。针对当前多跳阅读理解模型中所存在的实体图内缺乏关键问题信息以及信息冗余问题，提出一种基于改进图节点的图神经网络多跳阅读理解模型。采用基于指代词的实体提取方法提取实体，将提取到的实体基于问题关联实体构建实体图。对实体图中的节点进行编码预处理，通过门机制的图卷积网络模拟得到推理信息，计算推理信息与问题信息的双向注意力并进行结果预测。在WikiHop数据集上的实验结果表明，该模型在测试集上取得了73.1%的预测准确率，相比基于图神经网络、循环神经网络和注意力机制的多跳阅读理解模型准确率更高、泛化性能更强。

关键词: 多跳阅读理解, 实体图, 问题关联实体, 图卷积网络, 双向注意力机制

Abstract: Multi-hop reading comprehension requires searching for question-associated information in supporting documents to perform leaping reasoning to answer the question.The entity graphs of existing multi-hop reading comprehension models lack key information relevant to the question, but contain redundant information.To address this problem, we propose a multi-hop reading comprehension model based on a graph neural network with improved graph nodes.We employ a demonstrative pronoun-based method to extract entities, and use the extracted entities to build an entity graph based on the question-associated entities.After the nodes in the entity graph are preprocessed by encoding, we use the gated Graph Convolutional Network(GCN) to obtain the inference information through simulation.The bidirectional attention between reasoning and question information is calculated, and on this basis the answer is predicted.The experimental results on the WikiHop dataset show that the model achieves a prediction accuracy of 73.1% on the test set.Compared with the multi-hop reading comprehension model based on GCN, Recurrent Neural Network(RNN) and attention mechanism, the proposed model displays higher accuracy and stronger generalization performance.

Key words: multi-hop reading comprehension, entity graph, question-associated entity, Graph Convolutional Network(GCN), bidirectional attention mechanism

中图分类号:

TP391

舒冲, 欧阳智, 杜逆索, 何庆, 魏琴. 基于改进图节点的图神经网络多跳阅读理解研究[J]. 计算机工程, 2022, 48(1): 99-104.

SHU Chong, OUYANG Zhi, DU Nisuo, HE Qing, WEI Qin. Research on Multi-Hop Reading Comprehension Based on Graph Neural Network with Improved Graph Nodes[J]. Computer Engineering, 2022, 48(1): 99-104.

http://www.ecice06.com/CN/Y2022/V48/I1/99

图/表 6

20220108121956

20220108122000

20220108122006

20220108122017

20220108122022

20220108122026

参考文献

[1] SEO M, KEMBHAVI A, FARHADI A, et al.Bidirectional attention flow for machine comprehension[EB/OL].[2020-10-11].https://arxiv.org/abs/1611.01603v6.
[2] XIONG C M, VICTOR Z, RICHARD S.Dynamic coattention networks for question answering[C]//Proceedings of the 5th International Conference on Learning Representations.Toulon, France:[s.n.], 2017:1-8.
[3] LIU X, SHEN Y, DUH K, et al.Stochastic answer networks for machine reading comprehension[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics.Stroudsburg, USA:Association for Computational Linguistics, 2018:1694-1704.
[4] CHEN D Q, FISCH A, WESTON J, et al.Reading Wikipedia to answer open-domain questions[C]//Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics.Stroudsburg, USA:Association for Computational Linguistics, 2017:1870-1879.
[5] CLARK C, GARDNER M.Simple and effective multi-paragraph reading comprehension[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics.Stroudsburg, USA:Association for Computational Linguistics, 2018:845-855.
[6] 万静, 郭雅志.基于多段落排序的机器阅读理解研究[J].北京化工大学学报(自然科学版), 2019, 46(3):93-98. WAN J, GUO Y Z.Machine reading comprehension based on multi-passage ranking[J].Journal of Beijing University of Chemical Technology(Natural Science Edition), 2019, 46(3):93-98.(in Chinese)
[7] 吴睿智, 朱大勇, 王春雨, 等.基于图卷积神经网络的位置语义推断[J].电子科技大学学报, 2020, 49(5):739-744. WU R Z, ZHU D Y, WANG C Y, et al.Location semantics inference with graph convolutional networks[J].Journal of University of Electronic Science and Technology of China, 2020, 49(5):739-744.(in Chinese)
[8] 许力, 李建华.基于句法依存分析的图网络生物医学命名实体识别[J].计算机应用, 2021, 41(2):357-362. XU L, LI J H.Biomedical named entity recognition with graph network based on syntactic dependency parsing[J].Journal of Computer Applications, 2021, 41(2):357-362.(in Chinese)
[9] DHINGRA B, JIN Q, YANG Z L, et al.Neural models for reasoning over multiple mentions using coreference[C]//Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Stroudsburg, USA:Association for Computational Linguistics, 2018:42-48.
[10] DE CAO N, AZIZ W, TITOV I.Question answering by reasoning across documents with graph convolutional networks[EB/OL].[2020-10-11].https://arxiv.org/abs/1808.09920v1.
[11] SONG L F, WANG Z G, YU M, et al.Evidence integration for multi-hop reading comprehension with graph neural networks[EB/OL].[2020-10-11].https://www.researchgate.net/publication/340326781_Evidence_Integration_for_Multi-hop_Reading_Comprehension_with_Graph_Neural_Networks.
[12] CHEN J F, LIN S T, DURRETT G.Multi-hop question answering via reasoning chains[EB/OL].[2020-10-11].https://arxiv.org/abs/1910.02610.
[13] TU M, WANG G T, HUANG J, et al.Multi-hop reading comprehension across multiple documents by reasoning over heterogeneous graphs[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.Stroudsburg, USA:Association for Computational Linguistics, 2019:2704-2713.
[14] CAO Y, FANG M, TAO D.BAG:bi-directional attention entity graph convolutional network for multi-hop reasoning question answering[C]//Proceedings of 2019 Conference of the North American Chapter of the Association of Computational Linguistics:Human Language Technologies.Stroudsburg, USA:Association for Computational Linguistics, 2019:357-362.
[15] PETERS M, NEUMANN M, IYYER M, et al.Deep contextualized word representations[C]//Proceedings of 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Stroudsburg, USA:Association for Computational Linguistics, 2018:2227-2237.
[16] HEWLETT D, LACOSTE A, JONES L, et al.WikiReading:a novel large-scale language understanding task over Wikipedia[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.Stroudsburg, USA:Association for Computational Linguistics, 2016:1535-1545.
[17] TANG Z, SHEN Y, MA X, et al.Multi-hop reading comprehension across documents with path-based graph convolutional network[EB/OL].[2020-10-11].https://arxiv.org/abs/2006.06478.
[18] JIANG Y C, JOSHI N, CHEN Y C, et al.Explore, propose, and assemble:an interpretable model for multi-hop reading comprehension[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.Stroudsburg, USA:Association for Computational Linguistics, 2019:2714-2725.
[19] ZHONG V, XIONG C M, KESKAR N S, et al.Coarse-grain fine-grain coattention network for multi-evidence question answering[EB/OL].[2020-10-11].https://arxiv.org/abs/1901.00603v2.
[20] ZHUANG Y M, WANG H D.Token-level dynamic self-attention network for multi-passage reading comprehension[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.Stroudsburg, USA:Association for Computational Linguistics, 2019:2252-2262.

选择文件类型/文献管理软件名称

选择包含的内容

基于改进图节点的图神经网络多跳阅读理解研究

Research on Multi-Hop Reading Comprehension Based on Graph Neural Network with Improved Graph Nodes

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 6

参考文献

相关文章 14

编辑推荐

Metrics

本文评价

[1]	马建红, 龚天, 姚爽. 基于证据句与图卷积网络的文档级关系抽取[J]. 计算机工程, 2023, 49(8): 104-110.
[2]	陈文轩, 曾碧, 郭植星. 融合多特征与语义图卷积网络的摔倒检测方法[J]. 计算机工程, 2023, 49(5): 277-285,294.
[3]	陈昱瑾, 王晶, 武志昊, 赵耀帅, 林友芳. 基于图卷积网络融合群组关系的社会化推荐方法[J]. 计算机工程, 2023, 49(5): 112-121.
[4]	袁立宁, 胡皓, 刘钊. 基于多通道图卷积自编码器的图表示学习[J]. 计算机工程, 2023, 49(2): 150-160,174.
[5]	王曙燕, 郭睿涵, 孙家泽. 基于图对比学习的MOOC推荐方法[J]. 计算机工程, 2023, 49(1): 57-64,72.
[6]	俞莎莎, 牛保宁. 基于交易不可信度的比特币非法交易检测[J]. 计算机工程, 2022, 48(8): 166-172.
[7]	苗雨欣, 宋春花, 牛保宁, 康瑞雪. 双通道图协同过滤推荐算法[J]. 计算机工程, 2022, 48(8): 121-128.
[8]	冯思芸, 施振佺, 曹阳. 基于全局时空特性的城市路网交通速度预测模型[J]. 计算机工程, 2022, 48(5): 112-117.
[9]	车超, 刘迪. 基于双向对齐与属性信息的跨语言实体对齐[J]. 计算机工程, 2022, 48(3): 74-80.
[10]	贺煜航, 刘棪, 陈刚. 基于自适应图卷积网络的心电图多标签分类模型[J]. 计算机工程, 2022, 48(12): 261-269.
[11]	王庆荣, 魏怡萌, 朱昌锋, 田可可. 基于时空图卷积网络的交通事故风险预测研究[J]. 计算机工程, 2022, 48(11): 22-29.
[12]	金柯君, 于洪涛, 吴翼腾, 李邵梅, 操晓春. 基于改进投影梯度下降算法的图卷积网络投毒攻击[J]. 计算机工程, 2022, 48(10): 176-183.
[13]	杨顶, 邓明君, 徐丽萍. 基于时空信息融合学习的路段行程车速短时预测[J]. 计算机工程, 2021, 47(12): 78-86.
[14]	袁自勇, 高曙, 曹姣, 陈良臣. 基于异构图卷积网络的小样本短文本分类方法[J]. 计算机工程, 2021, 47(12): 87-94.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于改进图节点的图神经网络多跳阅读理解研究

Research on Multi-Hop Reading Comprehension Based on Graph Neural Network with Improved Graph Nodes

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 6

参考文献

相关文章 14

编辑推荐

Metrics

本文评价