作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2020, Vol. 46 ›› Issue (12): 43-51. doi: 10.19678/j.issn.1000-3428.0056056

• 人工智能与模式识别 • 上一篇    下一篇

基于双向注意力机制的多文档神经阅读理解

唐竑轩, 武恺莉, 朱朦朦, 洪宇   

  1. 苏州大学 计算机科学与技术学院, 江苏 苏州 215006
  • 收稿日期:2019-09-18 修回日期:2019-11-29 发布日期:2019-12-11
  • 作者简介:唐竑轩(1995-),男,硕士研究生,主研方向为机器阅读理解;武恺莉、朱朦朦,硕士研究生;洪宇(通信作者),教授。
  • 基金资助:
    国家自然科学基金(61672368,61672367,2017YFB1002104)。

Multi-Document Neural Reading Comprehension Based on Bi-Directional Attention Mechanism

TANG Hongxuan, WU Kaili, ZHU Mengmeng, HONG Yu   

  1. School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China
  • Received:2019-09-18 Revised:2019-11-29 Published:2019-12-11

摘要: 机器阅读理解是一项针对给定文本和特定问题自动生成或抽取相应答案的问答任务,该任务是评估计算机系统对自然语言理解程度的重要任务之一。相比于传统的阅读理解任务,多文档阅读理解需要计算模型具备更高的推理和理解能力。为此,提出一种基于多任务联合训练的阅读理解模型,该模型是由一组功能各异的神经网络构成的联合学习模型,其仿效人们推理和回答问题的基本方式分别执行文档选择和答案抽取两个关键步骤。文档选择过程融入了基于注意力矩阵的关联性判别机制,旨在建立各文档间的联系,而答案抽取过程则使用了语篇级的双向注意力机制,来找寻与答案相关的文字线索,将两者附着于一套神经阅读理解模型上,可形成一种基于联合学习的多文档阅读理解方法。在HotpotQA数据集上的实验结果表明,与基线模型相比,该模型的EM值和F1值分别提升了2.1%和1.7%。

关键词: 机器阅读理解, 多文档, 推理, 联合训练, 注意力机制

Abstract: Machine Reading Comprehension(MRC) is a question and answer task that automatically generates or extracts corresponding answers for a given text and specific questions.This task is of great significance to evaluating the understanding of computer systems for natural languages.Compared with traditional reading comprehension tasks,multi-document reading comprehension requires computation models with higher reasoning and comprehension capabilities.Therefore,this paper proposes a reading comprehension model based on multi-task joint training.The model is a joint learning model composed of a set of neural networks with different functions.It executes the two key steps,document selection and answer extraction,by imitating the basic way people reason and answer questions.The document selection process incorporates a relevance discrimination mechanism based on the attention matrix,which aims to establish the relationship between documents,while the answer extraction process uses a text-level bi-directional attention mechanism to find text clues related to the answer.The two parts are attached to a set of neural reading comprehension models to form a multi-document reading comprehension method based on joint learning.Experimental results on the HotpotQA dataset show that compared with the baseline model,the proposed model increases the EM value and F1 value by 2.1% and 1.7%,respectively.

Key words: Machine Reading Comprehension(MRC), multi-document, reasoning, joint training, attention mechanism

中图分类号: