基于多层注意力机制的回指消解算法

doi:10.19678/j.issn.1000-3428.0053545

计算机工程 ›› 2020, Vol. 46 ›› Issue (2): 59-64,71. doi: 10.19678/j.issn.1000-3428.0053545

基于多层注意力机制的回指消解算法

刘雨江^1,2, 付立军^1,2, 刘俊明^1,2, 吕鹏飞³

1. 中国科学院大学计算机科学与技术学院, 北京 100049;
2. 中国科学院沈阳计算技术研究所研究生部, 沈阳 110168;
3. 中国地质图书馆信息技术研究中心, 北京 100083

收稿日期:2019-01-02 修回日期:2019-03-04 发布日期:2019-03-14
作者简介:刘雨江(1994-),男,硕士研究生,主研方向为自然语言处理;付立军、刘俊明,教授;吕鹏飞,高级工程师。
基金资助:
国土资源部大数据科研专项（201511079-3）。

Anaphora Resolution Algorithm Based on Multilayer Attention Mechanism

LIU Yujiang^1,2, FU Lijun^1,2, LIU Junming^1,2, Lü Pengfei³

1. School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China;
2. Graduate Faculty, Shenyang Institute of Computing Technology, University of Chinese Academy of Sciences, Shenyang 110168, China;
3. Research Center of Information Technology, National Geological Library of China, Beijing 100083, China

Received:2019-01-02 Revised:2019-03-04 Published:2019-03-14

摘要/Abstract

摘要： 在信息抽取过程中，无法被判别的回指易造成信息抽取不完整的情况，这种指代关系可通过分析当前语境下的指代部分、被指代部分、周围的信息及原文内容生成的唯一判别信息进行判断。为此，构建一个多层注意力机制模型，在不同层次上对上述信息进行基于注意力机制的概率计算，利用最终结果判别回指关系是否成立。在指代部分与被指代部分向量化后，通过2个注意力层上的4次概率计算，使每一个训练结果在判别之前都具有唯一性。在OntoNotes 5.0数据集上的实验结果表明，该模型F值在显性指代和零指代均存在的条件下为70.1%，在存在零指代的条件下为60.7%，高于尹庆宇等人提出的模型。

关键词: 指代关系, 注意力机制, 显性指代, 零指代, 多层注意力机制模型

Abstract: In the information extraction process,the nondeterministic anaphora can cause incomplete information extraction.By analyzing the only discriminate information generated by the anaphoric part,the referenced part,the surrounding information,the referenced surrounding information and the original content in the current context,the anaphora relations are judged and a multilayer attention mechanism model is constructed.The probability calculation based on attention mechanism is performed on these five parts at different levels,and the final results are used to determine whether the anaphora relations can be proved or not.With the vectorization of the anaphoric part and the referenced part,the four probability calculations on two attention layers make every training result unique before judgment.Experimental results on OntoNotes 5.0 dataset show that the F value of the proposed model is 70.1% when both overt anaphora and zero anaphora are presented.When only zero anaphora are presented,the F value is 60.7%,which is higher than the model proposed by YIN Qingyu et al.

Key words: anaphora relations, attention mechanism, overt anaphora, zero anaphora, multilayer attention mechanism model

中图分类号:

TP18

刘雨江, 付立军, 刘俊明, 吕鹏飞. 基于多层注意力机制的回指消解算法[J]. 计算机工程, 2020, 46(2): 59-64,71.

LIU Yujiang, FU Lijun, LIU Junming, Lü Pengfei. Anaphora Resolution Algorithm Based on Multilayer Attention Mechanism[J]. Computer Engineering, 2020, 46(2): 59-64,71.

https://www.ecice06.com/CN/Y2020/V46/I2/59

图/表 4

参考文献

[1] ZHOU Xuanyu,LIU Juan,LU Xiao.Intra-document anaphora resolution:a survey[J].Journal of Wuhan University (Natural Science Edition),2014,60(1):24-33.(in Chinese) 周炫余,刘娟,卢笑.篇章中指代消解研究综述[J].武汉大学学报(理学版),2014,60(1):24-33.
[2] JIN Wei,QIAO Xiaodong,LIU Yao,et al.Study on the rules of zero anaphora resolution in Chinese patent literature[J].Library and Information Service,2015(9):142.(in Chinese) 靳玮,乔晓东,刘耀,等.面向中国专利文献的零形回指消解规则研究[J].图书情报工作,2015(9):142.
[3] XU Yongliang,ZHOU Xiaohui,LI Xiaoge.Anaphora resolution inquiry for information retrieval[J].Journal of Xi'an University of Arts and Science (Natural Science Edition),2015,18(2):65-69.(in Chinese) 许永良,周晓辉,李晓戈.面向信息抽取的指代消解探究[J].西安文理学院学报(自然科学版),2015,18(2):65-69.
[4] VOITA E,SERDYUKAV P,SENNRICH R,et al.Context-aware neural machine translation learns anaphora resolution[EB/OL].[2018-12-21]. https://arxiv.org/abs/1805.10163.
[5] UMAR M,KORDJAMSHIDI P.Anaphora resolution for improving spatial relation extraction from text[C]//Proceedings of the 1st International Workshop on Spatial Language Understanding.Chicago,USA:[s.n.],2018:53-62.
[6] KARTHIK N,YALA A.Improving information extraction by acquiring external evidence with reinforcement learning[EB/OL].[2018-12-21]. https://arxiv.org/abs/1603.07954.
[7] SAM W,RUSH A M.Learning global features for coreference resolution[EB/OL].[2018-12-21]. https://arxiv.org/abs/1604.03035.
[8] WISEMEN S J,RUSH A M,SHIEBER S M,et al.Learning anaphoricity and antecedent ranking features for coreference resolution[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing.Chicago,USA:[s.n.],2015:1416-1426.
[9] YIN Qingyu,ZHANG Yu,ZHANG Weinan,et al.Deep reinforcement learning for Chinese zero pronoun resolution[EB/OL].[2018-12-21]. https://arxiv.org/abs/1806.03711.
[10] HUANG Xuehua,KONG Fang,ZHOU Guodong.Expression recognition and anaphora resolution in Chinese[J].Computer Engineering,2016,42(9):168-173.(in Chinese) 黄学华,孔芳,周国栋.汉语表述识别与指代消解[J].计算机工程,2016,42(9):168-173.
[11] VASWANI A,SHAEZER N,PARMAR N,et al.Attention is all you need[C]//Proceedings of Advances in Neural Information Processing Systems.Long Beach,USA:[s.n.],2017:5998-6008.
[12] SORDONI A,BACHMAN P,TRISHLER A.Iterative alternating neural attention for machine reading[EB/OL].[2018-12-21]. https://arxiv.org/abs/1606.02245.
[13] DEVLIN J,CHANG M W,LEE K,et al.Bert:pre-training of deep bidirectional transformers for language understanding[EB/OL].[2018-12-21].https://arxiv.org/abs/1810.04805.
[14] CHEN C,NG V.Chinese overt pronoun resolution:a bilingual approach[C]//Proceedings of AAAI Conference on Artificial Intelligence.Québec,Canadian:AAAI Press,2014:1615-1621.
[15] CHEN C,NG V.Combining the best of two worlds:a hybrid approach to multilingual coreference resolution[C]//Proceedings of Association for Computational Linguistics.Jeju Island,Korea:[s.n.],2012:56-63.
[16] YANG Ziyi,GONG Zhengxian,KONG Fang,et al.Exploit comparable corpus to Chinese zero pronoun resolution[J].Acta Scientiarum Naturalium Universitatis Pekinensis(Natural Science Edition),2017,53(2):279-286.(in Chinese) 杨紫怡,贡正仙,孔芳,等.基于中英文可比较语料的中文零指代消解[J].北京大学学报(自然科学版),2017,53(2):279-286.
[17] ZHOU Xuanyu,LIU Juan,LUO Fei,et al.Comparison of Chinese anaphora resolution models[J].Computer Science,2015,43(2):31-34.(in Chinese) 周炫余,刘娟,罗飞,等.中文指代消解模型的对比研究[J].计算机科学,2015,43(2):31-34.
[18] YIN Qingyu,ZHANG Yu,ZHANG Weinan,et al.Zero pronoun resolution with attention-based neural network[C]//Proceedings of the 27th International Conference on Computational Linguistics.Santa Fe,USA:[s.n.],2018:13-23.
[19] LIU Bingquan,DU Xinkai,LIU Ming,et al.Resolving Chinese zero pronoun with word embedding[C]//Proceedings of National CCF Conference on Natural Language Processing and Chinese Computing.Dalian,China:[s.n.],2017:828-838.
[20] KNDU G,SIL A,FLORIAN R,et al.Neural cross-lingual conference resolution and its application to entity linking[EB/OL].[2018-12-21].https://arxiv.org/abs/1806.10201.
[21] DAUNIZEAU J.Semi-analytical approximations to statistical moments of sigmoid and softmax mappings of normal variables[EB/OL].[2018-12-21].https://arxiv.org/abs/1703.00091.

选择文件类型/文献管理软件名称

选择包含的内容

基于多层注意力机制的回指消解算法

Anaphora Resolution Algorithm Based on Multilayer Attention Mechanism

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 4

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	李俊俊, 董建刚, 李坤. 基于Kubernetes的集群节能策略研究[J]. 计算机工程, 2024, 50(9): 82-91.
[2]	林畅, 郭伟, 任哲聪, 金海波. 基于Transformer的目标跟踪与分割统一算法[J]. 计算机工程, 2024, 50(9): 130-141.
[3]	李泽霖, 吕兆峰, 陈富强, 李克. 基于多跳信息融合的实体对齐模型[J]. 计算机工程, 2024, 50(9): 142-152.
[4]	王汝英, 马嘉骏, 董建强, 刘万龙, 张海涛, 尹凯, 赵博超. 基于MTS-BiGRU-DMHSA的工业负荷预测方法[J]. 计算机工程, 2024, 50(9): 169-178.
[5]	朱凯, 李理, 张彤, 江晟, 别一鸣. 基于Transformer的多阶段运动模糊图像修复网络[J]. 计算机工程, 2024, 50(9): 276-285.
[6]	张天鹏, 韩晶, 吕学强. 基于多任务学习的超分辨率辅助小目标检测[J]. 计算机工程, 2024, 50(9): 304-312.
[7]	郭敏, 张熙涵, 李阳. 融合注意力的教师互一致性半监督医学图像分割[J]. 计算机工程, 2024, 50(9): 313-323.
[8]	曾钰琦, 刘博, 钟柏昌, 钟瑾. 智慧教育下基于改进YOLOv8的学生课堂行为检测算法[J]. 计算机工程, 2024, 50(9): 344-355.
[9]	饶日昕, 王怡文, 曾砺志, 童心恬, 赵海涛. 面向废旧电缆检测的轻量化网络模型[J]. 计算机工程, 2024, 50(8): 22-30.
[10]	李华昱, 张智康, 闫阳, 岳阳. 基于知识图谱增强的领域多模态实体识别[J]. 计算机工程, 2024, 50(8): 31-39.
[11]	王蕾, 党时鹏, 潘丰. 基于卷积神经网络的隐匿性旁路预测模型[J]. 计算机工程, 2024, 50(8): 40-49.
[12]	陈瀚, 赵春蕾, 蒋昊达, 王春东. 基于融合模型与语义网络的App用户意图识别研究[J]. 计算机工程, 2024, 50(8): 50-63.
[13]	王夙喆, 张雪英, 陈晓玉, 李凤莲, 吴泽林. 基于有效注意力和GAN结合的脑卒中EEG增强算法[J]. 计算机工程, 2024, 50(8): 336-344.
[14]	王宇, 祁琦, 王纯, 许才. 储能变流器信号高精度故障诊断方法[J]. 计算机工程, 2024, 50(8): 389-396.
[15]	王炼红, 林飞鹏, 李潇瑶, 谌桂枝, 周莉. 融入课程知识图谱的KMAKT预测[J]. 计算机工程, 2024, 50(7): 23-31.

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于多层注意力机制的回指消解算法

Anaphora Resolution Algorithm Based on Multilayer Attention Mechanism

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 4

参考文献

相关文章 15

编辑推荐

Metrics

本文评价