作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2022, Vol. 48 ›› Issue (8): 98-104,112. doi: 10.19678/j.issn.1000-3428.0062103

• 人工智能与模式识别 • 上一篇    下一篇

基于位置和注意力联合表示的知识图谱问答

吴天波1, 周欣1,2, 程军军2, 朱晗1, 何小海1   

  1. 1. 四川大学 电子信息学院, 成都 610065;
    2. 中国信息安全测评中心, 北京 100085
  • 收稿日期:2021-07-16 修回日期:2021-09-15 发布日期:2021-09-23
  • 作者简介:吴天波(1996-),男,硕士研究生,主研方向为自然语言处理;周欣,博士研究生;程军军,副研究员;朱晗,硕士研究生;何小海(通信作者),教授。
  • 基金资助:
    国家自然科学基金(U1836118);四川省科技计划项目(2018HH0143);成都市重大科技应用示范项目(2019-YF09-00120-SN)。

Knowledge Graph Question-Answering Based on Joint Location and Attention Representation

WU Tianbo1, ZHOU Xin1,2, CHENG Junjun2, ZHU Han1, HE Xiaohai1   

  1. 1. College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China;
    2. China Information Technology Security Evaluation Center, Beijing 100085, China
  • Received:2021-07-16 Revised:2021-09-15 Published:2021-09-23

摘要: 知识图谱是人工智能的重要组成部分,其以结构化的方式描述客观世界中的概念、实体及关系,提供了一种更优的组织、管理和理解互联网海量信息的能力。随着深度学习技术的发展,基于表示学习的知识图谱问答方法陆续出现。利用表示学习的方法实现知识图谱问答的核心目标是将问题嵌入到与三元组相同维度的表示向量空间中,通过合适的答案预测方法来匹配问题与答案。参考复数域编码的思路,构建一种基于位置和注意力联合表示的三元组表示模型Pos-Att-complex。在三元组表示部分,将词本身的特征和位置特征联合编码,并通过解码器网络进一步挖掘深层次特征,从而对三元组进行打分。在知识图谱问答部分,将问题通过RoBERTa嵌入到与三元组向量相同维度的向量空间中,并与通过关系筛选的关系集合进行向量融合。在此基础上,通过联合表示解码器为候选答案打分,以筛选出问题的答案。实验结果表明,该模型在三元组分类和多跳问答基准数据集上均能取得良好的测试结果,准确率优于GraftNet、VRN等模型。

关键词: 表示学习, 知识图谱问答, 复数域编码, 联合表示, 向量融合

Abstract: Knowledge graph is an important part of artificial intelligence.It describes the concepts, entities, and relationships in the objective world in a structured way and provides a better ability to organize, manage, and understand the massive amount of information available on the Internet.With the development of deep learning technology, representation-learning-based knowledge graph question-answering methods have emerged.The core goal of such methods is to embed the question into the representation vector space with the same dimension as triples and match the questions and answers through appropriate answer prediction methods.Referring to the idea of complex field coding, this paper presents a triple-representation model, Pos-Att-complex, based on joint location and attention representation.In the triplet representation part, the features of the word itself and the location features are jointly encoded.The deep-seated features are further mined through the decoder network, so as to score the triplet.In the question and answer part of the knowledge graph, the question is embedded into the vector space with the same dimension as the triplet vector through RoBERTa, and the vector is fused with the relationship set filtered through the relationship.On this basis, the candidate answers are scored by the joint representation decoder to screen them out.Experimental results show that the model can achieve good test results on triple classification and multi hop question and answer benchmark datasets.Furthermore, it outperforms GraftNet, VRN, and other existing models.

Key words: representation learning, knowledge graph question-answering, complex field coding, joint representation, vector fusion

中图分类号: