作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2023, Vol. 49 ›› Issue (10): 80-88. doi: 10.19678/j.issn.1000-3428.0066219

• 人工智能与模式识别 • 上一篇    下一篇

基于T-HDGN模型的对话摘要生成方法

高玮军, 刘健, 毛文静   

  1. 兰州理工大学 计算机与通信学院, 兰州 730050
  • 收稿日期:2022-11-10 出版日期:2023-10-15 发布日期:2023-10-10
  • 作者简介:

    高玮军(1973—),男,副教授、硕士,主研方向为高性能计算、自然语言处理、计算机视觉

    刘健, 硕士研究生

    毛文静,硕士研究生

  • 基金资助:
    国家自然科学基金(61762059)

Dialogue Summary Generation Method Based on T-HDGN Model

Weijun GAO, Jian LIU, Wenjing MAO   

  1. GAO Weijun, LIU Jian, MAO Wenjing School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China
  • Received:2022-11-10 Online:2023-10-15 Published:2023-10-10

摘要:

随着对话系统和文本摘要生成技术的发展,生成式对话摘要引起了广泛的关注。由于会话中的信息流至少在2个对话者之间交换,关键信息往往分散在各说话者的不同话语中,因此传统文本摘要模型生成的对话摘要包含冗余或者不正确的内容。针对传统文本摘要模型在生成对话摘要时对会话的上下文理解不充分且难以将说话人与其正确的行动相联系的问题,提出一种基于T-HDGN模型的对话摘要生成方法。利用抽取的行动三元组对会话结构进行显式建模,将话语和行动三元组作为2种不同类型的数据来构建异质对话图,并通过1个异质图网络对这2种信息进行建模。同时,还增加说话人作为异质节点以促进信息流的传播。此外,在解码阶段使用主题词特征辅助摘要的生成。在SAMSum数据集上的实验结果表明,所提方法在ROUGE-1、ROUGE-2、ROUGE-L评价指标上分别达到42.05%、18.09%、39.48%,相比Longest-3、PGN、Fast Abs RL等基线模型,能有效地融合信息并且准确地将说话人与其对应动作相关联。

关键词: 对话摘要, 异质图, 行动三元组, 主题词, 异质图网络

Abstract:

With the development of dialogue systems and text summary generation technology, generative dialogue summarization has attracted widespread attention. Because the information flow in a conversation is exchanged between at least two interlocutors, key information is often scattered across different discourses of each speaker. Therefore, the dialogue summary generated by traditional text summarization models contains redundant or incorrect content. To address the issue of insufficient understanding of the conversation context and difficulty in linking the speaker with their correct actions in traditional text summarization models, this study proposes a T-HDGN model-based method for generating dialogue summary. The conversation structure is explicitly modeled using extracted action triplets, a heterogeneous dialogue graph is contrasted using discourse and action triplets as two different types of data, and these two types of information are modeled through the T-HDGN. In addition, speakers are added as heterogeneous nodes to promote the dissemination of information flow. In addition, theme word features are used to assist in the generation of abstracts during the decoding phase. Experimental results on the SAMSum dataset show that the proposed method achieves 42.05%, 18.09%, and 39.48% of the ROUGE-1、ROUGE-2、ROUGE-L evaluation indicators. Compared with the baseline models, such as Longest-3, PGN, and Fast Abs RL, it can effectively fuse information and accurately associate the speaker with their corresponding actions.

Key words: dialogue summary, heterogeneous graph, action triplet, topic word, heterogeneous graph network