作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2021, Vol. 47 ›› Issue (4): 48-55. doi: 10.19678/j.issn.1000-3428.0057448

• 人工智能与模式识别 • 上一篇    下一篇

融合文本分类的多任务学习摘要模型

周伟枭, 蓝雯飞   

  1. 中南民族大学 计算机科学学院, 武汉 430074
  • 收稿日期:2020-02-21 修回日期:2020-04-11 发布日期:2021-04-23
  • 作者简介:周伟枭(1997-),男,硕士研究生,主研方向为自然语言处理、文本摘要、机器翻译;蓝雯飞(通信作者),教授、博士。
  • 基金资助:
    国家自然科学基金(61772562)。

Summarization Model Using Multi-Task Learning Fused with Text Classification

ZHOU Weixiao, LAN Wenfei   

  1. School of Computer Science, South-Central University for Nationalities, Wuhan 430074, China
  • Received:2020-02-21 Revised:2020-04-11 Published:2021-04-23

摘要: 文本摘要应包含源文本中所有重要信息,传统基于编码器-解码器架构的摘要模型生成的摘要准确性较低。根据文本分类和文本摘要的相关性,提出一种多任务学习摘要模型。从文本分类辅助任务中学习抽象信息改善摘要生成质量,使用K-means聚类算法构建Cluster-2、Cluster-10和Cluster-20文本分类数据集训练分类器,并研究不同分类数据集参与训练对摘要模型的性能影响,同时利用基于统计分布的判别法全面评价摘要准确性。在CNNDM测试集上的实验结果表明,该模型在ROUGE-1、ROUGE-2和ROUGE-L指标上相比强基线模型分别提高了0.23、0.17和0.31个百分点,生成摘要的准确性更高。

关键词: 编码器-解码器架构, 文本摘要, 文本分类, 多任务学习, 聚类算法, 统计分布

Abstract: The text summary should include all the important information in the source text,but the summaries generated by traditional summarization models based on encoder-decoder architecture are not accurate.Based on the correlation between text classification and text summarization,this paper proposes a summarization model using Multi-Task Learning(MTL).The model learns abstract information from the auxiliary tasks of text classification to improve the quality of generated summaries.The K-means clustering algorithm is used to construct text classification datasets Cluster-2,Cluster-10 and Cluster-20 to train the classifier.On this basis,the impact of different classification datasets participating in the training on the performance of the summarization model is studied,and a discriminant method based on statistical distribution is proposed to reflect the accuracy of the summary. Experimental results on the CNNDM test set show that the proposed model improves the ROUGE-1,ROUGE-2 and ROUGE-L indexes by 0.23,0.17 and 0.31 percentage points compared with the strong baseline model,which demonstrates the summaries generated by this model are more accurate.

Key words: encoder-decoder architecture, text summarization, text classification, Multi-Task Learning(MTL), clustering algorithm, statistical distribution

中图分类号: