基于多策略强化学习的低资源跨语言摘要方法研究

doi:10.19678/j.issn.1000-3428.0067225

计算机工程 ›› 2024, Vol. 50 ›› Issue (2): 68-77. doi: 10.19678/j.issn.1000-3428.0067225

基于多策略强化学习的低资源跨语言摘要方法研究

冯雄波¹^,²^,*(), 黄于欣¹^,², 赖华¹^,², 高玉梦¹^,²

1. 昆明理工大学信息工程与自动化学院, 云南昆明 650504
2. 昆明理工大学云南省人工智能重点实验室, 云南昆明 650504

收稿日期:2023-03-22 出版日期:2024-02-15 发布日期:2024-02-21
通讯作者: 冯雄波
基金资助:
国家自然科学基金(U21B2027); 云南省重大科技专项项目(202202AD080003); 云南省基础研究计划面上项目(202201AT070915); 云南省基础研究计划面上项目(202201AT070768); 昆明理工大学"双一流"创建联合专项(202201BE070001-021)

Research on Low-Resource Cross-Lingual Summarization Method Based on Multi-Strategy Reinforcement Learning

Xiongbo FENG¹^,²^,*(), Yuxin HUANG¹^,², Hua LAI¹^,², Yumeng GAO¹^,²

1. Faculty of Information Engineering and Automatic, Kunming University of Science and Technology, Kunming 650504, Yunnan, China
2. Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming 650504, Yunnan, China

Received:2023-03-22 Online:2024-02-15 Published:2024-02-21
Contact: Xiongbo FENG

摘要/Abstract

摘要：

跨语言摘要（CLS）旨在给定1个源语言文件（如越南语），生成目标语言（如中文）的摘要。端到端的CLS模型在大规模、高质量的标记数据基础上取得较优的性能，这些标记数据通常是利用机器翻译模型将单语摘要语料库翻译成CLS语料库而构建的。然而，由于低资源语言翻译模型的性能受限，因此翻译噪声会被引入到CLS语料库中，导致CLS模型性能降低。提出基于多策略的低资源跨语言摘要方法。利用多策略强化学习解决低资源噪声训练数据场景下的CLS模型训练问题，引入源语言摘要作为额外的监督信号来缓解翻译后的噪声目标摘要影响。通过计算源语言摘要和生成目标语言摘要之间的单词相关性和单词缺失程度来学习强化奖励，在交叉熵损失和强化奖励的约束下优化CLS模型。为验证所提模型的性能，构建1个有噪声的汉语-越南语CLS语料库。在汉语-越南语和越南语-汉语跨语言摘要数据集上的实验结果表明，所提模型ROUGE分数明显优于其他基线模型，相比NCLS基线模型，该模型ROUGE-1分别提升0.71和0.84，能够有效弱化噪声干扰，从而提高生成摘要的质量。

关键词: 汉语-越南语跨语言摘要, 低资源, 噪声数据, 噪声分析, 多策略强化学习

Abstract:

Cross-Lingual Summarization(CLS) aims to generate a summary in the target language(such as Chinese) given a source language file(such as Vietnamese). The end-to-end CLS model achieves better performance on large-scale and high-quality labeled data, which are usually constructed using models to machine translate monolingual abstract corpora into CLS corpora. However, the limited performance of low-resource language translation models, introduces noise into the CLS corpus, leading to a decrease in the performance of the CLS model. This paper proposes a low-resource CLS method based on multi-strategy. Using multi-strategy reinforcement learning to solve the training problem of CLS models in low-resource noise training data scenarios, whereby source language summaries are introduced as additional supervisory signals to alleviate the impact of translated noisy target summaries.To learn reinforcement rewards, the correlation and degree of missing words between the source and generated target language abstracts are calculated, thereby optimizing the CLS model under the constraints of cross entropy loss and reinforcement rewards. To verify the performance of the proposed model, a noisy Chinese-Vietnamese CLS corpus is constructed. The experimental results on the Chinese-Vietnamese and Vietnamese-Chinese CLS datasets show that the proposed model has significantly better ROUGE scores than the NCLS baseline model, improving ROUGE-1 by 0.71 and 0.84, respectively, effectively weakening noise interference and enhancing the quality of generated summaries

Key words: Chinese-Vietnamese Cross-Lingual Summarization(CLS), low-resource, noise data, noise analysis, multi-strategy reinforcement learning

冯雄波, 黄于欣, 赖华, 高玉梦. 基于多策略强化学习的低资源跨语言摘要方法研究[J]. 计算机工程, 2024, 50(2): 68-77.

Xiongbo FENG, Yuxin HUANG, Hua LAI, Yumeng GAO. Research on Low-Resource Cross-Lingual Summarization Method Based on Multi-Strategy Reinforcement Learning[J]. Computer Engineering, 2024, 50(2): 68-77.

https://www.ecice06.com/CN/Y2024/V50/I2/68

图/表 11

图1 汉语-越南语数据筛选流程

Fig.1 Procedure of Chinese-Vietnamese data screening

图2 多策略强化学习的汉语-越南语跨语言摘要模型

Fig.2 A cross-lingual summarization model for multi-strategy reinforcement learning of Chinese-Vietnamese

参考文献 27

1	DUAN X Y, YIN M M, ZHANG M, et al. Zero-shot cross-lingual abstractive sentence summarization through teaching generation and attention[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2019: 3162-3172.
2	ZHU J N, WANG Q A, WANG Y N, et al. NCLS: neural cross-lingual summarization[C]//Proceedings of Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2019: 3054-3064.
3	ZHANG B L, NAGESH A, KNIGHT K. Parallel corpus filtering via pre-trained language models[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 8545-8554.
4	LIN C Y. ROUGE: a package for automatic evaluation of summaries[EB/OL]. [2023-02-18]. http://oldsite.aclweb.org/anthology-new/W/W04/W04-1013.pdf.
5	ZHANG T Y, KISHORE V, WU F, et al. BERTScore: evaluating text generation with BERT[EB/OL]. [2023-02-18]. https://arxiv.org/abs/1904.09675v2.
6	赖华, 高玉梦, 黄于欣, 等. 基于多粒度特征的文本生成评价方法. 中文信息学报, 2022, 36(3): 45-53, 63.
	LAI H, GAO Y M, HUANG Y X, et al. Evaluation method of text generation based on multi-granularity features. Journal of Chinese Information Processing, 2022, 36(3): 45-53, 63.
7	DOU Z Y, KUMAR S, TSVETKOV Y. A deep reinforced model for zero-shot cross-lingual summarization with bilingual semantic similarity rewards[C]//Proceedings of the 14th Workshop on Neural Generation and Translation. Stroudsburg, USA: Association for Computational Linguistics, 2020: 60-68.
8	LEUSKI A, LIN C Y, ZHOU L A, et al. Cross-lingual CSTRD. ACM Transactions on Asian Language Information Processing, 2003, 2(3): 245- 269.
9	LIM J M, KANG I S, LEE J H. Multi-document summarization using cross-language texts[EB/OL]. [2023-02-18]. http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings4/TSC/NTCIR4-TSC-LimJM.pdf.
10	ORǍSAN C, CHIOREAN O A. Evaluation of a cross-lingual Romanian-English multi-document summariser[C]//Proceedings of the 6th International Conference on Language Resources and Evaluation. Stroudsburg, USA: Association for Computational Linguistics, 2008: 1-10.
11	AYANA, SHEN S Q, CHEN Y, et al. Zero-shot cross-lingual neural headline generation. ACM Transactions on Audio, Speech, and Language Processing, 2018, 26(12): 2319- 2327.
12	ZHU J N, ZHOU Y, ZHANG J J, et al. Attend, translate and summarize: an efficient method for neural cross-lingual summarization[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 1-10.
13	CAO Y E, LIU H, WAN X J. Jointly learning to align and summarize for neural cross-lingual summarization[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 6220-6231.
14	BAI Y, GAO Y, HUANG H Y. Cross-lingual abstractive summarization with limited parallel resources[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2021: 6910-6924.
15	KUMAR G, FOSTER G, CHERRY C, et al. Reinforcement learning based curriculum optimization for neural machine translation[EB/OL]. [2023-02-18]. https://arxiv.org/pdf/1903.00041.pdf.
16	YOU Y J, JIA W J, LIU T Y, et al. Improving abstractive document summarization with salient information modeling[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2019: 2132-2137.
17	PAULUS R, XIONG C M, SOCHER R. A deep reinforced model for abstractive summarization[EB/OL]. [2023-02-18]. https://arxiv.org/pdf/1705.04304.pdf.
18	BÖHM F, GAO Y, MEYER C M. Better rewards yield better summaries: learning to summarise without[EB/OL]. [2023-02-18]. https://arxiv.org/abs/1909.01214v1.
19	YOON WONJIN, YEO Y S, JEONG M, et al. Learning by semantic similarity makes abstractive summarization better[EB/OL]. [2023-02-18]. https://arxiv.org/abs/2002. 07767.
20	HU B T, CHEN Q C, ZHU F Z. LCSTS: a large scale Chinese short text summarization dataset[C]//Proceedings of Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2015: 1967-1972.
21	赵红梅, 刘群. 机器翻译常见错误类型总结[C]//第十届全国机器翻译研讨会. 中国, 重庆: 中国翻译协会, 2013: 1-10.
	ZHAO H M, LIU Q. Summary of common error types in machine translation[C]//Proceedings of the 10th National Machine Translation Sym-posium. Chongqing, China: Translation Association of China, 2013: 1-10. (in Chinese)
22	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. New York, USA: ACM Press, 2017: 5998-6008.
23	DYER C, CHAHUNEAU V, SMITH N A. A simple, fast, and effective reparameterization of IBM model 2 [C]//Proceedings of Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, USA: Association for Computational Linguistics, 2013: 644-648.
24	RENNIE S J, MARCHERET E, MROUEH Y, et al. Self-critical sequence training for image captioning[C]//Proceedings of Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2017: 7008-7024.
25	KANG X M, ZHAO Y, ZHANG J J, et al. Dynamic context selection for document-level neural machine translation via reinforcement learning[C]//Proceedings of Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2020: 2242-2254.
26	WU L J, TIAN F, QIN T, et al. A study of reinforcement learning for neural machine translation[C]//Proceedings of Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2018: 3612-3621.
27	JAUREGI UNANUE I, PARNELL J, PICCARDI M. BERTTune: fine-tuning neural machine translation with BERTScore[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. Stroudsburg, USA: Association for Computational Linguistics, 2021: 1-10.

[1]	杨兴睿, 马斌, 李森垚, 钟忺. 基于大语言模型的教育文本幂等摘要方法[J]. 计算机工程, 2024, 50(7): 32-41.
[2]	侯钰涛, 阿布都克力木·阿布力孜, 史亚庆, 马依拉木·木斯得克, 哈里旦木·阿布都克里木. 面向"一带一路"的低资源语言机器翻译研究[J]. 计算机工程, 2024, 50(4): 332-341.
[3]	武照渊, 余正涛, 黄于欣. 融合词簇约束的汉越跨语言词嵌入[J]. 计算机工程, 2023, 49(1): 82-91.
[4]	范贵生, 刁旭炀, 虞慧群, 陈丽琼. 基于实例过滤与迁移的跨项目缺陷预测方法[J]. 计算机工程, 2020, 46(8): 197-202,209.
[5]	王丽娟, 李可爱, 郝志峰, 蔡瑞初, 尹明. 基于低秩表示的鲁棒回归模型[J]. 计算机工程, 2020, 46(1): 74-79,86.
[6]	王俊超,黄浩,徐海华,胡英. 基于迁移学习的低资源度维吾尔语语音识别[J]. 计算机工程, 2018, 44(10): 281-285,291.
[7]	张衡,金鑫,秦晓倩. 受约束kNN回归在噪声数据中的应用[J]. 计算机工程, 2015, 41(12): 275-279,287.

选择文件类型/文献管理软件名称

选择包含的内容

基于多策略强化学习的低资源跨语言摘要方法研究

Research on Low-Resource Cross-Lingual Summarization Method Based on Multi-Strategy Reinforcement Learning

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献 27

相关文章 7

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于多策略强化学习的低资源跨语言摘要方法研究

Research on Low-Resource Cross-Lingual Summarization Method Based on Multi-Strategy Reinforcement Learning

RichHTML

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献 27

相关文章 7

编辑推荐

Metrics

本文评价