基于预训练模型的问答知识文本生成

doi:10.19678/j.issn.1000-3428.0070161

摘要/Abstract

摘要：

生成模型在许多问答推理任务中表现良好, 但是这往往需要人工花费大量成本为每条数据匹配对应的相关知识文本, 以保证模型输出的可靠性。如果语言模型可以经过充分训练内化知识库, 输出较为可靠的问答知识, 那么就可以降低在问答推理任务中提供相关知识的成本。此外, 输出问答相关知识文本也有利于探究模型在推理任务中依据哪些知识进行推理, 这对探究模型的可解释性有重要意义。为此, 提出一个新的自然语言生成任务, 将问答对作为输入, 使模型直接生成与问答相关的知识文本(依据这些问答知识文本能够辅助答案的推理)以帮助模型通过新任务形成知识库。为新任务提供基准模型, 结果显示了生成模型具有较高的生成质量, 表明了新任务是有可行性的, 并且当问答对的陈述形式也包含在输入中时, 模型的生成效果可以显著提高。实验比较了3种生成模型, 结果显示了参数更多的模型可能包含更全面的知识库, 具有更好的生成效果。此外, 实验设计了不同的输入融合方法和输出相关文本的数量, 确定最佳的任务形式。实验分析表明, 新任务对于未来的研究是可行和有价值的。

关键词: 生成模型, 可解释性, 知识文本, 开放领域问答, 注意力机制

Abstract:

Generative models show satisfactory performance in many question and answer reasoning tasks. However, significant manual effort is required for matching each data point with the corresponding relevant knowledge text to ensure the reliability of the model's output. If a language model can be sufficiently trained to internalize a knowledge base and reliably output question and answer knowledge, it can eliminate the cost of providing relevant knowledge explicitly in question and answer reasoning tasks. In addition, generating knowledge texts related to the answers can help explore which knowledge the model relies on for reasoning, which is crucial for investigating the interpretability of the model. For this purpose, this paper proposes a new natural language generation task. This task takes a question-answer pair as the input and requires the model to directly generate the relevant knowledge text. The generated text should support the reasoning behind the given answer, thereby helping the model consolidate its internal knowledge base during the training process. Benchmark models have been established for this new task. The results demonstrate the remarkable text generation quality of the model, confirming the feasibility of the task. When the statement forms of the question-answer pairs are also included in the input, the generation effect of the model can be significantly improved. A comparison of three generative models reveals that models with more parameters achieve superior generation performance, likely owing to their more comprehensive internal knowledge bases. Furthermore, experiments are conducted with different input fusion methods while varying the number of knowledge statements generated to identify the optimal task configuration. The results indicate that this task is feasible and valuable for future research.

Key words: generative model, interpretability, knowledge text, open domain question and answer, attention mechanism

瞿靖鸿, 王中卿, 周国栋. 基于预训练模型的问答知识文本生成[J]. 计算机工程, 2026, 52(5): 326-335.

QU Jinghong, WANG Zhongqing, ZHOU Guodong. Knowledge Text Generation Based on Pre-trained Model for Question and Answer[J]. Computer Engineering, 2026, 52(5): 326-335.

https://www.ecice06.com/CN/Y2026/V52/I5/326

图/表 11

图1 问答知识生成任务图

Fig.1 Question and answer knowledge generation task diagram

图2 任务1输入输出图

Fig.2 Task 1 input/output diagram

图3 任务2输入输出图

Fig.3 Task 2 input/output diagram

图4 任务3输入输出图

Fig.4 Task 3 input/output diagram

图5 模型结构

Fig.5 Model structure

图6 相关知识数量生成实验

Fig.6 Experiment on generating relevant knowledge quantity

图7 生成文本样例分析

Fig.7 Cases analysis of generating text

参考文献 32

1	MIN S, ZHONG V, SOCHER R, et al. Efficient and robust question answering from minimal context over documents[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1805.08092.
2	LIN Y K, JI H Z, LIU Z Y, et al. Denoising distantly supervised open-domain question answering[C]//Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Stroudsburg, USA: ACL Press, 2018: 1736-1745.
3	RAJPURKAR P, ZHANG J, LOPYREV K, et al. SQUAD: 100, 000+ questions for machine comprehension of text[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1606.05250.
4	ZHANG S, LIU X, LIU J, et al. ReCoRD: bridging the gap between human and machine commonsense reading comprehension[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1810.12885.
5	KHASHABI D, CHATURVEDI S, ROTH M, et al. Looking beyond the surface: a challenge set for reading comprehension over multiple sentences[C]//Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human LanguageTechnologies, Volume 1(Long Papers). Stroudsburg, USA: ACL Press, 2018: 252-262.
6	CLARK C, LEE K, CHANG M W, et al. BoolQ: exploring the surprising difficulty of natural yes/no questions[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1905.10044.
7	BERANT J, CHOU A, FROSTIG R, et al. Semantic parsing on freebase from question-answer pairs[C]//Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, USA: ACL Press, 2013: 1533-1544.
8	CHEN D, FISCH A, WESTON J, et al. Reading Wikipedia to answer open-domain questions[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1704.00051.
9	LEWIS M, LIU Y H, GOYAL N, et al. BART: denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1910.13461.
10	RAFFEL C , SHAZEER N , ROBERTS A , et al. Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 2020, 21 (1): 5485- 5551. doi: 10.48550/arXiv.1910.10683
11	PETRONI F, ROCKTÄSCHEL T, LEWIS P, et al. Language models as knowledge bases?[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1909.01066.
12	ROBERTS A, RAFFEL C, SHAZEER N. How much knowledge can you pack into the parameters of a language model?[EB/OL]. [2024-06-11]. https://arxiv.org/abs/2002.08910.
13	DALVI B, JANSEN P, TAFJORD O, et al. Explaining answers with entailment trees[EB/OL]. [2024-06-11]. https://arxiv.org/abs/2104.08661.
14	CHUNG H W , HOU L , LONGPRE S , et al. Scaling instruction-finetuned language models. The Journal of Machine Learning Research, 2024, 25 (1): 3381- 3433. URL
15	TOUVRON H, LAVRIL T, IZACARD G, et al. LLaMA: open and efficientfoundation language models[EB/OL]. [2024-06-11]. https://arxiv.org/abs/2302.13971.
16	OUYANG L , WU J , JIANG X , et al. Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 2022, 35, 27730- 27744.
17	PRAGER J . Open-domain question-answering. Foundations and Trends^® in Information Retrieval, 2006, 1 (2): 191- 231.
18	WANG A, SINGH A, MICHAEL J, et al. GLUE: a multi-task benchmark and analysis platform for natural language understanding[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1804.07461.
19	RADFORD A , WU J , CHILD R , et al. Language models are unsupervised multitask learners. OpenAI Blog, 2019, 1 (8): 9.
20	LEWIS P , PEREZ E , PIKTUS A , et al. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, 2020, 33, 9459- 9474. doi: 10.48550/arXiv.2005.11401
21	MIN S, MICHAEL J, HAJISHIRZI H, et al. AmbigQA: answering ambiguous open-domain questions[EB/OL]. [2024-06-11]. https://arxiv.org/abs/2004.10645.
22	IZACARD G, GRAVE E. Leveraging passage retrieval with generative models for open domain question answering[EB/OL]. [2024-06-11]. https://arxiv.org/abs/2007.01282.
23	CLARK P, COWHEY I, ETZIONI O, et al. Think you have solved question answering? Try arc, the AI2 reasoning challenge[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1803.05457.
24	XIE Z, THIEM S, MARTIN J, et al. WorldTree V2: a corpus of science-domain structured explanations and inference patterns supporting multi-hop inference[C]//Proceedings of the 12th Language Resources and Evaluation Conference. Berlin, Germany: Springer, 2020: 5456-5473.
25	VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1706.03762.
26	KINGMA D P, BA J. Adam: a method for stochastic optimization[EB/OL]. [2024-06-11]. https://arxiv.org/abs/1412.6980.
27	LIN C Y. ROUGE: a package for automatic evaluation of summaries[EB/OL]. [2024-06-11]. https://aclanthology.org/W04-1013.pdf.
28	PAPINENI K, ROUKOS S, WARD T, et al. BLEU: a method for automatic evaluation of machine translation[C]//Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. Philadelphia, USA: ACL Press, 2002: 311-318.
29	SELLAM T, DAS D, PARIKH A P. BLEURT: learning robust metrics for text generation[C]//Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Philadelphia, USA: ACL Press, 2020: 7881-7892.
30	刘全, 梁斌, 徐进, 等. 一种用于基于方面情感分析的深度分层网络模型. 计算机学报, 2018, 41 (12): 2637- 2652. doi: 10.11897/SP.J.1016.2018.02637
	LIU Q , LIANG B , XU J , et al. A deep hierarchical neural network model for aspect-based sentiment analysis. Chinese Journal of Computers, 2018, 41 (12): 2637- 2652. doi: 10.11897/SP.J.1016.2018.02637
31	栾克鑫, 杜新凯, 孙承杰, 等. 基于注意力机制的句子排序方法. 中文信息学报, 2018, 32 (1): 123- 130.
	LUAN K X , DU X K , SUN C J , et al. Sentence ordering based on attention mechanism. Journal of Chinese Information Processing, 2018, 32 (1): 123- 130.
32	周强伟, 施水才, 王洪俊. 基于预训练模型的受控文本生成研究综述. 软件导刊, 2024, 23 (4): 199- 207.
	ZHOU Q W , SHI S C , WANG H J . Overview of controlled text generation based on pre-trained models. Software Guide, 2024, 23 (4): 199- 207.

[1]	张红, 朱思雨, 张玺君, 魏轿云. 基于自适应图卷积优化元图学习的非平稳交通流预测研究[J]. 计算机工程, 2026, 52(5): 456-466.
[2]	宋天泽, 曹从军, 何佳琪, 王旭升, 刘晨煜. 基于改进DETR的密集行人检测算法研究[J]. 计算机工程, 2026, 52(5): 250-258.
[3]	杨家豪, 王雷. 基于多特征时空推理网络的个体关注目标检测[J]. 计算机工程, 2026, 52(5): 184-191.
[4]	吴沛颖, 李晓慧, 王俊峰. 基于上下文感知语言模型的C2流量检测[J]. 计算机工程, 2026, 52(5): 270-280.
[5]	李娇, 范浩东, 洪旭东, 许镇义, 樊旭, 黄俊. 基于标签视觉原型学习的多标签图像分类[J]. 计算机工程, 2026, 52(4): 229-238.
[6]	汤伟博, 方强, 李沛根, 艾龙金, 熊金红, 夏海廷. 基于RSD-YOLO的无人机航拍图像小目标检测[J]. 计算机工程, 2026, 52(4): 214-228.
[7]	温雯, 钟宴宏, 郝志峰. 融合时序衰减特性的复购行为及时间间隔预测[J]. 计算机工程, 2026, 52(4): 111-121.
[8]	游雅倩, 闫辉, 苏耀峰, 王晓双, 鄢睿丞. 基于EnsembleBRB-SHAP的航空发动机健康状态可解释预测方法[J]. 计算机工程, 2026, 52(4): 386-397.
[9]	尹恒杰, 郑克清, 柯建楠, 董云泉. 基于本地动量加速的非独立同分布联邦学习方法[J]. 计算机工程, 2026, 52(4): 103-110.
[10]	潘理虎, 尹佳莉, 张睿, 谢斌红, 张林梁. 面向交通流预测的全局-局部时空感知模型[J]. 计算机工程, 2026, 52(3): 392-402.
[11]	苏建华, 池云仙, 许云峰, 高凯. 基于注意力模态融合的多模态意图识别[J]. 计算机工程, 2026, 52(3): 234-242.
[12]	曹继卫, 罗飞, 丁炜超. BS-YOLO: 基于BSAM注意力机制和SCConv的小目标检测算法[J]. 计算机工程, 2026, 52(3): 119-127.
[13]	陈国莲, 冯梓洋, 曹均阔. 基于多模态空间特征融合的网络欺凌检测研究[J]. 计算机工程, 2026, 52(3): 255-263.
[14]	廖勇, 韩小金, 刘金林, 汪浩. 可解释人工智能研究进展[J]. 计算机工程, 2026, 52(3): 41-61.
[15]	顾群, 随思懿, 王瑞, 张海, 许天鹏. 基于改进YOLOv8的皮肤黑色素瘤图像分割算法[J]. 计算机工程, 2026, 52(3): 429-440.

选择文件类型/文献管理软件名称

选择包含的内容