多智能体博弈环境下的大语言模型协同决策研究

doi:10.19678/j.issn.1000-3428.0070301

摘要/Abstract

摘要： 在多智能体博弈仿真中，大语言模型的能力已经被广泛研究，但其在模糊任务目标或不确定性环境中引导多智能体合作的决策能力往往出现“失灵”现象。针对这一问题，提出了一种基于分布式贝叶斯推断的多层级协同决策框架。该框架基于分布式贝叶斯推断方法，集成了“决策—互评—监管”三大功能模块，利用多个大语言模型进行协同决策，并在空间囚徒困境博弈中进行了实验验证。实验结果表明，该框架有效克服了大语言模型在模糊任务环境下的决策瓶颈，成功促进了多智能体合作行为的涌现。此外，通过对不同实验场景下模型决策能力的量化评估，发现模型的决策误差与模型规模不呈线性关系。在模糊任务指令下，LLAMA3（70B）模型的决策误差较LLAMA3（8B）模型高出16.6%，较LLAMA2（7B）模型高出7.2%，表明在更复杂环境中，单纯依赖模型规模的扩大未能显著提升决策性能。相反，大语言模型协同决策在提升决策一致性和有效性方面显示出显著优势。这些结果揭示了多模型协同在复杂决策环境中的关键作用，并为未来在不确定性任务下的智能体系统设计提供了重要参考。

Abstract: The capabilities of large language models (LLMs) in multi-agent game simulations have been widely studied, but their ability to guide cooperative decision-making among agents often fails in environments with fuzzy task objectives or uncertainty. To address this issue, a multi-level collaborative decision-making framework based on distributed Bayesian inference is proposed. This framework integrates three functional modules—decision-making, peer review, and supervision—utilizing multiple LLMs for collaborative decision-making. The framework is validated through experiments in spatial prisoner’s dilemma games. The results demonstrate that this framework effectively overcomes the decision-making bottlenecks of LLMs in fuzzy task environments, successfully promoting the emergence of cooperative behavior among agents. Additionally, a quantitative evaluation of decision-making capabilities across different experimental scenarios reveals that decision errors do not scale linearly with model size. Under fuzzy task instructions, the decision error of the LLAMA3 (70B) model is 16.6% higher than that of the LLAMA3 (8B) model and 7.2% higher than that of the LLAMA2 (7B) model, indicating that simply increasing model size does not significantly improve decision-making performance in more complex environments. In contrast, the collaborative decision-making of LLMs shows significant advantages in enhancing decision consistency and effectiveness. These findings highlight the crucial role of multi-model collaboration in complex decision environments and provide valuable insights for the design of intelligent agent systems in uncertain tasks.

余滔, 董军. 多智能体博弈环境下的大语言模型协同决策研究[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0070301.

Yu Tao, Dong Jun. Research on Collaborative Decision-Making of Large Language Models in Multi-Agent Game Environments[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0070301.

参考文献

[1] Kim,Junsol,Lee, et al. AI-Augmented Surveys: Levera ging Large Language Models and Surveys for Opinio n Prediction[Z], 2023: arXiv:2305.09620.
[2] Organisciak,Peter,Acar, et al. Beyond semantic distanc e: Automated scoring of divergent thinking greatly im proves with large language models[J]. Thinking Skillsand Creativity, 2023, 49: 101356.
[3] Webb,Taylor,Holyoak, et al. Emergent analogical reaso ning in large language models[J]. Nature Human Beha viour, 2023, 7(9): 1526-1541.
[4] Qureshi,Riaz,Shaughnessy, et al. Are ChatGPT and lar ge language models “the answer” to bringing us cl oser to systematic revi
ew automation?[J]. Systematic Reviews, 2023, 12(1). [5] Contreras Kallens,Pablo,Kristensen-McLachlan, et al. L arge Language Models Demonstrate the Potential of S tatistical Learning in Language[J]. Cognitive Science, 2023, 47(3).
[6] Carvalho,Ines,Ivanov, et al. ChatGPT for tourism: appl ications, benefits and risks[J]. Tourism Review, 2024, 79(2): 290-303.
[7] Huang,Allen H.,Wang, et al. FinBERT: A Large Lang uage Model for Extracting Information from Financial Text*[J]. Contemporary Accounting Research, 2023, 40(2): 806-841.
[8] Alqahtani,Tariq,Badreldin, et al. The emergent role of artificial intelligence, natural learning processing, and large language models in higher education and researc h[J]. Research in Social & Administrative Pharmacy, 2023, 19(8): 1236-1242.
[9] Xu,Yuzhuang,Wang, et al. Exploring Large Language Models for Communication Games: An Empirical Stu dy on Werewolf[J]. Arxiv E-prints, 2023, abs/2309.04 658: arXiv:2309.04658.
[10] Zhang,Jintian,Xu, et al. Exploring Collaboration Mech anisms for LLM Agents: A Social Psychology View [J]. Arxiv E-prints, 2023, abs/2310.02124: arXiv:2310. 02124.
[11] Guo,Fulin. GPT in Game Theory Experiments[Z]: Arx iv, 2023: arXiv:2305.05516.
[12] Perc,Matjaz. Phase transitions in models of human co operation[J]. Physics Letters a, 2016, 380(36): 2803-2 808.
[13] Yuzhuang Xu,Shuo Wang,Peng Li, et al. Exploring La rge Language Models for Communication Games:An Empirical Study on Werewolf[J]: 1-23.
[14] 陈娟,赵新潮,隋京言,等. 故事启发大语言模型的时序知识图谱预测_陈娟[J]. 模式识别与人工智能, 2024, 37 (08): 715-728. Chen, J., Zhao, X., Sui, J., Qi, L., Tian, C., Pang, L., & Fang, J. (2024). Narrative-Driven Large Language Model for Temporal Knowledge Graph Prediction. Pa ttern Recognition and Artificial Intelligence, 37(08), 7 15-728. (in Chinese)
[15] 陶江垚,奚雪峰,盛胜利,等. 结构化思维提示增强大语言模型推理能力综述_陶江垚[J]. 计算机工程与应用: 1-2 1. Tao, J., Xi, X., Sheng, S., Cui, Z., & Zuo, Y. Enhan cing Large Language Models' Reasoning Abilities thro ugh Structured Thinking Prompts: A Review. Compute r Engineering and Applications, 1-21. (in Chinese)
[16] 罗焕坤,葛一烽,刘帅. 大语言模型在数学推理中的研究进展_罗焕坤[J]. 计算机工程, 2024, 50(09): 1-17. Luo, H., Ge, Y., & Liu, S. (2024). Research Progress of Large Language Models in Mathematical Reasoni ng. Computer Engineering, 50(09), 1-17. (in Chinese)
[17] Tang,Ming,Liao, et al. From conventional group decisi on making to large-scale group decision making: Wha t are the challenges and how to meet them in big da ta era? A state-of-the-art survey[J]. Omega-internationa l Journal of Management Science, 2021, 100: 102141.
[18] Siegenfeld,Alexander F.,Bar-Yam, et al. An Introductio n to Complex Systems Science and Its Applications[J]. Complexity, 2020, 2020: 1-16.
[19] Zhu,Wei,Jiang, et al. Event-based consensus of multi-a gent systems with general linear models[J]. Automatic a, 2014, 50(2): 552-558.
[20] Wang,Shiyong,Wan, et al. Towards smart factory for i ndustry 4.0: a self-organized multi-agent system with big data based feedback and coordination[J]. Compute r Networks, 2016, 101: 158-168.
[21] Gao,Chen,Lan, et al. S3: Social-network Simulation S ystem with Large Language Model-Empowered Agents [J]. Arxiv E-prints, 2023, abs/2307.14984: arXiv:2307. 14984.
[22] Williams,Ross,Hosseinichimeh, et al. Epidemic Modeli ng with Generative Agents[J]. Arxiv E-prints, 2023, a bs/2307.04986: arXiv:2307.04986.
[23] Mao,Shaoguang,Cai, et al. ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-M aking with AI Agents[Z], 2023: arXiv:2311.03220.
[24] Phelps,Steve,Russell, et al. Investigating Emergent Goal-Like Behaviour in Large Language Models Using E xperimental Economics[J]. Arxiv E-prints, 2023, abs/2 305.07970: arXiv:2305.07970.
[25] Akata,Elif,Schulz, et al. Playing repeated games with Large Language Models[Z]: Arxiv, 2023: arXiv:2305.1 6867.
[26] Guo,Jiaxian,Yang, et al. Suspicion-Agent: Playing Imp erfect Information Games with Theory of Mind Aware GPT-4[Z]: Arxiv, 2023: arXiv:2309.17277.
[27] Suzuki,Reiji,Arita, et al. An evolutionary model of pe rsonality traits related to cooperative behavior using a large language model[J]. Scientific Reports, 2024, 14: 5989.
[28] Phelps,Steve,Russell, et al. Investigating Emergent Goa l-Like Behaviour in Large Language Models Using E xperimental Economics[J]. Arxiv E-prints, 2023, abs/2 305.07970: arXiv:2305.07970.
[29] Xu,Lin,Hu, et al. MAgIC: Investigation of Large Lan guage Model Powered Multi-Agent in Cognition, Ada ptability, Rationality and Collaboration[J]. Arxiv E-pri nts, 2023: arXiv:2311.08562.
[30] Kai Xiong,Xiao Ding,Yixin Cao, et al. Examining Int er-Consistency of Large Language Models Collaboratio n: An In-depth Analysis via Debate[C]//Conference on Empirical Methods in Natural Language Processing, 2023.
[31] Xu,Zhenran,Shi, et al. Towards Reasoning in Large L anguage Models via Multi-Agent Peer Review Collabo ration[J]. Arxiv E-prints, 2023: arXiv:2311.08152.
[32] Zhang,Bin,Mao, et al. Controlling Large Language Mo del-based Agents for Large-Scale Decision-Making: A n Actor-Critic Approach[J]. Arxiv E-prints, 2023: arXi v:2311.13884.
[33] Touvron,Hugo,Lavril, et al. LLaMA: Open and Efficie nt Foundation Language Models[J]. Arxiv, 2023.
[34] Boccaletti,S.,Bianconi, et al. The structure and dynami cs of multilayer networks[J]. Physics Reports-review S ection of Physics Letters, 2014, 544(1): 1-122.
[35] Yang,Wenjing,Brenner, et al. Influence Maximization b y Link Activation in Social Networks[C]//2018 Ieee 2 3rd International Conference on Emerging Technologie s and Factory Automation (etfa), 2018: 1248-1251.
[36] Holme,Petter. Modern temporal network theory: a coll oquium[J]. European Physical Journal B, 2015, 88(9).

选择文件类型/文献管理软件名称

选择包含的内容