| 1 |
叶广大, 高鲁, 曹腾. 基于多智能体系统的合成部队弹药保障模式选择. 指挥控制与仿真, 2025, 47(5): 84- 95.
|
|
YE G D, GAO L, CAO T. Selection of ammunition support modes for combined forces based on multi-agent system. Command Control & Simulation, 2025, 47(5): 84- 95.
|
| 2 |
GRONAUER S, DIEPOLD K. Multi-agent deep reinforcement learning: a survey. Artificial Intelligence Review, 2022, 55(2): 895- 943.
doi: 10.1007/s10462-021-09996-w
|
| 3 |
黄昌勤, 钟益华, 王希哲, 等. 从单智能体到多智能体: 大模型智能体支持下的激励型学习活动设计与实证研究. 华东师范大学学报(教育科学版), 2025, 43(5): 44- 56.
|
|
HUANG C X, ZHONG Y H, WANG X Z, et al. From single agent to multi-agent: design and empirical study of motivational learning activities supported by large-scale intelligent agents. Journal of East China Normal University (Educational Sciences), 2025, 43(5): 44- 56.
|
| 4 |
|
| 5 |
WANG S, ZHANG G, YU M, et al. G-safeguard: a topology-guided security lens and treatment on LLM-based multi-agent systems[EB/OL]. [2025-02-16]. https://arxiv.org/abs/2502.11127.
|
| 6 |
|
| 7 |
|
| 8 |
董之南, 张勤学, 胡进, 等. 面向大模型多智能体系统的多维评估方法. 指挥控制与仿真, 2025, 47(2): 121- 131.
|
|
DONG Z N, ZHANG Q X, HU J, et al. A multi-dimensional evaluation method for large language model-powered multi-agent systems. Command Control & Simulation, 2025, 47(2): 121- 131.
|
| 9 |
LI X Y, WANG S, ZENG S Q, et al. A survey on LLM-based multi-agent systems: workflow, infrastructure, and challenges. Vicinagearth, 2024, 1(1): 9.
doi: 10.1007/s44336-024-00009-2
|
| 10 |
HUANG J T, ZHOU J, JIN T, et al. On the resilience of LLM-based multi-agent collaboration with faulty agents[EB/OL]. [2025-02-16]. https://arxiv.org/abs/2408.00989.
|
| 11 |
|
| 12 |
SUNG Y Y, KIM H, ZHANG D. VeriLA: a human-centered evaluation framework for interpretable verification of LLM agent failures[EB/OL]. [2025-02-16]. https://arxiv.org/abs/2503.12651.
|
| 13 |
EPPERSON W, BANSAL G, DIBIA V C, et al. Interactive debugging and steering of multi-agent AI systems[C]//Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems. New York, USA: ACM, 2025: 1-15.
|
| 14 |
LI G, HAMMOUD H, ITANI H, et al. CAMEL: communicative agents for "mind" exploration of large language model society. Advances in Neural Information Processing Systems, 2023, 36, 51991- 52008.
|
| 15 |
FOURNEY A, BANSAL G, MOZANNAR H, et al. Magentic-One: a generalist multi-agent system for solving complex tasks[EB/OL]. [2025-02-16]. https://arxiv.org/abs/2411.04468.
|
| 16 |
|
| 17 |
|
| 18 |
|
| 19 |
LI Q, CUI L, ZHAO X, et al. GSM-plus: a comprehensive benchmark for evaluating the robustness of LLMs as mathematical problem solvers[EB/OL]. [2025-02-16]. https://arxiv.org/abs/2402.19255.
|
| 20 |
TRIVEDI H, KHOT T, HARTMANN M, et al. AppWorld: a controllable world of apps and people for benchmarking interactive coding agents[EB/OL]. [2025-02-16]. https://arxiv.org/abs/2407.18901.
|
| 21 |
LI Y, XU J, HAN L, et al. Q-star meets scalable posterior sampling: bridging theory and practice via HyperAgent[C]//Proceedings of the 41st International Conference on Machine Learning. Vienna, Austria: JMLR, 2024: 29022-29062.
|
| 22 |
JIMENEZ C E, YANG J, WETTIG A, et al. SWE-bench: can language models resolve real-world GitHub issues?[C]//Proceedings of the 12th International Conference on Learning Representations. Vienna, Austria: ICLR, 2024: 1-14.
|
| 23 |
|
| 24 |
MIALON G, FOURRIER C, WOLF T, et al. GAIA: a benchmark for general AI assistants[C]//Proceedings of the 12th International Conference on Learning Representations. Vienna, Austria: ICLR, 2024: 1-15.
|
| 25 |
HONG S, ZHUGE M, CHEN J, et al. MetaGPT: meta programming for a multi-agent collaborative framework[C]// Proceedings of International Conference on Learning Representations (ICLR). New York, USA: ICLR, 2024: 1-10.
|
| 26 |
|
| 27 |
HENDRYCKS D, BURNS C, BASART S, et al. Measuring massive multitask language understanding[C]//Proceedings of International Conference on Learning Representations. Vienna, Austria: ICLR, 2021: 1-10.
|
| 28 |
|
| 29 |
LUNE H, BERG B L. Qualitative research methods for the social sciences. Boston, USA: Pearson, 2017.
|
| 30 |
KOHEN J. A coefficient of agreement for nominal scale. Educational and Psychological Measurement, 1960, 20, 37- 46.
doi: 10.1177/001316446002000104
|