[1] Zhu C, Dastani M, Wang S. A survey of multi-agent deep
reinforcement learning with communication[J].
Autono-mous Agents and Multi-Agent Systems, 2024,
38(1): 4.
[2] Lowe R, Wu Y I, Tamar A, et al. Multi-agent actor-critic
for mixed cooperative-competitive environments[J].
Advances in neural information processing systems, 2017,
30
[3] Rashid T, Samvelyan M, De Witt C S, et al. Monotonic
value function factorisation for deep multi-agent
reinforcement learning[J]. Journal of Machine Learning
Research, 2020, 21(178): 1-51.
[4] 王涵, 俞扬, 姜远. 基于通信的多智能体强化学习进展
综述[J]. 中国科学: 信息科学, 2022, 52(5): 742-764.
WANG H, YU Y, JIANG Y. Review of the progress of
communication-based multi-agent re-inforcement
learning[J]. Science in China(Information Sciences) ,
2022, 52(5): 742-764. (in Chinese)
[5] 罗彪,胡天萌,周育豪,等. 多智能体强化学习控制与决策
研 究 综 述 [J/OL]. 自 动 化 学 报 , 1-30[2024-12-14].
https://doi.org/10.16383/j.aas.c240392.
LUO B, HU T M, ZHOU Y H, et al. Survey on
Multi-agent Reinforcement Learning for Control and
Decision-making [J/OL]. ACTA AUTOMATICA SINICA,
1-30[2024-12-14]. https://doi.org/10.16383/j.aas.c240392.
(in Chinese)
[6] 丁世飞, 杜威, 张健, 等. 多智能体深度强化学习研究
进展[J]. 计算机学报, 2024, 47(07): 1547-1567. DING S
F, DU W, ZHANG J, et al. Research Progress of
Multi-Agent Deep Reinforcement Learning[J]. Chinese
Journal of Computers, 2024, 47(07): 1547-1567. (in
Chinese)
[7] Mao H, Zhang Z, Xiao Z, et al. Learning agent
communica-tion under limited bandwidth by message
pruning[C] //Proceedings of the AAAI Conference on
Artificial Intelligence, 2020, 34(04): 5142-5149.
[8] Ding Z, Huang T, Lu Z. Learning individually inferred
communication for multi-agent cooperation[J]. Advances
in neural information processing systems, 2020, 33:
22069-22079.
[9] Wang Y, Zhong F, Xu J, et al. ToM2C: Target-oriented
Multi-agent Communication and Cooperation with Theory
of Mind[C]//The Tenth International Conference on
Learning Representations, 2022.
[10] Hu S C, Shen L, Zhang Y, et al. Learning multi-agent
communication from graph modeling perspective[C]//
Proceedings of the 12th International Conference on
Learning Representations. Vienna, Austria, 2024.
[11] Wang X, Li X, Shao J, et al. AC2C: Adaptively Controlled
Two-Hop Communication for Multi-Agent Reinforcement
Learning[C]//Proceedings of the 2023 International
Conference on Autonomous Agents and Multiagent
Systems, 2023: 427–435.
[12] Zhang S Q, Zhang Q, Lin J. Succinct and robust
multi-agent communication with temporal message
control[J]. Advances in neural information processing
systems, 2020, 33: 17271-17282.
[13] Guan C, Chen F, Yuan L, et al. Efficient multi-agent
communication via self-supervised information
aggregati-on[J]. Advances in Neural Information
Processing Systems, 2022, 35: 1020-1033.
[14] Kim D, Moon S, Hostallero D, et al. Learning to schedule
communication in multi-agent reinforcement learning[C]//
Proceedings of the 7th International Conference on
Learn-ing Representations, 2019.
[15] Yuan L, Wang J, Zhang F, et al. Multi-agent incentive
communication via decentralized teammate modeling[C]//Proceedings of the AAAI Conference on Artificial
Intelligence, 2022, 36(9): 9466-9474.
[16] Guo X, Shi D, Fan W. Scalable Communication for
Multi-Agent Reinforcement Learning via
Transformer-Based Email Mechanism[C]//Proceedings of
the Thirty-Second International Joint Conference on
Artificial Intelligence, 2023: 126–134.
[17] Liu Z, Wan L, Sui X, et al. Deep Hierarchical
Communication Graph in Multi-Agent Reinforcement
Learning[C]//IJCAI, 2023: 208-216.
[18] Pina R, De Silva V, Artaud C, et al. Efficient Role-based
Communication for Multi-Agent Systems[C]//Proceedings
of the Autonomous Agents and Multi-Agent Systems,
2024.
[19] Duan W, Lu J, Xuan J, et al. Group-Aware Coordination
Graph for Multi-Agent Reinforcement Learning[C] //
Proceedings of the Thirty-Third International Joint
Conference on Artificial Intelligence, 2024: 3926-3934.
[20] Wang T, Dong H, Lesser VR, et al. ROMA: Multi-Agent
Reinforcement Learning with Emergent Roles[C]//
Proceedings of the 37th International Conference on
Machine Learning. 2020, 119: 9876-9886.
[21] Wang T, Gupta T, Mahajan A, et al. RODE: Learning
Roles to Decompose Multi-Agent Tasks[C]//Proceedings
of the 9th International Conference on Learning
Representations. Proceedings of Machine Learning
Research, 2021, 119: 9876-9886.
[22] Yang M, Zhao J, Hu X, et al. LDSA: Learning dynamic
subtask assignment in cooperative multi-agent
reinforce-ment learning[J]. Advances in Neural
Information Process-ing Systems, 2022, 35: 1698-1710.
[23] Yang M, Zhao K, Wang Y, et al. Team-wise effective
communication in multi-agent reinforcement learning[J].
Autonomous Agents and Multi-Agent Systems, 2024,
38(2): 36.
[24] Alemi AA, Fischer I, Dillon JV, et al. Deep Variational
Information Bottleneck[C]//Proceedings of the 5th
International Conference on Learning Representations,
Toulon, France, Conference Track Proceedings, 2017.
[25] Samvelyan M, Rashid T, Schröder de Witt C, et al. The
StarCraft Multi-Agent Challenge[C]//Proceedings of the
18th International Conference on Autonomous Agents and
Multi-Agent Systems. 2019: 2186-2188.
[26] Zhang S Q, Zhang Q, Lin J. Efficient communication in
multi-agent reinforcement learning via variance based
control[J]. Advances in neural information processing
systems, 2019, 32.
|