Multi-Agent Q-learning in RoboCup Based on Regional Cooperative

doi:10.3969/j.issn.1000-3428.2009.09.004

Computer Engineering ›› 2009, Vol. 35 ›› Issue (9): 11-13,1. doi: 10.3969/j.issn.1000-3428.2009.09.004

• Degree Paper • Previous Articles Next Articles

Multi-Agent Q-learning in RoboCup Based on Regional Cooperative

LIU Liang, LI Long-shu

(Key Lab of IC & SP of Ministry of Education, Anhui University, Hefei 230039)

Received:1900-01-01 Revised:1900-01-01 Online:2009-05-05 Published:2009-05-05

基于局部合作的RoboCup多智能体Q-学习

刘　亮，李龙澍

(安徽大学计算智能与信号处理教育部重点实验室，合肥 230039)

Abstract

Abstract: Many multi-Agent Q-learning problems can not be solved because the number of joint actions is exponential in the number of Agents, rendering this approach infeasible for most problems. This paper investigates a regional cooperative of the Q-function by only considering the joint actions in those states in which coordination is actually required. In all other states single-Agent Q-learning is applied. This paper offers a compact state-action value representation, without compromising much in terms of solution quality. It performs experiments in RoboCup-simulation 2D which is the ideal testing platform of multi-agent systems and compared the algorithm to other multi-Agent reinforcement learning algorithms with promising results.

Key words: Markov Decision Processes(MDP), Q-learning, regional cooperative, simulation 2D

摘要： 针对多智能体Q-学习中存在的联合动作指数级增长问题，采用一种局部合作的Q-学习方法，在智能体之间有协作时才考察联合动作，否则只进行简单的个体智能体的Q-学习，从而减少学习时所要考察的状态-动作对值。在机器人足球仿真2D平台上进行的实验表明，该方法比常用多智能体强化学习技术具有更高的效率。

关键词: 马尔可夫决策, Q-学习, 局部合作, 仿真2D

CLC Number:

TP311

LIU Liang; LI Long-shu. Multi-Agent Q-learning in RoboCup Based on Regional Cooperative[J]. Computer Engineering, 2009, 35(9): 11-13,1.

刘　亮;李龙澍. 基于局部合作的RoboCup多智能体Q-学习[J]. 计算机工程, 2009, 35(9): 11-13,1.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2009.09.004

http://www.ecice06.com/EN/Y2009/V35/I9/11

[1]	ZHANG Guofu, SHEN Yufeng, SONG Xiaoxiao, SU Zhaopin. Method for Modeling and Solving the Dynamic Scheduling Problem of the Repair Crew for Restoring Damaged Road Network [J]. Computer Engineering, 2023, 49(6): 300-313.
[2]	ZHANG Zundong, WANG Yannan, ZHOU Huijuan, ZHANG Yifan. The Influence of Decision Mechanisms on Network Cooperation Level in Q-learning Evolutionary Game [J]. Computer Engineering, 2023, 49(6): 99-106,114.
[3]	BI Xiang, HUANG Huang, ZHANG Benhong, WEI Xing. V2V Composite Routing Algorithm for Internet of Vehicles Based on Clustering and Improved Q-Learning [J]. Computer Engineering, 2023, 49(3): 221-230,247.
[4]	ZHAO Beiying, JI Weifeng, WENG Jiang, WU Xuan, LI Yingqi. Trusted Routing Algorithm Based on Heuristic Q-Learning for FANET [J]. Computer Engineering, 2022, 48(5): 162-169.
[5]	ZHANG Ran, GAO Yingxue, ZHAO Yu, DING Yuanming. Routing Algorithm for Micro-Nano-Satellite Based on Q-Learning Quantum Ant Colony [J]. Computer Engineering, 2022, 48(3): 162-169,188.
[6]	JIANG Baoqing, CHEN Hongbin. Trajectory Planning for Unmanned Aerial Vehicle Assisted WSN Data Collection Based on Q-Learning [J]. Computer Engineering, 2021, 47(4): 127-134,165.
[7]	ZHOU Yunteng, ZHANG Xueying, LI Fenglian, LIU Shuchang, JIAO Jiangli, TIAN Dou. SVDPP Recommendation Algorithm Optimized by Q-learning Algorithm [J]. Computer Engineering, 2021, 47(2): 46-51.
[8]	XIE Yongsheng, YANG Yuwang, QIU Xiulin, WANG Yinyin. Optimized FANET Routing Algorithm with Reinforcement Learning Based on Function Approximation [J]. Computer Engineering, 2021, 47(11): 207-213.
[9]	SHI Zhao, SUN Changyin, JIANG Fan. Block-aware Power Allocation Based on Q-Learning in Millimeter-Wave Network [J]. Computer Engineering, 2020, 46(12): 185-192.
[10]	YU Jinliang, TU Shanshan, MENG Yuan. Impersonation Attack Detection Algorithm Based on Reinforcement Learning in Mobile Fog Computing [J]. Computer Engineering, 2020, 46(1): 38-44.
[11]	WANG Xiaolei,CHEN Yunjie,WANG Chen,NIU Ben. Scheduling Method of Virtual Network Function Based on Q-learning [J]. Computer Engineering, 2019, 45(2): 64-69.
[12]	FENG Chenwei,ZHANG Lin. A Network Access Control Algorithm Based on Q-learning [J]. Computer Engineering, 2015, 41(10): 99-104.
[13]	WANG Zi-qiang, WU Ji-gang. Mobile Robot Path Planning Based on RDC-Q Learning Algorithm [J]. Computer Engineering, 2014, 40(6): 211-214.
[14]	HUANG Yu-Qing, WANG Yang-Lun. Q-learning MAC Algorithm of Multi-agent Supporting Service Differentiation [J]. Computer Engineering, 2013, 39(8): 112-116,120.
[15]	XU Dong-Mei, HAN Xiao-Xin, LI Ding, JIA Min. Research on Computable Model of Emotional Interaction Based on Q-learning Algorithm [J]. Computer Engineering, 2012, 38(10): 277-279.

Please choose a citation manager

Content to export

Multi-Agent Q-learning in RoboCup Based on Regional Cooperative

基于局部合作的RoboCup多智能体Q-学习

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Multi-Agent Q-learning in RoboCup Based on Regional Cooperative

基于局部合作的RoboCup多智能体Q-学习

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments