Novel Heuristic Q-learning Algorithm

doi:10.3969/j.issn.1000-3428.2009.22.059

Computer Engineering ›› 2009, Vol. 35 ›› Issue (22): 173-175. doi: 10.3969/j.issn.1000-3428.2009.22.059

• Artificial Intelligence and Recognition Technology • Previous Articles Next Articles

Novel Heuristic Q-learning Algorithm

WANG Hong-yan

(School of Computer Science, Shenyang Institute of Aeronautical Engineering, Shenyang 110136)

Received:1900-01-01 Revised:1900-01-01 Online:2009-11-20 Published:2009-11-20

新的启发式Q学习算法

王洪彦

(沈阳航空工业学院计算机学院，沈阳 110136)

Abstract

Abstract: Aiming at the continuity consolidate study, this paper presents a Q-learning algorithm which integrates heuristic function and evaluation function. It takes advance of heuristic function to accelerate learning, uses evaluation function to reduce the unnecessary exploration and improves learning efficiency. To assure the effect of the algorithm, heuristic function and evaluation function are calculated by Q function. Simulation experimental result of the Tank game proves that the algorithm can improve the learning efficiency of Q-learning.

Key words: Q-learning, heuristic function, evaluation function, online game

摘要： 针对连续型强化学习问题，提出一种综合启发函数和评估函数的Q学习算法，利用启发函数加快学习速度，采用评估函数减少不必要的探索，提高学习效率。为了保证该算法的有效性，启发函数和评估函数根据Q函数进行计算。坦克大战游戏的仿真实验结果证明，该方法可以较大地提高Q学习的学习效率。

关键词: Q学习, 启发函数, 评估函数, 网络游戏

CLC Number:

TP181

WANG Hong-yan.

Novel Heuristic Q-learning Algorithm

[J]. Computer Engineering, 2009, 35(22): 173-175.

王洪彦.

新的启发式Q学习算法

[J]. 计算机工程, 2009, 35(22): 173-175.

/ / Recommend / Download Citations

URL: http://www.ecice06.com/EN/10.3969/j.issn.1000-3428.2009.22.059

http://www.ecice06.com/EN/Y2009/V35/I22/173

[1]	ZHANG Guofu, SHEN Yufeng, SONG Xiaoxiao, SU Zhaopin. Method for Modeling and Solving the Dynamic Scheduling Problem of the Repair Crew for Restoring Damaged Road Network [J]. Computer Engineering, 2023, 49(6): 300-313.
[2]	ZHANG Zundong, WANG Yannan, ZHOU Huijuan, ZHANG Yifan. The Influence of Decision Mechanisms on Network Cooperation Level in Q-learning Evolutionary Game [J]. Computer Engineering, 2023, 49(6): 99-106,114.
[3]	BI Xiang, HUANG Huang, ZHANG Benhong, WEI Xing. V2V Composite Routing Algorithm for Internet of Vehicles Based on Clustering and Improved Q-Learning [J]. Computer Engineering, 2023, 49(3): 221-230,247.
[4]	ZHAO Beiying, JI Weifeng, WENG Jiang, WU Xuan, LI Yingqi. Trusted Routing Algorithm Based on Heuristic Q-Learning for FANET [J]. Computer Engineering, 2022, 48(5): 162-169.
[5]	ZHANG Ran, GAO Yingxue, ZHAO Yu, DING Yuanming. Routing Algorithm for Micro-Nano-Satellite Based on Q-Learning Quantum Ant Colony [J]. Computer Engineering, 2022, 48(3): 162-169,188.
[6]	JIANG Baoqing, CHEN Hongbin. Trajectory Planning for Unmanned Aerial Vehicle Assisted WSN Data Collection Based on Q-Learning [J]. Computer Engineering, 2021, 47(4): 127-134,165.
[7]	ZHOU Yunteng, ZHANG Xueying, LI Fenglian, LIU Shuchang, JIAO Jiangli, TIAN Dou. SVDPP Recommendation Algorithm Optimized by Q-learning Algorithm [J]. Computer Engineering, 2021, 47(2): 46-51.
[8]	XIE Yongsheng, YANG Yuwang, QIU Xiulin, WANG Yinyin. Optimized FANET Routing Algorithm with Reinforcement Learning Based on Function Approximation [J]. Computer Engineering, 2021, 47(11): 207-213.
[9]	SHI Zhao, SUN Changyin, JIANG Fan. Block-aware Power Allocation Based on Q-Learning in Millimeter-Wave Network [J]. Computer Engineering, 2020, 46(12): 185-192.
[10]	YU Jinliang, TU Shanshan, MENG Yuan. Impersonation Attack Detection Algorithm Based on Reinforcement Learning in Mobile Fog Computing [J]. Computer Engineering, 2020, 46(1): 38-44.
[11]	WEI Debin, LIU Jian, PAN Chengsheng, ZOU Qijie. Ant Colony Optimization Routing Algorithm Based on Multi-QoS Constraints in Satellite Networks [J]. Computer Engineering, 2019, 45(7): 114-120.
[12]	WANG Xiaolei,CHEN Yunjie,WANG Chen,NIU Ben. Scheduling Method of Virtual Network Function Based on Q-learning [J]. Computer Engineering, 2019, 45(2): 64-69.
[13]	ZHAO Hongsheng,DING Hua,LIU Jiancheng. Shape from Focus Based on Image Regional Pixel Reconstruction [J]. Computer Engineering, 2019, 45(2): 233-239,244.
[14]	ZHANG Jinfeng,ZHANG Jiye. Automatic Focusing Method with Variable Threshold and Variable Step Based on Function Change Rate [J]. Computer Engineering, 2016, 42(7): 216-219.
[15]	GE Shun,XIA Xuezhi. An Intelligence Decision Model Based on Probabilistic Influence Analysis [J]. Computer Engineering, 2016, 42(6): 213-217.

Please choose a citation manager

Content to export

Novel Heuristic Q-learning Algorithm

新的启发式Q学习算法

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments

模态框（Modal）标题

Please choose a citation manager

Content to export

Novel Heuristic Q-learning Algorithm

新的启发式Q学习算法

PDF

Knowledge

Cited

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments