Author Login Editor-in-Chief Peer Review Editor Work Office Work

Computer Engineering ›› 2009, Vol. 35 ›› Issue (22): 173-175. doi: 10.3969/j.issn.1000-3428.2009.22.059

• Artificial Intelligence and Recognition Technology • Previous Articles     Next Articles

Novel Heuristic Q-learning Algorithm

WANG Hong-yan   

  1. (School of Computer Science, Shenyang Institute of Aeronautical Engineering, Shenyang 110136)
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-11-20 Published:2009-11-20

新的启发式Q学习算法

王洪彦   

  1. (沈阳航空工业学院计算机学院,沈阳 110136)

Abstract: Aiming at the continuity consolidate study, this paper presents a Q-learning algorithm which integrates heuristic function and evaluation function. It takes advance of heuristic function to accelerate learning, uses evaluation function to reduce the unnecessary exploration and improves learning efficiency. To assure the effect of the algorithm, heuristic function and evaluation function are calculated by Q function. Simulation experimental result of the Tank game proves that the algorithm can improve the learning efficiency of Q-learning.

Key words: Q-learning, heuristic function, evaluation function, online game

摘要: 针对连续型强化学习问题,提出一种综合启发函数和评估函数的Q学习算法,利用启发函数加快学习速度,采用评估函数减少不必要的探索,提高学习效率。为了保证该算法的有效性,启发函数和评估函数根据Q函数进行计算。坦克大战游戏的仿真实验结果证明,该方法可以较大地提高Q学习的学习效率。

关键词: Q学习, 启发函数, 评估函数, 网络游戏

CLC Number: