作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2011, Vol. 37 ›› Issue (8): 149-152. doi: 10.3969/j.issn.1000-3428.2011.08.051

• 人工智能及识别技术 • 上一篇    下一篇

基于强化学习的多策略选择遗传算法

王晓燕,刘 全,傅启明,张 乐   

  1. (苏州大学计算机科学与技术学院,江苏 苏州 215006)
  • 出版日期:2011-04-20 发布日期:2012-10-31
  • 作者简介:王晓燕(1985-),女,硕士研究生,主研方向:遗传算法,强化学习;刘 全,教授、博士后、博士生导师;傅启明、张 乐,硕士研究生
  • 基金资助:
    国家自然科学基金资助项目(60673092, 60873116);江苏省自然科学基金资助项目(BK2008161);江苏省高校自然科学研究基金资助项目(09KJA520002);江苏省现代企业信息化应用支撑软件工程技术研究开发中心基金资助项目(SX200804)

Multiple Policy Selection Genetic Algorithm Based on Reinforcement Learning

WANG Xiao-yan, LIU Quan, FU Qi-ming, ZHANG Le   

  1. (School of Computer Science and Technology, Soochow University, Suzhou 215006, China)
  • Online:2011-04-20 Published:2012-10-31

摘要: 为解决传统遗传算法早熟收敛和收敛速度慢的问题,提出一种基于强化学习的多策略选择遗传算法MPSGA。通过使用不同的选择策略将整个种群划分为3个子种群并分别进化,能提高种群的多样性,有效避免遗传算法的早熟收敛问题。将种群的多样性和算法的运行机制相结合,根据种群多样性的变化运用强化学习算法动态地优化各子种群间的比例参数,从而将种群多样性保持在合适的范围,一定程度上解决了收敛速度和全局收敛性之间的矛盾。实验结果表明,该算法在收敛精度和搜索效率上都表现出较好的性能。

关键词: 遗传算法, 多策略选择, 强化学习, 种群多样性, 比例参数

Abstract: A new multiple policy selection Genetic Algorithm(GA) based on reinforcement learning is proposed to avoid the premature convergence and low speed of convergence. The whole population is divided into three sub-populations and each of them evolves respectively by using several different selection policies, which improves the diversity of population and avoids the premature convergence effectively. Population diversity is associated with the running mechanism of the algorithm, and the parameters of the sub-populations are optimized dynamically using reinforcement learning according to the variance diversity, which can maintain the population diversity in the appropriate range and it solves the contradiction between convergence speed and global convergence to a certain extent. Experimental results show that the algorithm has a high performance in precision of convergence and search efficiency.

Key words: Genetic Algorithm(GA), multiple policy selection, reinforcement learning, population diversity, proportion parameter

中图分类号: