作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2020, Vol. 46 ›› Issue (1): 87-92. doi: 10.19678/j.issn.1000-3428.0053381

• 人工智能与模式识别 • 上一篇    下一篇

基于深度确定性策略梯度的智能车汇流模型

吴思凡a, 杜煜b, 徐世杰a, 杨硕a, 杜晨c   

  1. 北京联合大学 a. 智慧城市学院;b. 机器人学院;c. 北京市信息服务工程重点实验室, 北京 100101
  • 收稿日期:2018-12-12 修回日期:2019-01-28 出版日期:2020-01-15 发布日期:2019-01-24
  • 作者简介:吴思凡(1994-),男,硕士研究生,主研方向为无人驾驶决策算法;杜煜,教授;徐世杰、杨硕、杜晨,硕士研究生。
  • 基金资助:
    国家自然科学基金(91420202)。

Traffic Merging Model for Intelligent Vehicle Based on Deep Deterministic Policy Gradient

WU Sifana, DU Yub, XU Shijiea, YANG Shuoa, DU Chenc   

  1. a. Smart City College;b. College of Robotics;c. Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, China
  • Received:2018-12-12 Revised:2019-01-28 Online:2020-01-15 Published:2019-01-24

摘要: 采用离散动作空间描述速度变化的智能车汇流模型不能满足实际车流汇入场景的应用要求,而深度确定性策略梯度(DDPG)结合策略梯度和函数近似方法,采用与深度Q网络(DQN)相同的网络结构,并使用连续动作空间对问题进行描述,更适合描述智能车速度变化。为此,提出一种基于DDPG算法的智能车汇流模型,将汇流问题转化为序列决策问题进行求解。实验结果表明,与基于DQN的模型相比,该模型的收敛速度较快,稳定性和成功率较高,更适合智能车汇入车辆场景的应用。

关键词: 智能车, 汇流, 深度确定性策略梯度, 深度Q网络, 连续动作空间

Abstract: Traffic merging models for intelligent vehicle that use discrete action space to describe changing speed cannot meet the application requirements of actual traffic merging scenarios.Deep Deterministic Policy Gradient(DDPG),which integrates policy gradient with function approximation methods and adopts the same network structure as Deep Q-Network(DQN),uses continuous action space for problem description.So DDPG is more suitable for describing the changing speed of intelligent vehicles.On this basis,this paper proposes a traffic merging model for intelligent vehicles based on the DDPG algorithm,reducing the traffic merging problem to a sequence decision problem to be resolved.Experimental results show that compared with DQN-based models,the proposed model has a faster convergence speed,higher reliability and a higher success rate,which means it is more applicable to traffic merging scenarios of intelligent vehicle.

Key words: intelligent vehicle, traffic merging, Deep Deterministic Policy Gradient(DDPG), Deep Q-Network(DQN), continuous action space

中图分类号: