基于CMAC强化学习的交叉口信号控制

doi:10.3969/j.issn.1000-3428.2011.17.051

计算机工程 ›› 2011, Vol. 37 ›› Issue (17): 152-154. doi: 10.3969/j.issn.1000-3428.2011.17.051

基于CMAC强化学习的交叉口信号控制

温凯歌，杨照辉

(长安大学电子与控制工程学院，西安 710064)

收稿日期:2011-02-23 出版日期:2011-09-05 发布日期:2011-09-05
作者简介:温凯歌(1976－)，男，讲师、博士，主研方向：智能交通控制；杨照辉，讲师、硕士
基金资助:
中央高校基本科研业务费专项基金资助项目(CHD2009JC060)

Intersection Signal Control Based on Reinforcement Learning with CMAC

WEN Kai-ge, YANG zhao-hui

(School of Electronic and Control Engineering, Chang’an University, Xi’an 710064, China)

Received:2011-02-23 Online:2011-09-05 Published:2011-09-05

摘要/Abstract

摘要： 采用神经网络值函数逼近的强化学习方法处理交叉口的信号控制。根据交通流及交叉口信号特征，建立强化学习的状态空间、动作空间和回报空间，以最小化车辆在交叉口的延误为控制目标，对信号进行优化控制。引入小脑模型关节控制器神经网络对强化学习(RL)的Q值进行逼近。在变化的交通条件下，使用典型交叉口对提出的RL模型进行验证，同传统的定时控制和全感应控制进行对比分析。仿真结果表明，RL控制器具有较强的学习能力，可以适应交通流的动态变化，稳定性好、自适应性强，对于环境变化具有较强的适应能力。

关键词: 交通控制, 强化学习, 小脑模型关节控制器, 非均匀量化, 信号交叉口

Abstract: The intersection signal control is disposed with the Reinforcement Learning(RL) method based on the neural network function approximate. Considering the stochastic characteristic of the traffic system, an adaptive RL control scheme, based on Cerebellar Model Articulation Controller(CMAC), is introduced in the traffic signal control systems. Besides, CMAC is introduced to approximate the RL agent Q value. The model is tested on a typical isolated traffic intersection comprised of five four-legged signalized intersections, and compared to full-actuated control and pre-timed control. Analysis of simulation results using this approach shows significant improvement over traditional full-actuated control, especially for the case of accident and over-saturated traffic demand.

Key words: traffic control, reinforcement learning, Cerebellar Model Articulation Controller(CMAC), non-uniform quantization, signal intersection

中图分类号:

U491

温凯歌, 杨照辉. 基于CMAC强化学习的交叉口信号控制[J]. 计算机工程, 2011, 37(17): 152-154.

WEN Kai-Ge, YANG Zhao-Hui. Intersection Signal Control Based on Reinforcement Learning with CMAC[J]. Computer Engineering, 2011, 37(17): 152-154.

https://www.ecice06.com/CN/Y2011/V37/I17/152

[1]	高家豪, 胡创业, 丁男, 刘战东. 智能网联汽车中联合驾驶风格的交通流数据有效性分析[J]. 计算机工程, 2024, 50(6): 367-376.
[2]	孙文洁, 李宗民, 孙浩淼. 基于图神经网络的多智能体强化学习值函数分解方法[J]. 计算机工程, 2024, 50(5): 62-70.
[3]	傅明建, 郭福强. 基于深度强化学习的无信号灯路口决策研究[J]. 计算机工程, 2024, 50(5): 91-99.
[4]	张斯力, 李梓健, 蔡瑞初, 郝志峰, 闫玉光. 基于因果机制约束的强化推荐系统[J]. 计算机工程, 2024, 50(5): 279-290.
[5]	冯雄波, 黄于欣, 赖华, 高玉梦. 基于多策略强化学习的低资源跨语言摘要方法研究[J]. 计算机工程, 2024, 50(2): 68-77.
[6]	杜海军, 余粟. 基于时空图注意力网络的服务机器人动态避障[J]. 计算机工程, 2024, 50(2): 105-112.
[7]	蔡梓越, 谭北海, 余荣, 黄旭民, 王思明. 面向6G物联网设备协同的区块链动态分片[J]. 计算机工程, 2024, 50(1): 50-59.
[8]	张晓天, 王雅文, 谢志庆, 金大海, 宫云战. 面向类集成测试序列确定的强化学习方法[J]. 计算机工程, 2024, 50(1): 68-78.
[9]	王少桐, 况立群, 韩慧妍, 熊风光, 薛红新. 基于优势后见经验回放的强化学习导航方法[J]. 计算机工程, 2024, 50(1): 313-319.
[10]	孔凌辉, 饶哲恒, 徐彦彦, 潘少明. 基于深度强化学习的无线网络智能路由算法[J]. 计算机工程, 2023, 49(9): 199-207, 216.
[11]	胡水. 基于深度强化学习的智能兵棋推演决策方法[J]. 计算机工程, 2023, 49(9): 303-312.
[12]	张冠莹, 伊鹏, 李丹, 朱棣, 毛明. 面向大规模网络的服务功能链部署方法[J]. 计算机工程, 2023, 49(8): 122-129.
[13]	梅晶, 戴龙宝, 童钊, 邓昕, 王嘉珂. 资源约束下基于Lyapunov优化的自适应卸载算法[J]. 计算机工程, 2023, 49(7): 34-46.
[14]	蔡丽娇, 秦进, 陈双. 远离旧区域和避免回路的强化探索方法[J]. 计算机工程, 2023, 49(7): 118-124.
[15]	李强, 仪晋辉, 杜婷婷, 王胜春. 移动边缘计算中基于A3C的依赖任务卸载与资源分配[J]. 计算机工程, 2023, 49(6): 42-52.

选择文件类型/文献管理软件名称

选择包含的内容

基于CMAC强化学习的交叉口信号控制

Intersection Signal Control Based on Reinforcement Learning with CMAC

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

模态框（Modal）标题

选择文件类型/文献管理软件名称

选择包含的内容

基于CMAC强化学习的交叉口信号控制

Intersection Signal Control Based on Reinforcement Learning with CMAC

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价