基于联合Q值分解的强化学习网约车订单派送
黄晓辉, 张雄, 杨凯铭, 熊李艳
Reinforcement Learning Online Car-Hailing Order Dispatch Based on Joint Q-value Decomposition
HUANG Xiaohui, ZHANG Xiong, YANG Kaiming, XIONG Liyan
计算机工程
.
2022, (12): 296
-303,311
.
DOI: 10.19678/j.issn.1000-3428.0063438