Author Login Chief Editor Login Reviewer Login Editor Login Remote Office
Reinforcement Learning Online Car-Hailing Order Dispatch Based on Joint Q-value Decomposition
HUANG Xiaohui, ZHANG Xiong, YANG Kaiming, XIONG Liyan
Computer Engineering . 2022, (12): 296 -303,311 .  DOI: 10.19678/j.issn.1000-3428.0063438