基于策略熵监督的在线强化学习任务调度算法
张玉樟, 田乐, 魏华利, 林雨茂, 吕世宾, 郭茂祖
An Online Reinforcement Learning Task Scheduling Algorithm Based on Policy Entropy Supervision
ZHANG Yuzhang, TIAN Le, WEI Huali, LIN Yumao, LV Shibin, GUO Maozu
计算机工程
.
0, (): 0
-0
.
DOI: 10.19678/j.issn.1000-3428.0253414