[1]姜同全,王子磊,奚宏生,等.基于动态阈值分配的流媒体边缘云会话迁移策略[J].计算机工程,2017,43(1):55-60.
[2]WANG Feng,LIU Jiangchuan,CHEN Minghua.CALMS:cloud-assisted live media streaming for globalized demands with time/region diversities[C]//Proceedings of IEEE International Conference on Computer Communications.Washington D.C.,USA:IEEE Press,2012:199-207.
[3]WOLF J L,YU P S,SHACHNAI H.Disk load balancing for video-on-demand systems[J].Multimedia Systems,1997,5(6):358-370.
[4]SUTTON R S,PRECUP D,SINGH S.Between MDPs and semi-MDPs:a framework for temporal abstraction in reinforcement learning[J].Artificial Intelligence,1999,112(1/2):181-211.
[5]MIYAZAWA T,KAFLE V P,HARAI H.Reinforcement learning based dynamic resource migration for virtual networks[C]//Proceedings of Symposium on Integrated Network and Service Management.Washington D.C.,USA:IEEE Press,2017:428-434.
[6]WANG Jinzhi,QU Shuhui,WANG Jie,et al.Real-time decision support with reinforcement learning for dynamic flowshop scheduling[C]//Proceedings of European Conference on Smart Objects,Systems and Technologies.Munich,Germany:[s.n.],2017:1-9.
[7]PENG Zhiping,CUI Delong,MA Yuanjia,et al.A reinforcement learning-based mixed job scheduler scheme for cloud computing under SLA constraint[C]// Proceedings of the 3rd International Conference on Cyber Security and Cloud Computing.Washington D.C.,USA:IEEE Press,2016:142-147.
[8]ZHAO Yang,XIAO Mingqing,GE Yawei.Dynamic resource scheduling of cloud-based automatic test system using reinforcement learning[C]//Proceedings of the 13th IEEE International Conference on Electronic Measurement and Instruments.Washington D.C.,USA:IEEE Press,2017:159-165.
[9]WANG Y C,USHER J M.Application of reinforcement learning for agent-based production scheduling[J].Engineering Applications of Artificial Intelligence,2005,18(1):73-82.
[10]MNIH V,KAVUKCUOGLU K,SILVER D,et al.Human-level control through deep reinforcement learning[J].Nature,2015,518(7540):529-533.
[11]SILVER D,HUANG A,MADDISON C J,et al.Mastering the game of go with deep neural networks and tree search[J].Nature,2016,529(7587):484-489.
[12]LILLICRAP T P,HUNT J J,PRITZEL A,et al.Continuous control with deep reinforcement learning[EB/OL].[2018-02-08].https://arxiv.org/pdf/1509.02971.pdf.
[13]李军,倪宏,王玲芳,等.流媒体系统中基于请求迁移的任务调度算法[J].吉林大学学报(工学版),2015,45(3):938-945.
[14]温暖,刘正华,祝令谱,等.深度强化学习在变体飞行器自主外形优化中的应用[J].宇航学报,2017,38(11):1153-1159.
[15]CHEN Liang,ZHOU Yipeng,CHIU D M.Smart streaming for online video services[J].IEEE Transactions on Multimedia,2015,17(4):485-497. |