[1] Hamzaoui I, Duthil B, Courboulay V, et al. A survey on the current challenges of energy-efficient cloud resources management[J].SN Computer Science,2020,1(7):1263-1283.
[2] 田倬璟,黄震春,张益农.云计算环境任务调度方法研究综述[J].计算机工程与应用,2021,57(02):1-11.
Tian Z J, Huang Z C, Zhang Y N. A review of task scheduling methods in cloud computing environments [J]. Computer Engineering and Applications, 2021, 57(02): 1-11.(in Chinese)
[3] Yang Y, Shen H. Deep reinforcement learning enhanced greedy optimization for online scheduling of batched tasks in cloud HPC systems[J]. IEEE Transactions on Parallel and Distributed Systems, 2021, 33(11): 3003-3014.
[4] Arunarani A ,Manjula D ,Sugumaran V . Task scheduling techniques in cloud computing: a literature survey[J].Future Generation Computer Systems,2019,91407-415.
[5] Soltani N, Soleimani B, Barekatain B. Heuristic algorithms for task scheduling in cloud computing: a survey[J]. International Journal of Computer Network and Information Security,2017,9(8):16-22.
[6] Houssein E H, Gad A G, Wazery Y M, et al. Task scheduling in cloud computing based on meta-heuristics: review, taxonomy, open challenges, and future trends[J]. Swarm and Evolutionary Computation, 2021, 62: 100841.
[7] 汪婷,邵鹏,李光泉,等.改进的粒子群优化算法在云计算任务调度中的应用[J].科学技术与工程, 2023, 23(29):12594-12603.
Wang T, Shao P, Li G G, et al. Improved particle swarm optimization algorithm for cloud computing task scheduling[J]. Science Technology and Engineering, 2023, 23(29): 12594-12603.(in Chinese)
[8] 王宏杰,徐胜超.基于改进遗传算法的云计算任务调度方法[J].计算机技术与发展,2024,34(02):40-45.
Wang H J, Xu S C. Cloud computing task scheduling method based on improved genetic algorithm [J]. Computer Technology and Development, 2024, 34(02): 40-45.(in Chinese)
[9] Paulraj D, Sethukarasi T, Neelakandan S, et al. An efficient hybrid job scheduling optimization (EHJSO) approach to enhance resource search using Cuckoo and grey wolf job optimization for cloud environment[J]. PloS one, 2023, 18(3): e0282600.
[10] Hosseini S M, Kanaan S K. A survey on meta-heuristic-based workflow scheduling algorithms running in the cloud computing platforms[J]. Service Oriented Computing and Applications, 2025,(prepublish): 1-21.
[11] Pan J, Wei Y, Meng L, et al. A dual scheduling framework for task and resource allocation in clouds using deep reinforcement learning[J]. Journal of King Saud University Computer and Information Sciences, 2025, 37(5): 81-81.
[12] 王立红,张延华,孟德彬,等.基于DDPG算法的云数据中心任务节能调度研究[J].高技术通讯,2023,33(09):927-936.
Wang L H, Zhang Y H, Meng D B et al. Research on task energy-saving scheduling of cloud data center based on DDPG algorithm [J]. High Technology Communications, 2023, 33(09): 927-936.(in Chinese)
[13] Mauro F, Gianluca R. Application of proximal policy optimization for resource orchestration in serverless edge computing[J]. Computers,2024,13(9): 224-224.
[14] Haarnoja T, Zhou A, Abbeel P, et al. Soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor[C]//International conference on machine learning. Pmlr, 2018: 1861-1870.
[15] Xiao Y, Yao Y, Zhu F. Parallel simulation multi-sample task scheduling approach based on deep reinforcement learning in cloud computing environment[J]. Mathematics, 2025, 13(14): 2249.
[16] Hou H, Ismail A. EETS: an energy-efficient task scheduler in cloud computing based on improved DQN algorithm[J]. Journal of King Saud University-Computer and Information Sciences, 2024, 36(8): 102177.
[17] Beloglazov A, Buyya R. Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in cloud data centers[J]. Concurrency and Computation: Practice and Experience, 2012, 24(13): 1397-1420.
[18] Schulman J, Levine S, Abbeel P, et al. Trust region policy optimization[C]//International conference on machine learning. PMLR, 2015: 1889-1897.
[19] Kumar A, Fu J, Tucker G, et al. Stabilizing off-policy Q-Learning via bootstrapping error reduction[J]. CoRR,2019,abs/1906.00949
[20] Neu G. Explore no more: improved high-probability regret bounds for non-stochastic bandits[C]//Proceedings of the 29th International Conference on Neural Information Processing Systems-Volume 2. 2015: 3168-3176.
[21] Dudík M, Erhan D , Langford J, et al. Doubly Robust Policy Evaluation and Optimization[J].Statistical Science,2014,29(4):485-511.
[22] Cisse M, Bojanowski P, Grave E, et al. Parseval networks: Improving robustness to adversarial examples[C]//International conference on machine learning. PMLR, 2017: 854-863.
[23] Li F, Hu B. Deepjs: job scheduling based on deep reinforcement learning in cloud data center[C]//Proceedings of the 4th International Conference on Big Data and Computing. 2019: 48-53.
[24] Peng Z, Cui D, Zuo J, et al. Random task scheduling scheme based on reinforcement learning in cloud computing[J]. Cluster computing, 2015, 18(4): 1595-1607.
[25] Lu J, Yang J, Li S, et al. A2C-DRL: dynamic scheduling for stochastic edge–cloud environments using A2C and deep reinforcement learning[J]. IEEE Internet of Things Journal, 2024, 11(9): 16915-16927.
[26] Zhang X, Li S, Tang J, et al. DRL-Enabled computation offloading for AIGC services in IoIT-assisted edge computing networks[J]. IEEE Internet of Things Journal, 2024,: 1.
|