移动边缘计算中基于A3C的依赖任务卸载与资源分配

doi:10.19678/j.issn.1000-3428.0066095

摘要/Abstract

摘要： 在移动边缘计算中，移动设备通常受限于自身的处理性能和电源容量，需要其他设备协助进行任务处理。将移动设备上的一系列具有依赖关系的任务卸载到边缘服务器执行，以应对移动设备资源受限的问题，可提高任务计算和能源效率。针对信道状态动态变化的移动边缘计算环境下任务延迟和移动设备能耗优化问题，根据依赖任务卸载模型计算得出依赖任务调度顺序和优化目标，设计一种基于A3C的依赖任务卸载与资源分配（DTORA）算法。通过定义状态空间、动作空间和奖励函数，将依赖任务卸载问题转化为马尔可夫决策过程下的最优策略问题，采用异步并发求解得到高效的任务卸载和资源分配策略，并在具有标准多核CPU的单个机器上进行并行学习，降低神经网络参数更新的相关性，提升学习效果。实验结果表明，在信道状态动态变化场景下，对于多种不同依赖关系的任务，DTORA算法相比于4种基线算法任务延迟减少14%~61%，移动设备能耗降低8%~66%。

关键词: 移动边缘计算, 深度强化学习, 依赖任务卸载, 资源分配, 能耗优化

Abstract: In Mobile Edge Computing（MEC），Mobile Device（MD） are usually limited by their processing performance and power constraints and need other devices to assist them in task processing. Offloading a series of tasks with dependencies on MD to the edge server for execution to cope with the resource constraint of the MD can improve the computational and energy efficiencies of the tasks.To address the task delay and MD energy consumption optimization problems in MEC scenarios with time-varying channels，a Dependent Task Offloading and Resource Allocation（DTORA） algorithm based on Asynchronous Advantage Actor-Critic（A3C） is proposed based on the established dependent task offloading model to calculate the dependent task scheduling order and optimization objectives. By defining the state space，action space，and reward function，the dependent task offloading problem is transformed into an optimization policy problem under a Markov Decision Process（MDP），and an efficient task offloading and resource allocation policy is solved using asynchronous concurrency，which can be learned in parallel on a single machine with standard multi-core CPUs to reduce the correlation of neural network parameter updates and improve the learning effect.Experiments show that DTORA reduces the task delay by 14%-61% and reduces the energy consumption of MDs by 8%-66% compared with that of the four baseline algorithms for various tasks with different dependencies under the channel dynamic change scenarios.

Key words: Mobile Edge Computing（MEC）, Deep Reinforcement Learning（DRL）, dependent task offloading, resource allocation, energy consumption optimization

中图分类号:

TP391

李强, 仪晋辉, 杜婷婷, 王胜春. 移动边缘计算中基于A3C的依赖任务卸载与资源分配[J]. 计算机工程, 2023, 49(6): 42-52.

LI Qiang, YI Jinhui, DU Tingting, WANG Shengchun. Dependent Task Offloading and Resource Allocation Based on A3C in Mobile Edge Computing[J]. Computer Engineering, 2023, 49(6): 42-52.

https://www.ecice06.com/CN/Y2023/V49/I6/42

图/表 8

20230615164345

20230615164349

20230615164352

20230615164355

20230615164359

20230615164404

20230615164407

20230615164410

参考文献

[1] MAO Y Y,YOU C S,ZHANG J,et al.A survey on mobile edge computing:the communication perspective[J].IEEE Communications Surveys & Tutorials,2017,19(4):2322-2358.
[2] 吕洁娜,张家波,张祖凡,等.移动边缘计算卸载策略综述[J].小型微型计算机系统,2020,41(9):1866-1877.LÜ J N,ZHANG J B,ZHANG Z F,et al.Survey of mobile edge computing offloading strategies[J].Journal of Chinese Computer Systems,2020,41(9):1866-1877.(in Chinese)
[3] 刘伟,黄宇成,杜薇,等.移动边缘计算中资源受限的串行任务卸载策略[J].软件学报,2020,31(6):1889-1908.LIU W,HUANG Y C,DU W,et al.Resource-constrained serial task offload strategy in mobile edge computing[J].Journal of Software,2020,31(6):1889-1908.(in Chinese)
[4] GUO S T,XIAO B,YANG Y Y,et al.Energy-efficient dynamic offloading and resource scheduling in mobile cloud computing[C]//Proceedings of the 35th Annual IEEE International Conference on Computer Communications.Washington D.C.,USA:IEEE Press,2016:1-9.
[5] YAN J,BI S Z,ZHANG Y J,et al.Optimal task offloading and resource allocation in mobile-edge computing with inter-user task dependency[J].IEEE Transactions on Wireless Communications,2020,19(1):235-250.
[6] AN X M,FAN R F,HU H,et al.Joint task offloading and resource allocation for IoT edge computing with sequential task dependency[J].IEEE Internet of Things Journal,2022,9(17):16546-16561.
[7] 程鹏,张文柱,谢书翰,等.车联网边缘计算的多目标均衡任务卸载方法研究[J].小型微型计算机系统,2022,43(9):1992-1998.CHENG P,ZHANG W Z,XIE S H,et al.Research on multi-objective balanced task offloading method for edge computing of Internet of vehicles[J].Journal of Chinese Computer Systems,2022,43(9):1992-1998.(in Chinese)
[8] MNIH V,KAVUKCUOGLU K,SILVER D,et al.Human-level control through deep reinforcement learning[J].Nature,2015,518(7540):529-533.
[9] MIN M H,XIAO L,CHEN Y,et al.Learning-based computation offloading for IoT devices with energy harvesting[J].IEEE Transactions on Vehicular Technology,2019,68(2):1930-1941.
[10] LU H F,GU C H,LUO F,et al.Optimization of lightweight task offloading strategy for mobile edge computing based on deep reinforcement learning[J].Future Generation Computer Systems,2020,102:847-861.
[11] CHEN X F,ZHANG H G,WU C,et al.Optimized computation offloading performance in virtual edge computing systems via deep reinforcement learning[J].IEEE Internet of Things Journal,2019,6(3):4005-4018.
[12] ZHANG D G,CAO L X,ZHU H L,et al.Task offloading method of edge computing in internet of vehicles based on deep reinforcement learning[J].Cluster Computing,2022,25:1175-1187.
[13] YAN J,BI S Z,ZHANG Y J A.Offloading and resource allocation with general task graph in mobile edge computing:a deep reinforcement learning approach[J].IEEE Transactions on Wireless Communications,2020,19(8):5404-5419.
[14] 于晶,鲁凌云,李翔.车联网中基于DDQN的边云协作任务卸载机制[J].计算机工程,2022,48(12):156-164.YU J,LU L Y,LI X.Edge-cloud collaborative task offloading mechanism based on DDQN in vehicular networks[J].Computer Engineering,2022,48(12):156-164.(in Chinese)
[15] WANG J,HU J,MIN G Y,et al.Fast adaptive task offloading in edge computing based on meta reinforcement learning[J].IEEE Transactions on Parallel and Distributed Systems,2021,32(1):242-253.
[16] TANG Z Q,LOU J,ZHANG F M,et al.Dependent task offloading for multiple jobs in edge computing[C]//Proceedings of the 29th International Conference on Computer Communications and Networks.Washington D.C.,USA:IEEE Press,2020:1-9.
[17] WANG J,HU J,MIN G Y,et al.Computation offloading in multi-access edge computing using a deep sequential model based on reinforcement learning[J].IEEE Communications Magazine,2019,57(5):64-69.
[18] SHENG S R,CHEN P,CHEN Z M,et al.Deep reinforcement learning-based task scheduling in IoT edge computing[J].Sensors,2021,21(5):1666.
[19] WANG J,HU J,MIN G Y,et al.Dependent task offloading for edge computing based on deep reinforcement learning[J].IEEE Transactions on Computers,2022,71(10):2449-2461.
[20] TU Y P,CHEN H M,YAN L J,et al.Task offloading based on LSTM prediction and deep reinforcement learning for efficient edge computing in IoT[J].Future Internet,2022,14(2):30.
[21] BABAEIZADEH M,FROSIO I,TYREE S,et al.Reinforcement learning through asynchronous advantage Actor-Critic on a GPU[EB/OL].[2022-09-11].https://arxiv.org/abs/1611.06256.
[22] GERARDS M E T,HURINK J L,KUPER J.On the interplay between global DVFS and scheduling tasks with precedence constraints[J].IEEE Transactions on Computers,2015,64(6):1742-1754.
[23] HUANG L,BI S Z,ZHANG Y J A.Deep reinforcement learning for online computation offloading in wireless powered mobile-edge computing networks[J].IEEE Transactions on Mobile Computing,2020,19(11):2581-2593.
[24] PERAHIA E,COX D C.Shadow fading correlation between uplink and downlink[C]//Proceedings of the 53rd Vehicular Technology Conference.Washington D.C.,USA:IEEE Press,2002:308-312.
[25] NAIR V,HINTON G E.Rectified linear units improve restricted Boltzmann machines[C]//Proceedings of the 27th International Conference on International Conference on Machine Learning.New York,USA:ACM Press,2010:807-814.

选择文件类型/文献管理软件名称

选择包含的内容