UAV assisted MEC offloading strategy with layered attention TD3

doi:10.19678/j.issn.1000-3428.0253349

Abstract

Abstract: In order to deal with the core challenges faced by the task offloading decision in the UAV assisted mobile edge computing system, such as multi-dimensional timing coupling, dynamic environment adaptation and insufficient strategy robustness, this paper innovatively proposes a dual delay depth deterministic strategy gradient algorithm (HTAN-TD3) that integrates hierarchical timing attention mechanism. The breakthrough contributions of this study are reflected in three aspects: firstly, a composite optimization objective that integrates total system latency, worst user experience, and multi-user fairness is constructed, which breaks through the limitations of traditional single objective modeling; Secondly, a hierarchical attention network (HTAN) with macro micro dual stream temporal analysis capability was designed. Through the heterogeneous collaboration and attention weighted fusion of LSTM and GRU, accurate perception and deep mining of dynamic features at multiple time scales in the system state were achieved; Furthermore, the Ornstein Uhlenbeck process with temporal correlation is introduced to explore the noise and dynamic adaptive Huber loss function, and the algorithm is systematically enhanced from two dimensions: policy exploration smoothness and training process robustness. In a complex edge scene simulating high load, strong occlusion and multi-user competition, HTAN-TD3 is significantly superior to mainstream baseline algorithms such as DDPG and TD3 and MATOPO in key indicators such as total system delay and user fairness, demonstrating excellent environmental adaptability and decision-making intelligence. This study provides a useful reference and reference for improving the autonomous decision-making ability of intelligent edge computing systems in dynamic and complex environments.

摘要： 为应对无人机辅助移动边缘计算系统中任务卸载决策面临的多维时序耦合、动态环境适配与策略稳健性不足等核心挑战，本文创新性地提出一种融合分层时序注意力机制的双延迟深度确定性策略梯度算法（HTAN-TD3）。本研究的突破性贡献体现在三个方面：首先，构建了一种融合系统总时延、最差用户体验与多用户公平性的复合优化目标，突破了传统单目标建模的局限性；其次，设计了具备宏观-微观双流时序解析能力的分层注意力网络（HTAN），通过LSTM与GRU的异构协同与注意力加权融合，实现了对系统状态中多时间尺度动态特征的精准感知与深度挖掘；进一步，引入具有时序相关性的Ornstein-Uhlenbeck过程探索噪声与动态自适应Huber损失函数，分别从策略探索平滑性与训练过程鲁棒性两个维度对算法进行系统性增强。在模拟高负载、强遮挡与多用户竞争的复杂边缘场景中，HTAN-TD3在系统总时延与用户公平性等关键指标上显著优于DDPG、TD3、MATOPO等主流基线算法，展现出卓越的环境适应性与决策智能性，本研究为提升智能边缘计算系统在动态复杂环境下的自主决策能力提供了有益的参考与借鉴。

Ding Li , Yang Jun. UAV assisted MEC offloading strategy with layered attention TD3[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0253349.

丁力, 杨军. 分层注意力TD3的无人机辅助MEC卸载策略[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0253349.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0253349

References

[1] Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, "A Survey on Mobile Edge Computing: The Communication Perspective," IEEE Communications Surveys & Tutorials, vol. 19, no. 4, pp. 2322-2358, Fourthquarter 2017.
[2] LIU Y, WANG S, ZHAO Q, DU J, LI Y. A dynamic task offloading strategy for UAV-assisted MEC based on deep reinforcement learning[J]. IEEE Transactions on Vehicular Technology, 2023, 72(5): 6123-6137.
[3] 刘振鹏, 郭超, 王仕磊, 陈杰, 李小菲. 基于博弈论和启发式算法的超密集网络边缘计算卸载[J]. 计算机工程, 2022, 48(12): 54-61,71. LIU Zhenpeng, GUO Chao, WANG Shilei, CHEN Jie, LI Xiaofei. Edge Computing Offloading of Ultra-Dense Network Based on Game Theory and Heuristic Algorithm[J]. Computer Engineering, 2022, 48(12): 54-61,71.
[4] 陈清林, 邝祝芳. 基于DDPG的边缘计算任务卸载和服务缓存算法[J]. 计算机工程, 2021, 47(10): 26-33. Zhang, Y., Sun, Y., & Letaief, K. B. (2022). Collaborative Computation Offloading and Trajectory Design for Multi-UAV Assisted MEC: A Multi-Agent RL Approach.
[5] CHEN Qinglin, KUANG Zhufang. Task Offloading and Service Caching Algorithm Based on DDPG in Edge Computing[J]. Computer Engineering, 2021, 47(10): 26-33.
[6] Zeng, Q. Wu, and R. Zhang, "Accessing From the Sky: A Tutorial on UAV Communications for 5G and Beyond," Proceedings of the IEEE, vol. 107, no. 12, pp. 2327-2375, Dec. 2019.
[7] Zhang, X. Cui, Q. Zhao. A Multi-Objective Joint Task Offloading Scheme for Vehicular Edge Computing[J]. Computers, Materials & Continua, 2025, 84(2): 2355-2373.
[8] Zhong R, Liu X, Liu Y, et al. Multi-agent reinforcement learning in NOMA-aided UAV networks for cellular offloading[J]. IEEE Transactions on Wireless Communications, 2021, 21(3): 1498-1512.
[9] LI X, HU Y, DU Y. Deep reinforcement learning for computation offloading and resource allocation in unmanned-aerial-vehicle assisted edge computing[J]. Sensors, 2021, 21(18): 6499. DOI:10.3390/s21186499.
[10] AL-HILO A, SAMIR M, ELHATTAB M, ASSI C, SHARAFEDDINE S. A comprehensive survey on aerial mobile edge computing: challenges, state-of-the-art, and future directions[J]. Computer Communications, 2022, 191: 233-256.
[11] DEHKORDI M F, JABBARI B. Efficient and sustainable task offloading in UAV-assisted MEC systems via meta deep reinforcement learning[DB/OL]. arXiv, 2025.
[12] ZHAO Chanchan, LYU Fei, SHI Bao, YU Xiaomin, YANG Xingchen, YUE Xiaocan. Review of Collaborative Inference Methods for Edge Intelligence[J]. Computer Engineering and Applications, 2025, 61(3): 1-2.
[13] Zhao, Q., Du, J., & Li, Y. (2023). A Dynamic Task Offloading Strategy for UAV-Assisted MEC Based on Deep Reinforcement Learning. IEEE Transactions on Vehicular Technology, 72(5), 6123-6137.
[14] Huda S M A, Moh S. Deep reinforcement learning-based computation offloading in UAV swarm-enabled edge computing for surveillance applications[J]. IEEE Access, 2023, 11: 68269-68285.
[15] Wang Y, Fang W, Ding Y, et al. Computation offloading optimization for UAV-assisted mobile edge computing: a deep deterministic policy gradient approach[J]. Wireless Networks, 2021, 27(4): 2991-3006.
[16] Li L, Xu G, Liu Z, et al. Multi-Objective Optimization of Energy Efficiency and Fairness in UAV-Assisted Wireless Powered MEC Systems: A DRL-based Approach[J]. IEEE Internet of Things Journal, 2025.
[17] 贺晨煜,陆建荣.基于LSTM-GRU的水质监控系统设计与预测方法研究[J].物联网技术,2025,15(24):35-38.DOI:10.16667/j.issn.2095-1302.2025.24.007. C. Y. He and J. R. Lu, “Research on design and prediction method of water quality monitoring system based on LSTM-GRU,” Internet of Things Technologies, vol. 15, no. 24, pp. 35–38, 2025.
[18] 牛钰清,张志勇,张中亚,等.基于稀疏图注意力的工业物联网边缘计算容错方法[J/OL].计算机应用,1-11[2025-12-24]. Y. Q. Niu, Z. Y. Zhang, Z. Y. Zhang, et al., “Fault tolerance method for industrial IoT edge computing based on sparse graph attention,” Journal of Computer Applications, pp. 1-11, 2025.
[19] 巨涛,李林娟,张文金,等.多无人机辅助的移动边缘计算任务卸载及路径优化方法[J].电子科技大学学报,2025,54(01):72-83. T. Ju, L. J. Li, W. J. Zhang, Y. F. Zhang, and J. Y. Huo, “Multi-UAV assisted task offloading and path optimization method for mobile edge computing,” J. Univ. Electron. Sci. Technol. China, vol. 54, no. 1, pp. 72–83, Jan. 2025.
[20] Shen X, Gu L, Yang J, et al. Energy Efficiency Optimization for UAV-RIS-Assisted Wireless Powered Communication Networks[J]. Drones, 2025, 9(5): 344.
[21] Wang L, Wang K, Pan C, et al. Deep reinforcement learning based dynamic trajectory control for UAV-assisted mobile edge computing[J]. IEEE Transactions on Mobile Computing, 2021, 21(10): 3536-3550.
[22] Wang X, Ma L, Li H, et al. Digital twin-assisted efficient reinforcement learning for edge task scheduling[C]//2022 IEEE 95th vehicular technology conference:(VTC2022-Spring). IEEE, 2022: 1-5.
[23] Zhang R, Zhou R, Wang Y, et al. Incentive mechanisms for online task offloading with privacy-preserving in UAV-assisted mobile edge computing[J]. IEEE/ACM Transactions on Networking, 2024, 32(3): 2646-2661.
[24] Akter S, Duong D V A, Kim D Y, et al. Task offloading and resource allocation in uav-aided emergency response operations via soft actor critic[J]. IEEE Access, 2024, 12: 69258-69275.
[25] Hwang S, Lee H, Park J, et al. Decentralized computation offloading with cooperative UAVs: Multi-agent deep reinforcement learning perspective[J]. IEEE Wireless Communications, 2022, 29(4): 24-31.
[26] 徐飞,赵前奔,杨雪.基于TD3的无人机计算卸载算法[J].计算机系统应用,2025,34(02):37-48.DOI:10.15888/j.cnki.csa.009755. Xu, F., Zhao, Q. B., & Yang, X. (2025). UAV computation offloading algorithm based on TD3. Computer Systems and Applications, 34(2), 37-48. DOI:10.15888/j.cnki.csa.009755. [27] Wei, Z., Zhao, L., & Cai, Y. (2024). Digital Twin-Empowered Intelligent Task Offloading in UAV-Assisted MEC Networks with LSTM-TD3. IEEE Internet of Things Journal, 11(4), 6218-6233. [28] Zhang, Y., Sun, Y., & Letaief, K. B. (2022). Collaborative Computation Offloading and Trajectory Design for Multi-UAV Assisted MEC: A Multi-Agent RL Approach. IEEE Transactions on Wireless Communications, 21(12), 10523-10536.
[29] Guo, F., Zhang, H., Ji, H., Li, X., & Leung, V. C. M. (2022). Joint Trajectory and Computation Offloading for UAV-Assisted MEC with NOMA: A Multi-Agent Deep Reinforcement Learning Approach. IEEE Transactions on Mobile Computing, 21(11), 3983-3997.
[30] Al-Hilo, A., Samir, M., Elhattab, M., Assi, C., & Sharafeddine, S. (2022). UAV-Assisted Mobile Edge Computing: A Comprehensive Survey. IEEE Communications Surveys & Tutorials, 24(4), 2224-2260.
[31] Huang, L., Feng, X., Zhang, L., Qian, L., & Wu, Y. (2023). Multi-Agent Deep Reinforcement Learning for Cooperative Task Offloading in UAV Swarm-Enabled Edge Computing. IEEE Transactions on Network Science and Engineering, 10(3), 1519-1533. [32] Zhou, Z., Yu, H., & Zhang, W. (2024). Energy-Efficient Computation Offloading in UAV-MEC Systems: A Distributed Proximal Policy Optimization Approach with Attention Mechanism. IEEE Transactions on Green Communications and Networking, 8(1), 321-335.
[33] Wang, L., Wang, K., Pan, C., Xu, W., & Hanzo, L. (2022). Dynamic Trajectory and Offloading Control of UAV-Assisted MEC Under Uncertainty: A Deep Reinforcement Learning Approach. IEEE Transactions on Cognitive Communications and Networking, 8(2), 781-796.
[34] Li, J., Liang, W., Li, Y., Xu, Z., & Jia, X. (2023). Digital Twin-Assisted Efficient Reinforcement Learning for Adaptive Task Offloading in UAV-Aided MEC. IEEE Transactions on Parallel and Distributed Systems, 34(8), 2336-2349.
[35] Deng, X., Li, J., Shi, L., He, J., & Zhang, P. (2024). Federated Deep Reinforcement Learning for Secure and Privacy-Preserving Task Offloading in UAV-Edge Networks. IEEE Transactions on Information Forensics and Security, 19, 1684-1699.
[36] 王晓君, 徐涛, 赵春丽, 等. 基于改进TD3算法的青霉素发酵过程控制方法[J]. 软件导刊, 2025, XX(XX): 1-7. Wang, X. J., Xu, T., Zhao, C. L., Peng, L. L., & Du, Y. (2025). Penicillin fermentation process control based on improved TD3 algorithm. Software Guide, XX(XX), 1-7.
[37] Sun,Y.,Guo, S., & Li, X. (2023). Heterogeneous Multi-Agent Deep Reinforcement Learning for Computation Offloading and Resource Allocation in Multi-UAV MEC Systems. Computer Communications, 197, 262-273.
[38] 孙晓宇, 周涛, 杨立群. (2023).基于改进TD3算法的多无人机协同边缘计算任务卸载策略[J]. 计算机研究与发展, 60(9), 1987-2000. Sun, X. Y., Zhou, T., & Yang, L. Q. (2023). Multi-UAV collaborative edge computing task offloading strategy based on improved TD3 algorithm. Journal of Computer Research and Development, 60(9), 1987-2000.
[39] Chen, L., Zhang, T., & Li, M. (2025). Adaptive LSTM-TD3 for Dynamic Computation Offloading in UAV-Assisted MEC Systems.IEEE Transactions on Mobile Computing, 24(3), 1125-1138.
[40] 陈静, 刘洋, 赵欣. (2025).面向6G的无人机辅助MEC系统资源分配与任务调度联合优化[J]. 电子与信息学报, 47(1), 1-10 Chen, J., Liu, Y., & Zhao, X. (2025). Joint optimization of resource allocation and task scheduling for UAV-assisted MEC systems towards 6G. Journal of Electronics & Information Technology, 47(1), 1-10.
[41] 宋兴,葛海波,马世雄.改进DQN的边缘计算任务卸载策略[J].计算机工程与设计,2024,45(07):1964-1971.DOI:10.16208/j.issn1000-7024.2024.07.007. SONG X, GE H B, MA S X. Edge computing task offloading strategy based on improved DQN[J]. Computer Engineering and Design, 2024, 45(07): 1964-1971.DOI:10.16208/j.issn1000-7024.2024.07.007.
[42] 王云霄.基于深度强化学习的车载边缘计算卸载和资源分配方法研究[D].桂林理工大学,2024.DOI:10.27050/d.cnki.gglgc.2024.000032. WANG Y X. Research on vehicle-mounted edge computing offloading and resource allocation method based on deep reinforcement learning[D]. Guilin University of Technology, 2024. DOI: 10.27050/d.cnki.gglgc.2024.000032.

Please choose a citation manager

Content to export