Priority-Aware Task Offloading and Allocation in SAGIN with Multi-Agent Learning

doi:10.19678/j.issn.1000-3428.0253100

Abstract

Abstract: In remote and disaster-stricken areas, ground Internet of Things (IoT) devices are constrained by limited computing capabilities and insufficient communication infrastructure, making it difficult to support a large number of emergency tasks with stringent latency requirements within a short time. Existing studies mainly adopt single unmanned aerial vehicle (UAV) or low Earth orbit (LEO) satellite architectures, or treat UAVs merely as communication relay nodes, and their optimization objectives primarily focus on minimizing system latency or a weighted sum of latency and energy consumption, failing to fully exploit the cooperative computing potential of multiple UAVs and multiple LEO satellites as well as to satisfy the heterogeneous quality-of-service (QoS) requirements arising from different task priorities and latency constraints. Therefore, this paper proposes a multi-agent deep reinforcement learning–based task offloading and adaptive resource allocation strategy, termed TOARA. First, a space–air–ground integrated network (SAGIN) architecture with cooperative multiple UAVs and multiple LEO satellites is constructed and integrated with edge computing technologies to effectively alleviate ground resource limitations. In this architecture, UAVs collect ground tasks and make intelligent offloading decisions, dynamically assigning tasks to local edge nodes or LEO satellite nodes for execution. Then, the joint task offloading and resource allocation problem is formulated as a decentralized partially observable Markov decision process and solved using a multi-agent deep deterministic policy gradient (MADDPG) algorithm under a centralized training and decentralized execution framework, enabling agents to autonomously learn efficient offloading decisions and adaptive resource allocation strategies to jointly optimize task processing latency, system energy consumption, and the completion rates of tasks with different priority levels. Finally, simulation results demonstrate that, compared with several baseline strategies, the proposed algorithm reduces the average task processing latency and system energy consumption by at least 26.09% and 27.53%, respectively, while improving the completion rate of high-priority tasks by at least 22.24%, validating its effectiveness in learning efficient task offloading and resource allocation decisions in dynamic and complex environments.

摘要： 针对偏远及灾害地区地面物联网（IoT）设备计算能力受限、通信基础设施不足，难以在短时间内支撑大量具有严格时延约束的紧急任务处理问题，现有研究多采用单独无人机（UAV）或低轨（LEO）卫星架构，或仅将UAV作为通信中继节点，且优化目标主要侧重系统时延或时延与能耗的加权和，未能充分考虑多UAV与多LEO卫星协同计算潜力以及不同任务优先级和时延约束的差异化服务质量需求。因此，本文提出了一种基于多智能体深度强化学习的任务卸载和自适应资源分配策略（TOARA）。首先，构建了多UAV和多LEO卫星协同的空天地一体化网络（SAGIN）架构，并将该架构与边缘技术相结合，有效缓解了地面资源受限问题。其中，UAV负责收集地面任务并进行智能卸载决策，将任务动态分配到本地边缘节点或LEO卫星节点进行处理。其次将上述问题建模为分布式部分可观察马尔可夫决策过程并采用基于多智能体深度确定性策略梯度（MADDPG）方法求解，该策略采用基于集中式训练-分布式执行的训练框架，使各智能体能够自主学习高效的卸载决策和动态资源分配，优化任务处理时延、系统能耗及不同优先级任务完成率等多个目标。最后，仿真结果表明，相较于多种基线策略，该算法的任务处理平均时延和系统能耗分别至少降低26.09%和27.53%，高优先级任务完成率至少提升22.24%，验证了该算法在动态复杂的环境下高效学习任务卸载和资源分配决策的有效性。

Jiang Xiao, Qin Tuanfa, Sun Hongmin, Zhou Huayang, Gu Weiyu, Wang Suhong. Priority-Aware Task Offloading and Allocation in SAGIN with Multi-Agent Learning[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0253100.

江潇, 覃团发, 孙洪民, 周华阳, 顾伟玉, 王素红. SAGIN中多智能体优先级任务卸载与分配[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0253100.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0253100

References

[1] S. Du, X. Chen, L. Jiao, et al. Energy Efficient Task Offloading for UAV-assisted Mobile Edge Computing[C]// 2021 China Automation Congress. Beijing, China: IEEE Press ,2021: 6567-6571 .
[2] A. Al-Bakhrani, M. Li, M. S. Obaidat, et al. MOALF-UAV-MEC: Adaptive Multiobjective Optimization for UAV-Assisted Mobile Edge Computing in Dynamic IoT Environments[J]. IEEE Internet of Things Journal, 2025, 12(12): 20736-20756.
[3] M. Mukherjee, V. Kumar, A. Lat, et al. Distributed Deep Learning-based Task Offloading for UAV-enabled Mobile Edge Computing[C]//IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops. Toronto, Canada: IEEE Press, 2020: 1208-1212.
[4] M. Centenaro, C. E. Costa, F. Granelli, et al. A Survey on Technologies, Standards and Open Challenges in Satellite IoT[J]. IEEE Commun. Surveys Tuts., 2021, 23(3): 1693–1720.
[5] Z. Chen, H. Xu, C. Cheng. Task Offloading Strategy in Multi-UAV Mobile Edge Computing via Deep Reinforcement Learning[C]//2025 5th International Conference on Consumer Electronics and Computer Engineering. Dongguan, China: IEEE Press, 2025: 727-732.
[6] J. Choi, S. Krishnan, J. Park. Latency-Optimal Resource Allocation for UAV-Aided LEO Communication[J]. IEEE Transactions on Vehicular Technology, 2024, 73(8): 12096-12109.
[7] 杨黎明,周玉前,王文浩,等.多低轨卫星协作的边缘计算卸载与资源分配策略[J].现代电子技术,2025,48(17):7-15.YANG L M,ZHOU Y Q,WANG W H, et al. Collaborative Computation Offloading and Resource Allocation Strategy for Multi-LEO Satellite Edge Computing[J].Modern Electronics Technique: 2025,48(17):7-15.
[8] J. Lai, H. Liu, Y. Sun, et al. Multi-Agent Deep Reinforcement Learning Based Computation Offload Approach for LEO Satellite Broadband Networks[C]//2023 IEEE Symposium on Computers and Communications. Gammarth, Tunisia: IEEE Press, 2023: 1435-1440.
[9] Y. Zuo, M. Yue, H. Yang, et al. Integrating Communication, Sensing and Computing in Satellite Internet of Things: Challenges and Opportunities[J]. IEEE Wireless Communications, 2024, 31(3): 332-338.
[10] S. Jung, S. Jeong, J. Kang, et al. Marine IoT systems with space–air–sea integrated networks: Hybrid LEO and UAV edge computing[J]. IEEE Internet of Things Journal, 2023, 10(23): 20498–20510.
[11] 王怡,覃团发,韦睿,等.SAG-MEC网络下支持WPT的无人机动态任务卸载与资源分配[J/OL].计算机工程:1-10[2025-09-28].https://doi.org/10.19678/j.issn.1000-3428.0070030.WANG Y,QIN T F,WEI R,et al. Dynamic Task Offloading and Resource Allocation for WPT-Enabled UAVs in SAG-MEC Networks[J/OL]. Computer Engineering: 1-10[2025-09-28].https://doi.org/10.19678/j.issn.1000-3428.0070030.
[12] B. Wondmagen, T. T. Win, D. Won, et al. A Review on Task Offloading and Resource Allocation in Aerial and Satellite-Assisted MEC Systems[C]//2025 International Conference on Artificial Intelligence in Information and Communication. Fukuoka, Japan: IEEE Press, 2025: 0007-0011.
[13] H. Zhang, S. Xi, H. Jiang, et al. Resource Allocation and Offloading Strategy for UAV-Assisted LEO Satellite Edge Computing[J]. Drones, 2023, 7(6): 383-403.
[14] S. Zhang, A. Liu, C. Han, et al. Multiagent Reinforcement Learning-Based Orbital Edge Offloading in SAGIN Supporting Internet of Remote Things[J]. IEEE Internet of Things Journal, 2023, 10(23): 20472-20483.
[15] M. Cao, L. Liu, C. Wang, et al. Computation Resource Offloading in Mobile Edge Computing: A Deep Reinforcement Approach[C]//2023 3rd International Symposium on Computer Technology and Information Science. Chengdu, China: IEEE Press, 2023: 360-364.
[16] J. Wu, M. Jia, Q. Guo, et al. Efficient Resource Management Based on DQN in LEO Satellite Edge Computing System[C]//2023 IEEE Globecom Workshops. Kuala Lumpur, Malaysia: IEEE Press, 2023: 135-140.
[17] H. Zhang, M. Jiang, L. Ma, et al. Computing Offloading Strategy for Internet of Medical Things in Space-Air-Ground Integrated Network[C]//2023 IEEE International Conference on E-health Networking, Application & Services Chongqing, China: IEEE Press, 2023: 177-182.
[18] C. Huang, G. Chen, P. Xiao, et al. Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach[J]. IEEE Journal on Selected Areas in Communications, 2024, 42(5): 1029-1043.
[19] M. D. Nguyen, L. B. Le, A. Girard. Joint Computation Offloading, UAV Trajectory, User Scheduling, and Resource Allocation in SAGIN[C]//GLOBECOM 2022 - 2022 IEEE Global Communications Conference. Rio de Janeiro, Brazil: IEEE Press, 2022: 5099-5104.
[20] H. Wu, J. Zheng, S. Jin. Adaptive Computation Offloading Scheme Based on a Collaborative Architecture with Heterogeneous MEC Nodes: a DRL Approach[J/OL]. IEEE Transactions on Mobile Computing: 1-18[2025-09-28]. 10.1109/TMC.2025.3586623
[21] J. Yu, C. Ma, B. Yang, et al. Blockchain-enabled RCS Task Offloading and Resource Allocation Policy Using DRL Approach[C]//2022 IEEE 10th International Conference on Computer Science and Network Technology. Dalian, China: IEEE Press, 2022: 104-108.
[22] S. Nath, J. Wu. Deep reinforcement learning for dynamic computation offloading and resource allocation in cache-assisted mobile edge computing systems[J]. Intelligent and Converged Networks, 2020, 1(2): 181-198.
[23] S. Kapil, M. Chawla. Performance evaluation of K-means clustering algorithm with various distance metrics[C]//2016 IEEE 1st International Conference on Power Electronics, Intelligent Control and Energy Systems. Delhi, India: IEEE Press, 2016: 1-4.
[24] L. Luo. Priority Task Offloading Decision Optimization Based on Multi-Objective Grey Wolf Algorithm in Internet of Things Edge Computing Scenarios[C]//2023 IEEE 3rd International Conference on Data Science and Computer Application. Dalian, China: IEEE Press, 2023: 1574-1577.
[25] 周华阳,廖杨,江潇,等.基于多优先级动态调度的高密度WBAN任务卸载策略[J/OL].计算机应用研究,1-10[2025-09-28].https://d:oi.org/10.19734/j.issn.1001-3695.2025.03.0065.ZHOU H Y,LIAO Y,JIANG X, et al. A Multi-Priority-Based Dynamic Scheduling Strategy for Task Offloading in High-Density WBAN[J/OL]. Computer Applications Research,1-10[2025-09-28].https://doi.org/10.1973 4/j.issn.1001-3695.2025.03.0065.
[26] S. S. Ehsan, M. S. Hossain, R. A. Ar Rafi, et al. Optimizing K-Means for the Deployment of UAV-Assisted Wireless Networks: A Parametric Analysis[C]//2024 IEEE International Conference on Signal Processing, Information, Communication and Systems, Khulna, Bangladesh: IEEE Press, 2024: 1-5.
[27] T. Bose, N. Chatur and A. Adhya. Energy-Efficient Disjoint Multi-UAV Deployment for Emergency Communication in Disaster-Affected Areas[C]//2024 IEEE 21st India Council International Conference, Kharagpur, India: IEEE Press, 2024: 1-6.
[28] H. Shen, T. Yang and Z. Mao. Research on UAV-assisted maritime communication method based on K-means algorithm[C]//2024 4th International Conference on Communication Technology and Information Technology, Guangzhou, China: IEEE Press, 2024: 1-5.
[29] S. Badarneh, D. B. Da Costa. Shadowing, Multi-Path, and Line-of-Sight Effects: Are They Friends or Foes?[C]//2024 IEEE Wireless Communications and Networking Conference. Dubai, United Arab Emirates: IEEE Press, 2024: 1-6.
[30] S. Du, X. Chen, L. Jiao, et al. Energy Efficient Task Offloading for UAV-assisted Mobile Edge Computing[C]//2021 China Automation Congress Beijing, China: IEEE Press, 2021: 6567-6571.
[31] T. Ma, H. Zhou, B. Qian, et al. UAV-LEO Integrated Backbone: A Ubiquitous Data Collection Approach for B5G Internet of Remote Things Networks[J]. IEEE Journal on Selected Areas in Communications, 2021, 39(11): 3491-3505.
[32] P. An, L. Du, Y. Chen. Learning-Based Task Offloading and UAV Trajectory Optimization in SAGIN[C]//2024 33rd Wireless and Optical Communications Conference. Hsinchu, Taiwan: IEEE Press, 2024: 12-16.
[33] D. Wang, J. Tian, H. Zhang, et al. Task Offloading and Trajectory Scheduling for UAV-Enabled MEC Networks: An Optimal Transport Theory Perspective[J]. IEEE Wireless Communications Letters, 2022, 11(1): 150-154.
[34] R. Lowe, Y. Wu, A. Tamar, et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments[EB/OL]. (2020-03-14)[2025-09-28]. https://arxiv.org/pdf/1706.02275.
[35] G. Cui, P. Duan, L. Xu, et al. Latency Optimization for Hybrid GEO–LEO Satellite-Assisted IoT Networks[J].IEEE Internet Things J., 2023, 10(7): 6286–6297.
[36] F. Wei, S. Chen and W. Zou. A greedy algorithm for task offloading in mobile edge computing system[J]. China Communications, 2018, 15(11): 149-157.
[37] Z. Chen, H. Xu and C. Cheng. Task Offloading Strategy in Multi-UAV Mobile Edge Computing via Deep Reinforcement Learning[C]//2025 5th International Conference on Consumer Electronics and Computer Engineering, Dongguan, China: IEEE Press, 2025: 727-732.

Please choose a citation manager

Content to export