作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程 ›› 2026, Vol. 52 ›› Issue (5): 371-382. doi: 10.19678/j.issn.1000-3428.0070030

• 新一代网络与边缘计算 • 上一篇    下一篇

SAG-MEC网络下支持WPT的无人机动态任务卸载与资源分配

王怡, 覃团发*(), 韦睿, 黄金宝   

  1. 广西大学计算机与电子信息学院广西多媒体通信与网络技术重点实验室, 广西 南宁 530004
  • 收稿日期:2024-06-24 修回日期:2024-09-07 出版日期:2026-05-15 发布日期:2024-12-05
  • 通讯作者: 覃团发
  • 作者简介:

    王怡(CCF学生会员), 女, 硕士研究生, 主研方向为空天地一体化、无人机通信

    覃团发(通信作者), 教授、博士生导师

    韦睿, 硕士研究生

    黄金宝, 硕士研究生

  • 基金资助:
    国家自然科学基金(61563004)

Dynamic Task Offloading and Resource Allocation of UAVs Supported by WPT in SAG-MEC Network

WANG Yi, QIN Tuanfa*(), WEI Rui, HUANG Jinbao   

  1. Guangxi Key Laboratory of Multimedia Communications and Network Technology, School of Computer and Electronic Information, Guangxi University, Nanning 530004, Guangxi, China
  • Received:2024-06-24 Revised:2024-09-07 Online:2026-05-15 Published:2024-12-05
  • Contact: QIN Tuanfa

摘要:

针对偏远地区蜂窝网络覆盖不足且物联网(IoT)设备能量和计算能力低而无法满足大量延迟敏感型任务卸载和计算需求的问题, 考虑将空天地一体化网络(SAGIN)和移动边缘计算(MEC)相结合, 提出一种支持无线电力传输(WPT)技术的无人机辅助IoT设备的动态任务卸载和资源分配方案, 其中无人机负责收集IoT设备产生的计算密集型任务, 采用部分卸载模式将这些任务根据当前状态进行本地计算或动态卸载给基站和低地球轨道(LEO)卫星进一步处理。由于动态的异构网络和长期排队延迟与短期决策的耦合性, 因此在排队延迟的约束下提出一种基于Lyapunov优化的双延迟深度确定性策略梯度(TD3PG)算法, 该算法通过优化无人机动态关联、任务分配、计算资源分配和带宽分配来协调无人机学习最优卸载策略和资源分配。仿真结果表明, 所提出的动态方案与其他对比方案相比能够有效降低无人机网络的能耗、网络积压总和及平均排队延迟, 在2种学习率组合下TD3PG算法相对于深度确定性策略梯度(DDPG)算法和双重深度Q网络(DDQN)算法的奖励分别提高了13.6%、24.0%和20.4%、17.9%。

关键词: 空天地一体化网络, 移动边缘计算, 无人机, 任务卸载, 资源分配

Abstract:

Remote areas face problems such as insufficient cellular network coverage as well as low energy and computing power of Internet of Things (IoT) devices. Hence, the requirements for delay-sensitive task offloading and computing for a large number of tasks cannot be met. Considering the combination of the Space—Air—Ground Integrated Network (SAGIN) and Mobile Edge Computing (MEC), this paper proposes a strategy for dynamic task offloading and resource allocation for Unmanned Aerial Vehicle (UAV)-assisted IoT devices that support Wireless Power Transmission (WPT) technology, in which UAVs are responsible for collecting compute-intensive tasks generated by IoT devices. These tasks are locally calculated or dynamically unloaded to the base station and a Low Earth Orbit (LEO) satellite for further processing using a partial unloading mode, according to the current state. Given the dynamic heterogeneous network environment, as well as the tight coupling between long-term queuing delays and short-term decision-making, this paper proposes a Twin Delayed Deep Deterministic Policy Gradient (TD3PG) algorithm based on Lyapunov optimization under queuing delay constraints. The algorithm coordinates UAVs to learn the optimal offloading strategy and resource allocation by optimizing UAV dynamic association, task allocation, computing resource allocation, and bandwidth allocation. Simulation results show that, compared with other schemes, the proposed dynamic scheme can effectively reduce the energy consumption, network backlog sum, and average queue delay in the UAV network. Under different learning rate combinations, the reward of the TD3PG algorithm increases by 13.6% and 24.0% compared with that of the Deep Deterministic Policy Gradient (DDPG) algorithm, and by 20.4% and 17.9% compared with that of the Double Deep Q-Network (DDQN) algorithm.

Key words: Space—Air—Ground Integrated Network (SAGIN), Mobile Edge Computing (MEC), Unmanned Aerial Vehicle (UAV), task offloading, resource allocation