Author Login Chief Editor Login Reviewer Login Editor Login Remote Office

Computer Engineering ›› 2026, Vol. 52 ›› Issue (4): 339-348. doi: 10.19678/j.issn.1000-3428.0069899

• Next-Generation Networks and Edge Computing • Previous Articles     Next Articles

Adaptive Cooperative Task Offloading Decision for Multiple Unmanned Aerial Vehicles

LIU Yi1, LUO Chun1, ZHONG Weifeng1,*(), YU Yi2, OU Zhiqing1   

  1. 1. School of Automation, Guangdong University of Technology, Guangzhou 510000, Guangdong, China
    2. Changsha Electronic Industry School, Changsha 410016, Hunan, China
  • Received:2024-05-22 Revised:2024-08-12 Online:2026-04-15 Published:2026-04-08
  • Contact: ZHONG Weifeng

多无人机自适应合作任务卸载决策

刘义1, 罗淳1, 钟伟锋1,*(), 余意2, 欧智清1   

  1. 1. 广东工业大学自动化学院, 广东 广州 510000
    2. 长沙市电子工业学校, 湖南 长沙 410016
  • 通讯作者: 钟伟锋
  • 作者简介:

    刘义, 男, 教授、博士、博士生导师, 主研方向为智能电网、网络资源调度

    罗淳, 硕士研究生

    钟伟锋(通信作者), 副教授、博士

    余意, 硕士研究生

    欧智清, 硕士研究生

  • 基金资助:
    国家重点研发计划(2020YFB1807805); 国家重点研发计划(2020YFB1807800); 国家自然科学基金(62003099); 广东省自然科学基金(2024A1515011795); 广州市基础与应用基础研究项目(2023A04J1704)

Abstract:

This study investigates adaptive cooperative task offloading and allocation in a multiple Unmanned Aerial Vehicles (UAVs) collaborative mobile edge computing network. To enhance collaboration among UAVs in a time-varying environment and improve the efficiency of task execution, this study constructs a UAV task queuing model in a time-varying environment and establishes a UAVs task offloading decision model based on the Markov Decision Process (MDP). Moreover, this study proposes a Cooperative-based Deep Deterministic Policy Gradient (CODDPG) algorithm to address the optimization problem of multiple UAVs offloading. The CODDPG algorithm, which integrates CommNet with the traditional Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm, facilitates the sharing of environmental observations among all UAVs. This approach effectively extends the UAVs' perception of the environment and enhances their collaborative decision capability. It also addresses the issue of local optima in the MADDPG algorithm caused by its sole dependence on local information during agent training, thereby minimizing total computation delay. Experimental results demonstrate that the CODDPG algorithm not only significantly reduces task computation delay effectively but also converges faster than the traditional MADDPG algorithm.

Key words: mobile edge computing, multiple Unmanned Aerial Vehicles (UAVs) collaboration, deep reinforcement learning, offloading optimization, computation delay minimization

摘要:

研究多无人机(UAV)协同移动边缘计算网络中自适应合作任务卸载与分配问题。为了提高时变环境下无人机之间的协同性, 进而提升任务的执行效率, 构建时变环境下无人机任务队列模型, 并建立基于马尔可夫决策过程(MDP)的无人机任务卸载决策模型。提出一种基于合作的深度确定性策略梯度(CODDPG)算法, 以解决多无人机卸载决策优化问题。CODDPG算法结合神经网络CommNet与传统的多智能体深度确定性策略梯度(MADDPG)算法, 实现了无人机的环境观测值共享, 有效拓展了无人机的环境感知范围并增强了它们之间的协同决策能力, 并且解决了MADDPG算法中智能体的训练仅依赖局部信息而陷入局部最优解问题, 从而最小化总计算时延。经过实验证明, CODDPG算法不仅有效降低了任务计算时延, 而且与传统的MADDPG算法相比收敛速度更快。

关键词: 移动边缘计算, 多无人机协同, 深度强化学习, 卸载优化, 计算时延最小化