Resource Scheduling Algorithm for Space Science Satellite Ground Data Processing

doi:10.19678/j.issn.1000-3428.0260191

Abstract

Abstract: With the increasing number of space science satellites, the types of onboard scientific payloads have become increasingly diverse, and the volume of downlinked scientific data has grown continuously. However, the available computational resources of ground data processing systems for space science satellites remain limited. Consequently, data processing tasks generated by satellites during in-orbit operations must be completed under constrained resource conditions. Meanwhile, different tasks exhibit significant heterogeneity in terms of timeliness requirements and computational resource consumption characteristics, and the system workload and resource states vary dynamically over time. Therefore, scheduling strategies need to dynamically adjust the execution order of data processing tasks and resource allocation schemes based on real-time system states (including task loads and computational resource utilization) to improve overall processing efficiency and system responsiveness.To address these challenges, we propose an online decision-making deep reinforcement learning–based resource scheduling algorithm, DeepRL-Sched, which is built upon Proximal Policy Optimization (PPO) and models the satellite data processing task scheduling problem as a Markov Decision Process (MDP). To mitigate the short-sighted decision-making issue caused by reinforcement learning methods relying solely on the current system state, as well as the challenges of slow convergence and unstable training, we design two key components: a computational resource demand prediction module and an imitation learning module. The former predicts future task workloads and resource demands to provide the scheduling policy with foresight information, thereby alleviating short-sighted decisions caused by partial observability. The latter employs imitation learning to extract prior knowledge from high-quality expert scheduling strategies, guiding the training of the policy network and significantly improving convergence speed and training stability.Experimental results demonstrate that the proposed algorithm effectively enhances the scheduling efficiency of space science satellite ground data processing systems, reduces the overall task completion time, and significantly improves the timeliness of processing high-priority tasks.

摘要： 随着空间科学卫星数量的持续增加及科学载荷类型的多样化，下行科学数据规模不断增长，数据处理任务在数量、类型及处理流程等方面的复杂度显著提升，不同数据处理任务在时效性要求和计算资源占用特征方面存在显著差异，给地面数据处理系统带来了更高的计算与调度压力，因此需要结合空间科学卫星数据处理任务特点进行计算资源调度策略的研究，对卫星数据处理任务的执行顺序与计算资源进行更高效的调度与分配，从而提升整体处理效率与系统响应能力。本文提出了一种支持在线决策的深度强化学习资源调度算法DeepRL- Sched，该算法以近端策略优化（Proximal Policy Optimization，PPO）为核心，将卫星数据处理任务调度过程建模为马尔可夫决策过程。为了解决强化学习方法仅依赖当前系统状态进行决策而易产生短视性的问题，以及训练过程中收敛慢、稳定性差的挑战，设计了计算资源需求预测模块和模仿学习模块两个关键组件：前者通过预测未来任务负载与资源需求，构建扩展状态表示，从而增强策略对系统未来演化趋势的感知能力，缓解因局部观测导致的短视决策；后者则采用模仿学习方法，从高质量专家调度策略中提取先验知识，引导策略网络训练，从而有效提升算法的收敛速度与训练稳定性。实验结果表明，该算法能够有效提升空间科学卫星地面数据处理系统的调度效率，降低任务整体完成时间，并显著改善高时效性任务的处理及时性。

ZOU Shengpeng, MA Fuli, LI Yunlong, YU Qinsi, HU Xiaoyan, ZOU Ziming. Resource Scheduling Algorithm for Space Science Satellite Ground Data Processing[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0260191.

邹圣鹏, 马福利, 李云龙, 于勤思, 胡晓彦, 邹自明. 面向空间科学卫星数据处理的资源调度算法[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0260191.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0260191

References

[1] 马福利,纪珍,杨晓艳,等.EP卫星数据预处理与分发服务平台设计与实现[J].空间科学学报,2025,45(04):1098-1113. Ma Fuli, Ji Zhen, Yang Xiaoyan, et al. Design and Implementation of EP Satellite Data Preprocessing and Distribution Service Platform[J].空间科学学报,2025,45(04):1098-1113.
[2] 杨阳,张红梅,王爽.基于科学工作流的通用卫星数据处理调度系统[J].计算机技术与发展,2024,34(06):164-170. YANG Yang,ZHANG Hong-mei,WANG Shuang.A Universal Satellite Data Processing and Scheduling System Based on Scientific Workflow[J].计算机技术与发展,2024,34(06):164-170
[3] H. Chen, Y. Zhou, C. Du and J. Li, "A satellite cluster data transmission scheduling method based on genetic algorithm with rote learning operator," 2016 IEEE Congress on Evolutionary Computation (CEC), Vancouver, BC, Canada, 2016
[4] Malte Schwarzkopf, Andy Konwinski, Michael Abd-El-Malek, and John Wilkes. 2013. Omega: flexible, scalable schedulers for large compute clusters. In Proceedings of the 8th ACM European Conference on Computer Systems (EuroSys '13). Association for Computing Machinery, New York, NY, USA, 351–364.
[5] Hongzi Mao, Mohammad Alizadeh, Ishai Menache, and Srikanth Kandula. 2016. Resource Management with Deep Reinforcement Learning. In Proceedings of the 15th ACM Workshop on Hot Topics in Networks (HotNets '16). Association for Computing Machinery, New York, NY, USA, 50–56.
[6] Huang, Y.; Mu, Z.; Wu, S.; Cui, B.; Duan, Y. Revising the Observation Satellite Scheduling Problem Based on Deep Reinforcement Learning. Remote Sens. 2021, 13, 2377.
[7] Ying Du, Shuo Zhang, Pu Cheng, Rita Yi Man Li, Xiao-Guang Yue,Remote Sensing Data Processing Process Scheduling Based on Reinforcement Learning in Cloud Environment,CMES - Computer Modeling in Engineering and Sciences,Volume 135, Issue 3,2022.
[8] 马福利,石涛,陈玲,等.面向多星多任务的大数据处理系统设计[J].大数据,2021,7(05):3-16. Ma Fuli, Shi Tao, Chen Ling, et al. Design of big data processing system supporting multi-satellites and multi-tasks[J].大数据,2021,7(05):3-16.
[9] 马福利,陈玲,李冰,等.GECAM卫星快速预处理流程设计与实现[J].天文研究与计术,2022,,19(03):274-282. Ma Fuli, Chen Ling, Li Bing, et al. Design and Implementation of Rapid Preprocessing Flow for GECAM Satellite[J].天文研究与计术,2022,,19(03):274-282.
[10] W. Guo, W. Tian, Y. Ye, L. Xu and K. Wu, "Cloud Resource Scheduling With Deep Reinforcement Learning and Imitation Learning," in IEEE Internet of Things Journal, vol. 8, no. 5, pp. 3576-3586, 1 March1, 2021
[11] 汤佳伟,郭铁铮,闻英友.基于强化学习的Kubernetes云边协同计算调度算法[J].浙江大学学报(工学版),2025,59(11):2400-2408.
[12] VARGHESE B, WANG N, BARBHUIYA S, et al. Challenges and opportunities in edge computing [C]//IEEE International Conference on Smart Cloud. New York: IEEE, 2016: 20-26.
[13] MAO H, SCHWARZKOPF M, VENKATAKRISHNAN S B, et al. Learning scheduling algorithms for data processing clusters [C]//Proceedings of the ACM Special Interest Group on Data Communication. New York: ACM, 2019: 270-288.
[14] 李婷,贾鹏德,杨宇,等.面向卫星常规测控任务的地面站资源调度[J].系统仿真技术,2024,20(01):65-72. Li Ting, Jia Pengde, Yang Yu, et al. Ground Station Resource Scheduling for Routine TT&C Tasks of Satellites[J].系统仿真技术,2024,20(01):65-72.
[15] 高立京,陈志敏,王春梅,等.基于Flink的空间科学卫星数据实时处理方法[J].计算机仿真,2023,40(07):26-31+290. Gao Lijing, Chen Zhimin, Wang Chunmei, et al. A Real-time Processing Method for Space Science Satellite Data Based on Flink[J].计算机仿真,2023,40(07):26-31+290.
[16] 王爽,张红梅,杨阳,等.基于Daisy的天文卫星数据处理软件系统设计与实现[J].核电子学与探测技术,2024,44(02):191-199. Wang Shuang, Zhang Hongmei, Yang Yang, et al. Design and Implementation of an Astronomical Satellite Data Processing Software System Based on Daisy[J].核电子学与探测技术,2024,44(02):191-199.
[17] Huilong Fan, Zhan Yang, Xi Zhang, Shimin Wu, Jun Long, Limin Liu,A novel multi-satellite and multi-task scheduling method based on task network graph aggregation,Expert Systems with Applications,Volume 205,2022,117565.
[18] He, Yongming & Wu, Guohua & Chen, Ying-Wu & Pedrycz, Witold. (2021). A Two-stage Framework and Reinforcement Learning-based Optimization Algorithms for Complex Scheduling Problems. 10.48550/arXiv.2103.05847.
[19] Fengcun Li and Bo Hu. 2019. DeepJS: Job Scheduling Based on Deep Reinforcement Learning in Cloud Data Center. In Proceedings of the 4th International Conference on Big Data and Computing (ICBDC '19). Association for Computing Machinery, New York, NY, USA, 48–53.
[20] Jitendra Kumar, Rimsha Goomer, Ashutosh Kumar Singh,Long Short Term Memory Recurrent Neural Network (LSTM-RNN) Based Workload Forecasting Model For Cloud Datacenters,Procedia Computer Science,Volume 125,2018,Pages 676-682.
[21] Y. Li et al., "Task Placement and Resource Allocation for Edge Machine Learning: A GNN-Based Multi-Agent Reinforcement Learning Paradigm," in IEEE Transactions on Parallel and Distributed Systems, vol. 34, no. 12, pp. 3073-3089, Dec. 2023.
[22] J. Gao, H. Wang and H. Shen, "Task Failure Prediction in Cloud Data Centers Using Deep Learning," in IEEE Transactions on Services Computing, vol. 15, no. 3, pp. 1411-1422, 1 May-June 2022.
[23] L. P. Kaelbling, M. L. Littman, and A. W. Moore, “Reinforcement learning: A survey,” J. Artif. Intell. Res., vol. 4, pp. 237–285, May 1996.
[24] D. Zeng, L. Gu, S. Pan, J. Cai, and S. Guo, “Resource management at the network edge: A deep reinforcement learning approach,” IEEE Netw., vol. 33, no. 3, pp. 26–33, May/Jun. 2019.
[25] V. R. Konda and J. N. Tsitsiklis, “Actor-critic algorithms,” in Advances in Neural Information Processing Systems. Cambridge, MA, USA: MIT Press, 2000, pp. 1008–1014.
[26] Zhang, H.; Buchmeister, B.; Li, X.; Ojstersek, R. An Efficient Metaheuristic Algorithm for Job Shop Scheduling in a Dynamic Environment. Mathematics 2023, 11, 2336.
[27] Li, F., Lang, S., Tian, Y. et al. A transformer-based deep reinforcement learning approach for dynamic parallel machine scheduling problem with family setups. J Intell Manuf 36, 4735–4768 (2025).
[28] 韩向敏,鲍泓,梁军,潘峰,玄祖兴. 一种基于深度强化学习的自适应巡航控制算法[J]. 计算机工程,2018, 44(7): 32 - 35,41.
[29] 朱炜,王俊,周迅钊. 基于负载均衡的医院云计算系统资源调度方案[J]. 计算机工程, 2018, 44(3): 37 - 41,54.

Please choose a citation manager

Content to export