中国航空学报(英文版)2024,Vol.37Issue(2) :417-430.DOI:10.1016/j.cja.2023.10.011

Efficient and fair PPO-based integrated scheduling method for multiple tasks of SATech-01 satellite

Qi SHI Lu LI Ziruo FANG Xingzi BI Huaqiu LIU Xiaofeng ZHANG Wen CHEN Jinpei YU
中国航空学报(英文版)2024,Vol.37Issue(2) :417-430.DOI:10.1016/j.cja.2023.10.011

Efficient and fair PPO-based integrated scheduling method for multiple tasks of SATech-01 satellite

Qi SHI 1Lu LI 2Ziruo FANG 3Xingzi BI 2Huaqiu LIU 2Xiaofeng ZHANG 2Wen CHEN 3Jinpei YU3
扫码查看

作者信息

  • 1. Shanghai Satellite Network Research Institute CO.,LTD,Shanghai 201210,China
  • 2. Innovation Academy for Microsatellites of Chinese Academy of Sciences,Shanghai 201306,China
  • 3. Innovation Academy for Microsatellites of Chinese Academy of Sciences,Shanghai 201306,China;University of Chinese Academy of Sciences,Beijing 100039,China
  • 折叠

Abstract

SATech-01 is an experimental satellite for space science exploration and on-orbit demon-stration of advanced technologies.The satellite is equipped with 16 experimental payloads and sup-ports multiple working modes to meet the observation requirements of various payloads.Due to the limitation of platform power supply and data storage systems,proposing reasonable mission plan-ning schemes to improve scientific revenue of the payloads becomes a critical issue.In this article,we formulate the integrated task scheduling of SATech-01 as a multi-objective optimization prob-lem and propose a novel Fair Integrated Scheduling with Proximal Policy Optimization(FIS-PPO)algorithm to solve it.We use multiple decision heads to generate decisions for each task and design the action mask to ensure the schedule meeting the platform constraints.Experimental results show that FIS-PPO could push the capability of the platform to the limit and improve the overall obser-vation efficiency by 31.5%compared to rule-based plans currently used.Moreover,fairness is con-sidered in the reward design and our method achieves much better performance in terms of equal task opportunities.Because of its low computational complexity,our task scheduling algorithm has the potential to be directly deployed on board for real-time task scheduling in future space projects.

Key words

Satellite observatories/SATech-01/Multi-modes platform/Scheduling algorithms/Reinforcement learning/Proximal Policy Optimiza-tion(PPO)

引用本文复制引用

基金项目

Strategic Priority Program on Space Science,Chinese Academy of Sciences()

出版年

2024
中国航空学报(英文版)
中国航空学会

中国航空学报(英文版)

CSTPCDEI
影响因子:0.847
ISSN:1000-9361
参考文献量1
段落导航相关论文