面向低轨星座边缘计算的博弈强化学习方法综述

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：博弈强化学习作为人工智能领域的新兴范式,是当前解决低轨星座边缘计算问题的主流方法.融入博弈论的多智能体深度强化学习方法为复杂、动态、不确定性的星座边缘计算问题提供了新思路.通过梳理总结卫星组网、任务卸载以及资源调度3种卫星边缘计算主要研究方向,详细阐述了博弈强化学习范式基础,并从博弈模型、深度Q网络、深度确定性策略梯度以及近端策略优化等方面分别阐述了3种研究方向上的典型应用现状,最后对该领域的前沿挑战进行分析,期望为博弈强化学习范式与低轨星座边缘计算领域的交叉融合研究提供参考.

外文标题：Overview on game reinforcement learning methods for edge computing of low-orbit constellation

外文摘要：As a new paradigm in the field of artificial intelligence,game reinforcement learning is an advanced mainstream method to solve the edge computing problem of low-orbit constellation.The multi-agent deep reinforcement learning inte-grated into the game perspective provides a new idea for dynamic,complex and uncertain constellation edge computing problems.By summarizing the three main research directions of satellite edge computing,namely satellite networking,task unloading and resource scheduling,the basis of game reinforcement learning paradigm is elaborated,and the typi-cal applications in the three research directions are described respectively from the methods of game model,deep Q network,deep deterministic strategy gradient and near-end strategy optimization.In the end,the paper looks forward to the frontier challenges in this field,expected to provide a reference for the cross-fusion research of game reinforce-ment learning paradigm and low-orbit constellation edge computing.

外文关键词：

low-orbit constellationedge computinggame theorymulti-agent reinforcement learning

作者：

谷学强、张万鹏、谭思雨、罗俊仁、周棪忠

展开 >

作者单位：

国防科技大学智能科学学院,湖南长沙 410073

湖南先进技术研究院,湖南长沙 410205

关键词：

低轨星座边缘计算博弈论多智能体强化学习

基金：

国家自然科学基金项目国家自然科学基金项目

项目编号：

9227110862173336

出版年：

2024

DOI：

10.11959/j.issn.2096-6652.202432

智能科学与技术学报

CSTPCD

ISSN：

年,卷(期)：2024.6(3)

参考文献量22