一种基于生成对抗模仿学习的作战决策方法

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：为研究有限作战指挥样本下的智能决策方法,针对作战决策经验难以表达和智能决策学习训练样本稀缺等问题,基于联合战役仿真推演环境,提出了一种基于生成对抗模仿学习的作战决策方法.该方法整合了作战决策经验表示与学习过程,在上层决策和底层动作分层的基础上,采用规则定义特定任务执行逻辑,并利用生成对抗模仿学习算法提升智能体场景泛化能力.在构设的典型对抗场景中,该方法达到了预期效果,算法训练收敛,智能体输出决策合理.实验结果初步表明,生成对抗模仿学习作为一种智能作战决策方法,具有进一步研究价值.

外文标题：A decision-making method based on generative adversarial imitation learning

外文摘要：To study the intelligent decision making methods under limited decision samples,aiming at the problems that op-erational decision-making experience is difficult to express and the training samples for intelligent decision learning are limit-ed,based on the joint operational simulation and drill environment,a decision-making method based on generative adversari-al imitation learning is proposed.This method integrates the operational decision-making experience representation and learn-ing process.On the basis of high-level decision-making and low-level action,rule definitions are used to specify the logic of task execution,and generative adversarial imitation learning algorithms are utilized to improve the generalization ability of in-telligent agents in scenarios.This method achieved expected results in the constructed typical adversarial scenarios.The algo-rithm training converged and the decisions output by the intelligent agent are reasonable.Preliminary experimental results in-dicate that generative adversarial imitation learning,as an intelligent operational decision-making method,has value for fur-ther research.

外文关键词：

intelligent decision-makingoperational decision-makingrule-based methodgenerative adversarial imitation learning

作者：

李东、许霄、吴琳

展开 >

作者单位：

国防大学联合作战学院, 北京 100091

关键词：

智能决策作战决策基于规则的方法生成对抗模仿学习

基金：

国家自然科学基金

项目编号：

62006235

出版年：

2024

DOI：

10.3969/j.issn.1673-3819.2024.02.003

指挥控制与仿真

中国船舶重工集团公司　第七一六研究所

指挥控制与仿真

CSTPCD

影响因子：0.309

ISSN：1673-3819

年,卷(期)：2024.46(2)

参考文献量15