首页|一种基于生成对抗模仿学习的作战决策方法

一种基于生成对抗模仿学习的作战决策方法

扫码查看
为研究有限作战指挥样本下的智能决策方法,针对作战决策经验难以表达和智能决策学习训练样本稀缺等问题,基于联合战役仿真推演环境,提出了一种基于生成对抗模仿学习的作战决策方法.该方法整合了作战决策经验表示与学习过程,在上层决策和底层动作分层的基础上,采用规则定义特定任务执行逻辑,并利用生成对抗模仿学习算法提升智能体场景泛化能力.在构设的典型对抗场景中,该方法达到了预期效果,算法训练收敛,智能体输出决策合理.实验结果初步表明,生成对抗模仿学习作为一种智能作战决策方法,具有进一步研究价值.
A decision-making method based on generative adversarial imitation learning
To study the intelligent decision making methods under limited decision samples,aiming at the problems that op-erational decision-making experience is difficult to express and the training samples for intelligent decision learning are limit-ed,based on the joint operational simulation and drill environment,a decision-making method based on generative adversari-al imitation learning is proposed.This method integrates the operational decision-making experience representation and learn-ing process.On the basis of high-level decision-making and low-level action,rule definitions are used to specify the logic of task execution,and generative adversarial imitation learning algorithms are utilized to improve the generalization ability of in-telligent agents in scenarios.This method achieved expected results in the constructed typical adversarial scenarios.The algo-rithm training converged and the decisions output by the intelligent agent are reasonable.Preliminary experimental results in-dicate that generative adversarial imitation learning,as an intelligent operational decision-making method,has value for fur-ther research.

intelligent decision-makingoperational decision-makingrule-based methodgenerative adversarial imitation learning

李东、许霄、吴琳

展开 >

国防大学联合作战学院, 北京 100091

智能决策 作战决策 基于规则的方法 生成对抗模仿学习

国家自然科学基金

62006235

2024

指挥控制与仿真
中国船舶重工集团公司 第七一六研究所

指挥控制与仿真

CSTPCD
影响因子:0.309
ISSN:1673-3819
年,卷(期):2024.46(2)
  • 15