面向动态毁伤概率和目标价值的武器目标分配方法

扫码查看

原文链接

万方数据
维普

中文摘要：面向毁伤和目标价值动态变化条件下的大规模武器目标动态分配问题，提出了一种基于深度强化学习的武器目标分配求解方法。该方法采用双神经网络结构，基于武器目标分配目标函数设计了一套简单、直观的状态与奖励建模方法。通过仿真实验对所提方法进行了验证，结果表明，所提方法能够较快实现收敛，且整体毁伤和计算效率上优于基于粒子群的方法。所提方法能够有效应对毁伤概率和目标价值动态变化条件下的武器目标分配问题，说明了其良好的拓展性。该方法可应用于作战任务规划、仿真单元自动交火等场景下的武器目标分配快速求解。

外文标题：A Method for Solving Weapon Target Assignment Problem with Dynamic Damage Rates and Target Values

外文摘要：The paper proposed a deep Q-learning method to solve the weapon target assignment problem with dynamic damage rates and target values.The DQN method adopted a double network structure.A straightforward method is proposed for modeling the state and reward function of the DQN,which is designed based on the objective function of the weapon target assignment problem.The proposed DQN model was tested by using several simulated scenarios.Results showed the model can converge fast and effectively solve the weapon target assignment problem with dynamic damage rates and target values.The proposed method achieved a better total damage rate and less computation time than the particle swarm-based method.The proposed method can be applied to weapon target assignment problem under the scenario of combat mission planning and combat simulation.

外文关键词：

weapon target assignmentdeep reinforcement learningdynamic damage ratedynamic target value

作者：

林雕、朱燕、杨剑

展开 >

作者单位：

陆军指挥学院,南京 210045

解放军61175部队,南京 210046

信息工程大学,郑州 450052

关键词：

武器目标分配深度强化学习动态毁伤动态目标价值

出版年：

2024

DOI：

10.3969/j.issn.1002-0640.2024.02.021

火力与指挥控制

火力与指挥控制研究会,火力与指挥控制专业情报网

火力与指挥控制

CSTPCD北大核心

影响因子：0.312

ISSN：1002-0640

年,卷(期)：2024.49(2)

参考文献量17