被动探测视场角约束下的深度强化学习制导方法

扫码查看

原文链接

万方数据
维普

中文摘要：针对红外制导导弹拦截机动目标的导引律设计问题,提出了一种纯角度量测下考虑视场角约束的深度强化学习制导方法.首先,将拦截制导问题转化为一个马尔可夫决策过程,建立了基于双延迟深度确定性策略梯度算法的深度强化学习制导模型,并充分考虑了导弹一阶自动驾驶仪特性;其次,设计了一种满足导引头视场角约束,同时又能权衡能量消耗和制导精度的综合奖励函数,并在典型场景下进行了深度强化学习制导律训练.在目标采用不同机动形式的条件下进行了对比仿真与蒙特卡洛仿真.仿真结果表明,该方法采用红外导引头探测到的纯角度信息,能够在满足视场角约束、过载指令饱和约束的前提下以较高精度命中目标,同时对目标的不同机动方式具有较强的鲁棒性.

外文标题：Deep Reinforcement Learning Guidance Method Considering the Field-of-view Angle Constraint of Passive Detection

外文摘要：A deep reinforcement learning guidance method is proposed to address the problem of guidance law design for intercepting maneuverable targets with infrared-guided missiles,taking into consideration pure angle measurements and field-of-view angle constraints.Firstly,the interception guidance problem is formulated as a Markov Decision Process.A deep reinforcement learning guidance model is established based on the double delay deep deterministic policy gradient(TD3)algorithm,giving thorough consideration to the first-order autopilot characteristics of the missile.Secondly,a comprehensive reward function is designed to consider the field-of-view angle constraints of the passive seeker while balancing energy consumption and guidance accuracy,and the guidance law of deep reinforcement learning is trained in a variety of typical scenarios.The comparison simulation and Monte Carlo simulation are carried out under the condition of different maneuvering modes of the target.The simulation results show that through the method,the mssile can hit the target with high accuracy under the premise of meeting the constraint of the field-of-view angle and the constraint of overload instruction saturation by using the pure angle information detected by the infrared seeker.Meanwhile,it has strong robustness to different maneuvering modes of the target.

外文关键词：

Deep reinforcement learningManeuvering targetField-of-view angle constraintPure angular measurementInfrared guidanceMissile interception

作者：

张青龙、赵斌、许新鹏

展开 >

作者单位：

西北工业大学精确制导与控制研究所,西安 710072

关键词：

深度强化学习机动目标视场约束纯角度量测红外制导导弹拦截

基金：

国家自然科学基金中央高校基本科研业务费

项目编号：

62373307G2022KY0608

出版年：

2024

DOI：

10.3873/j.issn.1000-1328.2024.08.012

宇航学报

中国宇航学会

宇航学报

CSTPCD北大核心

影响因子：0.887

ISSN：1000-1328

年,卷(期)：2024.45(8)