首页|被动探测视场角约束下的深度强化学习制导方法

被动探测视场角约束下的深度强化学习制导方法

扫码查看
针对红外制导导弹拦截机动目标的导引律设计问题,提出了一种纯角度量测下考虑视场角约束的深度强化学习制导方法.首先,将拦截制导问题转化为一个马尔可夫决策过程,建立了基于双延迟深度确定性策略梯度算法的深度强化学习制导模型,并充分考虑了导弹一阶自动驾驶仪特性;其次,设计了一种满足导引头视场角约束,同时又能权衡能量消耗和制导精度的综合奖励函数,并在典型场景下进行了深度强化学习制导律训练.在目标采用不同机动形式的条件下进行了对比仿真与蒙特卡洛仿真.仿真结果表明,该方法采用红外导引头探测到的纯角度信息,能够在满足视场角约束、过载指令饱和约束的前提下以较高精度命中目标,同时对目标的不同机动方式具有较强的鲁棒性.
Deep Reinforcement Learning Guidance Method Considering the Field-of-view Angle Constraint of Passive Detection
A deep reinforcement learning guidance method is proposed to address the problem of guidance law design for intercepting maneuverable targets with infrared-guided missiles,taking into consideration pure angle measurements and field-of-view angle constraints.Firstly,the interception guidance problem is formulated as a Markov Decision Process.A deep reinforcement learning guidance model is established based on the double delay deep deterministic policy gradient(TD3)algorithm,giving thorough consideration to the first-order autopilot characteristics of the missile.Secondly,a comprehensive reward function is designed to consider the field-of-view angle constraints of the passive seeker while balancing energy consumption and guidance accuracy,and the guidance law of deep reinforcement learning is trained in a variety of typical scenarios.The comparison simulation and Monte Carlo simulation are carried out under the condition of different maneuvering modes of the target.The simulation results show that through the method,the mssile can hit the target with high accuracy under the premise of meeting the constraint of the field-of-view angle and the constraint of overload instruction saturation by using the pure angle information detected by the infrared seeker.Meanwhile,it has strong robustness to different maneuvering modes of the target.

Deep reinforcement learningManeuvering targetField-of-view angle constraintPure angular measurementInfrared guidanceMissile interception

张青龙、赵斌、许新鹏

展开 >

西北工业大学精确制导与控制研究所,西安 710072

深度强化学习 机动目标 视场约束 纯角度量测 红外制导 导弹拦截

国家自然科学基金中央高校基本科研业务费

62373307G2022KY0608

2024

宇航学报
中国宇航学会

宇航学报

CSTPCD北大核心
影响因子:0.887
ISSN:1000-1328
年,卷(期):2024.45(8)