首页|基于域泛化D3QN的跨工况故障诊断方法

基于域泛化D3QN的跨工况故障诊断方法

扫码查看
针对深度强化学习对交互环境的依赖性导致的其在跨工况设备故障诊断中可移植性差的问题,提出一种D3QN(Dueling double deep Q network,D3QN)域泛化的故障诊断方法.采用自适应权值的最大相关最小冗余特征筛选方法进行特征优化选择,实现数据环境去冗余精化处理;在竞争网络和双Q网络基础上引入了域识别网络,实现工况环境掩蔽下的故障状态信息分离提取;构建基于故障模式类间距的量化奖励矩阵,并结合域辨识奖励设置分治奖励策略,增强智能体对跨工况混叠故障模式的辨识决策能力.齿轮箱故障与轴承故障的跨工况诊断结果表明,能够较好地解决深度强化学习网络对交互环境的依赖性和其在跨工况故障诊断中与环境独立性之间的矛盾问题,实现深度强化模型在不同工况环境中的复用移植,提高深度强化学习在跨域故障诊断中的适用性.
Domain Generalization D3QN for Machinery Fault Diagnosis Across Different Working Conditions
To address the problem of poor portability of deep reinforcement learning model in cross-condition fault diagnosis due to its dependence on the interaction environment,a domain generalization D3QN(Domain generalization dueling double deep Q network,DGD3QN)model is proposed for the machinery fault diagnosis across different working conditions.To realize the de-redundancy and refinement of data environment,the adaptive weighted max-relevance-min-redundancy method is utilized to optimize feature selection.The domain recognition network branch is introduced into D3QN network to separate and extract the fault state information from multi-conditions.To enhance the agent's ability of identifying the overlapping failure modes in the multi-condition,the graded reward strategy is set by combining the domain recognition reward and the quantitative reward matrix constructed based on the inter-class distance of multi-condition failure modes.The experimental results of cross-condition diagnosis of gearbox fault and bearing fault showed that the proposed DGD3QN can better solve the contradiction between the environment dependence of DQN and the independence of cross-condition fault diagnosis on environmental conditions,realize the multiplexing and transplantation of D3QN models in different operating environments and enhance the applicability of DQN in the cross-domain fault diagnosis accuracy.

fault diagnosisdomain generalizationfeature screeninggraded reward strategydeep reinforcement learning

柏林、何牧耕、陈兵奎、刘小峰

展开 >

重庆大学机械与运载工程学院 重庆 400044

故障诊断 域泛化 特征筛选 分治奖励 深度强化学习

2024

机械工程学报
中国机械工程学会

机械工程学报

CSTPCD北大核心
影响因子:1.362
ISSN:0577-6686
年,卷(期):2024.60(22)