Integrated avionics system safety optimization method based on deep reinforcement learning
To solve the problem that traditional safety design methods based on manual inspection were difficult to cope with the explosion of optional residence solutions caused by the large-scale integration of avionics systems,an avionics system partition model,task model and safety criticality level quantification model were constructed,and the comprehensive design optimization considering safety was modeled as an MDP problem.An optimization method of Soft Action-Critic(SAC)algorithm based on Actor-Critic framework was proposed.In order to obtain the correlation between the parameter selection and training results of SAC algorithm,the sensitivity of the algorithm parameters was studied.At the same time,to verify the superiority of the optimization method based on the SAC algorithm in optimizing the comprehensive design considering safety,optimization comparison experiments were carried out with the Deep Deterministic Policy Gradient(DDPG)algorithm and the traditional allocation algorithm as the objects.The results show that under the optimal parameter combination,the maximum reward after using convergence of SAC algorithm increases by nearly 8%compared with other parameter combinations,and the convergence time is shortened by nearly 16.6%.Compared with the DDPG algorithm and the traditional allocation algorithm,the optimization method based on SAC algorithm has improved approximately 62%,7464%,8370%,2123%and 775%in terms of the maximum reward,cumulative constraint violation rate,partition balance risk effect,partition resource utilization and solution time.
deep reinforcement learningintegrated modular avionicssafetyMarkov decision process(MDP)integrated design