Joint Routing and Resource Scheduling Algorithm for Large-scale Multi-mode Mesh Networks Based on Reinforcement Learning
In order to balance the transmission reliability and efficiency of large-scale multi-mode mesh networks in the new power system,a two-stage algorithm is proposed based on reinforcement learning for joint routing selection and resource scheduling in large-scale multi-mode mesh networks,building upon the description and analysis of optimization problems.In the first stage,based on the network topology information and service requirements,a multi shortest path routing algorithm is utilized to generate all the shortest paths.In the second stage,a resource scheduling algorithm based on Multi-Armed Bandit(MAB)is proposed.The algorithm constructs the arms of the MAB based on the obtained set of shortest paths,then calculates the reward according to the service demands,and finally gives the optimal route selection and resource scheduling mode for service transmission.Simulation results show that the proposed algorithm can meet different service transmission requirements,and achieve an efficient balance between the average end-to-end path delay and the average transmission success rate.