Multi-Agent Reinforcement Learning Enabled Spectrum Sharing for Vehicular Networks
Aiming at the problem that it is difficult for base stations to collect and manage instantaneous channel state information in high dynamic vehicle networking environment,a spectrum allocation algorithm for vehicle networking based on multi-agent deep reinforcement learning is proposed.The algorithm aims to maximize the network throughput under the constraints of vehicle communication delay and reliability,and uses the learning algorithm to improve the spectrum and power allocation strategy.Firstly,the implicit cooperative agent is trained by improving DQN model and EXP3 strategy.Secondly,the nonstationary problem caused by multi-agent concurrent learning is solved by using hysteretic Q-learning and concurrent experience replay trajectory.The simulation results show that the average successful delivery rate of the payload of the proposed algorithm can reach 95.89%,which is 16.48%higher than the random baseline algorithm.It can quickly ob-tain the approximate optimal solution,and has significant advantages in reducing the signaling overhead of the Internet of vehicles communication system.