Research on Integrated Energy Optimal Scheduling Based on Improved TD3
Aiming at the economic optimal scheduling problem of the integrated energy system,a twin delayed deep deterministic strategy gradient algorithm(TD3)based on priority experience playback mechanism and absolute mean method is proposed.The priority experience playback mechanism optimizes the sampling process by distinguis-hing the sample value,and the absolute mean method calculates the TD error to ensure the reliability of the sample value.Taking the total operation cost of the system as the index,the system scheduling model is constructed,and the environment state,scheduling action and reward function are designed.The simulation results of a microgrid in a uni-versity show that the proposed algorithm can coordinate the output of equipment more effectively and improve the e-conomy of the system than the TD3 algorithm,deep deterministic policy gradient(DDPG)algorithm and CPLEX sol-ver.
Deep reinforcement learningIntegrated energy systemAbsolute meanPriority experience replay