首页|基于超松弛双Q学习的源荷储协同频率稳定算法研究

基于超松弛双Q学习的源荷储协同频率稳定算法研究

扫码查看
具有强随机特性的新能源规模化接入将给电网带来强随机扰动,传统控制方法无法有效解决分布式电网模式下由强随机扰动引起的频率失衡、控制性能标准变差的问题.该文从二次调频角度提出一种多区域互联电网的智能发电控制算法,即超松弛双Q学习算法,来获取多区域协同控制.所提算法在快速Q学习基础上引入超松弛因子ω来加速最优值函数的计算,同时引入双 Q 学习策略来解决强化学习Q算法体系中的动作探索值过高估计问题,以提升算法的收敛性与更新效率.在改进的IEEE标准两区负荷频率控制模型以及云南互联电网模型中进行仿真分析,结果可知,所提算法表现出更佳的控制性能与收敛速度.
Research on Source Load Storage Cooperative Frequency Stabilization Algorithm Based on Super Relaxed Double Q Learning
The large-scale access to new energy with strong random characteristics will bring strong random disturbance to the power grid.The traditional control methods can not effectively solve the problems of frequency instability and worse control performance standards caused by a strong random disturbance in the distributed power grid mode.From the point of secondary frequency modulation,this paper proposes a multi-agent cooperative control algorithm for distributed multi-area interconnected power grid,i.e.over-relaxation double Q learning algorithm to obtain multi-area cooperation control.The proposed algorithm introduces an over-relaxation factor based on fast Q-learning ω.To accelerate the calculation of the optimal value function,at the same time,the double Q learning strategy is introduced to solve the problem of overestimation of the active exploration value in the reinforcement learning of the Q algorithm system,so as to improve the update efficiency and convergence performance of the algorithm.Through the simulation of the improved IEEE standard two-area load frequency control model and Yunnan interconnected power grid model,the proposed algorithm shows better control performance and convergence speed.

new energysecondary frequency modulationreinforcement learningmulti agent

周博奇、柳丹、席磊、李彦营

展开 >

三峡大学电气与新能源学院,湖北省 宜昌市 443002

国网湖北省电力有限公司电力科学研究院,湖北省 武汉市 430000

新能源 二次调频 强化学习 多智能体

国家自然科学基金

52277108

2024

中国电机工程学报
中国电机工程学会

中国电机工程学报

CSTPCD北大核心
影响因子:2.712
ISSN:0258-8013
年,卷(期):2024.44(4)
  • 29