首页|基于安全深度强化学习的电网有功频率协同优化控制

基于安全深度强化学习的电网有功频率协同优化控制

扫码查看
可再生能源占比不断增加给互联电网频率控制带来严峻考验。由于常规的自动发电控制(AGC)策略没有考虑电网潮流安全约束,所以传统方法根据专家知识和经验进行尝试性发电机功率调整,需耗费较多时间;基于最优电力潮流的互联电网AGC优化模型由于非凸性和大规模性,求解时间较长且存在收敛性问题。鉴于常规深度强化学习具有"离线训练、在线端对端形成策略"的优点,但在动作探索过程中无法保证系统安全性,提出一种基于安全深度强化学习的电网有功频率协同优化控制方法。首先,将电网频率控制建模为约束马尔可夫决策过程,对决策过程添加相关安全约束进行智能体设计;然后,基于华东电网实际系统算例对智能体进行训练和性能提升;最后,对比智能体决策与常规AGC策略效果。结果表明:所提方法在多种运行方式下可快速生成有功频率控制策略,且保证系统频率恢复过程中电网的安全性,可辅助调度员在线决策。
Coordinated Active Power-Frequency Control Based on Safe Deep Reinforcement Learning
The continuous increase in renewables penetration poses a severe challenge to the frequency control of interconnected power grid.Since the conventional automatic generation control(AGC)strategy does not consider the power flow constraints of the network,the traditional approach is to make tentative generator power adjustments based on expert knowledge and experience,which is time consuming.The optimal power flow-based AGC optimization model has a long solution time and convergence issues due to its non-convexity and large size.Deep reinforcement learning has the advantage of"offline training and online end-to-end strategy formation",which yet cannot ensure the security of artificial intelligence(AI)in power grid applications.A coordinated optimal control method is proposed for active power and frequency control based on safe deep reinforcement learning.First,the method models the frequency control problem as a constrained Markov decision process,and an agent is designed by considering various safety constraints.Then,the agent is trained using the example of East China Power Grid through continuous interactions with the grid.Finally,the effect of the agent and the conventional AGC strategy is compared.The results show that the proposed approach can quickly generate control strategies under various operating conditions,and can assist dispatchers to make decisions online.

coordinated power and frequency controlartificial intelligence(AI)safe deep reinforcement learningconstrained Markov decision processagent

周毅、周良才、史迪、赵小英、闪鑫

展开 >

国家电网有限公司华东分部,上海 200002

AINERGY,美国圣塔克拉拉95051

国电南瑞科技股份有限公司,南京 210024

有功频率协同控制 人工智能 深度强化学习 约束马尔可夫决策过程 智能体

国家电网华东分部科技项目

SGHD0000DKJS2100235

2024

上海交通大学学报
上海交通大学

上海交通大学学报

CSTPCD北大核心
影响因子:0.555
ISSN:1008-7095
年,卷(期):2024.58(5)
  • 25