基于知识融合和深度强化学习的智能紧急切机决策

Intelligent Emergency Generator Rejection Schemes Based on Knowledge Fusion and Deep Reinforcement Learning

李舟平 ¹曾令康 ¹姚伟 ¹胡泽 ¹帅航 ²汤涌 ³文劲宇¹

扫码查看

作者信息

1. 强电磁工程与新技术国家重点实验室(华中科技大学电气与电子工程学院),湖北省武汉市 430074
2. 田纳西大学电气工程与计算机科学系,美国田纳西州诺克斯维尔市 37996
3. 中国电力科学研究院有限公司,北京市海淀区 100192
折叠

摘要

紧急控制是在严重故障后维持电力系统暂态安全稳定的重要手段.目前常用的"人在环路"离线紧急控制决策制定方式存在效率不高、严重依赖专家经验等问题,该文提出一种基于知识融合和深度强化学习(deep reinforcement learning,DRL)的智能紧急切机决策制定方法.首先,构建基于DRL的紧急切机决策制定框架.然后,在智能体处理多个发电机决策时,由于产生的高维决策空间使得智能体训练困难,提出决策空间压缩和应用分支竞争 Q(branching dueling Q,BDQ)网络的两种解决方法.接着,为了进一步提高智能体的探索效率和决策质量,在智能体训练中融合紧急切机控制相关知识经验.最后,在10机39 节点系统中的仿真结果表明,所提方法可以在多发电机决策时快速给出有效的紧急切机决策,应用BDQ网络比决策空间压缩的决策性能更好,知识融合策略可引导智能体减少无效决策探索从而提升决策性能.

Abstract

Emergency control is an important means of maintaining power system transient security and stability following serious faults.The current popular"human-in-the-loop"offline emergency control decision-making method has some drawbacks,including low efficiency and heavy reliance on expert experience.Therefore,this paper proposes an intelligent emergency generator rejection decision-making method based on knowledge fusion and deep reinforcement learning(DRL).First,a DRL-based emergency generator rejection decision-making framework is built.Then,when the agent deals with multi-generator decisions,the resulting high-dimensional decision space makes the agent training difficult.There are two solutions proposed:decision space compression and the application of a branching dueling Q(BDQ)network.Next,to further improve the exploration efficiency and the decision-making quality of the agent,the knowledge and experience related to emergency generator rejection control are integrated to the agent training.Finally,the simulation results in the 10-machine 39-bus system show that the proposed method can quickly give effective emergency generator rejection decisions in multi-generator decision-making.Applying a BDQ network has better decision performance than decision space compression.The knowledge fusion strategy can guide the agents to reduce ineffective decision-making explorations and improve decision-making performance.

关键词

紧急切机决策/深度强化学习/决策空间/分支竞争Q网络/知识融合

Key words

emergency generator rejection decision/deep reinforcement learning/decision space/branching dueling Q network/knowledge fusion

引用本文复制引用

基金项目

国家自然科学基金项目(U1866602)

出版年

2024

中国电机工程学报

中国电机工程学会

中国电机工程学报

CSTPCD北大核心

影响因子：2.712

ISSN：0258-8013

参考文献量27

段落导航