基于自注意力PPO算法的智能配电网多设备协同无功优化控制策略

扫码查看

原文链接

万方数据
维普

中文摘要：针对智能配电网无功可调控资源多样化场景下的快速趋优难题,提出了一种基于多头自注意力近端策略优化算法的多设备协同无功优化控制方法.首先,将无功优化问题建模为马尔可夫决策过程;然后,在深度强化学习框架下使用多头自注意力改进近端策略优化(PPO)算法对策略网络进行优化训练,算法采用多头自注意力网络获取配电网的实时状态特征,并通过剪切策略梯度法动态控制策略网络的更新幅度;最后,在改进IEEE 69节点系统进行仿真验证.结果表明,所提算法的控制性能优于现有先进强化学习算法.

外文标题：Multi-device Cooperative Reactive Power Optimization Control Strategy of Intelligent Distribution Network Based on Self-attention PPO Algorithm

外文摘要：Aiming at the fast optimization problem in the diversified scenarios of reactive power controllable resources in intelligent distribution networks,this paper proposes a multi-device collaborative reactive power optimization control method based on multi-head self-attention proximal policy optimization(PPO)algorithm.Firstly,the reactive power optimization problem is modeled as Markov decision process.Then,under the framework of deep reinforcement learning,the multi-head self-attention improved PPO algorithm is used to optimize and train the strategy network.The algorithm uses a multi-head self-attention network to obtain the real-time state characteristics of the distribution network,and dynamically controls the update amplitude of the strategy network by the pruning strategy gradient method.Finally,the simulation is done in the improved IEEE 69-node system.The results show that the control performance of the proposed algorithm is better than that of the existing advanced reinforcement learning algorithms.

外文关键词：

distribution networkdistributed photovoltaicvoltage reactive power controlmulti-head self-attentionproximal policy optimization algorithm

作者：

张黎元、宋兴旺、李冰洁、梁睿、刘长德、彭奕洲

展开 >

作者单位：

国网天津市电力公司城西供电分公司,天津 300190

天津大学电气自动化与信息工程学院,天津 300072

关键词：

配电网分布式光伏电压无功控制多头自注意力近端策略优化算法

基金：

国家自然科学基金资助项目国网天津市电力公司科技项目

项目编号：

52277118城西-研发2023-01

出版年：

2024

DOI：

10.20204/j.sp.2024.10006

智慧电力

陕西省电力公司

智慧电力

CSTPCD北大核心

影响因子：0.831

ISSN：1673-7598

年,卷(期)：2024.52(10)

参考文献量18