电力工程技术2024,Vol.43Issue(6) :88-99.DOI:10.12158/j.2096-3203.2024.06.009

基于PPO算法的CIES低碳优化调度方法

A low-carbon optimization scheduling method of CIES based on PPO algorithm

陈凡 吴凌霄 王曼 吕干云 张小莲
电力工程技术2024,Vol.43Issue(6) :88-99.DOI:10.12158/j.2096-3203.2024.06.009

基于PPO算法的CIES低碳优化调度方法

A low-carbon optimization scheduling method of CIES based on PPO algorithm

陈凡 1吴凌霄 1王曼 1吕干云 1张小莲1
扫码查看

作者信息

  • 1. 南京工程学院电力工程学院,江苏南京 211167
  • 折叠

摘要

阶梯式碳交易机制以及优化调度模型求解算法是进行园区综合能源系统(community integrated energy sys-tem,CIES)优化调度的重要因素,现有文献对这两个因素的考虑不够全面.为此,文中在考虑阶梯式碳交易机制的基础上,提出采用近端策略优化(proximal policy optimization,PPO)算法求解CIES低碳优化调度问题.该方法基于低碳优化调度模型搭建强化学习交互环境,利用设备状态参数及运行参数定义智能体的状态、动作空间及奖励函数,再通过离线训练获取可生成最优策略的智能体.算例分析结果表明,采用PPO算法得到的CIES低碳优化调度方法能够充分发挥阶梯式碳交易机制减少碳排放量和提高能源利用率方面的优势.

Abstract

The tiered carbon trading mechanism and optimization scheduling model solving algorithm are pivotal for the community integrated energy system(CIES).CIES plays a crucial role in optimizing scheduling,yet existing literature often does not fully consider these two factors.To address this gap,the adoption of the proximal policy optimization(PPO)algorithm is proposed,which incorporates a ladder-type carbon trading mechanism to solve the low-carbon optimization scheduling problem of CIES.This method constructs a reinforcement learning interactive environment based on a low-carbon optimization scheduling model.The intelligent agent's state,action space,and reward function are defined using device status and operating parameters.An intelligent agent capable of generating the optimal policy is obtained through offline training.Case study analysis results demonstrate that the low-carbon optimization scheduling scheme for CIES achieved through the PPO algorithm,effectively leverages the advantages of the tiered carbon trading mechanism,significantly reducing carbon emissions and improving energy utilization efficiency.

关键词

园区综合能源系统(CIES)/优化调度/近端策略优化(PPO)算法/阶梯式碳交易机制/惩罚系数/碳排放

Key words

community integrated energy system(CIES)/optimize scheduling/proximal policy optimization(PPO)algorithm/ladder-type carbon trading mechanism/penalty coefficient/carbon emission

引用本文复制引用

出版年

2024
电力工程技术
江苏省电力公司 江苏省电机工程学会

电力工程技术

CSTPCD北大核心
影响因子:0.969
ISSN:2096-3203
段落导航相关论文