基于深度强化学习的多能流楼宇低碳调度方法

A low-carbon scheduling method for multi-energy flow buildings based on deep rein-forcement learning

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
维普
万方数据

中文摘要：建筑减排已成为中国达到"双碳"目标的重要途径,智慧楼宇作为多能流网络耦合的综合能源主体,面临碳排放量较多、多能流网络耦合程度高、负荷用能行为动态特性明显等问题.针对这一问题,提出基于深度强化学习的多能流楼宇低碳调度方法.首先,根据智慧楼宇的实际碳排放量,建立了一种奖惩阶梯型碳排放权交易机制.其次,面向碳市场和多能流耦合网络,以最小化运行成本为目标函数,建立多能流低碳楼宇调度模型,并将该调度问题转换为马尔可夫决策过程.然后,利用Rainbow算法进行优化调度问题的求解.最后,通过仿真分析验证了优化调度模型的可行性及有效性.

外文摘要：Building emissions reduction has become a crucial pathway for China to achieve its'dual-carbon'goals.As an integrated energy entity coupled with multi-energy flow networks,smart buildings face challenges such as high carbon emissions,a high degree of coupling in multi-energy flow networks,and distinct dynamic characteris-tics in load energy consumption behavior.In response to these challenges,a low-carbon scheduling method for multi-energy flow buildings based on deep reinforcement learning(deep RL)is proposed.Firstly,a reward and punish-ment ladder-type carbon emissions trading mechanism is established based on the actual carbon emissions of smart buildings.Secondly,targeting the carbon market and multi-energy flow coupling networks,a low-carbon scheduling model for multi-energy flow buildings is developed,aiming to minimize operating costs as the objective function,and the scheduling is transformed into a Markov decision process(MDP).Subsequently,the Rainbow algorithm is employed to solve the optimal scheduling.Finally,the feasibility and effectiveness of the optimal scheduling model are verified through simulation analysis.

外文关键词：

'dual-carbon'goalsmulti-energy flowlow-carbon schedulingdeep RL

作者：

胥栋、李逸超、李赟、徐刚、杜佳玮

展开 >

作者单位：

国网上海市电力公司浦东供电公司,上海 200122

关键词：

"双碳"目标多能流低碳调度深度强化学习

基金：

国网上海市电力公司浦东供电公司营销项目

项目编号：

640921220001

出版年：

2024

DOI：

10.19585/j.zjdl.202402014

浙江电力

浙江省电力学会浙江省电力试验研究院

浙江电力

CSTPCD

影响因子：0.438

ISSN：1007-1881

年,卷(期)：2024.43(2)

参考文献量20