首页|建筑空调系统迁移模仿学习仿真调度策略研究

建筑空调系统迁移模仿学习仿真调度策略研究

扫码查看
为解决空调调度在线部署初期,低质量数据工况存在性能不稳定与训练过程效率低下的问题,提出一种基于迁移模仿学习的空调仿真调度策略制定方法.通过强化学习方法获得建筑运行策略,建立标准建筑仿真模型作为源域部署迁移学习,模仿学习损失函数被应用于智能体损失函数中以增强算法性能.结果表明:相比未采用迁移学习的方法,运行效益提升了 1 6.2%,有效解决了强化学习训练初期的运行不稳定问题;相比未采用模仿学习的方法,运行效益提升了 11.5%,有效提高了强化学习的训练效率.
Research on Scheduling Strategies Simulation for Building Air-conditioning Systems Based on Transfer Imitation Learning
To solve the problem of unstable performance and inefficient training process of low-quality data conditions at the initial stage of online deployment of air conditioner scheduling,we propose a migration-imitation learning-based air conditioning scheduling strategy simulation method.Reinforcement learning methods are used to generate building operation strategies.A standard building simulation model serves as the source domain,upon which migration learning is applied.An imitation learning loss function is incorporated into the intelligent loss function to enhance algorithm performance.The results indicate that,compared with the non-use of migration learning,the proposed method can improve the operational efficiency by 16.2%,effectively resolving the operational instability issues at the initial stage of reinforcement learning training.Compared to methods without imitation learning,operational efficiency is enhanced by 11.5%,significantly improving the training efficiency of reinforcement learning.

transfer learningreinforcement learningimitation learningair conditioning controlroom temperature control

王翘楚、丁研、梁传志、张颢正、黄宸

展开 >

天津大学环境科学与工程学院,天津 300354

住房和城乡建设部科技与产业化发展中心,北京 100835

迁移学习 强化学习 模仿学习 空调调控方法 室温控制

2024

系统仿真学报
北京仿真中心 中国系统仿真学会

系统仿真学报

CSTPCD北大核心
影响因子:0.551
ISSN:1004-731X
年,卷(期):2024.36(12)