改进的A2C算法在交通信号控制中的应用

Application of improved A2C algorithm in traffic signal control

曹桐 ¹黄德启 ¹赵军¹

扫码查看

作者信息

1. 新疆大学电气工程学院,新疆乌鲁木齐 830047
折叠

摘要

针对目前以数据为驱动的交通控制算法在处理交通数据时容易忽略道路本身的空间信息的问题,提出一种结合道路拓扑结构信息的A2C(advantage actor-critic,A2C)算法.以A2C算法为基础,提取路网中车流量的信息,经过MLP(multilayer perceptron,MLP)对路口观测到的交通状态特征进行编码;结合图卷积神经网络提取道路之间的空间信息,引入多头注意力机制关注智能体之间的影响,在SUMO仿真环境中进行仿真验证.实验结果表明,改进的A2C算法相较于基线算法在等待时间、平均行驶速度上性能分别提升9.84％、7.57％,可以更好提高车辆通行效率.

Abstract

Aiming at the problem that the current data-driven traffic control algorithms tend to ignore the spatial information of the road itself when processing traffic data,an A2C algorithm combined with road topology information was proposed.The algo-rithm was based on the advantage actor-critic algorithm,the information of traffic flow in the road network was extracted.Through MLP,the observed traffic state characteristics were encoded at the intersection.Combined with the graph convolutional network,the spatial information between roads was extracted.The multi-head attention mechanism was introduced to focus on the influence between agents,and the simulation verification was carried out in the SUMO simulation environment.Experimental results show that compared with the baseline algorithm,the improved A2C algorithm improves the performance of 9.84％and 7.57％in terms of waiting time and average driving speed,respectively,which can better improve the efficiency of vehicle traffic.

关键词

强化学习/图卷积神经网络/优势行动者-评论家/多层感知机/多头注意力机制/交通信号控制/多智能体

Key words

reinforcement learning/graph convolutional network/advantage actor-critic/multilayer perceptron/multi-head-at-tention/traffic signal control/multi-agents system

引用本文复制引用

基金项目

国家自然科学基金(51468062)

出版年

2024

计算机工程与设计

中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心

影响因子：0.617

ISSN：1000-7024

参考文献量3

段落导航