首页|基于时空依赖关系多智能体强化学习的多路口交通信号协同控制方法

基于时空依赖关系多智能体强化学习的多路口交通信号协同控制方法

扫码查看
面对日益严重的交通拥堵现象,智能交通信号控制已成为提升城市道路网络性能必不可少的手段。提出一种基于时空依赖关系多智能体强化学习算法的多路口交通信号控制方法STLight(spatiotemporal traffic light control)。通过基于注意力机制的时空依赖模块 STDM(spatiotemporal dependent module),STLight可将初始交通观测数据提取为时空特征,以有效捕获各交叉路口间的时空依赖关系。此外,基于所提取的时空特征,STLight在基于集中训练分散执行框架的多智能体强化学习算法基础之上进一步为各个智能体引入全局时空信息,从而进一步提升多智能体之间的协作能力。实验结果表明,STLight在提升城市道路网络的性能方面具有显著的优势,有助于缓解当前大规模城市道路网络的交通拥堵问题。
Cooperative traffic signal control method for multi-intersection:an approach based on spatiotemporal dependence multi-agent reinforcement learning
In the face of increasingly serious traffic congestion,intelligent traffic signal control has become an indispensable means to improve the performance of urban road network.In this paper,a spatiotemporal traffic light control(STLight)based on multi-agent reinforcement learning algorithm is proposed.Through the spatiotemporal dependent module(STDM)based on the attention mechanism,STLight can extract the initial traffic observation data as spatiotemporal features,so as to effectively capture the spatiotemporal dependence relationship between intersections.In addition,based on the extracted spatiotemporal characteristics,STLight further introduces global spatiotemporal information to each agent on the basis of the multi-agent reinforcement learning algorithm based on the centralized training decentralized execution framework,so as to further improve the cooperation ability among multi-agents.The experimental results show that STLight has significant advantages in improving the performance of urban road networks,and helps to alleviate the traffic congestion problem of current large-scale urban road networks.

multi-agent reinforcement learningmulti-intersection traffic signal controlattention mechanismMarkov gamespatiotemporal dependent

王兆瑞、岩延、张宝贤

展开 >

中国科学院大学人工智能学院,北京 100049

多智能体强化学习 多路口交通信号控制 注意力机制 马尔可夫博弈 时空依赖

国家重点研发计划项目国家自然科学基金

2018AAA010080461872331

2024

中国科学院大学学报
中国科学院大学

中国科学院大学学报

CSTPCD北大核心
影响因子:0.614
ISSN:2095-6134
年,卷(期):2024.41(3)
  • 43