基于时空依赖关系多智能体强化学习的多路口交通信号协同控制方法

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据

中文摘要：面对日益严重的交通拥堵现象,智能交通信号控制已成为提升城市道路网络性能必不可少的手段.提出一种基于时空依赖关系多智能体强化学习算法的多路口交通信号控制方法STLight(spatiotemporal traffic light control).通过基于注意力机制的时空依赖模块 STDM(spatiotemporal dependent module),STLight可将初始交通观测数据提取为时空特征,以有效捕获各交叉路口间的时空依赖关系.此外,基于所提取的时空特征,STLight在基于集中训练分散执行框架的多智能体强化学习算法基础之上进一步为各个智能体引入全局时空信息,从而进一步提升多智能体之间的协作能力.实验结果表明,STLight在提升城市道路网络的性能方面具有显著的优势,有助于缓解当前大规模城市道路网络的交通拥堵问题.

外文标题：Cooperative traffic signal control method for multi-intersection:an approach based on spatiotemporal dependence multi-agent reinforcement learning

外文摘要：In the face of increasingly serious traffic congestion,intelligent traffic signal control has become an indispensable means to improve the performance of urban road network.In this paper,a spatiotemporal traffic light control(STLight)based on multi-agent reinforcement learning algorithm is proposed.Through the spatiotemporal dependent module(STDM)based on the attention mechanism,STLight can extract the initial traffic observation data as spatiotemporal features,so as to effectively capture the spatiotemporal dependence relationship between intersections.In addition,based on the extracted spatiotemporal characteristics,STLight further introduces global spatiotemporal information to each agent on the basis of the multi-agent reinforcement learning algorithm based on the centralized training decentralized execution framework,so as to further improve the cooperation ability among multi-agents.The experimental results show that STLight has significant advantages in improving the performance of urban road networks,and helps to alleviate the traffic congestion problem of current large-scale urban road networks.

外文关键词：

multi-agent reinforcement learningmulti-intersection traffic signal controlattention mechanismMarkov gamespatiotemporal dependent

作者：

王兆瑞、岩延、张宝贤

展开 >

作者单位：

中国科学院大学人工智能学院,北京 100049

关键词：

多智能体强化学习多路口交通信号控制注意力机制马尔可夫博弈时空依赖

基金：

国家重点研发计划项目国家自然科学基金

项目编号：

2018AAA010080461872331

出版年：

2024

DOI：

10.7523/j.ucas.2023.076

中国科学院大学学报

中国科学院大学

中国科学院大学学报

CSTPCD北大核心

影响因子：0.614

ISSN：2095-6134

年,卷(期)：2024.41(3)

参考文献量43