基于上下文多摇臂赌博机的交通信号控制算法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：近年来,由于交通拥堵问题日益严重,引起了学术界对交通信号灯控制算法研究的广泛关注.现有研究表明,基于深度强化学习(DRL)的方法在模拟环境中表现良好,但在实际应用中存在着数据和计算资源需求大、难以实现路口之间协同等问题.为解决这一问题,本文提出了一种基于上下文多摇臂赌博机的新型交通信号控制算法.与传统方法相比,本文所提算法通过从路网中提取主干道的方式,实现了路口之间的高效协同,并利用上下文多摇臂赌博机模型实现了交通信号的快速、有效控制.最后,通过在真实数据集以及合成数据集上进行充分的实验验证,证明了本文算法相较于过去算法的优越性.

外文标题：Traffic Signal Control Algorithm Based on Contextual Multi-armed Bandit

外文摘要：In recent years,the exacerbation of traffic congestion has sparked widespread interest in the research on traffic signal control algorithms.Current studies indicate that methods based on deep reinforcement learning(DRL)exhibit promising performance in simulated environments.However,challenges persist in their practical application,including substantial requirements for data and computational resources,as well as difficulties in achieving coordination between intersections.To address these challenges,this study proposes a novel traffic signal control algorithm based on a contextual multi-armed bandit model.In contrast to conventional algorithms,the proposed algorithm achieves efficient coordination between intersections by extracting the main arteries from the road network.Moreover,it employs a contextual multi-armed bandit model to facilitate rapid and effective traffic signal control.Finally,through extensive experimentation on both real and synthetic datasets,the superiority of the proposed algorithm over previous algorithms is empirically demonstrated.

外文关键词：

intelligent trafficreinforcement learningcontextual multi-armed banditmulti-agent systemtraffic signal control

作者：

邵俊杰、肖明军

展开 >

作者单位：

中国科学技术大学计算机科学与技术学院,合肥 230027

中国科学技术大学苏州高等研究院,苏州 215127

关键词：

智能交通强化学习上下文多臂赌博机多智能体系统交通信号控制

基金：

国家自然科学基金面上项目江苏省自然科学基金面上项目

项目编号：

62172386BK20231212

出版年：

2024

DOI：

10.15888/j.cnki.csa.009645

计算机系统应用

中国科学院软件研究所

计算机系统应用

CSTPCD

影响因子：0.449

ISSN：1003-3254

年,卷(期)：2024.33(10)