基于MAPPO的无信号灯交叉口自动驾驶决策

Autonomous Driving Decision-Making at Signal-Free Intersections Based on MAPPO

许曼晨 ¹于镝 ¹赵理 ²郭陈栋²

扫码查看

作者信息

1. 北京信息科技大学自动化学院,北京 100192
2. 北京信息科技大学机电工程学院,北京 100192
折叠

摘要

针对自动驾驶在通过无信号灯交叉口由于车流密集且车辆行为随机不确定的问题,提出一种基于MAPPO(Multi-Agent Proximal Policy Optimization)算法的无信号灯交叉口自动驾驶决策方案.通过MetaDrive仿真环平台搭建多智能体仿真环境,并且设计了综合考虑交通规则、安全到达或发生碰撞等安全性以及交叉口车辆最大、最小速度等车流效率的奖励函数,旨在实现安全高效的自动驾驶决策.仿真实验表明,所提出的自动驾驶决策方案在训练中相较于其他算法具有更出色的稳定性和收敛性,在不同车流密度下均呈现出更高的成功率和安全性.该自动驾驶决策方案在解决无信号灯交叉口环境方面具有显著潜力,并且为复杂路况自动驾驶决策的研究起到促进作用.

Abstract

Due to the dense traffic flow and stochastic uncertainty of vehicle behaviors,the scenario of unsignalized intersection poses significant challenges for autonomous driving.An innovative approach for autonomous driving decision-making at unsignalized intersections is proposed based on the MAPPO(Multi-Agent Proximal Policy Optimization)algorithm.Applying the MetaDrive simulation platform to construct a multi-agent simulation environment,we design a reward function that comprehensively considers traffic regulations,safety including arriving safely and occurring collisions,and traffic efficiency considering the maximum and minimum speeds of vehicles at intersections,aiming to achieve safe and efficient autonomous driving decisions.Simulation experiments demonstrate that the proposed decision-making approach exhibits superior stability and convergence during training compared to other algorithms,showcasing higher success rates and safety levels across varying traffic densities.These findings underscore the significant potential of the autonomous driving decision-making solution for addressing challenges in unsignalized intersection environments,thereby advancing research in autonomous driving decision-making under complex road conditions.

关键词

自动驾驶/智能决策/无信号灯交叉口/MAPPO算法

Key words

autonomous driving/intelligent decision-making/signal-free intersections/multi-agent proximal policy optimization(MAPPO)algorithm

引用本文复制引用

基金项目

国家自然科学基金资助项目(52077007)

出版年

2024

吉林大学学报(信息科学版)

吉林大学

吉林大学学报(信息科学版)

CSTPCD

影响因子：0.607

ISSN：1671-5896

段落导航