首页|仿真到现实环境的自动驾驶决策技术综述

仿真到现实环境的自动驾驶决策技术综述

扫码查看
自动驾驶汽车作为未来交通的重要发展方向,决策技术是其进行安全高效行驶的关键。基于成本和安全性的考虑,最新的自动驾驶决策技术往往先在仿真环境中研究,再在现实世界中应用,故在自动驾驶决策领域,仿真到现实的方法能帮助自动驾驶系统更有效地进行学习、训练和验证。然而,仿真环境和现实环境之间的差距会在这些模型和技术转移到真实车辆时带来挑战,这种仿真到现实环境域差距的问题促使研究人员探索解决该问题的途径,并且提出各种有效的方法。本文将这些方法总结为两大类:虚实迁移和平行智能。前者通过不同方法将在模拟环境中训练的车辆决策迁移到现实环境中,以解决域差距问题;后者通过构建虚拟的人工系统和现实的物理系统,将二者进行交互、比较、学习和实验,从而解决自动驾驶决策在现实环境中的适配问题。本文首先从虚实迁移和平行智能的原理,以及自动驾驶决策领域应用的角度进行了详细综述,这也是首次从平行智能的角度来思考自动驾驶决策技术中仿真到现实环境的问题,然后总结了搭建仿真环境常用的自动驾驶模拟器,最后归纳了仿真到现实环境的自动驾驶面临的挑战和未来的发展趋势,既为自动驾驶在现实场景的应用与推广提供技术方案,也为自动驾驶研究人员提供新的想法和方向。
Decision technologies of simulation to reality for autonomous driving:a survey
Since the mid-1980s,numerous research institutions have been developing autonomous driving technologies.The main idea of autonomous driving technology is to perceive the ego-vehicle states and its surroundings in real time through sensors,utilize an intelligent system for decision-making planning,and execute the driving operation through the control system.The decision-making module,which is an important component in autonomous driving systems,bridges perception and vehicle control.This module is mainly responsible for finding optimal paths or correct and reliable behav-iors for the ego-vehicle to effectively drive on the road.In the research process of autonomous driving decision-making tech-nologies,which are remarkably strict for safety,if the training is performed directly in the real world,then it will not only lead to a considerable cost increment but will also miss some marginal driving scenarios.In this case,numerous studies are first conducted in the simulation world before applying new autonomous driving models in the real world.However,the simulation can only provide an approximate model of vehicle dynamics and its interaction with the surrounding environ-ment,and the vehicle agent trained only in the simulation world cannot be generalized to the real world.A gap still exists between reality and simulation,which is called the reality gap(RG)and poses a challenge for the transfer of developed autonomous driving models from simulated vehicles to real vehicles.Researchers have proposed numerous approaches to addressing the reality gap.This paper presents the principles and state-of-the-art methods of transferring knowledge from simulation to reality(sim2real)and parallel intelligence(PI),as well as their applications in decision-making for autono-mous driving.Sim2real approaches reduce RG by simply transferring the learned models from the simulation to the reality environment.In autonomous driving,the basic idea of sim2real is to train the vehicle agent in the simulation environment and then transfer it to the reality environment using various methods,which can substantially reduce the number of interac-tions between the vehicle agent and the reality environment.Sim2real can also improve the effectiveness and performance of decision-making algorithms for autonomous driving.At present,the main sim2real methods include robust reinforcement learning(RL),meta-learning,curriculum learning,knowledge distillation,and transfer learning,as well as some other helpful techniques such as domain randomization and system identification,which have their own way of reducing the real-ity gap.For example,transfer learning bridges the reality gap by directly addressing the differences between domains.Vehicle agents in the real world may be exposed to problems that do not exist in the simulation world;thus,some research-ers use meta-learning to bridge the gap.Sim2real methods handle the RG problem in some way,but their computational cost remains a challenge,especially when dealing with complex and dynamic environments,which limits the application range of sim2real methods.The PI,which solves the RG problem by parallelly performing the simulation environment with the reality environment,is proposed to solve the aforementioned problem.PI is a new paradigm based on the ACP method(artificial society,computational experiment,and parallel execution),which deeply integrates simulated and real sce-narios.The main process of parallel intelligence involves the formation of a complete system through repeated interactions between the artificial and physical systems and the reduction of the RG through parallel execution and computational experi-ments.Among them,the computational experiment is divided into description learning,prediction learning,and prescrip-tive learning,which gradually transitions from the simulation environment to the real world.Parallel intelligence and sim2real technologies extend the physical space to the virtual space and model the real world through virtual-real interac-tion.Therefore,the vehicle agent can gain knowledge and experience through the simulation and real-life environments.The core technology of PI is to make decisions through the interaction between the real and artificial driving systems and realize the management and control of the driving system using comparison,learning,and experimentation of the two sys-tems.Compared with sim2real methods,parallel intelligence deals with the relationship between simulated and real sce-narios from a higher technical level,solves complex modeling problems,and markedly reduces the difference between simulated and real scenarios.In the field of autonomous driving,PI has developed several branches,mainly including the parallel system,parallel learning,parallel driving,and parallel planning.Moreover,the theoretical system has been con-tinuously developed and has achieved remarkable results in numerous fields,such as transportation,medical treatment,manufacturing,and control.Subsequently,some autonomous driving simulators,such as AirSim and CARLA,are pre-sented in this paper.Simulators for autonomous driving generally aim to minimize the mismatch between real and simulated setups by providing training data and experience,thus enabling the deployment of vehicle agents into the real world.Finally,existing challenges and future perspectives in sim2real and PI methods are summarized.With the continuous development of simulation-to-reality technologies,additional breakthroughs and progress in autonomous driving will be achieved in the future.

autonomous drivingdecision technologyreality gap(RG)sim2realparallel intelligence(PI)

胡学敏、黄婷玉、余雅澜、任佳佳、谢微、陈龙

展开 >

湖北大学人工智能学院,武汉 430062

智能感知系统与安全教育部重点实验室,武汉 430062

湖北大学计算机与信息工程学院,武汉 430062

临沂大学计算机与信息工程学院,临沂 276000

中国科学院自动化研究所,北京 100190

展开 >

自动驾驶 决策技术 域差距(RG) 虚实迁移(sim2real) 平行智能(PI)

2024

中国图象图形学报
中国科学院遥感应用研究所,中国图象图形学学会 ,北京应用物理与计算数学研究所

中国图象图形学报

CSTPCD北大核心
影响因子:1.111
ISSN:1006-8961
年,卷(期):2024.29(11)