基于无监督深度学习的单目视觉里程计

Monocular Visual Odometry Based on Unsupervised Deep Learning

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：近年来,视觉实时定位与建图技术(visual simultaneous localization and mapping,VSLAM)技术取得了快速发展.然而,大多数传统的VSLAM系统存在鲁棒性较差的问题而基于深度学习的VSLAM存在精度较低等问题.为了提高VSLAM系统的性能,提出了一种基于无监督深度学习的单目VSLAM.该方法首先设计深度估计网络完成对图像的深度估计.然后,设计位姿估计网络进行相机的位姿估计.最后,使用合理的损失函数保证网络在迭代过程中的有效收敛.在KITTI数据集上验证了该系统的性能.实验结果表明,与SfMLearner相比,绝对轨迹误差(absolute trajectory error,ATE)降低大约50％.与传统的VSLAM系统相比,绝对位姿误差(absolute pose error,APE)的平移部分误差也明显下降且鲁棒性得到提升.

外文摘要：VSLAM(visual simultaneous localization and mapping)technology makes rapid progress in recent years.However,most traditional VSLAM systems have poor robustness,while VSLAM based on deep learning have low accuracy.In order to improve the performance of VSLAM systems,an unsupervised deep learning based monocular VSLAM was proposed.The depth estimation net-work was designed to complete the depth estimation of the image and the pose estimation network was applied to estimate the pose of the camera.Finally,a reasonable loss function was used to ensure the effective convergence of the network during the iterative process.The performance of the system is verified on KITTI data set.The results show that ATE(absolute trajectory error)is reduced by 50％compared to SfMLearner.Compared with traditional VSLAM system,the translation part of APE(absolute pose error)is also signifi-cantly reduced and the robustness is also improved.

外文关键词：

SLAM roboticsdepth estimation networkpose estimation networkunsupervised learning

作者：

白宇、钟锐、王奥博、方浩、刘建涛

展开 >

作者单位：

中国建筑一局(集团)有限公司,北京 100071

中建市政工程有限公司,北京 100071

北京理工大学自动化学院,北京 100081

河北林创电子科技有限公司,石家庄 050035

展开 >

关键词：

同步定位与建图深度估计网络位姿估计网络无监督学习

基金：

国家重点研发计划国家自然科学基金

项目编号：

2022YFA100470362133002

出版年：

2024

DOI：

10.12404/j.issn.1671-1815.2305883

科学技术与工程

中国技术经济学会

科学技术与工程

CSTPCD北大核心

影响因子：0.338

ISSN：1671-1815

年,卷(期)：2024.24(22)