基于改进YOLOv8和GMM图像点集匹配的双目测距方法

Binocular ranging method based on improved YOLOv8 and GMM image point set matching

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：针对无人塔吊系统的研究需求,提出一种基于改进YOLOv8和GMM图像点集匹配的双目测距方法,对驾驶室外环境中的塔吊吊钩进行检测识别并测距.通过双目摄像头进行图像采集,引入 FasterNet 骨干网络和Slim-neck颈部连接层,对YOLOv8目标检测算法进行改进,有效检测画面中的塔吊吊钩并获取检测框的二维坐标信息;采用局部敏感哈希方法,并融合分阶段匹配策略,提升GMM图像点集匹配模型的匹配效率,针对检测框中的塔吊吊钩,进行特征点匹配;最后通过双目相机三角测量原理计算得出塔吊吊钩的深度信息.实验结果表明,改进后的YOLOv8 算法与原算法相比,精确率P提高了 2.9%,平均精度AP50提高了 2.2%,模型复杂度降低了10.01 GFLops,参数量减少了3.37 M,在提升检测精度的同时实现了模型的轻量化.改进后的图像点集匹配算法与原算法相比,各个指标表现出更加良好的鲁棒性.最后在工程现场对塔吊吊钩进行识别与测距,误差可接受范围内有效完成了塔吊吊钩的检测识别与测距任务,验证了本方法的可行性.

外文摘要：Addressing the research needs for unmanned tower crane systems,a binocular ranging method was proposed,based on the improved YOLOv8 and GMM image point set matching to detect and recognize the hooks of tower cranes in the outdoor environment of the driver's cab and measure the distance.Image acquisition was performed through binocular cameras,and the FasterNet backbone network and Slim-neck connection layer was introduced to improve the YOLOv8 target detection algorithm,thereby effectively detecting the hooks of tower cranes in the image and obtaining the two-dimensional coordinate information of the detection box.The local sensitive hashing method was employed,and a phased matching strategy was integrated to improve the matching efficiency of the GMM image point set matching model,performing feature point matching for the hooks of tower cranes in the detection box.Finally,the depth information of the tower crane hook was calculated through the principle of binocular camera triangulation.The experimental results demonstrated that compared to the original algorithm,the improved YOLOv8 algorithm had increased precision P by 2.9%,average precision AP50 by 2.2%,reduced model complexity by 10.01 GFLops,and reduced parameter quantity by 3.37 M.This achieved model light-weighting while enhancing detection accuracy.Compared with the original algorithm,the improved image point set matching algorithm exhibited better robustness in various indicators.Finally,the recognition and ranging of tower crane hooks were effectively completed within an acceptable margin of error at the engineering site,verifying the feasibility of this method.

外文关键词：

YOLOv8 object detectiongaussian mixture modelpoint set matchingdeep learningbinocular visionsmart construction site visualization

作者：

胡欣、常娅姝、秦皓、肖剑、程鸿亮

展开 >

作者单位：

长安大学能源与电气工程学院,陕西西安 710018

比亚迪汽车有限公司,陕西西安 710119

长安大学电子与控制工程学院,陕西西安 710061

关键词：

YOLOv8目标检测高斯混合模型点集匹配深度学习双目视觉智慧工地可视化

基金：

陕西省秦创原"科学家+工程师"队伍建设项目西安市重点产业链项目

项目编号：

2024QCY-KXJ-16123ZDCYJSGG0013-2023

出版年：

2024

DOI：

10.11996/JG.j.2095-302X.2024040714

图学学报

中国图学学会

图学学报

CSTPCD北大核心

影响因子：0.73

ISSN：2095-302X

年,卷(期)：2024.45(4)