首页|基于CNN和Transformer混合网络模型的车道线检测

基于CNN和Transformer混合网络模型的车道线检测

扫码查看
车道线检测技术在自动驾驶系统中发挥着重要作用,目前基于深度学习的车道线检测方法通常在主干网络提取特征之后分别获取车道线关键点的置信度以及这些点相对车道线起始点的偏移.但由于车道线是细长结构,现有的主干网络无法有效提取这种结构特征,偏移网络也难以回归车道线上关键点相对起始点的偏移.鉴于注意力机制在提取空间结构特征、表征长距离图像序列间依赖关系方面的优越性能,在基于点的车道线检测方法的基础上提出了一种基于卷积神经网络(convolutional neural network,CNN)和 Transformer 的混合网络(CNN-Transformer hybrid network,CTNet)模型,该模型通过特征金字塔和增强的坐标注意力机制提高特征的表征能力,使用基于视觉Transformer的偏移网络回归关键点的偏移量,因此,CTNet能够提取细长车道线特征、捕获长距离点间的偏移,有效提升车道线检测的精度.实验对比了 CTNet和6种常用车道线检测算法在数据集TuSimple和CULane上的效果,在TuSimple上CTNet各项精度指标均优于现有方法,在CULane数据集的9种不同车道场景中,CTNet在6个场景中取得了最佳精度.
Lane Line Detection Based on CNN and Transformer Hybrid Network
Lane detection technology plays a crucial role in autonomous driving systems.Currently,deep learning-based methods for lane detection typically involve extracting fea-tures from a backbone network,followed by confidence estimation of key points on the lane lines and their offsets relative to a starting point.However,existing backbone networks struggle to effectively capture features of elongated lanes,and offset networks face chal-lenges in regressing the offsets of key points along the lane line.In this paper,we propose a hybrid network model called CTNet(CNN-Transformer hybrid network)based on a point-based lane detection approach.CTNet enhances feature representation through a feature pyramid network and an augmented coordinate attention mechanism.Additionally,it em-ploys a vision transformer-based offset network to regress crucial offsets.Consequently,CTNet extracts elongated lane line features,captures long-range offsets between points,and significantly improves the accuracy of lane detection.Experiments conducted on the TuSimple and CULane datasets demonstrate that CTNet outperforms six commonly used lane detection algorithms across various accuracy metrics.Specifically,CTNet achieves su-perior results on TuSimple across all evaluation metrics.Furthermore,when tested across nine different lane scenarios in the CULane dataset,CTNet achieves the highest accuracy in six scenarios.

lane line detectionvisual Transformercoordinate attention(CA)feature pyramid network(FPN)

唐洪、邓锋、张恺、聂学方、李光辉

展开 >

华东交通大学信息与软件工程学院,江西南昌 330013

江西省交通科学研究院有限公司,江西南昌 330038

车道线检测 视觉Transformer 坐标注意力 特征金字塔网络

国家自然科学基金江西省03专项江西省自然科学基金面上项目江西省自然科学基金江西省教育厅科学基金

5206201620203ABC03W0720212BAB20200920212BAB202004GJJ190319

2024

应用科学学报
上海大学 中国科学院上海技术物理研究所

应用科学学报

CSTPCD北大核心
影响因子:0.594
ISSN:0255-8297
年,卷(期):2024.42(5)