基于数据增强与掩码学习的移动激光扫描点云分类方法

扫码查看

原文链接

万方数据
维普

中文摘要：车载移动激光扫描(MLS)点云可以精确描述道路周围场景,其分类结果可为智能交通、数字孪生城市、高精地图及辅助驾驶等任务提供数据基础.为增强点云分类模型提取特征的表达能力,提高预测的鲁棒性,提出一个基于数据增强与掩码学习的MLS点云分类方法.所提方法主要由高程校准的Mix3D(EC-Mix3D)点云增强策略和掩码学习框架构成.其中,EC-Mix3D策略用于对训练数据进行高程校准的场景混合,扩充训练样本分布,提高模型预测的鲁棒性;掩码学习框架首先对输入点云施加随机块掩码操作获取掩码点云,然后对原始点云和掩码点云的预测进行标签监督学习、一致性约束及错误预测熵最大化,提高对点云特征的表达能力.采用公共MLS数据集(Toronto3D和Pairs数据集)进行方法验证.实验结果表明,所提方法可以对MLS点云进行有效分类,在两个测试数据集上分别获得了 83.8％和68.74％的平均交并比,优于其他对比方法.

外文标题：Mobile Laser Scanning Point Cloud Classification Based on Data Augmentation and Mask Learning

外文摘要：Objective A mobile laser scanning(MLS)system can quickly and accurately acquire 3D point cloud data of a scene around a road,and such a system has the advantages of fast acquisition speed,high positioning accuracy,high point cloud density,strong anti-interference ability,and information richness.Thus,MLS systems are widely used in many fields,such as intelligent transportation,digital twin cities,high-precision maps,and assisted driving.As the basis of this system application,people need to extract accurate and semantically rich information from large-scale and complex MLS point clouds.Although there are many large-scale deep learning point cloud classification methods that achieve competitive classification accuracy,they still suffer from problems such as insufficient original training scenes and incomplete point cloud feature expression.To further expand the distribution of the training data and improve the feature representation of the model,we propose an MLS point cloud classification method based on data augmentation and mask learning.Methods The proposed method is divided into two main parts:an elevation-calibrated Mix3D(EC-Mix3D)data augmentation strategy and a mask learning framework.Specifically,the EC-Mix3D data augmentation strategy first extracts the normalized elevation of the point cloud through a cloth simulation filtering algorithm.Then,the point clouds of two independent sub-point clouds are mixed via normalized elevation calibration.Because the point clouds of the two scenes have the same elevation reference,they can be effectively mixed to generate training data with a new context,which expands the distribution of the training data.The mask learning framework first generates mask data by applying a random block mask operation to the input data.Then,the original input data and mask data are fed into the Siamese two-branch network and predicted.Finally,label supervision,consistency constraints,and error prediction entropy maximization based on two-branch predictions are performed to enhance the expressiveness of the point cloud features and reduce the model's overconfidence when predicting in complex regions.Specifically,the label supervision of the original branch provides the underlying supervised signal for model training,and the label supervision of the mask branch can motivate the model to efficiently learn additional point cloud context features by predicting masked data from unmasked data.Consistency constraints increase point cloud feature expressiveness and model prediction stability by minimizing the difference between the predicted values of the masked input branch and the original point cloud branch,and error prediction entropy maximization increases prediction uncertainty in complex scenarios by encouraging high entropy posterior probabilities for misclassified points,which suppresses classification model overfitting.Results and Discussions The Toronto3D and Paris public MLS datasets were used to validate the proposed method.Experimental results(Fig.6,Fig.7)show that the proposed method can effectively classify most MLS point clouds.However,the classification results for points far from the center and feature similarities are unconvincing.Comparing these results(Table 1,Table 2)with those obtained via other methods shows that the proposed method can effectively improve the accuracy of the baseline method and obtain optimal accuracy.Specifically,we obtain 97.7％and 93.20％overall accuracy(OA),and 83.8％and 68.74％mean intersection of union(mIoU)on the Toronto3D and Paris datasets,respectively.The ablation experiment results(Table 3)show that each component of the proposed method improves the classification performance of the model.By sequentially adding the EC-Mix3D data augmentation strategy and mask learning framework to the baseline method,the mIoU can be improved by 3.38 percentage points and 6.34 percentage points,respectively.The results of different mask voxel sizes(Table 4)and mask ratios(Table 5)indicate that the proposed method employs the mask parameters that are the most suitable for Toronto3D point cloud classification.Finally,the results of a model complexity analysis(Fig.8)show that the proposed method enhances model classification performance without increasing model complexity.Conclusions In the present study,a novel MLS point cloud classification method is proposed to improve model robustness and the expressiveness of the point cloud features via an EC-Mix3D data augmentation strategy and a mask learning framework.Specifically,the EC-Mix3D strategy expands the training sample distribution by scene mixing the training data to improve the robustness of the model.The mask learning framework obtains a masked point cloud by applying a random block mask to the input point cloud,and it then utilizes the predicted values of the original and masked point cloud for label-supervised learning,consistency constraints,and error prediction entropy regularization,aiming to enhance the feature representation capability of the model.Two public MLS datasets(namely,the Toronto3D and Paris datasets)are used to validate the proposed method.The results show that the proposed method can effectively classify MLS point clouds and obtain mIoU of 83.8％and 68.74％,respectively,which are better than those of other methods.

外文关键词：

vehicle-mounted mobile laser scanning point cloud classificationEC-Mix3D data augmentationmask learning frameworkconsistency constrainterror prediction entropy maximization

作者：

雷相达、管海燕、陈科、秦楠楠、臧玉府

展开 >

作者单位：

南京信息工程大学遥感与测绘工程学院,江苏南京 210044

自然资源部遥感导航一体化应用工程技术创新中心,江苏南京 210044

关键词：

车载移动激光扫描点云分类 EC-Mix3D数据增强掩码学习框架一致性约束错误预测熵最大化

出版年：

2024

DOI：

10.3788/CJL231396