DeepLabV3_DHC:城市无人机遥感图像语义分割

DeepLabV3_DHC:Semantic Segmentation of Urban Unmanned Aerial Vehicle Remote Sensing Image

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：高分辨率无人机遥感图像具有极为丰富的语义和地物特征,在语义分割中容易出现目标分割不全、边缘信息缺失、分割精度不足等问题.为了解决上述问题,基于DeepLabV3_plus模型提出改进的DeepLabV3_DHC.首先,利用多种主干网络进行下采样,采集图像的低级特征和高级特征.其次,将原模型的atrous spatial pyramid pooling(ASPP)全部替换成深度可分离混合空洞卷积,同时添加自适应系数,减弱网格效应.之后,抛弃传统上采样的双线性插值法,替换为可学习的密集上采样卷积.最后,在低级特征中串联注意机制.选用多种主干网络进行实验,数据集选用四川省隆昌市地区的部分图像,采用平均交并比和类别平均像素准确率作为评价指标.实验结果表明:所提方法不仅具有较高的分割精度,而且减少了计算量和参数量.

外文摘要：High-resolution unmanned aerial vehicle remote sensing images have extremely rich semantic and ground feature features,which are prone to problems such as incomplete target segmentation,missing edge information,and insufficient segmentation accuracy in semantic segmentation.To solve the above problems,based on DeepLabV3_plus model,an improved DeepLabV3_DHC is proposed.First of all,multiple backbone networks are used for down-sampling to collect low-level and high-level features of the image.Second,the atrous spatial pyramid pooling(ASPP)of the original model is replaced by a depthwise separable hybrid dilated convolution,and an adaptive coefficient is added to weaken the mesh effect.After that,the traditional sampling bilinear interpolation method is abandoned and replaced by the learnable dense upsampling convolution.Finally,cascade attention mechanism in low-level features.In this paper,a variety of backbone networks are selected for the experiment,and some images of Longchang City,Sichuan Province are selected for the dataset.The evaluation index uses the average intersection and combination ratio and the average pixel accuracy of the category as the reference basis.The experimental results show that the method in this paper not only has higher segmentation accuracy,but also reduces the amount of computation and parameters.

外文关键词：

urban unmanned aerial vehicle remote sensing imagesemantic segmentationdepthwise separable hybrid dilated convolutiondense upsampling convolutionattention mechanismgrid effect

作者：

孙国文、罗小波、张坤强

展开 >

作者单位：

重庆邮电大学计算机科学与技术学院重庆空间大数据智能技术工程研究中心,重庆 400065

昆明理工大学信息工程与自动化学院,云南昆明 650500

关键词：

城市无人机遥感图像语义分割深度可分离混合空洞卷积密集上采样注意力机制网格效应

基金：

国家重点研发计划政府间国际科技创新合作项目重庆市高技术产业重大产业技术研发项目重庆市教委重点合作项目

项目编号：

2021YFE0194700D2018-82HZ2021008

出版年：

2024

DOI：

10.3788/LOP230886

激光与光电子学进展

中国科学院上海光学精密机械研究所

激光与光电子学进展

CSTPCD北大核心

影响因子：1.153

ISSN：1006-4125

年,卷(期)：2024.61(4)

参考文献量28