南京邮电大学学报(自然科学版)2024,Vol.44Issue(2) :62-68.DOI:10.14132/j.cnki.1673-5439.2024.02.008

用于高分辨率遥感影像地类识别的Deeplabv3+改进模型

Improved Deeplabv3+for land class identification in high resolution remote sensing images

张载龙 徐杰
南京邮电大学学报(自然科学版)2024,Vol.44Issue(2) :62-68.DOI:10.14132/j.cnki.1673-5439.2024.02.008

用于高分辨率遥感影像地类识别的Deeplabv3+改进模型

Improved Deeplabv3+for land class identification in high resolution remote sensing images

张载龙 1徐杰2
扫码查看

作者信息

  • 1. 南京邮电大学 物联网学院,江苏 南京 210003
  • 2. 南京邮电大学 计算机学院,江苏 南京 210023
  • 折叠

摘要

在高分辨率遥感影像地类识别上,语义分割网络Deeplabv3+表现优异,但是所需参数非常多,训练时间久.遥感影像中的地类与普通RGB图片中的对象相比颗粒度非常大,其具有更显著的特征以及更少的类别,并不需要过深过大的网络.因此,文中提出了一种基于轻量级网络的语义分割模型(Thin-Deeplabv3+),对 Deeplabv3+的编码器进行了改进,利用轻量级膨胀网络(Light and Dilated Network,LDNet)提取输入遥感影像的特征,然后利用膨胀系数分别为 2、12、24 和 36 的空洞空间金字塔池化(Atrous Spatial Pyramid Pooling,ASPP)模块加强特征提取.在高分遥感影像数据集(Gaofen Image Dataset,GID)以及DeepGlobe土地覆盖分类挑战数据集(DeepGlobe Land Cover Classification Challenge,DLCCC)上的实验结果表明,Thin-Deeplabv3+的识别精度高于Deeplabv3+,并且所需参数仅约为Deeplabv3+的1/10.

Abstract

The semantic segmentation network Deeplabv3+is adept at land class recognition for high-resolution remote sensing images,but it requires considerable parameters and costs extra time for training.Compared with the objects in ordinary RGB images,the earth class in remote sensing images is very grainy,with more significant features and fewer categories.It does not need a network too deep or too large.Therefore,this paper proposes a semantic segmentation model based on lightweight network(Thin-Deeplabv3+),which improves the Deeplabv3+encoder,and uses the light and dilated network(LDNet)to extract features from input remote sensing images.Then,the atrous spatial pyramid pooling(ASPP)module with expansion coefficients of(2,12,24,36)is used to enhance feature extraction.Finally,experiments are conducted on the Gaofen Image Dataset(GID)and the DeepGlobe Land Cover Classification Challenge(DLCCC),and the results show that the recognition accuracy of Thin-Deeplabv3+is higher than that of Deeplabv3+,and the required parameter number is only about one tenth of that of Deeplabv3+.

关键词

LDNet/地类识别/遥感影像/Thin-Deeplabv3+

Key words

light and dilated network(LDNet)/land class identification/remote sensing image/Thin-Deeplabv3+

引用本文复制引用

基金项目

国家自然科学基金(61876091)

南京大学计算机软件新技术国家重点实验室开放基金(KFKT2022B01)

南京邮电大学校级科研项目(NY221071)

出版年

2024
南京邮电大学学报(自然科学版)
南京邮电大学

南京邮电大学学报(自然科学版)

CSTPCD北大核心
影响因子:0.486
ISSN:1673-5439
参考文献量23
段落导航相关论文