基于Swin-Unet的遥感卫星图像分割研究

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：在过去的几年中,卷积神经网络(CNN)在图像分割方向取得了很大的进展,但是由于卷积运算的局限性,不能很好地处理全局与长距离依赖关系等问题,提出了一种基于 Swin-Unet的图像分割模型,通过将Transformer Block模块引入到 U-Net网络模型中的编码与解码阶段,并使用更适合二分类的Dice＿loss损失函数,来进行特征提取和学习.使用用于城市建筑物遥感卫星图像研究的Inria Aerial Image Labeling数据集进行试验.结果表明,所采用的Swin-Unet模型可以从遥感卫星图像中提取更多的语义信息,从而达到更好的识别效果,IoU分数为 0.70.

外文标题：Research on Remote sensing satellite image segmentation based on Swin-Unet

外文摘要：In the past few years,convolutional neural network(CNN)has made great progress in the di-rection of image segmentation.However,due to the limitations of convolution operation,it can not deal with the global and long-distance dependence well.An image segmentation model based on Swin-Unet is proposed.By introducing the Transformer Block module into the Encoder and De-coder stages of the U-Net network model,and using the Dice＿loss loss function,which is more suitable for binary classification,feature extraction and learning are carried out.The Inria Aerial Image Labeling data set for remote sensing satellite image research of urban buildings was used for experiments.Experiments show that the Swin-Unet model can extract more semantic informa-tion from remote sensing satellite images,so as to achieve better recognition effect,and the IoU score is 0.70.

外文关键词：

Remote sensing satellite imagesImage segmentationDeep learningSwin-UnetTrans-former

作者：

王俊博、孙皓月、刘晓

展开 >

作者单位：

河北建筑工程学院,河北张家口 075000

关键词：

遥感卫星图像图像分割深度学习 Swin-Unet Transformer

出版年：

2024

DOI：

10.3969/j.issn.1008-4185.2024.03.039

河北建筑工程学院学报

河北建筑工程学院

河北建筑工程学院学报

影响因子：0.502

ISSN：1008-4185

年,卷(期)：2024.42(3)

浏览量1