基于Swin-Unet的遥感卫星图像分割研究
Research on Remote sensing satellite image segmentation based on Swin-Unet
王俊博 1孙皓月 1刘晓1
作者信息
- 1. 河北建筑工程学院,河北 张家口 075000
- 折叠
摘要
在过去的几年中,卷积神经网络(CNN)在图像分割方向取得了很大的进展,但是由于卷积运算的局限性,不能很好地处理全局与长距离依赖关系等问题,提出了一种基于 Swin-Unet的图像分割模型,通过将Transformer Block模块引入到 U-Net网络模型中的编码与解码阶段,并使用更适合二分类的Dice_loss损失函数,来进行特征提取和学习.使用用于城市建筑物遥感卫星图像研究的Inria Aerial Image Labeling数据集进行试验.结果表明,所采用的Swin-Unet模型可以从遥感卫星图像中提取更多的语义信息,从而达到更好的识别效果,IoU分数为 0.70.
Abstract
In the past few years,convolutional neural network(CNN)has made great progress in the di-rection of image segmentation.However,due to the limitations of convolution operation,it can not deal with the global and long-distance dependence well.An image segmentation model based on Swin-Unet is proposed.By introducing the Transformer Block module into the Encoder and De-coder stages of the U-Net network model,and using the Dice_loss loss function,which is more suitable for binary classification,feature extraction and learning are carried out.The Inria Aerial Image Labeling data set for remote sensing satellite image research of urban buildings was used for experiments.Experiments show that the Swin-Unet model can extract more semantic informa-tion from remote sensing satellite images,so as to achieve better recognition effect,and the IoU score is 0.70.
关键词
遥感卫星图像/图像分割/深度学习/Swin-Unet/TransformerKey words
Remote sensing satellite images/Image segmentation/Deep learning/Swin-Unet/Trans-former引用本文复制引用
出版年
2024