结合空间注意力机制与多任务学习的耕地地块分割模型

Agricultural field segmentation using spatial attention mechanism and multi-task learning strategy

扫码查看

原文链接

维普
万方数据

中文摘要：地块作为农业耕作的最小单元,对其精准识别是国土资源监测、耕地利用监测的需要.现有的方法多使用手工勾绘的方式获取,耗时费力,成本高昂,并且无法实现实时、近实时更新.本文设计了一种基于空间注意力机制与多任务学习的地块分割模型—Field-Net.模型基于UNet架构,增加了空间注意力机制,并采用多任务学习的策略,在语义分割的基础上增加了边界、像素到地块边界的距离等任务.在山东省东营市利津县对模型的性能进行了测试,结果发现耕地地块识别的交并比达到了 87.05％,总体精度为92.23％.Field-Net模型的性能优于几种高性能的深度学习框架,交并比较Link-Net模型高出0.26％,较DeepLab v3+高出7.59％.在空间泛化性能测试中,Field-Net模型的平均交并比Link-Net模型高出3.51％,空间泛化性能明显提升.通过消融试验发现,使用空间注意力机制的Field-Net较ResUNet模型F1-Score提高了 1.01％,交并比提高了 1.6％;多任务学习策略使得Field-Net模型的F.-Score提高了 0.18％,交并比提高了 0.21％;将模型权重特征进行可视化后发现空间注意力机制模块和多任务学习策略可以使模型学习到的特征更加聚集于地块边界和地块内部,使学习到的特征更具代表性.总体而言,Field-Net模型可以支撑地块级别国土资源和耕地非农化、非粮化利用监测,从而提高监测的效率和时效性.

外文摘要：The accurate identification of agricultural fields,which are the smallest units of agricultural farming,is crucial for monitoring land resources and arable land utilization.Manual mapping methods are time-consuming and expensive,and they are incapable of real-time or near-real-time updates.To enhance agricultural field delineation efficiency,we introduce Field-Net,a field segmentation model that leverages a spatial attention mechanism and multitask learning in this research.Field-Net is based on the UNet architecture.It combines a spatial attention mechanism and a multitask learning approach.In addition to the segmentation task,we incorporate boundary identification and distance to the field boundary as two additional tasks,enabling the model to learn representative features related to fields.The model's performance was evaluated using Gaofen-1 and Ziyuan-3 satellite images with a spatial resolution of 2 m in Lijin County,Dongying City,Shandong Province.We labeled 3,480 tiles that measured 256x256 pixels in YanWo District,with 3000 used for training,360 for validation,and 120 for the spatial generalization performance test.We initially analyzed loss weight for the three tasks,i.e.,mask,boundary,and pixel-to-boundary distance,in multitask learning by using a gradient test.For multitask learning,loss weight should prioritize the mask segmentation task as the primary task,while other tasks should be considered secondary.Across the entire test set,Field-Net achieved an overall accuracy of 92.23％and an Intersection Over Union(IOU)of 87.05％.We compared Field-Net with four state-of-the-art architectures:DeepLab v3+,HRNet,LinkNet,and D-LinkNet.Field-Net outperformed all of them in semantic segmentation tasks,with an IOU that was 0.26％higher than Link-Net,the most accurate among the four compared models,and 7.59％higher than that of DeepLab v3+.In the spatial generalization performance test,the average IOU of the Field-Net model was 3.51％higher than that of the Link-Net model,and spatial generalization performance was significantly improved.Ablation tests demonstrated that the spatial attention mechanism and multitask learning strategy improved the Fl score by 1.01％and IOU by 1.6％compared with the ResUNet model.The multitask learning strategy led to an improvement of 0.18％in Fl score for Field-Net and an improvement of 0.21％in IOU.Although challenges remain in identifying contiguous fields due to unclear boundaries,future enhancements can incorporate multitemporal and high-resolution remote sensing images to improve field feature discrimination.Feature visualization analysis revealed that the spatial attention mechanism and multitask learning strategy enabled the model to learn clustered features at field boundaries and within plots,enhancing feature representativeness.Overall,the Field-Net model supports the field-level monitoring of cropland use,including nonagricultural applications,such as grain production,enhancing the efficiency and timeliness of land resource monitoring.In generating the field dataset of China,complex and fragmented croplands pose considerable challenges to this task.In the future,the problem of lack of samples for model training can be solved by accumulating field segmentation datasets from different regions by borrowing the paradigm of Image-Net,while a general model for channels,regions,and sensors should be constructed subsequently.In the future,with the arrival of the"large model"era of deep learning,constructing a model for the task of parcel segmentation is also necessary to segment every field from the perspective of model and dataset.

外文关键词：

filed segmentationField-Net modelspatial attention mechanismmulti-task learningGF satellite data

作者：

田富有、曹玉佩、赵航、吴炳方、曾红伟、刘亚洲、覃星力、张淼、朱亮、朱伟伟

展开 >

作者单位：

中国科学院空天信息创新研究院遥感科学国家重点实验室,北京 100094

北京师范大学,北京 100101

山东土地集团数字科技有限公司,枣庄 277000

中国科学院大学,北京 100049

展开 >

关键词：

地块分割 Field-Net模型空间注意力机制多任务学习高分卫星数据

出版年：

2024

DOI：

10.11834/jrs.20243191

遥感学报

中国地理学会环境遥感分会中国科学院遥感应用研究所

遥感学报

CSTPCD北大核心

影响因子：2.921

ISSN：1007-4619

年,卷(期)：2024.28(11)