首页|基于多尺度信息融合的杂草语义分割方法

基于多尺度信息融合的杂草语义分割方法

扫码查看
玉米田间杂草有效分割是无人机精准变量施药的必要前提,针对传统CNN语义分割模型难以克服因玉米与杂草相互遮挡、小目标杂草漏检等问题,以玉米3~5叶龄期田间杂草无人机正射数码影像为对象,提出Transformer与CNN多尺度信息融合的杂草语义分割模型TFPSP-CA.首先,将基础模型PSPNet特征金字塔替换为BiFPN,加强特征融合,增强模型对图像细节的学习能力和上下文信息获取能力;使用MobileNet系列网络替换原有的ResNet网络,加快模型预测速度、减小模型规模,得到改进的杂草分割模型FPSPNet.结果表明:改进的分割模型FPSPNet在小目标、中目标、大目标的mloU、PA分别为84.48%、89.36%,82.05%、89.34%和83.32%、89.26%,尤其对小目标杂草分割精度提升明显,其mIoU、PA比基础分割模型分别提高4.60%和2.42%.其次,进一步在FPSPNet金字塔模块的输出端引入Transformer特征输出模块并行连接从而获得更多尺度信息,并加入结合坐标与通道信息的CA注意力机制,得到TFPSP-CA杂草分割模型,该模型能够更好地捕捉全局特征和长距离依赖关系,从而提高对复杂环境下杂草分割问题的处理能力.结果表明:改进后的TFPSP-CA模型在无遮挡、轻度遮挡、严重遮挡情况下杂草分割mIoU和PA分别为90.21%、91.98%,89.44%、89.11%和87.59%、87.53%,改进模型在严重遮挡情况下精度提升明显,对比原始模型PSPNet和FPSPNet,mIoU、PA分别提升6.34%、2.27%和10.96%、5.22%.
Semantic Segmentation of Weeds Based on Multiscale Information Fusion
Effective segmentation of weeds in maize fields is a prerequisite for accurate variable herbicide application by UAVs.Aiming at the problems of missing detection of the traditional CNN semantic segmentation model for the mutual occlusion of maize and weeds,and the small target weeds,etc.,taking UAV ortho-digital image of the field weed at three-five-leaf-age of maize as the object,we propose the weed semantic segmentation model TFPSP-CA based on multi-scale information fusion of Transformer and CNN.Firstly,the base model PSPNet feature pyramid is replaced with BiFPN to strengthen feature fusion and enhance the model's ability to learn image details and acquire contextual information;the original ResNet network is replaced with the MobileNet series network to speed up the model prediction speed and reduce the model size,and the improved weed segmentation model FPSPNet is obtained.The results show that the mloU and PA of the improved segmentation model FPSPNet are 84.48%,89.36%,82.05%,89.34%,and 83.32%,89.26%for small,medium,and large targets,respectively.The accuracy of weed segmentation is especially improved significantly for small targets,and its mIoU and PA are increased by 4.60%and 2.42%respectively compared to the base segmentation model.Secondly,the Transformer feature output module is further introduced at the output end of the FPSPNet pyramid module to connect in parallel to obtain more scale information.The CA attention mechanism combining coordinate and channel information is added to obtain the TFPSP-CA weed segmentation model.The model can better capture the global features and long-distance dependence,thus improving the processing ability of the weed segmentation problem in complex environments.The results show that the improved TFPSP-CA model has 90.21%,91.98%,89.44%,89.11%,and 87.59%,87.53%of weed segmentation mIoU and PA in the case of no occlusion,mild occlusion,and severe occlusion,respectively,and the accuracy of the improved model in the case of severe occlusion improves significantly,compared with the original models PSPNet and FPSPNet,the mIoU and PA are improved by 6.34%,2.27%and 10.96%and 5.22%,respectively.

weed segmentationmaize seedling stageprecision agricultureTransformerconvolutional neural network

曹英丽、陈晓安、蔺雨桐、李严、郭忠辉

展开 >

沈阳农业大学 信息与电气工程学院,沈阳 110161

国家数字农业区域创新分中心(东北),沈阳 110161

杂草分割 玉米苗期 精准农业 Transformer 卷积神经网络

2024

沈阳农业大学学报
沈阳农业大学

沈阳农业大学学报

CSTPCD北大核心
影响因子:0.667
ISSN:1000-1700
年,卷(期):2024.55(6)