首页|基于改进DeepLabv3+的林木图像分割方法

基于改进DeepLabv3+的林木图像分割方法

扫码查看
近年来,人们越来越重视森林资源管理规划,但是森林结构复杂、分布破碎,准确区分树木区域与非树木区域以及预测森林面积比较困难。针对森林区域提取困难、边界分割不精确的问题,提出改进的DeepLabv3+模型研究森林区域智能精准提取。首先,在编码器阶段使用CFF(cross feature fusion)模块融合主干网络与空洞卷积的多尺度低级和高级特征来获得高分辨率的掩码特征,有效地聚合多层次编码器特征;其次,在解码器阶段引入cSE(spatial squeeze and channel excitation)通道注意力模块,使模型能够更好地获取不同通道上的特征,提高网络的表现力使其关注到输入图像的边缘位置,从而提高分割准确率;最后,将卷积之后的深层特征与浅层特征进行融合,增强网络的分割性能。研究表明:基于改进的DeepLabv3+深度学习神经网络得到的森林类别平均像素准确率(mPA)达到了 93。85%,平均交并比(mIoU)达到了 89。17%,准确率(Accuracy)达到了 95。66%,相较于原始DeepLabv3+网络分别提升了 0。77%,1。8%和0。89%,模型参数量减少了 48。84 M,检测速度FPS提升了 17。93帧/s,检测效率更高,分割性能更好。
Forest image segmentation method based on improved DeepLabv3+
Forest resource management planning has gained increasing attention due to the vital role that forests play in ecological balance,climate regulation,and the provision of various resources.Due to the complex structure of forests and their fragmented distribution,it is difficult to accurately distinguish between tree areas and non-tree areas and it is difficult to accurately predict forest areas.To solve the problems of difficult extraction of forest area and inaccurate boundary segmentation,based on UAV remote sensing images of forest areas,an improved DeepLabv3+model was proposed to investigate the intelligent and accurate extraction of forest areas,to provide technical support for intelligent monitoring and management of forest area.Firstly,the backbone network Xception in the model was replaced with a lightweight MobileNetv2 to reduce the calculation of model parameters and improve the operation efficiency of the model.Secondly,the CFF(cross feature fusion)module was used to fuse the multi-scale low-level and high-level features of the backbone network and the cavity convolution in the encoder stage to obtain high-resolution mask fea-tures and effectively aggregate the multi-level encoder features.Thirdly,the cSE(spatial squeeze and channel excita-tion)channel attention module was introduced in the decoder stage by establishing the dependency relationship between channels.The cSE model can better obtain the features on different channels,emphasize the useful features,suppress the useless features,and improve the expressiveness of the network to pay attention to the edge position of the input image,so as to improve the accuracy of forest area segmentation and the efficiency of the model.Finally,the deep features after convolution were fused with the shallow features to enhance the segmentation performance of the network.The results showed that the average pixel accuracy of mPA model for forest categories obtained based on the improved DeepLabv3+deep learning neural network segmentation model reached 93.85%,the average intersection and merger ratio mIoU reached 89.17%,and the accuracy reached 95.66%,which were 0.77%,1.8%and 0.89%higher than the original DeepLabv3+network.The number of model parameters was reduced by 48.84 M,and the model detection speed FPS was increased by 17.93 frames per second,detection efficiency was enhanced and segmen-tation performance was improved.

forest image segmentationDeepLabv3+modelMobileNetfeature fusionattention mechanism

林洁如、朱洪前、杨国、肖恒玉、胡涛、何翔

展开 >

中南林业科技大学材料科学与工程学院,长沙 410004

林木图像分割 DeepLabv3+模型 MobileNet 特征融合 注意力机制

国家自然科学基金面上项目湖南教育厅科研项目

6207625621C0168

2024

林业工程学报
南京林业大学

林业工程学报

CSTPCD北大核心
影响因子:0.742
ISSN:2096-1359
年,卷(期):2024.9(3)