基于跨模态特征融合的RGB-D花椒图像显著性检测

扫码查看

原文链接

万方数据
维普

中文摘要：针对现有显著性检测模型无法有效地协同花椒枝干彩色图像和深度图像特征,建立基于注意力的RGB-D图像花椒枝干显著性检测模型.由两个单流卷积网络分别提取彩色和深度图像特征;设计基于空间和通道注意力机制的跨模态融合模块,用于融合多尺度的彩色流和深度流特征;研发多尺度监督机制,用于缓解由于采用最近邻域上采样的解码方式导致边缘预测不准确的问题.实验结果表明:该方法的平均精确度、平均召回率、综合评价指标和平均绝对误差均优于对比显著性目标检测方法.

外文标题：RGB-D Pepper Image Saliency Detection Based on Cross-modal Feature Fusion

外文摘要：To address the inability of existing saliency detection models to utilize the features of pepper branch color images and depth images effectively,an attention-based RGB-D image pepper branch saliency detection model is proposed.Color and depth image features are extracted separately by two single-stream convolutional networks.A cross-modal fusion module based on spatial and channel attention mechanisms is designed to fuse multi-scale color stream and depth stream features.A multi-scale supervision mechanism is developed to alleviate the inaccurate edge prediction caused by the use of nearest-neighbor upsampling decoding.Experimental results show that the average accuracy,average recall rate,comprehensive evaluation index and average absolute error of the proposed method are all superior to the compared salient object detection methods.

外文关键词：

automated pepper harvestingpicture processingRGB-D significance target detectioncross-mode fusionattention mechanismmulti-dimension supervision

作者：

李节、孙成龙、王逸涵、杨前、李柏林

展开 >

作者单位：

西南交通大学机械工程学院,四川成都 610031

关键词：

花椒自动化采摘图像处理 RGB-D显著性目标检测跨模态融合注意力机制多尺寸监督

出版年：

2024

DOI：

10.19344/j.cnki.issn1671-5276.2024.06.042

机械制造与自动化

南京机械工程学会南京机电产业（集团）有限公司

机械制造与自动化

CSTPCD

影响因子：0.29

ISSN：1671-5276

年,卷(期)：2024.53(6)