基于YOLOX-L-TN模型的番茄果实识别
Tomato Fruit Recognition Based on YOLOX-L-TN Model
李名博 1刘玉乐 1穆志民 2郭俊旺 1卫勇 1任东悦 1贾济深 1卫泽中 1栗宇红3
作者信息
- 1. 天津农学院工程技术学院,天津 300384
- 2. 天津农学院基础科学学院,天津 300384
- 3. 山西省长治市沁县农业农村局,山西 长治 046499
- 折叠
摘要
针对植物工厂对番茄采摘作业的智能化需求,为克服在采摘作业过程中因番茄果实大小不一、遮挡重叠造成的识别精度不高和速度不快的问题,提出了YOLOX-L的改进型目标识别模型YOLOX-L-TN.该模型依据特征图的通道和空间注意力机制的内部结构和原理,设计了含有残差结构的TN模块,并融入到YOLOX-L的主干网络中,在保持网络轻量化的同时,实现模型识别速度和精度的同时提升.与YOLOX-L相比,YOLOX-L-TN的AP值提高了4.81个百分点,单张图像的识别时间提升了0.141 7 s,TN模块的最佳位置为输入端与主干网络之间.进一步将TN模块与类似模块SENet、CAM、CBAM和CAM进行对比,AP值分别提高0.53、4.19、6.12、6.34个百分点,单张图像识别时间分别提升0.019 1、0.025 0、0.021 1、0.018 9 s.由此可见,提出的YOLOX-L-TN模型具有精度高、识别速度快、鲁棒性高等优点,可为番茄后期的智能采摘提供技术支持.
Abstract
Aiming at the intelligent demand for tomato picking operation in plant factories,in order to overcome the problems of low recognition accuracy and low speed caused by different sizes and overlapping of tomato fruits during picking operation,an improved target recognition model of YOLOX-L-TN was proposed,in which a TN module containing residual structure was designed according to the internal structure and principle of channel and spatial attention mechanism of feature graph,and integrated into the backbone network of YOLOX-L.This model improved the speed and accuracy of model recognition while maintaining the lightweight of the network.Compared with YOLOX-L,the AP value of YOLOX-L-TN was increased by 4.81 percentage points,and the recognition time of single image is increased by 0.141 7 s,and the optimal position of TN module was between the input and the backbone network.Furthermore,TN module was compared with similar modules SENet,CAM,CBAM and CAM,and the results showed that AP value was increased by 0.53,4.19,6.12 and 6.34 percentage points,respectively,and the recognition time of single image is increased by 0.019 1,0.025 0,0.021 1,0.018 9 s,respectively.In conclusion,the proposed YOLOX-L-TN model had the advantages of high precision,fast identification speed and high robustness,which provided technical support for the intelligent picking of tomatoes in the later stage.
关键词
番茄识别/注意力机制/TN模块/YOLOX-LKey words
tomato recognition/attention mechanism/TN module/YOLOX-L引用本文复制引用
基金项目
天津市科技计划(21YDTPJC00600)
天津市教委教学改革项目(A201006102)
出版年
2024