基于EfficientNet的无锚框目标检测模型

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据
维普

中文摘要：目标检测是计算机视觉的热门研究方向之一,包含分类和定位两个任务.针对单阶段目标检测模型普遍存在的两个问题:训练时正负样本的不均衡以及锚框的设置需要人工干预,提出一种基于EfficientNet的无锚框目标检测模型(Anchor-free Efficientnet-based Object Detector,AEOD).AEOD先筛选出落在目标框中的特征点,再根据特征点所作的预测计算代价矩阵,在训练时基于代价矩阵为目标动态分配正负样本,从而达到平衡二者数量的目的.此模型通过特征图中的特征点直接预测目标的位置和形状,不仅省去了人工设置锚框的环节,还提高了可检出目标的数量.此外,可缩放的EfficientNet进一步提高了模型的泛化能力,使之可以接收多尺度的输入.在PASCAL VOC07+12 数据集中,AEOD最高可以获得91.3%的平均精度(mAP),检测速度达到32.1 FPS,较其他主流的目标检测模型有显著提升.

外文标题：An Anchor-free Object Detection Model Based on EfficientNet

外文摘要：Object detection is one of the hot research areas in computer vision,which includes two tasks:classification and location.Due to the two common problems appearing in one-stage object detector:extreme imbalance between positive/negative samples during training and anchors pre-defined deeply depending on manual settings,an anchor-free efficientnet-based object detector(AEOD)is pro-posed.AEOD first selects out the feature points that fall in the target box,then calculates the cost matrix based on values predicted by these feature points,finally assigns the positive/negative samples to the target dynamically according to the cost matrix during the training.Therefore,the number of positive/negative samples is balanced to enhance the performance of the model.AEOD directly predicts location and shape of the object through the feature points in the feature maps.As a result,not only the step of pre-defining anchors can be skipped,but also the number of objects that successfully detected increases.In addition,the scalable backbone(EfficientNet)improves the generalization ability of AEOD,it can receive multi-scale input.AEOD achieves the highest 91.3%mAP on PASCAL VOC07+12 at speed of 32.1 FPS,showing a significant improvement compared to other modern models.

外文关键词：

deep learningcomputer visionobject detectionpositive/negative samples assignment algorithmanchor-free

作者：

卜子渝、杨哲、刘纯平

展开 >

作者单位：

苏州大学计算机科学与技术学院,江苏苏州 215006

江苏省计算机信息处理技术重点实验室,江苏苏州 215006

江苏省大数据智能工程实验室,江苏苏州 215006

关键词：

深度学习计算机视觉目标检测正负样本分配算法无锚框

基金：

国家自然科学基金资助项目江苏省高校自然科学基金资助项目

项目编号：

6200225319KJA230001

出版年：

2024

DOI：

10.3969/j.issn.1673-629X.2024.01.006

计算机技术与发展

陕西省计算机学会

计算机技术与发展

CSTPCD

影响因子：0.621

ISSN：1673-629X

年,卷(期)：2024.34(1)

参考文献量2