应用科学学报2024,Vol.42Issue(1) :1-14.DOI:10.3969/j.issn.0255-8297.2024.01.001

基于非线性高斯平方距离损失的目标检测

Object Detection Based on Nonlinear Gaussian Squared Distance Loss

李瑞 李毅
应用科学学报2024,Vol.42Issue(1) :1-14.DOI:10.3969/j.issn.0255-8297.2024.01.001

基于非线性高斯平方距离损失的目标检测

Object Detection Based on Nonlinear Gaussian Squared Distance Loss

李瑞 1李毅1
扫码查看

作者信息

  • 1. 四川大学计算机学院,四川成都 610065
  • 折叠

摘要

在目标检测领域中,基于交并比(intersection over union,IoU)的系列损失函数存在一定的局限性,使得边界框回归的精度和稳定性有待进一步提升.为此提出了一种基于非线性高斯平方距离的边界框回归损失函数.首先综合考虑了边界框中重叠性、中心点距离和长宽比3个因素,将边界框建模为高斯分布;然后提出一种高斯平方距离来衡量概率分布之间的差距;最后设计了符合优化趋势的非线性函数,将高斯平方距离转化为有利于神经网络学习的损失函数.实验结果表明,与IoU损失相比,所提方法在掩膜区域卷积神经网络、一阶全卷积目标检测器和自适应特征选择目标检测器上的平均精度均值分别提高了 0.3%、1.1%和2.3%,证明了该方法能有效提升目标检测的性能,同时有利于高精度边界框的回归.

Abstract

Existing series of loss functions based on intersection over union(IoU)have certain limitations,impacting the accuracy and stability of bounding box regression in object detection.To address this problem,a bounding box regression loss based on nonlin-ear Gaussian squared distance is proposed.Firstly,the three factors including overlapping,center point distance and aspect ratio in the bounding box are comprehensively considered,and the bounding box is modeled as a Gaussian distribution.Then a Gaussian squared distance is proposed to measure the distance between two distributions.Finally,a non-linear function is designed to transform the Gaussian square distance into a loss function that facilitates neural network learning.Experimental results show that compared with IoU loss,the mean average precision of the proposed method on mask region-based convo-lutional neural network,fully convolutional one-stage object detector and adaptive training sample selection object detector is improved by 0.3%,1.1%and 2.3%,respectively.These results demonstrate the efficiency of the proposed method in enhancing target detection performance and supporting the regression of high-precision bounding boxes.

关键词

目标检测/边界框回归/高斯分布/交并比/卷积神经网络

Key words

object detection/bounding box regression/Gaussian distribution/intersection over union(IoU)/convolutional neural network

引用本文复制引用

基金项目

国家自然科学基金(62172290)

出版年

2024
应用科学学报
上海大学 中国科学院上海技术物理研究所

应用科学学报

CSTPCDCSCD北大核心
影响因子:0.594
ISSN:0255-8297
参考文献量32
段落导航相关论文