现代电子技术2025,Vol.48Issue(1) :151-156.DOI:10.16652/j.issn.1004-373x.2025.01.025

基于FPGA的MobileNetV1目标检测加速器设计

Design of MobileNetV1 object detection accelerator based on FPGA

严飞 郑绪文 孟川 李楚 刘银萍
现代电子技术2025,Vol.48Issue(1) :151-156.DOI:10.16652/j.issn.1004-373x.2025.01.025

基于FPGA的MobileNetV1目标检测加速器设计

Design of MobileNetV1 object detection accelerator based on FPGA

严飞 1郑绪文 2孟川 2李楚 2刘银萍3
扫码查看

作者信息

  • 1. 南京信息工程大学 自动化学院,江苏 南京 210044;江苏省大气环境与装备技术协同创新中心,江苏 南京 210044
  • 2. 南京信息工程大学 自动化学院,江苏 南京 210044
  • 3. 南京信息工程大学 应急管理学院,江苏 南京 210044
  • 折叠

摘要

卷积神经网络是目标检测中的常用算法,但由于卷积神经网络参数量和计算量巨大导致检测速度慢、功耗高,且难以部署到硬件平台,故文中提出一种采用CPU与FPGA融合结构实现MobileNetV1目标检测加速的应用方法.首先,通过设置宽度超参数和分辨率超参数以及网络参数定点化来减少网络模型的参数量和计算量;其次,对卷积层和批量归一化层进行融合,减少网络复杂性,提升网络计算速度;然后,设计一种八通道核间并行卷积计算引擎,每个通道利用行缓存乘法和加法树结构实现卷积运算;最后,利用FPGA并行计算和流水线结构,通过对此八通道卷积计算引擎合理的复用完成三种不同类型的卷积计算,减少硬件资源使用量、降低功耗.实验结果表明,该设计可以对MobileNetV1目标检测进行硬件加速,帧率可达56.7 f/s,功耗仅为0.603 W.

Abstract

The convolutional neural network is a commonly used algorithm in object detection.However,due to the large number of parameters and computation load of the convolutional neural network(CNN),the detection speed of the CNN is slow,its power consumption is high,and it is difficult to deploy the CNN at the hardware platform.In view of this,the paper proposes an application method using the fusion structure of CPU and FPGA to realize the acceleration of MobileNetV1 object detection.The parameter number and computation load of the network model are reduced by setting width hyperparameters and resolution hyperparameters,as well as performing network parameter fixed-point.The convolutional layer and batch normalization layer are fused to reduce network complexity and improve network computation speed.An eight-channel inter-kernel parallel convolution engine is designed.Row cache multiplication and addition tree structure are used to implement convolution operation in each channel.Finally,by utilizing FPGA parallel computing and pipeline structure,three different types of convolution calculation are realized by reasonable reuse of the eight-channel convolution computing engine,so as to reduce the consumption of hardware resources and power consumption.The experimental results show that the design can accelerate the MobileNetV1 object detection with a frame rate of 56.7 f/s and a power consumption of 0.603 W.

关键词

卷积神经网络/目标检测/FPGA/MobileNetV1/并行计算/硬件加速

Key words

CNN/object detection/FPGA/MobileNetV1/parallel computing/hardware acceleration

引用本文复制引用

出版年

2025
现代电子技术
陕西电子杂志社

现代电子技术

CSTPCD北大核心
影响因子:0.417
ISSN:1004-373X
段落导航相关论文