首页|一种改进型LeNet的交通标识多分类异构加速器的实现

一种改进型LeNet的交通标识多分类异构加速器的实现

扫码查看
提出一种基于改进型 LeNet的交通标志多分类异构加速器的实现方案.该加速器利用 ARM+FPGA异构平台,将改进型 LeNet的前向推理部署到 FPGA上,实现并行计算.在 FPGA端,采用 AXI-Stream协议,通过 DMA实现数据高速流转,使用数组分区和多级流水线等技术实现数据的并行处理.在 ARM端使用PYNQ框架进行数据更新和加速器调度.在 GTSRB数据集上的实验结果显示,该设计方案在工作时钟频率为 50 MHz时,平均推理时间为 14.489 ms,在 MCU上的推理时间为 710 ms,加速比可达 49,对于实现交通标识多分类的边缘端应用具有显著的作用.
Implementation of an Improved LeNet Traffic Sign Multi-classification Heterogeneous Accelerator
An implementation of traffic sign multi-classification heterogeneous accelerator based on improved LeNet is proposed.The accelerator utilizes an ARM+FPGA heterogeneous platform to deploy the forward inference of the improved LeNet on the FPGA for parallel computing.On the FPGA side,the AXI-Stream protocol is employed with DMA to achieve high-speed data streaming,and techniques such as array partitioning and multi-level pipeline are utilized for parallel data processing.On the ARM side,the PYNQ framework is used for data updates and accelerator scheduling.Experimental results on GTSRB demonstrate that proposed design achieves an average inference time of 14.489 ms at a working clock frequency of 50 MHz,compared to 710 ms on the MCU,resulting in a speedup of up to 49 times.This design provides significant assistance for edge applications involving traffic sign multi-classification.

LeNetFPGAPYNQheterogeneous computing

杨永杰、郑君泰、马立、杨昊

展开 >

南通大学信息科学技术学院,南通 226019

LeNet FPGA PYNQ 异构计算

2024

北京大学学报(自然科学版)
北京大学

北京大学学报(自然科学版)

CSTPCD北大核心
影响因子:0.785
ISSN:0479-8023
年,卷(期):2024.60(6)