高能效低延迟的BNN硬件加速器设计

Design of energy-efficient low-latency BNN hardware accelerator

周培培 ¹杜高明 ¹李桢旻 ¹王晓蕾¹

扫码查看

作者信息

1. 合肥工业大学微电子学院,安徽合肥 230601
折叠

摘要

针对二值化神经网络(binary neural network,BNN)硬件设计过程中大量0值引发计算量增加以及BNN中同一权值数据与同一特征图数据多次重复运算导致计算周期和计算功耗增加的问题,文章分别提出全0值跳过方法和预计算结果缓存方法,有效减少网络的计算量、计算周期和计算功耗;并基于现场可编程门阵列(field programmable gate array,FPGA)设计一款BNN硬件加速器,即手写数字识别系统.实验结果表明,使用所提出的全0值跳过方法和预计算结果缓存方法后,在100 MHz的频率下,设计的加速器平均能效可达1.81 TOPs/W,相较于其他BNN加速器,提升了1.27～4.34倍.

Abstract

There are a large number of zero values used in the operation of binary neural network(BNN)applications,which leads to the surge of computations,as well as computing delay and com-puting power caused by repeated operations of the same weight data and feature graph data in BNN.In this paper,the methods of all-zero skipping and precomputed result caching are proposed.The pro-posed methods can effectively reduce the computation cost,computing delay and computing power.In addition,a BNN hardware accelerator based on field programmable gate array(FPGA)is designed and applied to handwritten digit recognition system.The experimental results show that after applying the proposed methods,the average power efficiency of the accelerator can reach 1.81 TOPs/W at the fre-quency of 100 MHz,which is 1.27-4.34 times higher than that of other BNN accelerators.

关键词

二值化神经网络(BNN)/权值共享/重复运算/现场可编程门阵列(FPGA)/硬件加速器

Key words

binary neural network(BNN)/weight sharing/repeated operation/field programmable gate array(FPGA)/hardware accelerator

引用本文复制引用

出版年

2024

合肥工业大学学报(自然科学版)

合肥工业大学

合肥工业大学学报(自然科学版)

CSTPCD北大核心

影响因子：0.608

ISSN：1003-5060

段落导航