天地一体化信息网络2023,Vol.4Issue(4) :79-85.DOI:10.11959/j.issn.2096-8930.2023045

稀疏递归神经网络的可扩展低功耗加速器

Scalable Low Power Accelerator for Sparse Recurrent Neural Network

金磐石 李俊杰 王静逸 李鹏翀 邢磊 李晓栋
天地一体化信息网络2023,Vol.4Issue(4) :79-85.DOI:10.11959/j.issn.2096-8930.2023045

稀疏递归神经网络的可扩展低功耗加速器

Scalable Low Power Accelerator for Sparse Recurrent Neural Network

金磐石 1李俊杰 2王静逸 2李鹏翀 3邢磊 2李晓栋1
扫码查看

作者信息

  • 1. 中国建设银行股份有限公司,北京 100034
  • 2. 建信金融科技有限责任公司,上海 321004
  • 3. 浪潮电子信息产业股份有限公司,山东 济南 250000
  • 折叠

摘要

利用银行网点内边缘计算设备进行客流分析、安全保护、风险防控等应用日益广泛,其中 AI 推理芯片的性能和功耗已经成为边缘计算设备选型的一个非常重要的因素.针对递归神经网络由数据依赖性和低数据重用性导致的功耗大、推理性能弱、能效低,难以在低功耗平台上处理等问题,利用FPGA实现了一种电压可扩展的稀疏循环神经网络(RNN)低功率加速器,并在边缘设计算设备上进行了验证.首先,对稀疏RNN进行分析并采用网络压缩的方法设计了处理阵列;其次,由于稀疏RNN的工作负载不平衡,引入电压缩放方法以保持低功耗和高吞吐量.试验表明,该方法可以显著提高系统的RNN 推理速度并降低芯片的处理功耗.

Abstract

The use of edge computing devices in bank outlets for passenger flow analysis,security protection,risk prevention and con-trol is increasingly widespread,among which the performance and power consumption of AI reasoning chips have become a very im-portant factor in the selection of edge computing devices.Aiming at the problems of recurrent neural network,such as high power con-sumption,weak reasoning performance and low energy efficiency,which were caused by data dependence and low data reusability,this paper realized a sparse RNN low-power accelerator with scalable voltage by using FPGA,and verifies it on the edge design and calcula-tion equipment.Firstly,the sparse-RNN was analyzed and the processing array was designed by network compression.Secondly,due to the unbalanced workload of sparse RNN,it introduced voltage scaling method to maintain low power consumption and high throughput.Experiments show that this method could significantly improve the RNN reasoning speed of the system and reduce the processing pow-er consumption of the chip.

关键词

RNN/稀疏/低功耗/加速方案

Key words

RNN/sparse/low power consumption/acceleration scheme

引用本文复制引用

出版年

2023
天地一体化信息网络

天地一体化信息网络

CSTPCD
ISSN:
参考文献量9
段落导航相关论文