基于特征分布调整的深度神经网络二值量化方法

扫码查看

原文链接

NETL
NSTL
万方数据

中文摘要：二值卷积神经网络(BNNs)由于其占用空间小、计算效率高而受到关注.但由于量化激活特征的正负部分分布不均等问题,二值网络和浮点深度神经网络(DNNs)之间存在着明显的性能差距,影响了其在资源受限平台上的部署.二值网络性能受限的主要原因是特征离散性造成的信息损失以及分布优化不当造成的语义信息消失.针对此问题,应用特征分布调整引导二值化,通过调整特征的均值方差均衡特征分布,减小离散性造成的信息损失.同时,通过分组激励与特征精调模块设计,调整优化量化零点位置,均衡二值化激活分布,最大程度保留语义信息.实验表明,所提出方法在不同骨干网络、使用不同数据集时均能取得较好效果,其中在CIFAR-10上使用ResNet-18网络量化后网络准确率仅损失0.4％,高于当前主流先进二值量化算法.

外文标题：Feature distribution guided binary neural networks

外文摘要：In recent years,binary neural networks(BNNs)have received attention due to their small memory consumption and high computational efficiency.However,there exists a significant performance gap between BNNs and floating-point deep neural networks(DNNs)due to problems,such as imbalanced distributions of positive and negative parts of quantized activation features,which affects their deployment on resource-constrained platforms.The main reason for the limited accuracy of binary networks is the information loss caused by feature discretization and the disappearance of semantic information caused by improper distribution optimization.To address this problem,this paper applies feature distribution adjustment to guide binarization,which adjusts the mean-variance of features to balance the feature distribution and reduce the information loss caused by discretization.At the same time,through the design of group excitation and feature fine-tuning module,the quantization zero points are optimized to balance the binarization activation distributions and retain the semantic information to the maximum extent.Experiments show that the proposed method achieves better results on different backbone networks using different datasets,in which only 0.4％of accuracy is lost after binarizing ResNet-18 on CIFAR-10,which surpasses the current mainstream BNNs.

外文关键词：

feature distributionmean and variance adjustmentsemantic information speicherungmodel compressionbinary neural networksneural network quantization

作者：

刘畅、陈莹

展开 >

作者单位：

江南大学轻工过程先进控制教育部重点实验室,江苏无锡 214122

关键词：

特征分布均值方差调整语义信息保留模型压缩二值神经网络模型量化

基金：

国家自然科学基金

项目编号：

62173160

出版年：

2024

DOI：

10.13195/j.kzyjc.2022.1945

控制与决策

东北大学

控制与决策

CSTPCD北大核心

影响因子：1.227

ISSN：1001-0920

年,卷(期)：2024.39(6)

浏览量1