改进MetaFormer的轻量化模型设计

Design of Lightweight Model with Improved MetaFormer

扫码查看

原文链接

维普
万方数据

中文摘要：为了提升传统CNN的远距离特征提取能力,将深度可分离卷积操作嵌入到改进MetaFormer架构中,添加了通道混洗操作,提出了一种创新的轻量化CNN网络模型:ViT-Net.通过这种融合策略,不仅保留了ViTs的灵活性和扩展性,还增强了CNN模型的图像特征提取能力.ViTNet的轻量化优势,使其能够高效运行于计算资源有限的设备上.实验结果表明:在cifar10上,ViTNet-1.0×比MobileNetV2提高了1.8％准确率,延迟降低了32％,表现出良好的竞争力.

外文摘要：In order to improve the remote feature extraction capability of traditional CNN,this paper presents an innovative lightweight CNN network model known as ViTNet,with the deep detachable convolution operation embedded into the improved MetaFormer architecture and with channel shuffle operation added.This fusion strategy not only maintains the flexibility and scalability of ViTs,but also enhances the image feature extraction capability of the CNN model.Furthermore,lightweight enables ViTNet to operate efficiently on devices with restricted computing resources.The experimental results show that on cifar10,ViTNet-1.0× is more competitive than MobileNetV2 with accuracy improved by 1.8％ and latency reduced by 32％.

外文关键词：

image identificationdepth-wise separable convolutionlightweight networkmodel compression

作者：

徐飞、禹婷婷、张乐怡、张瑞轩

展开 >

作者单位：

西安工业大学计算机科学与工程学院,西安 710021

关键词：

图像识别深度可分离卷积轻量级网络模型压缩

出版年：

2024

DOI：

10.16185/j.jxatu.edu.cn.2024.04.401

西安工业大学学报

西安工业大学

西安工业大学学报

CSTPCDCHSSCD

影响因子：0.381

ISSN：1673-9965

年,卷(期)：2024.44(4)