Adaptive joint compression method for deep neural networks

Home > Archive>Volume 44, Issue 5, 2023 >21-32

Adaptive joint compression method for deep neural networks
DOI:
                        
CSTR:
                        [cstr]
                    
Author:
                        
Affiliation:
Clc Number:TP183 TH89
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Deep neural network compression methods with a single and fixed pattern are difficult to compress the model sufficiently due to the limitation of accuracy loss. As a result, the compressed model still needs to consume costly and limited storage resources when it is deployed, which is a significant barrier to its use in edge devices. To address this problem, this article proposes an adaptive joint compression method, which optimizes model structure and weight bit-width in parallel. Compared with the majority of existing combined compression methods, adequate fusion of sparsity and quantization methods is performed for joint compression training to reduce model parameter redundancy comprehensively. Meanwhile, the layer-wise adaptive sparse ratio and weight bit-width are designed to solve the sub-optimization problem of model accuracy and improve model accuracy loss due to the fixed compression ratio. Experimental results of VGG, ResNet, and MobileNet using the CIFAR-10 dataset show that the proposed method achieves 143. 0 ×, 151. 6 ×, and 19. 7 × parameter compression ratios. The corresponding accuracy loss values are 1. 3% , 2. 4% , and 0. 9% , respectively. In addition, compared with 12 typical compression methods, the proposed method reduces the consumption of hardware memory resources by 15. 3×~ 148. 5×. In addition, the proposed method achieves maximum compression ratio of 284. 2× whilemaintaining accuracy loss within limited range of 1. 2% on the self-built remote sensing optical image dataset.

Reference

Cited by

Get Citation

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:
Revised:
Adopted:
Online: August 17,2023
Published:

Home

Introduction

Current Issue

Editorial Committee

Policy

Contact Us

中文版

Get Citation

Related Videos

Share

Article Metrics

History

Article QR Code