Guaranteed Quantization Error Computation for Neural Network Model Compression

Wesley Cooke; Zihao Mo; Weiming Xiang

doi:10.1109/ICIT58465.2023.10143141

Guaranteed Quantization Error Computation for Neural Network Model Compression

Wesley Cooke, Zihao Mo, Weiming Xiang

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Neural network model compression techniques can address the computation issue of deep neural networks on embedded devices in industrial systems. The guaranteed output error computation problem for neural network compression with quantization is addressed in this paper. A merged neural network is built from a feedforward neural network and its quantized version to produce the exact output difference between two neural networks. Then, optimization-based methods and reachability analysis methods are applied to the merged neural network to compute the guaranteed quantization error. Finally, a numerical example is proposed to validate the applicability and effectiveness of the proposed approach.

Original language	English (US)
Title of host publication	2023 IEEE International Conference on Industrial Technology, ICIT 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9798350336504
DOIs	https://doi.org/10.1109/ICIT58465.2023.10143141
State	Published - 2023
Event	2023 IEEE International Conference on Industrial Technology, ICIT 2023 - Orlando, United States Duration: Apr 4 2023 → Apr 6 2023

Publication series

Name	Proceedings of the IEEE International Conference on Industrial Technology
Volume	2023-April

Conference

Conference	2023 IEEE International Conference on Industrial Technology, ICIT 2023
Country/Territory	United States
City	Orlando
Period	4/4/23 → 4/6/23

Keywords

model compression
neural networks
quantization

ASJC Scopus subject areas

Computer Science Applications
Electrical and Electronic Engineering

Access to Document

10.1109/ICIT58465.2023.10143141

Cite this

Cooke, W., Mo, Z., & Xiang, W. (2023). Guaranteed Quantization Error Computation for Neural Network Model Compression. In 2023 IEEE International Conference on Industrial Technology, ICIT 2023 (Proceedings of the IEEE International Conference on Industrial Technology; Vol. 2023-April). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICIT58465.2023.10143141

Guaranteed Quantization Error Computation for Neural Network Model Compression. / Cooke, Wesley; Mo, Zihao; Xiang, Weiming.
2023 IEEE International Conference on Industrial Technology, ICIT 2023. Institute of Electrical and Electronics Engineers Inc., 2023. (Proceedings of the IEEE International Conference on Industrial Technology; Vol. 2023-April).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Cooke, W, Mo, Z & Xiang, W 2023, Guaranteed Quantization Error Computation for Neural Network Model Compression. in 2023 IEEE International Conference on Industrial Technology, ICIT 2023. Proceedings of the IEEE International Conference on Industrial Technology, vol. 2023-April, Institute of Electrical and Electronics Engineers Inc., 2023 IEEE International Conference on Industrial Technology, ICIT 2023, Orlando, United States, 4/4/23. https://doi.org/10.1109/ICIT58465.2023.10143141

@inproceedings{780d9dba43cc4cd58e8c01155b3f1247,

title = "Guaranteed Quantization Error Computation for Neural Network Model Compression",

abstract = "Neural network model compression techniques can address the computation issue of deep neural networks on embedded devices in industrial systems. The guaranteed output error computation problem for neural network compression with quantization is addressed in this paper. A merged neural network is built from a feedforward neural network and its quantized version to produce the exact output difference between two neural networks. Then, optimization-based methods and reachability analysis methods are applied to the merged neural network to compute the guaranteed quantization error. Finally, a numerical example is proposed to validate the applicability and effectiveness of the proposed approach.",

keywords = "model compression, neural networks, quantization",

author = "Wesley Cooke and Zihao Mo and Weiming Xiang",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 IEEE International Conference on Industrial Technology, ICIT 2023 ; Conference date: 04-04-2023 Through 06-04-2023",

year = "2023",

doi = "10.1109/ICIT58465.2023.10143141",

language = "English (US)",

series = "Proceedings of the IEEE International Conference on Industrial Technology",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2023 IEEE International Conference on Industrial Technology, ICIT 2023",

}

TY - GEN

T1 - Guaranteed Quantization Error Computation for Neural Network Model Compression

AU - Cooke, Wesley

AU - Mo, Zihao

AU - Xiang, Weiming

PY - 2023

Y1 - 2023

N2 - Neural network model compression techniques can address the computation issue of deep neural networks on embedded devices in industrial systems. The guaranteed output error computation problem for neural network compression with quantization is addressed in this paper. A merged neural network is built from a feedforward neural network and its quantized version to produce the exact output difference between two neural networks. Then, optimization-based methods and reachability analysis methods are applied to the merged neural network to compute the guaranteed quantization error. Finally, a numerical example is proposed to validate the applicability and effectiveness of the proposed approach.

AB - Neural network model compression techniques can address the computation issue of deep neural networks on embedded devices in industrial systems. The guaranteed output error computation problem for neural network compression with quantization is addressed in this paper. A merged neural network is built from a feedforward neural network and its quantized version to produce the exact output difference between two neural networks. Then, optimization-based methods and reachability analysis methods are applied to the merged neural network to compute the guaranteed quantization error. Finally, a numerical example is proposed to validate the applicability and effectiveness of the proposed approach.

KW - model compression

KW - neural networks

KW - quantization

UR - http://www.scopus.com/inward/record.url?scp=85163306576&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85163306576&partnerID=8YFLogxK

U2 - 10.1109/ICIT58465.2023.10143141

DO - 10.1109/ICIT58465.2023.10143141

M3 - Conference contribution

AN - SCOPUS:85163306576

T3 - Proceedings of the IEEE International Conference on Industrial Technology

BT - 2023 IEEE International Conference on Industrial Technology, ICIT 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 IEEE International Conference on Industrial Technology, ICIT 2023

Y2 - 4 April 2023 through 6 April 2023

ER -

Guaranteed Quantization Error Computation for Neural Network Model Compression

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this