An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems

Kyubaik Choi; Gerald E. Sobelman

doi:10.1109/CSCE60160.2023.00150

An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems

Kyubaik Choi, Gerald E. Sobelman

Electrical and Computer Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

It is difficult to achieve real-time performance for a deep neural networks (DNN) in low-profile systems such as edge computing or Internet of Things (IoT) due to the large amount of computations that are required. A commonly used activation function is ReLU (Rectified Linear Unit) due to its simplicity and good performance. A key characteristic of the ReLU function is that it produces sparse vectors within the neural network. In this paper, we propose an optimized DNN accelerator for sparse vector multiplications. In particular, we propose a squeezer unit to detect zeros and then skip feeding that data to the processing elements. In addition, we design a dynamic scheduler to efficiently allocate multiple neuron computations to the processing elements. With our architecture, we achieve reduced hardware resources compared to prior work due to a 62% reduction in the number of required index bits. We also achieve 8.6% better processing speed than for a system without the acceleration.

Original language	English (US)
Title of host publication	Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	887-892
Number of pages	6
ISBN (Electronic)	9798350327595
DOIs	https://doi.org/10.1109/CSCE60160.2023.00150
State	Published - 2023
Event	2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023 - Las Vegas, United States Duration: Jul 24 2023 → Jul 27 2023

Publication series

Name	Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023

Conference

Conference	2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023
Country/Territory	United States
City	Las Vegas
Period	7/24/23 → 7/27/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

Deep neural networks
DNN accelerator
IoT
low-cost
sparse data multiplication

Access

10.1109/CSCE60160.2023.00150

OpenUrl availability

Full text

Cite this

Choi, K., & Sobelman, G. E. (2023). An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems. In Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023 (pp. 887-892). (Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CSCE60160.2023.00150

An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems. / Choi, Kyubaik; Sobelman, Gerald E.
Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 887-892 (Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Choi, K & Sobelman, GE 2023, An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems. in Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023. Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023, Institute of Electrical and Electronics Engineers Inc., pp. 887-892, 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023, Las Vegas, United States, 7/24/23. https://doi.org/10.1109/CSCE60160.2023.00150

Choi K, Sobelman GE. An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems. In Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 887-892. (Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023). doi: 10.1109/CSCE60160.2023.00150

Choi, Kyubaik ; Sobelman, Gerald E. / An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems. Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 887-892 (Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023).

@inproceedings{4eefbf583d744f3c89db3aff9b0d8bd8,

title = "An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems",

abstract = "It is difficult to achieve real-time performance for a deep neural networks (DNN) in low-profile systems such as edge computing or Internet of Things (IoT) due to the large amount of computations that are required. A commonly used activation function is ReLU (Rectified Linear Unit) due to its simplicity and good performance. A key characteristic of the ReLU function is that it produces sparse vectors within the neural network. In this paper, we propose an optimized DNN accelerator for sparse vector multiplications. In particular, we propose a squeezer unit to detect zeros and then skip feeding that data to the processing elements. In addition, we design a dynamic scheduler to efficiently allocate multiple neuron computations to the processing elements. With our architecture, we achieve reduced hardware resources compared to prior work due to a 62% reduction in the number of required index bits. We also achieve 8.6% better processing speed than for a system without the acceleration.",

keywords = "Deep neural networks, DNN accelerator, IoT, low-cost, sparse data multiplication",

author = "Kyubaik Choi and Sobelman, {Gerald E.}",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023 ; Conference date: 24-07-2023 Through 27-07-2023",

year = "2023",

doi = "10.1109/CSCE60160.2023.00150",

language = "English (US)",

series = "Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "887--892",

booktitle = "Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023",

}

TY - GEN

T1 - An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems

AU - Choi, Kyubaik

AU - Sobelman, Gerald E.

PY - 2023

Y1 - 2023

N2 - It is difficult to achieve real-time performance for a deep neural networks (DNN) in low-profile systems such as edge computing or Internet of Things (IoT) due to the large amount of computations that are required. A commonly used activation function is ReLU (Rectified Linear Unit) due to its simplicity and good performance. A key characteristic of the ReLU function is that it produces sparse vectors within the neural network. In this paper, we propose an optimized DNN accelerator for sparse vector multiplications. In particular, we propose a squeezer unit to detect zeros and then skip feeding that data to the processing elements. In addition, we design a dynamic scheduler to efficiently allocate multiple neuron computations to the processing elements. With our architecture, we achieve reduced hardware resources compared to prior work due to a 62% reduction in the number of required index bits. We also achieve 8.6% better processing speed than for a system without the acceleration.

AB - It is difficult to achieve real-time performance for a deep neural networks (DNN) in low-profile systems such as edge computing or Internet of Things (IoT) due to the large amount of computations that are required. A commonly used activation function is ReLU (Rectified Linear Unit) due to its simplicity and good performance. A key characteristic of the ReLU function is that it produces sparse vectors within the neural network. In this paper, we propose an optimized DNN accelerator for sparse vector multiplications. In particular, we propose a squeezer unit to detect zeros and then skip feeding that data to the processing elements. In addition, we design a dynamic scheduler to efficiently allocate multiple neuron computations to the processing elements. With our architecture, we achieve reduced hardware resources compared to prior work due to a 62% reduction in the number of required index bits. We also achieve 8.6% better processing speed than for a system without the acceleration.

KW - Deep neural networks

KW - DNN accelerator

KW - IoT

KW - low-cost

KW - sparse data multiplication

UR - http://www.scopus.com/inward/record.url?scp=85191189022&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85191189022&partnerID=8YFLogxK

U2 - 10.1109/CSCE60160.2023.00150

DO - 10.1109/CSCE60160.2023.00150

M3 - Conference contribution

AN - SCOPUS:85191189022

T3 - Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023

SP - 887

EP - 892

BT - Proceedings - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 Congress in Computer Science, Computer Engineering, and Applied Computing, CSCE 2023

Y2 - 24 July 2023 through 27 July 2023

ER -

An Efficient Sparse Neural Network Accelerator for Low-Cost Edge Systems

Abstract

Publication series

Conference

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this