Tensor Decomposition for Model Reduction in Neural Networks: A Review [Feature]

Xingyi Liu; Keshab K. Parhi

doi:10.1109/MCAS.2023.3267921

Tensor Decomposition for Model Reduction in Neural Networks: A Review [Feature]

Xingyi Liu, Keshab K. Parhi

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

Modern neural networks have revolutionized the fields of computer vision (CV) and Natural Language Processing (NLP). They are widely used for solving complex CV tasks and NLP tasks such as image classification, image generation, and machine translation. Most state-of-the-art neural networks are over-parameterized and require a high computational cost. One straightforward solution is to replace the layers of the networks with their low-rank tensor approximations using different tensor decomposition methods. This article reviews six tensor decomposition methods and illustrates their ability to compress model parameters of convolutional neural networks (CNNs), recurrent neural networks (RNNs) and Transformers. The accuracy of some compressed models can be higher than the original versions. Evaluations indicate that tensor decompositions can achieve significant reductions in model size, run-time and energy consumption, and are well suited for implementing neural networks on edge devices.

Original language	English (US)
Pages (from-to)	8-28
Number of pages	21
Journal	IEEE Circuits and Systems Magazine
Volume	23
Issue number	2
DOIs	https://doi.org/10.1109/MCAS.2023.3267921
State	Published - 2023
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 2001-2012 IEEE.

Keywords

Tensor decomposition
Tucker decomposition
block-term decomposition
canonical polyadic decomposition
convolution neural network acceleration
hierarchical Tucker decomposition
model compression.
recurrent neural network acceleration
tensor ring decomposition
tensor train decomposition
transformer acceleration

Access

10.1109/MCAS.2023.3267921

OpenUrl availability

Full text

Cite this

@article{3038cf83e8f448ac86e8a76c73822c18,

title = "Tensor Decomposition for Model Reduction in Neural Networks: A Review [Feature]",

abstract = "Modern neural networks have revolutionized the fields of computer vision (CV) and Natural Language Processing (NLP). They are widely used for solving complex CV tasks and NLP tasks such as image classification, image generation, and machine translation. Most state-of-the-art neural networks are over-parameterized and require a high computational cost. One straightforward solution is to replace the layers of the networks with their low-rank tensor approximations using different tensor decomposition methods. This article reviews six tensor decomposition methods and illustrates their ability to compress model parameters of convolutional neural networks (CNNs), recurrent neural networks (RNNs) and Transformers. The accuracy of some compressed models can be higher than the original versions. Evaluations indicate that tensor decompositions can achieve significant reductions in model size, run-time and energy consumption, and are well suited for implementing neural networks on edge devices.",

keywords = "Tensor decomposition, Tucker decomposition, block-term decomposition, canonical polyadic decomposition, convolution neural network acceleration, hierarchical Tucker decomposition, model compression., recurrent neural network acceleration, tensor ring decomposition, tensor train decomposition, transformer acceleration",

author = "Xingyi Liu and Parhi, {Keshab K.}",

note = "Publisher Copyright: {\textcopyright} 2001-2012 IEEE.",

year = "2023",

doi = "10.1109/MCAS.2023.3267921",

language = "English (US)",

volume = "23",

pages = "8--28",

journal = "IEEE Circuits and Systems Magazine",

issn = "1531-636X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2",

}

TY - JOUR

T1 - Tensor Decomposition for Model Reduction in Neural Networks

T2 - A Review [Feature]

AU - Liu, Xingyi

AU - Parhi, Keshab K.

PY - 2023

Y1 - 2023

N2 - Modern neural networks have revolutionized the fields of computer vision (CV) and Natural Language Processing (NLP). They are widely used for solving complex CV tasks and NLP tasks such as image classification, image generation, and machine translation. Most state-of-the-art neural networks are over-parameterized and require a high computational cost. One straightforward solution is to replace the layers of the networks with their low-rank tensor approximations using different tensor decomposition methods. This article reviews six tensor decomposition methods and illustrates their ability to compress model parameters of convolutional neural networks (CNNs), recurrent neural networks (RNNs) and Transformers. The accuracy of some compressed models can be higher than the original versions. Evaluations indicate that tensor decompositions can achieve significant reductions in model size, run-time and energy consumption, and are well suited for implementing neural networks on edge devices.

AB - Modern neural networks have revolutionized the fields of computer vision (CV) and Natural Language Processing (NLP). They are widely used for solving complex CV tasks and NLP tasks such as image classification, image generation, and machine translation. Most state-of-the-art neural networks are over-parameterized and require a high computational cost. One straightforward solution is to replace the layers of the networks with their low-rank tensor approximations using different tensor decomposition methods. This article reviews six tensor decomposition methods and illustrates their ability to compress model parameters of convolutional neural networks (CNNs), recurrent neural networks (RNNs) and Transformers. The accuracy of some compressed models can be higher than the original versions. Evaluations indicate that tensor decompositions can achieve significant reductions in model size, run-time and energy consumption, and are well suited for implementing neural networks on edge devices.

KW - Tensor decomposition

KW - Tucker decomposition

KW - block-term decomposition

KW - canonical polyadic decomposition

KW - convolution neural network acceleration

KW - hierarchical Tucker decomposition

KW - model compression.

KW - recurrent neural network acceleration

KW - tensor ring decomposition

KW - tensor train decomposition

KW - transformer acceleration

UR - http://www.scopus.com/inward/record.url?scp=85165916799&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85165916799&partnerID=8YFLogxK

U2 - 10.1109/MCAS.2023.3267921

DO - 10.1109/MCAS.2023.3267921

M3 - Article

AN - SCOPUS:85165916799

SN - 1531-636X

VL - 23

SP - 8

EP - 28

JO - IEEE Circuits and Systems Magazine

JF - IEEE Circuits and Systems Magazine

IS - 2

ER -

Tensor Decomposition for Model Reduction in Neural Networks: A Review [Feature]

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this