Exactly Uncorrelated Sparse Principal Component Analysis

Oh Ran Kwon; Zhaosong Lu; Hui Zou

doi:10.1080/10618600.2023.2232843

Exactly Uncorrelated Sparse Principal Component Analysis

Oh Ran Kwon, Zhaosong Lu, Hui Zou

Statistics (Twin Cities)

Research output: Contribution to journal › Article › peer-review

Abstract

Sparse principal component analysis (PCA) aims to find principal components as linear combinations of a subset of the original input variables without sacrificing the fidelity of the classical PCA. Most existing sparse PCA methods produce correlated sparse principal components. We argue that many applications of PCA prefer uncorrelated principal components. However, handling sparsity and uncorrelatedness properties in a sparse PCA method is nontrivial. This article proposes an exactly uncorrelated sparse PCA method named EUSPCA, whose formulation is motivated by original views and motivations of PCA as advocated by Pearson and Hotelling. EUSPCA is a non-smooth constrained non-convex manifold optimization problem. We solve it by combining augmented Lagrangian and non-monotone proximal gradient methods. We observe that EUSPCA produces uncorrelated components and maintains a similar or better level of fidelity based on adjusted total variance through simulated and real data examples. In contrast, existing sparse PCA methods produce significantly correlated components. Supplemental materials for this article are available online.

Original language	English (US)
Pages (from-to)	231-241
Number of pages	11
Journal	Journal of Computational and Graphical Statistics
Volume	33
Issue number	1
DOIs	https://doi.org/10.1080/10618600.2023.2232843
State	Published - 2024

Bibliographical note

Publisher Copyright:
© 2023 American Statistical Association and Institute of Mathematical Statistics.

Keywords

Augmented Lagrangian method
Manifold optimization
Non-monotone proximal gradient method
Principal component analysis
Sparse principal component
Uncorrelated component

Access

10.1080/10618600.2023.2232843

OpenUrl availability

Full text

Cite this

@article{2b7f57bc327d49aa8923eb50dbb29b67,

title = "Exactly Uncorrelated Sparse Principal Component Analysis",

abstract = "Sparse principal component analysis (PCA) aims to find principal components as linear combinations of a subset of the original input variables without sacrificing the fidelity of the classical PCA. Most existing sparse PCA methods produce correlated sparse principal components. We argue that many applications of PCA prefer uncorrelated principal components. However, handling sparsity and uncorrelatedness properties in a sparse PCA method is nontrivial. This article proposes an exactly uncorrelated sparse PCA method named EUSPCA, whose formulation is motivated by original views and motivations of PCA as advocated by Pearson and Hotelling. EUSPCA is a non-smooth constrained non-convex manifold optimization problem. We solve it by combining augmented Lagrangian and non-monotone proximal gradient methods. We observe that EUSPCA produces uncorrelated components and maintains a similar or better level of fidelity based on adjusted total variance through simulated and real data examples. In contrast, existing sparse PCA methods produce significantly correlated components. Supplemental materials for this article are available online.",

keywords = "Augmented Lagrangian method, Manifold optimization, Non-monotone proximal gradient method, Principal component analysis, Sparse principal component, Uncorrelated component",

author = "Kwon, {Oh Ran} and Zhaosong Lu and Hui Zou",

note = "Publisher Copyright: {\textcopyright} 2023 American Statistical Association and Institute of Mathematical Statistics.",

year = "2024",

doi = "10.1080/10618600.2023.2232843",

language = "English (US)",

volume = "33",

pages = "231--241",

journal = "Journal of Computational and Graphical Statistics",

issn = "1061-8600",

publisher = "American Statistical Association",

number = "1",

}

TY - JOUR

T1 - Exactly Uncorrelated Sparse Principal Component Analysis

AU - Kwon, Oh Ran

AU - Lu, Zhaosong

AU - Zou, Hui

PY - 2024

Y1 - 2024

N2 - Sparse principal component analysis (PCA) aims to find principal components as linear combinations of a subset of the original input variables without sacrificing the fidelity of the classical PCA. Most existing sparse PCA methods produce correlated sparse principal components. We argue that many applications of PCA prefer uncorrelated principal components. However, handling sparsity and uncorrelatedness properties in a sparse PCA method is nontrivial. This article proposes an exactly uncorrelated sparse PCA method named EUSPCA, whose formulation is motivated by original views and motivations of PCA as advocated by Pearson and Hotelling. EUSPCA is a non-smooth constrained non-convex manifold optimization problem. We solve it by combining augmented Lagrangian and non-monotone proximal gradient methods. We observe that EUSPCA produces uncorrelated components and maintains a similar or better level of fidelity based on adjusted total variance through simulated and real data examples. In contrast, existing sparse PCA methods produce significantly correlated components. Supplemental materials for this article are available online.

AB - Sparse principal component analysis (PCA) aims to find principal components as linear combinations of a subset of the original input variables without sacrificing the fidelity of the classical PCA. Most existing sparse PCA methods produce correlated sparse principal components. We argue that many applications of PCA prefer uncorrelated principal components. However, handling sparsity and uncorrelatedness properties in a sparse PCA method is nontrivial. This article proposes an exactly uncorrelated sparse PCA method named EUSPCA, whose formulation is motivated by original views and motivations of PCA as advocated by Pearson and Hotelling. EUSPCA is a non-smooth constrained non-convex manifold optimization problem. We solve it by combining augmented Lagrangian and non-monotone proximal gradient methods. We observe that EUSPCA produces uncorrelated components and maintains a similar or better level of fidelity based on adjusted total variance through simulated and real data examples. In contrast, existing sparse PCA methods produce significantly correlated components. Supplemental materials for this article are available online.

KW - Augmented Lagrangian method

KW - Manifold optimization

KW - Non-monotone proximal gradient method

KW - Principal component analysis

KW - Sparse principal component

KW - Uncorrelated component

UR - http://www.scopus.com/inward/record.url?scp=85169676925&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85169676925&partnerID=8YFLogxK

U2 - 10.1080/10618600.2023.2232843

DO - 10.1080/10618600.2023.2232843

M3 - Article

AN - SCOPUS:85169676925

SN - 1061-8600

VL - 33

SP - 231

EP - 241

JO - Journal of Computational and Graphical Statistics

JF - Journal of Computational and Graphical Statistics

IS - 1

ER -

Exactly Uncorrelated Sparse Principal Component Analysis

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this