On the sample complexity of robust PCA

Matthew Coudron; Gilad Lerman

On the sample complexity of robust PCA

Matthew Coudron, Gilad Lerman

Mathematics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

9 Scopus citations

Abstract

We estimate the rate of convergence and sample complexity of a recent robust estimator for a generalized version of the inverse covariance matrix. This estimator is used in a convex algorithm for robust subspace recovery (i.e., robust PCA). Our model assumes a sub-Gaussian underlying distribution and an i.i.d. sample from it. Our main result shows with high probability that the norm of the difference between the generalized inverse covariance of the underlying distribution and its estimator from an i.i.d. sample of size N is of order O(N^-0.5+∈) for arbitrarily small ∈ > 0 (affecting the probabilistic estimate); this rate of convergence is close to the one of direct covariance estimation, i.e., O(N^-0.5). Our precise probabilistic estimate implies for some natural settings that the sample complexity of the generalized inverse covariance estimation when using the Frobenius norm is O(D^2+δ) for arbitrarily small δ > 0 (whereas the sample complexity of direct covariance estimation with Frobenius norm is O(D₂)). These results provide similar rates of convergence and sample complexity for the corresponding robust subspace recovery algorithm. To the best of our knowledge, this is the only work analyzing the sample complexity of any robust PCA algorithm.

Original language	English (US)
Title of host publication	Advances in Neural Information Processing Systems 25
Subtitle of host publication	26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012
Pages	3221-3229
Number of pages	9
State	Published - Dec 1 2012
Event	26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012 - Lake Tahoe, NV, United States Duration: Dec 3 2012 → Dec 6 2012

Publication series

Name	Advances in Neural Information Processing Systems
Volume	4
ISSN (Print)	1049-5258

Other

Other	26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012
Country/Territory	United States
City	Lake Tahoe, NV
Period	12/3/12 → 12/6/12

OpenUrl availability

Full text

Cite this

On the sample complexity of robust PCA. / Coudron, Matthew; Lerman, Gilad.
Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012. 2012. p. 3221-3229 (Advances in Neural Information Processing Systems; Vol. 4).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Coudron, M & Lerman, G 2012, On the sample complexity of robust PCA. in Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012. Advances in Neural Information Processing Systems, vol. 4, pp. 3221-3229, 26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012, Lake Tahoe, NV, United States, 12/3/12.

@inproceedings{977bad1aab2342dba927453ede41d7a5,

title = "On the sample complexity of robust PCA",

abstract = "We estimate the rate of convergence and sample complexity of a recent robust estimator for a generalized version of the inverse covariance matrix. This estimator is used in a convex algorithm for robust subspace recovery (i.e., robust PCA). Our model assumes a sub-Gaussian underlying distribution and an i.i.d. sample from it. Our main result shows with high probability that the norm of the difference between the generalized inverse covariance of the underlying distribution and its estimator from an i.i.d. sample of size N is of order O(N-0.5+∈) for arbitrarily small ∈ > 0 (affecting the probabilistic estimate); this rate of convergence is close to the one of direct covariance estimation, i.e., O(N-0.5). Our precise probabilistic estimate implies for some natural settings that the sample complexity of the generalized inverse covariance estimation when using the Frobenius norm is O(D2+δ) for arbitrarily small δ > 0 (whereas the sample complexity of direct covariance estimation with Frobenius norm is O(D2)). These results provide similar rates of convergence and sample complexity for the corresponding robust subspace recovery algorithm. To the best of our knowledge, this is the only work analyzing the sample complexity of any robust PCA algorithm.",

author = "Matthew Coudron and Gilad Lerman",

year = "2012",

month = dec,

day = "1",

language = "English (US)",

isbn = "9781627480031",

series = "Advances in Neural Information Processing Systems",

pages = "3221--3229",

booktitle = "Advances in Neural Information Processing Systems 25",

note = "26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012 ; Conference date: 03-12-2012 Through 06-12-2012",

}

TY - GEN

T1 - On the sample complexity of robust PCA

AU - Coudron, Matthew

AU - Lerman, Gilad

PY - 2012/12/1

Y1 - 2012/12/1

N2 - We estimate the rate of convergence and sample complexity of a recent robust estimator for a generalized version of the inverse covariance matrix. This estimator is used in a convex algorithm for robust subspace recovery (i.e., robust PCA). Our model assumes a sub-Gaussian underlying distribution and an i.i.d. sample from it. Our main result shows with high probability that the norm of the difference between the generalized inverse covariance of the underlying distribution and its estimator from an i.i.d. sample of size N is of order O(N-0.5+∈) for arbitrarily small ∈ > 0 (affecting the probabilistic estimate); this rate of convergence is close to the one of direct covariance estimation, i.e., O(N-0.5). Our precise probabilistic estimate implies for some natural settings that the sample complexity of the generalized inverse covariance estimation when using the Frobenius norm is O(D2+δ) for arbitrarily small δ > 0 (whereas the sample complexity of direct covariance estimation with Frobenius norm is O(D2)). These results provide similar rates of convergence and sample complexity for the corresponding robust subspace recovery algorithm. To the best of our knowledge, this is the only work analyzing the sample complexity of any robust PCA algorithm.

AB - We estimate the rate of convergence and sample complexity of a recent robust estimator for a generalized version of the inverse covariance matrix. This estimator is used in a convex algorithm for robust subspace recovery (i.e., robust PCA). Our model assumes a sub-Gaussian underlying distribution and an i.i.d. sample from it. Our main result shows with high probability that the norm of the difference between the generalized inverse covariance of the underlying distribution and its estimator from an i.i.d. sample of size N is of order O(N-0.5+∈) for arbitrarily small ∈ > 0 (affecting the probabilistic estimate); this rate of convergence is close to the one of direct covariance estimation, i.e., O(N-0.5). Our precise probabilistic estimate implies for some natural settings that the sample complexity of the generalized inverse covariance estimation when using the Frobenius norm is O(D2+δ) for arbitrarily small δ > 0 (whereas the sample complexity of direct covariance estimation with Frobenius norm is O(D2)). These results provide similar rates of convergence and sample complexity for the corresponding robust subspace recovery algorithm. To the best of our knowledge, this is the only work analyzing the sample complexity of any robust PCA algorithm.

UR - http://www.scopus.com/inward/record.url?scp=84877726395&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84877726395&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84877726395

SN - 9781627480031

T3 - Advances in Neural Information Processing Systems

SP - 3221

EP - 3229

BT - Advances in Neural Information Processing Systems 25

T2 - 26th Annual Conference on Neural Information Processing Systems 2012, NIPS 2012

Y2 - 3 December 2012 through 6 December 2012

ER -

On the sample complexity of robust PCA

Abstract

Publication series

Other

OpenUrl availability

Other files and links

Fingerprint

Cite this