On the distribution of the largest eigenvalue in principal components analysis

Iain M. Johnstone

Type: Article

Publication Date: 2001-04-01

Citations: 1945

DOI: https://doi.org/10.1214/aos/1009210544

Abstract

Let x(1) denote the square of the largest singular value of an n × p matrix X, all of whose entries are independent standard Gaussian variates. Equivalently, x(1) is the largest principal component variance of the covariance matrix $X'X$, or the largest eigenvalue of a pvariate Wishart distribution on n degrees of freedom with identity covariance. Consider the limit of large p and n with $n/p = \gamma \ge 1$. When centered by $\mu_p = (\sqrt{n-1} + \sqrt{p})^2$ and scaled by $\sigma_p = (\sqrt{n-1} + \sqrt{p})(1/\sqrt{n-1} + 1/\sqrt{p}^{1/3}$, the distribution of x(1) approaches the Tracey-Widom law of order 1, which is defined in terms of the Painlevé II differential equation and can be numerically evaluated and tabulated in software. Simulations show the approximation to be informative for n and p as small as 5. The limit is derived via a corresponding result for complex Wishart matrices using methods from random matrix theory. The result suggests that some aspects of large p multivariate distribution theory may be easier to apply in practice than their fixed p counterparts.

Locations

The Annals of Statistics - View - PDF

Similar Works

Action	Title	Year	Authors
+	On the largest eigenvalue of Wishart matrices with identity covariance when n, p and p/n tend to infinity	2003	Noureddine El Karoui
+	Accuracy of the Tracy-Widom limit for the largest eigenvalue in white Wishart matrices	2008	Zongming Ma
+	Asymptotic Distribution of the Smallest Eigenvalue of Wishart(N, n) When N, n → ∞ Such That N/n → 0	2011	Debashis Paul
+	Accuracy of the Tracy–Widom limits for the extreme eigenvalues in white Wishart matrices	2012	Zongming Ma
+ PDF Chat	Number of Relevant Directions in Principal Component Analysis and Wishart Random Matrices	2012	Satya N. Majumdar Pierpaolo Vivo
+ PDF Chat	Multivariate analysis and Jacobi ensembles: Largest eigenvalue, Tracy–Widom limits and rates of convergence	2008	Iain M. Johnstone
+	On the distribution of the ratio of the largest eigenvalue to the trace of a Wishart matrix	2010	Boaz Nadler
+	Selecting the number of principal components: estimation of the true rank of a noisy matrix	2014	Yunjin Choi Jonathan Taylor Robert Tibshirani
+	Selecting the number of principal components: estimation of the true rank of a noisy matrix	2014	Yun‐Jin Choi Jonathan Taylor Robert Tibshirani
+ PDF Chat	Tracy-Widom limit for the largest eigenvalue of high-dimensional covariance matrices in elliptical distributions	2022	Wen Jun Xie Jiahui Long Yu Zhou Wang
+	The 70th anniversary of the distribution of random matrices: A survey	2002	Ingram Olkin
+	Principal Component Analysis of Large Dispersion Matrices	1991	C. R. Narayanaswamy D. Raghavarao
+	Number of relevant directions in Principal Component Analysis and Wishart random matrices	2011	Satya N. Majumdar Pierpaolo Vivo
+	Number of relevant directions in Principal Component Analysis and Wishart random matrices	2011	Satya N. Majumdar Pierpaolo Vivo
+ PDF Chat	Tracy–Widom limit for the largest eigenvalue of a large class of complex sample covariance matrices	2007	Noureddine El Karoui
+	Recent Results About the Largest Eigenvalue of Random Covariance Matrices and Statistical Application	2005	Noureddine El Karoui
+	Properties of the Extreme Points of the Joint Eigenvalue Probability Density Function of the Wishart Matrix	2021	Asaph Keikara Muhumuza Karl Lundengård Sergei Silvestrov John Magero Mango Godwin Kakuba
+ PDF Chat	Convergence rate to the Tracy–Widom laws for the largest eigenvalue of sample covariance matrices	2023	Kevin Schnelli Yuanyuan Xu
+	On the behaviour of the smallest eigenvalue of a high-dimensional sample covariance matrix	2013	Pavel Yaskov
+	Exploring Multivariate Statistics: Unveiling the Power of Eigenvalues in Wishart Distribution Analysis	2024	Randa A. Makled Weihu Cheng

Works That Cite This (1541)

Action	Title	Year	Authors
+	High Dimensional Correlation Matrices: The Central Limit Theorem and Its Applications	2016	Jiti Gao Xiao Han Guangming Pan Yanrong Yang
+	On the estimation of integrated covariance matrices of high dimensional diffusion processes	2011	Xinghua Zheng Yingying Li
+ PDF Chat	Robust estimation of precision matrices under cellwise contamination	2015	Garth Tarr Samuel Müller N. C. Weber
+ PDF Chat	Estimation of low-rank matrices via approximate message passing	2021	Andrea Montanari Ramji Venkataramanan
+	Supervised singular value decomposition and its asymptotic properties	2015	Gen Li Dan Yang Andrew B. Nobel Haipeng Shen
+ PDF Chat	Poisson Statistics for the Largest Eigenvalues in Random Matrix Ensembles	2006	Alexander Soshnikov
+ PDF Chat	Permutation methods for factor analysis and PCA	2020	Edgar Dobriban
+	Edge universality of correlation matrices	2012	Natesh S. Pillai Jun Yin
+ PDF Chat	Central limit theorem for Hotelling’s T2 statistic under large dimension	2011	Guangming Pan Zhou Wang
+	Extreme eigenvalues of large-dimensional spiked Fisher matrices with application	2017	Qinwen Wang Jianfeng Yao

Works Cited by This (42)

Action	Title	Year	Authors
+	Asymptotic Forms for Laguerre Polynomials	1960	A. Erdélyi
+	Introduction to the Theory of Linear Nonselfadjoint Operators in Hilbert Space	1969	Israel Gohberg M. Г. Крейн
+	Painlevé transcendent evaluation of the scaled distribution of the smallest eigenvalue in the Laguerre orthogonal and symplectic ensembles	2000	Peter J. Forrester
+	Orthogonal Polynomials and Random Matrices: A Riemann-Hilbert Approach	2000	Percy Deift
+	The Distribution of the Largest Eigenvalue in the Gaussian Ensembles: β = 1, 2, 4	2000	Craig A. Tracy Harold Widom
+	On the distribution of the length of the longest increasing subsequence of random permutations	1999	Jinho Baik Percy Deift Kurt Johansson
+	On the relation between orthogonal, symplectic and unitary matrix ensembles	1998	Harold Widom
+	None	2002	Alexander Soshnikov
+	Integrable systems and combinatorial theory	2000	Percy Deift
+	Powers of the largest latent root test of ∑= I	1974	Robb J. Muirhead