Your browser doesn't support javascript.
loading
Fundamental limits in structured principal component analysis and how to reach them.
Barbier, Jean; Camilli, Francesco; Mondelli, Marco; Sáenz, Manuel.
Afiliação
  • Barbier J; Quantitative Life Sciences and Mathematics Sections, International Centre for Theoretical Physics, Trieste 34151, Italy.
  • Camilli F; Quantitative Life Sciences and Mathematics Sections, International Centre for Theoretical Physics, Trieste 34151, Italy.
  • Mondelli M; Institute of Science and Technology Austria, Klosterneuburg 3400, Austria.
  • Sáenz M; Centro de Matemática, Universidad de La República, Montevideo 11400, Uruguay.
Proc Natl Acad Sci U S A ; 120(30): e2302028120, 2023 Jul 25.
Article em En | MEDLINE | ID: mdl-37463204
ABSTRACT
How do statistical dependencies in measurement noise influence high-dimensional inference? To answer this, we study the paradigmatic spiked matrix model of principal components analysis (PCA), where a rank-one matrix is corrupted by additive noise. We go beyond the usual independence assumption on the noise entries, by drawing the noise from a low-order polynomial orthogonal matrix ensemble. The resulting noise correlations make the setting relevant for applications but analytically challenging. We provide characterization of the Bayes optimal limits of inference in this model. If the spike is rotation invariant, we show that standard spectral PCA is optimal. However, for more general priors, both PCA and the existing approximate message-passing algorithm (AMP) fall short of achieving the information-theoretic limits, which we compute using the replica method from statistical physics. We thus propose an AMP, inspired by the theory of adaptive Thouless-Anderson-Palmer equations, which is empirically observed to saturate the conjectured theoretical limit. This AMP comes with a rigorous state evolution analysis tracking its performance. Although we focus on specific noise distributions, our methodology can be generalized to a wide class of trace matrix ensembles at the cost of more involved expressions. Finally, despite the seemingly strong assumption of rotation-invariant noise, our theory empirically predicts algorithmic performance on real data, pointing at strong universality properties.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Itália

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Itália