Your browser doesn't support javascript.
loading
Spectral neighbor joining for reconstruction of latent tree Models.
Jaffe, Ariel; Amsel, Noah; Aizenbud, Yariv; Nadler, Boaz; Chang, Joseph T; Kluger, Yuval.
Afiliação
  • Jaffe A; Program in Applied Mathematics, Yale University, New Haven, CT 06511.
  • Amsel N; Program in Applied Mathematics, Yale University, New Haven, CT 06511.
  • Aizenbud Y; Program in Applied Mathematics, Yale University, New Haven, CT 06511.
  • Nadler B; Department of Computer Science, Weizmann Institute of Science, Rehovot, 76100, Israel.
  • Chang JT; Department of Statistics, Yale University, New Haven, CT 06520, USA.
  • Kluger Y; Program in Applied Mathematics, Yale University, New Haven, CT 06511.
SIAM J Math Data Sci ; 3(1): 113-141, 2021.
Article em En | MEDLINE | ID: mdl-34124606
ABSTRACT
A common assumption in multiple scientific applications is that the distribution of observed data can be modeled by a latent tree graphical model. An important example is phylogenetics, where the tree models the evolutionary lineages of a set of observed organisms. Given a set of independent realizations of the random variables at the leaves of the tree, a key challenge is to infer the underlying tree topology. In this work we develop Spectral Neighbor Joining (SNJ), a novel method to recover the structure of latent tree graphical models. Given a matrix that contains a measure of similarity between all pairs of observed variables, SNJ computes a spectral measure of cohesion between groups of observed variables. We prove that SNJ is consistent, and derive a sufficient condition for correct tree recovery from an estimated similarity matrix. Combining this condition with a concentration of measure result on the similarity matrix, we bound the number of samples required to recover the tree with high probability. We illustrate via extensive simulations that in comparison to several other reconstruction methods, SNJ requires fewer samples to accurately recover trees with a large number of leaves or long edges.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2021 Tipo de documento: Article