Your browser doesn't support javascript.
loading
Computationally Efficient Demographic History Inference from Allele Frequencies with Supervised Machine Learning.
Tran, Linh N; Sun, Connie K; Struck, Travis J; Sajan, Mathews; Gutenkunst, Ryan N.
Afiliação
  • Tran LN; Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, AZ 85721, USA.
  • Sun CK; Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA.
  • Struck TJ; Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA.
  • Sajan M; Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA.
  • Gutenkunst RN; Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA.
Mol Biol Evol ; 41(5)2024 May 03.
Article em En | MEDLINE | ID: mdl-38636507
ABSTRACT
Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite-likelihood optimization. However, dadi's optimization procedure can be computationally expensive. Here, we present donni (demography optimization via neural network inference), a new inference method based on dadi that is more efficient while maintaining comparable inference accuracy. For each dadi-supported demographic model, donni simulates the expected AFS for a range of model parameters then trains a set of Mean Variance Estimation neural networks using the simulated AFS. Trained networks can then be used to instantaneously infer the model parameters from future genomic data summarized by an AFS. We demonstrate that for many demographic models, donni can infer some parameters, such as population size changes, very well and other parameters, such as migration rates and times of demographic events, fairly well. Importantly, donni provides both parameter and confidence interval estimates from input AFS with accuracy comparable to parameters inferred by dadi's likelihood optimization while bypassing its long and computationally intensive evaluation process. donni's performance demonstrates that supervised machine learning algorithms may be a promising avenue for developing more sustainable and computationally efficient demographic history inference methods.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article