Your browser doesn't support javascript.
loading
Excluding Loci With Substitution Saturation Improves Inferences From Phylogenomic Data.
Duchêne, David A; Mather, Niklas; Van Der Wal, Cara; Ho, Simon Y W.
Afiliación
  • Duchêne DA; Centre for Evolutionary Hologenomics, University of Copenhagen, Øster Farimagsgade 5A, 1352 Copenhagen, Denmark.
  • Mather N; School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia.
  • Van Der Wal C; School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia.
  • Ho SYW; School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia.
Syst Biol ; 71(3): 676-689, 2022 04 19.
Article en En | MEDLINE | ID: mdl-34508605
ABSTRACT
The historical signal in nucleotide sequences becomes eroded over time by substitutions occurring repeatedly at the same sites. This phenomenon, known as substitution saturation, is recognized as one of the primary obstacles to deep-time phylogenetic inference using genome-scale data sets. We present a new test of substitution saturation and demonstrate its performance in simulated and empirical data. For some of the 36 empirical phylogenomic data sets that we examined, we detect substitution saturation in around 50% of loci. We found that saturation tends to be flagged as problematic in loci with highly discordant phylogenetic signals across sites. Within each data set, the loci with smaller numbers of informative sites are more likely to be flagged as containing problematic levels of saturation. The entropy saturation test proposed here is sensitive to high evolutionary rates relative to the evolutionary timeframe, while also being sensitive to several factors known to mislead phylogenetic inference, including short internal branches relative to external branches, short nucleotide sequences, and tree imbalance. Our study demonstrates that excluding loci with substitution saturation can be an effective means of mitigating the negative impact of multiple substitutions on phylogenetic inferences. [Phylogenetic model performance; phylogenomics; substitution model; substitution saturation; test statistics.].
Asunto(s)

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Genoma / Evolución Biológica Tipo de estudio: Prognostic_studies Idioma: En Revista: Syst Biol Asunto de la revista: BIOLOGIA Año: 2022 Tipo del documento: Article País de afiliación: Dinamarca

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Genoma / Evolución Biológica Tipo de estudio: Prognostic_studies Idioma: En Revista: Syst Biol Asunto de la revista: BIOLOGIA Año: 2022 Tipo del documento: Article País de afiliación: Dinamarca