Your browser doesn't support javascript.
loading
Evolutionary Shortcuts via Multinucleotide Substitutions and Their Impact on Natural Selection Analyses.
Lucaci, Alexander G; Zehr, Jordan D; Enard, David; Thornton, Joseph W; Kosakovsky Pond, Sergei L.
Afiliación
  • Lucaci AG; Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA.
  • Zehr JD; Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA.
  • Enard D; Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona.
  • Thornton JW; Department of Human Genetics, University of Chicago, Chicago, Illinois.
  • Kosakovsky Pond SL; Department of Ecology & Evolution, University of Chicago, Chicago, Illinois.
Mol Biol Evol ; 40(7)2023 07 05.
Article en En | MEDLINE | ID: mdl-37395787
Inference and interpretation of evolutionary processes, in particular of the types and targets of natural selection affecting coding sequences, are critically influenced by the assumptions built into statistical models and tests. If certain aspects of the substitution process (even when they are not of direct interest) are presumed absent or are modeled with too crude of a simplification, estimates of key model parameters can become biased, often systematically, and lead to poor statistical performance. Previous work established that failing to accommodate multinucleotide (or multihit, MH) substitutions strongly biases dN/dS-based inference towards false-positive inferences of diversifying episodic selection, as does failing to model variation in the rate of synonymous substitution (SRV) among sites. Here, we develop an integrated analytical framework and software tools to simultaneously incorporate these sources of evolutionary complexity into selection analyses. We found that both MH and SRV are ubiquitous in empirical alignments, and incorporating them has a strong effect on whether or not positive selection is detected (1.4-fold reduction) and on the distributions of inferred evolutionary rates. With simulation studies, we show that this effect is not attributable to reduced statistical power caused by using a more complex model. After a detailed examination of 21 benchmark alignments and a new high-resolution analysis showing which parts of the alignment provide support for positive selection, we show that MH substitutions occurring along shorter branches in the tree explain a significant fraction of discrepant results in selection detection. Our results add to the growing body of literature which examines decades-old modeling assumptions (including MH) and finds them to be problematic for comparative genomic data analysis. Because multinucleotide substitutions have a significant impact on natural selection detection even at the level of an entire gene, we recommend that selection analyses of this type consider their inclusion as a matter of routine. To facilitate this procedure, we developed, implemented, and benchmarked a simple and well-performing model testing selection detection framework able to screen an alignment for positive selection with two biologically important confounding processes: site-to-site synonymous rate variation, and multinucleotide instantaneous substitutions.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Evolución Molecular / Modelos Genéticos Tipo de estudio: Risk_factors_studies Límite: Animals / Humans Idioma: En Revista: Mol Biol Evol Asunto de la revista: BIOLOGIA MOLECULAR Año: 2023 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Evolución Molecular / Modelos Genéticos Tipo de estudio: Risk_factors_studies Límite: Animals / Humans Idioma: En Revista: Mol Biol Evol Asunto de la revista: BIOLOGIA MOLECULAR Año: 2023 Tipo del documento: Article País de afiliación: Estados Unidos