Analysis of selection in protein-coding sequences accounting for common biases.
Brief Bioinform
; 22(5)2021 09 02.
Article
em En
| MEDLINE
| ID: mdl-33479739
The evolution of protein-coding genes is usually driven by selective processes, which favor some evolutionary trajectories over others, optimizing the subsequent protein stability and activity. The analysis of selection in this type of genetic data is broadly performed with the metric nonsynonymous/synonymous substitution rate ratio (dN/dS). However, most of the well-established methodologies to estimate this metric make crucial assumptions, such as lack of recombination or invariable codon frequencies along genes, which can bias the estimation. Here, we review the most relevant biases in the dN/dS estimation and provide a detailed guide to estimate this metric using state-of-the-art procedures that account for such biases, along with illustrative practical examples and recommendations. We also discuss the traditional interpretation of the estimated dN/dS emphasizing the importance of considering complementary biological information such as the role of the observed substitutions on the stability and function of proteins. This review is oriented to help evolutionary biologists that aim to accurately estimate selection in protein-coding sequences.
Palavras-chave
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Seleção Genética
/
Proteínas
/
Fases de Leitura Aberta
/
Evolução Molecular
/
Mutação de Sentido Incorreto
/
Modelos Genéticos
Tipo de estudo:
Guideline
Idioma:
En
Ano de publicação:
2021
Tipo de documento:
Article