Exploiting public databases of genomic variation to quantify evolutionary constraint on the branch point sequence in 30 plant and animal species.
Nucleic Acids Res
; 51(22): 12069-12075, 2023 Dec 11.
Article
em En
| MEDLINE
| ID: mdl-37953306
ABSTRACT
The branch point sequence is a degenerate intronic heptamer required for the assembly of the spliceosome during pre-mRNA splicing. Disruption of this motif may promote alternative splicing and eventually cause phenotype variation. Despite its functional relevance, the branch point sequence is not included in most genome annotations. Here, we predict branch point sequences in 30 plant and animal species and attempt to quantify their evolutionary constraints using public variant databases. We find an implausible variant distribution in the databases from 16 of 30 examined species. Comparative analysis of variants from whole-genome sequencing shows that variants submitted from exome sequencing or false positive variants are widespread in public databases and cause these irregularities. We then investigate evolutionary constraint with largely unbiased public variant databases in 14 species and find that the fourth and sixth position of the branch point sequence are more constrained than coding nucleotides. Our findings show that public variant databases should be scrutinized for possible biases before they qualify to analyze evolutionary constraint.
Texto completo:
1
Bases de dados:
MEDLINE
Assunto principal:
Plantas
/
Splicing de RNA
/
Evolução Biológica
Limite:
Animals
Idioma:
En
Revista:
Nucleic Acids Res
Ano de publicação:
2023
Tipo de documento:
Article
País de afiliação:
Suíça