The Protein-Coding Human Genome: Annotating High-Hanging Fruits.
Bioessays
; 41(11): e1900066, 2019 11.
Article
em En
| MEDLINE
| ID: mdl-31544971
The major transcript variants of human protein-coding genes are annotated to a certain degree of accuracy combining manual curation, transcript data, and proteomics evidence. However, there is considerable disagreement on the annotation of about 2000 genes-they can be protein-coding, noncoding, or pseudogenes-and on the annotation of most of the predicted alternative transcripts. Pure transcriptome mapping approaches seem to be limited in discriminating functional expression from noise. These limitations have partially been overcome by dedicated algorithms to detect alternative spliced micro-exons and wobble splice variants. Recently, knowledge about splice mechanism and protein structure are incorporated into an algorithm to predict neighboring homologous exons, often spliced in a mutually exclusive manner. Predicted exons are evaluated by transcript data, structural compatibility, and evolutionary conservation, revealing hundreds of novel coding exons and splice mechanism re-assignments. The emerging human pan-genome is necessitating distinctive annotations incorporating differences between individuals and between populations.
Palavras-chave
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Proteínas
/
Genoma Humano
Limite:
Animals
/
Humans
Idioma:
En
Revista:
Bioessays
Assunto da revista:
BIOLOGIA
/
BIOLOGIA MOLECULAR
Ano de publicação:
2019
Tipo de documento:
Article
País de afiliação:
Suíça