Your browser doesn't support javascript.
loading
Identification of Novel Bacterial Microproteins Encoded by Small Open Reading Frames Using a Computational Proteogenomics Workflow.
de Souza, Eduardo Vieira; Bizarro, Cristiano Valim.
Afiliação
  • de Souza EV; Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil.
  • Bizarro CV; Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, Porto Alegre, Rio Grande do Sul, Brazil.
Methods Mol Biol ; 2836: 19-34, 2024.
Article em En | MEDLINE | ID: mdl-38995533
ABSTRACT
Genome annotation has historically ignored small open reading frames (smORFs), which encode a class of proteins shorter than 100 amino acids, collectively referred to as microproteins. This cutoff was established to avoid thousands of false positives due to limitations of pure genomics pipelines. Proteogenomics, a computational approach that combines genomics, transcriptomics, and proteomics, makes it possible to accurately identify these short sequences by overlaying different levels of omics evidence. In this chapter, we showcase the use of µProteInS, a bioinformatics pipeline developed for the identification of unannotated microproteins encoded by smORFs in bacteria. The workflow covers all the steps from quality control and transcriptome assembly to the scoring and post-processing of mass spectrometry data. Additionally, we provide an example on how to apply the pipeline's machine learning method to identify high-confidence spectra and pinpoint the most reliable identifications from large datasets.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas de Bactérias / Fases de Leitura Aberta / Biologia Computacional / Fluxo de Trabalho / Proteogenômica Idioma: En Revista: Methods Mol Biol Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteínas de Bactérias / Fases de Leitura Aberta / Biologia Computacional / Fluxo de Trabalho / Proteogenômica Idioma: En Revista: Methods Mol Biol Ano de publicação: 2024 Tipo de documento: Article