Búsqueda | Portal de Búsqueda de la BVS

Spectral Prediction Features as a Solution for the Search Space Size Problem in Proteogenomics.

Verbruggen, Steven; Gessulat, Siegfried; Gabriels, Ralf; Matsaroki, Anna; Van de Voorde, Hendrik; Kuster, Bernhard; Degroeve, Sven; Martens, Lennart; Van Criekinge, Wim; Wilhelm, Mathias; Menschaert, Gerben.

Mol Cell Proteomics ; 20: 100076, 2021.

Artículo en Inglés | MEDLINE | ID: mdl-33823297

RESUMEN

Proteogenomics approaches often struggle with the distinction between true and false peptide-to-spectrum matches as the database size enlarges. However, features extracted from tandem mass spectrometry intensity predictors can enhance the peptide identification rate and can provide extra confidence for peptide-to-spectrum matching in a proteogenomics context. To that end, features from the spectral intensity pattern predictors MS2PIP and Prosit were combined with the canonical scores from MaxQuant in the Percolator postprocessing tool for protein sequence databases constructed out of ribosome profiling and nanopore RNA-Seq analyses. The presented results provide evidence that this approach enhances both the identification rate as well as the validation stringency in a proteogenomic setting.

Asunto(s)

Proteogenómica/métodos , Bases de Datos de Proteínas , Células HCT116 , Humanos , Aprendizaje Automático , RNA-Seq , Ribosomas

Discovery of noncanonical translation initiation sites through mass spectrometric analysis of protein N termini.

Na, Chan Hyun; Barbhuiya, Mustafa A; Kim, Min-Sik; Verbruggen, Steven; Eacker, Stephen M; Pletnikova, Olga; Troncoso, Juan C; Halushka, Marc K; Menschaert, Gerben; Overall, Christopher M; Pandey, Akhilesh.

Genome Res ; 28(1): 25-36, 2018 01.

Artículo en Inglés | MEDLINE | ID: mdl-29162641

RESUMEN

Translation initiation generally occurs at AUG codons in eukaryotes, although it has been shown that non-AUG or noncanonical translation initiation can also occur. However, the evidence for noncanonical translation initiation sites (TISs) is largely indirect and based on ribosome profiling (Ribo-seq) studies. Here, using a strategy specifically designed to enrich N termini of proteins, we demonstrate that many human proteins are translated at noncanonical TISs. The large majority of TISs that mapped to 5' untranslated regions were noncanonical and led to N-terminal extension of annotated proteins or translation of upstream small open reading frames (uORF). It has been controversial whether the amino acid corresponding to the start codon is incorporated at the TIS or methionine is still incorporated. We found that methionine was incorporated at almost all noncanonical TISs identified in this study. Comparison of the TISs determined through mass spectrometry with ribosome profiling data revealed that about two-thirds of the novel annotations were indeed supported by the available ribosome profiling data. Sequence conservation across species and a higher abundance of noncanonical TISs than canonical ones in some cases suggests that the noncanonical TISs can have biological functions. Overall, this study provides evidence of protein translation initiation at noncanonical TISs and argues that further studies are required for elucidation of functional implications of such noncanonical translation initiation.

Asunto(s)

Regiones no Traducidas 5' , Espectrometría de Masas , Sistemas de Lectura Abierta , Iniciación de la Cadena Peptídica Traduccional , Ribosomas/metabolismo , Células HEK293 , Células Endoteliales de la Vena Umbilical Humana/metabolismo , Humanos , Dominios Proteicos , Ribosomas/genética

PROTEOFORMER 2.0: Further Developments in the Ribosome Profiling-assisted Proteogenomic Hunt for New Proteoforms.

Verbruggen, Steven; Ndah, Elvis; Van Criekinge, Wim; Gessulat, Siegfried; Kuster, Bernhard; Wilhelm, Mathias; Van Damme, Petra; Menschaert, Gerben.

Mol Cell Proteomics ; 18(8 suppl 1): S126-S140, 2019 08 09.

Artículo en Inglés | MEDLINE | ID: mdl-31040227

RESUMEN

PROTEOFORMER is a pipeline that enables the automated processing of data derived from ribosome profiling (RIBO-seq, i.e. the sequencing of ribosome-protected mRNA fragments). As such, genome-wide ribosome occupancies lead to the delineation of data-specific translation product candidates and these can improve the mass spectrometry-based identification. Since its first publication, different upgrades, new features and extensions have been added to the PROTEOFORMER pipeline. Some of the most important upgrades include P-site offset calculation during mapping, comprehensive data pre-exploration, the introduction of two alternative proteoform calling strategies and extended pipeline output features. These novelties are illustrated by analyzing ribosome profiling data of human HCT116 and Jurkat data. The different proteoform calling strategies are used alongside one another and in the end combined together with reference sequences from UniProt. Matching mass spectrometry data are searched against this extended search space with MaxQuant. Overall, besides annotated proteoforms, this pipeline leads to the identification and validation of different categories of new proteoforms, including translation products of up- and downstream open reading frames, 5' and 3' extended and truncated proteoforms, single amino acid variants, splice variants and translation products of so-called noncoding regions. Further, proof-of-concept is reported for the improvement of spectrum matching by including Prosit, a deep neural network strategy that adds extra fragmentation spectrum intensity features to the analysis. In the light of ribosome profiling-driven proteogenomics, it is shown that this allows validating the spectrum matches of newly identified proteoforms with elevated stringency. These updates and novel conclusions provide new insights and lessons for the ribosome profiling-based proteogenomic research field. More practical information on the pipeline, raw code, the user manual (README) and explanations on the different modes of availability can be found at the GitHub repository of PROTEOFORMER: https://github.com/Biobix/proteoformer.

Asunto(s)

Proteogenómica/métodos , Ribosomas/metabolismo , Cromatografía Liquida , Células HCT116 , Humanos , Células Jurkat , Espectrometría de Masas en Tándem

eIF1 modulates the recognition of suboptimal translation initiation sites and steers gene expression via uORFs.

Fijalkowska, Daria; Verbruggen, Steven; Ndah, Elvis; Jonckheere, Veronique; Menschaert, Gerben; Van Damme, Petra.

Nucleic Acids Res ; 45(13): 7997-8013, 2017 Jul 27.

Artículo en Inglés | MEDLINE | ID: mdl-28541577

RESUMEN

Alternative translation initiation mechanisms such as leaky scanning and reinitiation potentiate the polycistronic nature of human transcripts. By allowing for reprogrammed translation, these mechanisms can mediate biological responses to stimuli. We combined proteomics with ribosome profiling and mRNA sequencing to identify the biological targets of translation control triggered by the eukaryotic translation initiation factor 1 (eIF1), a protein implicated in the stringency of start codon selection. We quantified expression changes of over 4000 proteins and 10 000 actively translated transcripts, leading to the identification of 245 transcripts undergoing translational control mediated by upstream open reading frames (uORFs) upon eIF1 deprivation. Here, the stringency of start codon selection and preference for an optimal nucleotide context were largely diminished leading to translational upregulation of uORFs with suboptimal start. Interestingly, genes affected by eIF1 deprivation were implicated in energy production and sensing of metabolic stress.

Asunto(s)

Factores Eucarióticos de Iniciación/metabolismo , Proteínas de Neoplasias/metabolismo , Proteínas del Tejido Nervioso/metabolismo , Iniciación de la Cadena Peptídica Traduccional , Línea Celular , Codón Iniciador , Metabolismo Energético/genética , Factores Eucarióticos de Iniciación/antagonistas & inhibidores , Factores Eucarióticos de Iniciación/genética , Expresión Génica , Técnicas de Silenciamiento del Gen , Células HCT116 , Humanos , Proteínas de Neoplasias/antagonistas & inhibidores , Proteínas de Neoplasias/genética , Proteínas del Tejido Nervioso/antagonistas & inhibidores , Proteínas del Tejido Nervioso/genética , Conformación de Ácido Nucleico , Sistemas de Lectura Abierta , ARN Mensajero/química , ARN Mensajero/genética , ARN Mensajero/metabolismo , Ribosomas/genética , Ribosomas/metabolismo , Estrés Fisiológico/genética

sORFs.org: a repository of small ORFs identified by ribosome profiling.

Olexiouk, Volodimir; Crappé, Jeroen; Verbruggen, Steven; Verhegen, Kenneth; Martens, Lennart; Menschaert, Gerben.

Nucleic Acids Res ; 44(D1): D324-9, 2016 Jan 04.

Artículo en Inglés | MEDLINE | ID: mdl-26527729

RESUMEN

With the advent of ribosome profiling, a next generation sequencing technique providing a "snap-shot'' of translated mRNA in a cell, many short open reading frames (sORFs) with ribosomal activity were identified. Follow-up studies revealed the existence of functional peptides, so-called micropeptides, translated from these 'sORFs', indicating a new class of bio-active peptides. Over the last few years, several micropeptides exhibiting important cellular functions were discovered. However, ribosome occupancy does not necessarily imply an actual function of the translated peptide, leading to the development of various tools assessing the coding potential of sORFs. Here, we introduce sORFs.org (http://www.sorfs.org), a novel database for sORFs identified using ribosome profiling. Starting from ribosome profiling, sORFs.org identifies sORFs, incorporates state-of-the-art tools and metrics and stores results in a public database. Two query interfaces are provided, a default one enabling quick lookup of sORFs and a BioMart interface providing advanced query and export possibilities. At present, sORFs.org harbors 263 354 sORFs that demonstrate ribosome occupancy, originating from three different cell lines: HCT116 (human), E14_mESC (mouse) and S2 (fruit fly). sORFs.org aims to provide an extensive sORFs database accessible to researchers with limited bioinformatics knowledge, thus enabling easy integration into personal projects.

Asunto(s)

Bases de Datos Genéticas , Sistemas de Lectura Abierta , Animales , Secuencia de Bases , Línea Celular , Secuencia Conservada , Drosophila melanogaster/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Internet , Espectrometría de Masas , Ratones , Péptidos/química , ARN Mensajero/química , Ribosomas/metabolismo , Análisis de Secuencia de ARN

mQC: A post-mapping data exploration tool for ribosome profiling.

Verbruggen, Steven; Menschaert, Gerben.

Comput Methods Programs Biomed ; 181: 104806, 2019 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-30401579

RESUMEN

BACKGROUND AND OBJECTIVE: Ribosome profiling is a recent next generation sequencing technique enabling the genome-wide study of gene expression in biomedical research at the translation level. Too often, researchers precipitously start trying to test their hypotheses after alignment of their data, without checking the quality and the general features of their mapped data. Despite the fact that these checks are essential to prevent errors and ensure valid conclusions afterwards, easy-to-use tools for visualizing the quality and overall outlook of mapped ribosome profiling data are lacking. METHODS: We present mQC, a modular tool implemented as a Bioconda package and also available in the Galaxy tool shed. Herewith both bio-informaticians as well as non-experts can easily perform the indispensable visualization of both the quality and the general features of their mapped P-site corrected ribosome profiling reads. The user manual, the raw code and more information can be found on its GitHub repository (https://github.com/Biobix/mQC). RESULTS: mQC was tested on multiple datasets to assess its general applicability and was compared to other tools that partly perform similar tasks. CONCLUSIONS: Our results demonstrate that mQC can accomplish an unfilled but essential position in the ribosome profiling data analysis procedure by performing a thorough RIBO-Seq-specific exploration of aligned and P-site corrected ribosome profiling data.

Asunto(s)

Biología Computacional/métodos , Perfilación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Ribosomas/química , Análisis de Secuencia de ADN , Algoritmos , Línea Celular Tumoral , Neoplasias del Colon/tratamiento farmacológico , Cicloheximida/farmacología , Células HCT116 , Células HEK293 , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Sistemas de Lectura Abierta , Control de Calidad , ARN Mensajero/genética , Reproducibilidad de los Resultados , Análisis de Secuencia de ARN , Programas Informáticos

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA