Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 82
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Mol Cell Proteomics ; 23(5): 100753, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38527648

RESUMEN

Bacterial or viral antigens can contain subdominant protein regions that elicit weak antibody responses upon vaccination or infection although there is accumulating evidence that antibody responses against subdominant regions can enhance the protective immune response. One proposed mechanism for subdominant protein regions is the binding of host proteins that prevent antibody production against epitopes hidden within the protein binding interfaces. Here, we used affinity purification combined with quantitative mass spectrometry (AP-MS) to examine the level of competition between antigen-specific antibodies and host-pathogen protein interaction networks using the M1 protein from Streptococcus pyogenes as a model system. As most humans have circulating antibodies against the M1 protein, we first used AP-MS to show that the M1 protein interspecies protein network formed with human plasma proteins is largely conserved in naïve mice. Immunizing mice with the M1 protein generated a time-dependent increase of anti-M1 antibodies. AP-MS analysis comparing the composition of the M1-plasma protein network from naïve and immunized mice showed significant enrichment of 292 IgG peptides associated with 56 IgG chains in the immune mice. Despite the significant increase of bound IgGs, the levels of interacting plasma proteins were not significantly reduced in the immune mice. The results indicate that the antigen-specific polyclonal IgG against the M1 protein primarily targets epitopes outside the other plasma protein binding interfaces. In conclusion, this study demonstrates that AP-MS is a promising strategy to determine the relationship between antigen-specific antibodies and host-pathogen interaction networks that could be used to define subdominant protein regions of relevance for vaccine development.


Asunto(s)
Antígenos Bacterianos , Inmunoglobulina G , Unión Proteica , Streptococcus pyogenes , Animales , Streptococcus pyogenes/inmunología , Streptococcus pyogenes/metabolismo , Antígenos Bacterianos/inmunología , Antígenos Bacterianos/metabolismo , Ratones , Humanos , Inmunoglobulina G/inmunología , Inmunoglobulina G/metabolismo , Inmunidad Adaptativa , Proteínas de la Membrana Bacteriana Externa/inmunología , Proteínas de la Membrana Bacteriana Externa/metabolismo , Anticuerpos Antibacterianos/inmunología , Mapas de Interacción de Proteínas , Espectrometría de Masas , Proteínas Portadoras/metabolismo , Proteínas Portadoras/inmunología , Femenino , Interacciones Huésped-Patógeno/inmunología
2.
Anal Chem ; 96(22): 9060-9068, 2024 Jun 04.
Artículo en Inglés | MEDLINE | ID: mdl-38701337

RESUMEN

An important element of antibody-guided vaccine design is the use of neutralizing or opsonic monoclonal antibodies to define protective epitopes in their native three-dimensional conformation. Here, we demonstrate a multimodal mass spectrometry-based strategy for in-depth characterization of antigen-antibody complexes to enable the identification of protective epitopes using the cytolytic exotoxin Streptolysin O (SLO) from Streptococcus pyogenes as a showcase. We first discovered a monoclonal antibody with an undisclosed sequence capable of neutralizing SLO-mediated cytolysis. The amino acid sequence of both the antibody light and the heavy chain was determined using mass-spectrometry-based de novo sequencing, followed by chemical cross-linking mass spectrometry to generate distance constraints between the antibody fragment antigen-binding region and SLO. Subsequent integrative computational modeling revealed a discontinuous epitope located in domain 3 of SLO that was experimentally validated by hydrogen-deuterium exchange mass spectrometry and reverse engineering of the targeted epitope. The results show that the antibody inhibits SLO-mediated cytolysis by binding to a discontinuous epitope in domain 3, likely preventing oligomerization and subsequent secondary structure transitions critical for pore-formation. The epitope is highly conserved across >98% of the characterized S. pyogenes isolates, making it an attractive target for antibody-based therapy and vaccine design against severe streptococcal infections.


Asunto(s)
Proteínas Bacterianas , Epítopos , Espectrometría de Masas , Streptococcus pyogenes , Estreptolisinas , Streptococcus pyogenes/inmunología , Streptococcus pyogenes/química , Estreptolisinas/química , Estreptolisinas/inmunología , Estreptolisinas/metabolismo , Proteínas Bacterianas/inmunología , Proteínas Bacterianas/química , Epítopos/inmunología , Epítopos/química , Anticuerpos Monoclonales/inmunología , Anticuerpos Monoclonales/química , Secuencia de Aminoácidos , Modelos Moleculares
3.
PLoS Comput Biol ; 19(1): e1010457, 2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36668672

RESUMEN

Generating and analyzing overlapping peptides through multienzymatic digestion is an efficient procedure for de novo protein using from bottom-up mass spectrometry (MS). Despite improved instrumentation and software, de novo MS data analysis remains challenging. In recent years, deep learning models have represented a performance breakthrough. Incorporating that technology into de novo protein sequencing workflows require machine-learning models capable of handling highly diverse MS data. In this study, we analyzed the requirements for assembling such generalizable deep learning models by systemcally varying the composition and size of the training set. We assessed the generated models' performances using two test sets composed of peptides originating from the multienzyme digestion of samples from various species. The peptide recall values on the test sets showed that the deep learning models generated from a collection of highly N- and C-termini diverse peptides generalized 76% more over the termini-restricted ones. Moreover, expanding the training set's size by adding peptides from the multienzymatic digestion with five proteases of several species samples led to a 2-3 fold generalizability gain. Furthermore, we tested the applicability of these multienzyme deep learning (MEM) models by fully de novo sequencing the heavy and light monomeric chains of five commercial antibodies (mAbs). MEMs extracted over 10000 matching and overlapped peptides across six different proteases mAb samples, achieving a 100% sequence coverage for 8 of the ten polypeptide chains. We foretell that the MEMs' proven improvements to de novo analysis will positively impact several applications, such as analyzing samples of high complexity, unknown nature, or the peptidomics field.


Asunto(s)
Aprendizaje Profundo , Proteómica , Proteómica/métodos , Espectrometría de Masas en Tándem/métodos , Péptidos/química , Análisis de Secuencia de Proteína/métodos , Péptido Hidrolasas , Anticuerpos Monoclonales
4.
Bioinformatics ; 37(24): 4871-4872, 2021 12 11.
Artículo en Inglés | MEDLINE | ID: mdl-34128979

RESUMEN

SUMMARY: Protein-protein interactions (PPIs) are central in many biological processes but difficult to characterize, especially in complex, unfractionated samples. Chemical cross-linking combined with mass spectrometry (MS) and computational modeling is gaining recognition as a viable tool in protein interaction studies. Here, we introduce Cheetah-MS, a web server for predicting the PPIs in a complex mixture of samples. It combines the capability and sensitivity of MS to analyze complex samples with the power and resolution of protein-protein docking. It produces the quaternary structure of the PPI of interest by analyzing tandem MS/MS data (also called MS2). Combining MS analysis and modeling increases the sensitivity and, importantly, facilitates the interpretation of the results. AVAILABILITY AND IMPLEMENTATION: Cheetah-MS is freely available as a web server at https://www.txms.org.


Asunto(s)
Acinonyx , Animales , Acinonyx/metabolismo , Espectrometría de Masas en Tándem , Computadores , Proteínas/química , Simulación por Computador
5.
PLoS Comput Biol ; 17(1): e1008169, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33411763

RESUMEN

Streptococcus pyogenes (Group A streptococcus; GAS) is an important human pathogen responsible for mild to severe, life-threatening infections. GAS expresses a wide range of virulence factors, including the M family proteins. The M proteins allow the bacteria to evade parts of the human immune defenses by triggering the formation of a dense coat of plasma proteins surrounding the bacteria, including IgGs. However, the molecular level details of the M1-IgG interaction have remained unclear. Here, we characterized the structure and dynamics of this interaction interface in human plasma on the surface of live bacteria using integrative structural biology, combining cross-linking mass spectrometry and molecular dynamics (MD) simulations. We show that the primary interaction is formed between the S-domain of M1 and the conserved IgG Fc-domain. In addition, we show evidence for a so far uncharacterized interaction between the A-domain and the IgG Fc-domain. Both these interactions mimic the protein G-IgG interface of group C and G streptococcus. These findings underline a conserved scavenging mechanism used by GAS surface proteins that block the IgG-receptor (FcγR) to inhibit phagocytic killing. We additionally show that we can capture Fab-bound IgGs in a complex background and identify XLs between the constant region of the Fab-domain and certain regions of the M1 protein engaged in the Fab-mediated binding. Our results elucidate the M1-IgG interaction network involved in inhibition of phagocytosis and reveal important M1 peptides that can be further investigated as future vaccine targets.


Asunto(s)
Antígenos Bacterianos , Proteínas de la Membrana Bacteriana Externa , Proteínas Portadoras , Inmunoglobulina G , Streptococcus pyogenes , Antígenos Bacterianos/química , Antígenos Bacterianos/metabolismo , Proteínas de la Membrana Bacteriana Externa/química , Proteínas de la Membrana Bacteriana Externa/metabolismo , Proteínas Portadoras/química , Proteínas Portadoras/metabolismo , Interacciones Huésped-Patógeno , Humanos , Inmunoglobulina G/química , Inmunoglobulina G/metabolismo , Espectrometría de Masas , Simulación de Dinámica Molecular , Fagocitosis , Unión Proteica , Streptococcus pyogenes/química , Streptococcus pyogenes/metabolismo , Factores de Virulencia/química , Factores de Virulencia/metabolismo
6.
J Proteome Res ; 20(5): 2983-3001, 2021 05 07.
Artículo en Inglés | MEDLINE | ID: mdl-33855848

RESUMEN

Proteogenomic approaches have enabled the generat̲ion of novel information levels when compared to single omics studies although burdened by extensive experimental efforts. Here, we improved a data-independent acquisition mass spectrometry proteogenomic workflow to reveal distinct molecular features related to mammographic appearances in breast cancer. Our results reveal splicing processes detectable at the protein level and highlight quantitation and pathway complementarity between RNA and protein data. Furthermore, we confirm previously detected enrichments of molecular pathways associated with estrogen receptor-dependent activity and provide novel evidence of epithelial-to-mesenchymal activity in mammography-detected spiculated tumors. Several transcript-protein pairs displayed radically different abundances depending on the overall clinical properties of the tumor. These results demonstrate that there are differentially regulated protein networks in clinically relevant tumor subgroups, which in turn alter both cancer biology and the abundance of biomarker candidates and drug targets.


Asunto(s)
Neoplasias de la Mama , Proteogenómica , Neoplasias de la Mama/diagnóstico por imagen , Neoplasias de la Mama/genética , Femenino , Humanos , Mamografía , Fenotipo , Flujo de Trabajo
7.
Nat Methods ; 14(12): 1141-1152, 2017 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-29083403

RESUMEN

We present a combined report on the results of three editions of the Cell Tracking Challenge, an ongoing initiative aimed at promoting the development and objective evaluation of cell segmentation and tracking algorithms. With 21 participating algorithms and a data repository consisting of 13 data sets from various microscopy modalities, the challenge displays today's state-of-the-art methodology in the field. We analyzed the challenge results using performance measures for segmentation and tracking that rank all participating methods. We also analyzed the performance of all of the algorithms in terms of biological measures and practical usability. Although some methods scored high in all technical aspects, none obtained fully correct solutions. We found that methods that either take prior information into account using learning strategies or analyze cells in a global spatiotemporal video context performed better than other methods under the segmentation and tracking scenarios included in the challenge.


Asunto(s)
Algoritmos , Rastreo Celular/métodos , Interpretación de Imagen Asistida por Computador , Benchmarking , Línea Celular , Humanos
8.
Med Microbiol Immunol ; 209(3): 265-275, 2020 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-32072248

RESUMEN

A central challenge in infection medicine is to determine the structure and function of host-pathogen protein-protein interactions to understand how these interactions facilitate bacterial adhesion, dissemination and survival. In this review, we focus on proteomics, electron cryo-microscopy and structural modeling to showcase instances where affinity-purification (AP) and cross-linking (XL) mass spectrometry (MS) has advanced our understanding of host-pathogen interactions. We highlight cases where XL-MS in combination with structural modeling has provided insight into the quaternary structure of interspecies protein complexes. We further exemplify how electron cryo-tomography has been used to visualize bacterial-human interactions during attachment and infection. Lastly, we discuss how AP-MS, XL-MS and electron cryo-microscopy and -tomography together with structural modeling approaches can be used in future studies to broaden our knowledge regarding the function, dynamics and evolution of such interactions. This knowledge will be of relevance for future drug and vaccine development programs.


Asunto(s)
Interacciones Microbiota-Huesped , Modelos Moleculares , Mapeo de Interacción de Proteínas , Proteómica , Proteínas Bacterianas/química , Microscopía por Crioelectrón , Humanos , Espectrometría de Masas , Mapas de Interacción de Proteínas , Estructura Cuaternaria de Proteína
9.
EMBO Rep ; 19(8)2018 08.
Artículo en Inglés | MEDLINE | ID: mdl-29954836

RESUMEN

Despite recent mass spectrometry (MS)-based breakthroughs, comprehensive ADP-ribose (ADPr)-acceptor amino acid identification and ADPr-site localization remain challenging. Here, we report the establishment of an unbiased, multistep ADP-ribosylome data analysis workflow that led to the identification of tyrosine as a novel ARTD1/PARP1-dependent in vivo ADPr-acceptor amino acid. MS analyses of in vitro ADP-ribosylated proteins confirmed tyrosine as an ADPr-acceptor amino acid in RPS3A (Y155) and HPF1 (Y238) and demonstrated that trans-modification of RPS3A is dependent on HPF1. We provide an ADPr-site Localization Spectra Database (ADPr-LSD), which contains 288 high-quality ADPr-modified peptide spectra, to serve as ADPr spectral references for correct ADPr-site localizations.


Asunto(s)
ADP-Ribosilación , Adenosina Difosfato Ribosa/metabolismo , Tirosina/metabolismo , Secuencia de Aminoácidos , Proteínas Portadoras/metabolismo , Daño del ADN , Células HeLa , Humanos , Espectrometría de Masas , Proteínas Nucleares/metabolismo , Péptidos/química , Péptidos/metabolismo , Fosfoproteínas/metabolismo , Poli(ADP-Ribosa) Polimerasa-1/metabolismo , Proteoma/metabolismo , ARN Interferente Pequeño/metabolismo , Reproducibilidad de los Resultados
10.
BMC Bioinformatics ; 20(Suppl 4): 141, 2019 Apr 18.
Artículo en Inglés | MEDLINE | ID: mdl-30999854

RESUMEN

BACKGROUND: Bacterial surfaces are complex systems, constructed from membranes, peptidoglycan and, importantly, proteins. The proteins play crucial roles as critical regulators of how the bacterium interacts with and survive in its environment. A full catalog of the motifs in protein families and their relative conservation grade is a prerequisite to target the protein-protein interaction that bacterial surface protein makes to host proteins. RESULTS: In this paper, we propose a greedy approach to identify conserved motifs in large sequence families iteratively. Each iteration discovers a motif de novo and masks all occurrences of that motif. Remaining unmasked sequences are subjected to the next round of motif detection until no more significant motifs can be found. We demonstrate the utility of the method through the construction of a proteome-wide motif repository for Group A Streptococcus (GAS), a significant human pathogen. GAS produce numerous surface proteins that interact with over 100 human plasma proteins, helping the bacteria to evade the host immune response. We used the repository to find that proteins part of the bacterial surface has motif architectures that differ from intracellular proteins. CONCLUSIONS: We elucidate that the M protein, a coiled-coil homodimer that extends over 500 A from the cell wall, has a motif architecture that differs between various GAS strains. As the M protein is known to bind a variety of different plasma proteins, the results indicate that the different motif architectures are responsible for the quantitative differences of plasma proteins that various strains bind. The speed and applicability of the method enable its application to all major human pathogens.


Asunto(s)
Proteínas Bacterianas/química , Proteínas Bacterianas/metabolismo , Biología Computacional/métodos , Proteoma/metabolismo , Algoritmos , Secuencias de Aminoácidos , Secuencia Conservada , Genoma Bacteriano , Streptococcus pyogenes/genética
11.
Nat Methods ; 13(9): 777-83, 2016 09.
Artículo en Inglés | MEDLINE | ID: mdl-27479329

RESUMEN

Next-generation mass spectrometric (MS) techniques such as SWATH-MS have substantially increased the throughput and reproducibility of proteomic analysis, but ensuring consistent quantification of thousands of peptide analytes across multiple liquid chromatography-tandem MS (LC-MS/MS) runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we developed TRIC (http://proteomics.ethz.ch/tric/), a software tool that utilizes fragment-ion data to perform cross-run alignment, consistent peak-picking and quantification for high-throughput targeted proteomics. TRIC reduced the identification error compared to a state-of-the-art SWATH-MS analysis without alignment by more than threefold at constant recall while correcting for highly nonlinear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups. Thus, TRIC fills a gap in the pipeline for automated analysis of massively parallel targeted proteomics data sets.


Asunto(s)
Procesamiento Automatizado de Datos/métodos , Péptidos/análisis , Proteómica/métodos , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Algoritmos , Procesamiento Automatizado de Datos/instrumentación , Humanos , Espectrometría de Masas , Péptidos/metabolismo , Células Madre Pluripotentes/metabolismo , Precursores de Proteínas/análisis , Precursores de Proteínas/metabolismo , Proteolisis , Proteómica/instrumentación , Reproducibilidad de los Resultados , Alineación de Secuencia/instrumentación , Análisis de Secuencia de Proteína/instrumentación , Streptococcus pyogenes/metabolismo
12.
Nat Methods ; 13(9): 741-8, 2016 08 30.
Artículo en Inglés | MEDLINE | ID: mdl-27575624

RESUMEN

High-resolution mass spectrometry (MS) has become an important tool in the life sciences, contributing to the diagnosis and understanding of human diseases, elucidating biomolecular structural information and characterizing cellular signaling networks. However, the rapid growth in the volume and complexity of MS data makes transparent, accurate and reproducible analysis difficult. We present OpenMS 2.0 (http://www.openms.de), a robust, open-source, cross-platform software specifically designed for the flexible and reproducible analysis of high-throughput MS data. The extensible OpenMS software implements common mass spectrometric data processing tasks through a well-defined application programming interface in C++ and Python and through standardized open data formats. OpenMS additionally provides a set of 185 tools and ready-made workflows for common mass spectrometric data processing tasks, which enable users to perform complex quantitative mass spectrometric analyses with ease.


Asunto(s)
Biología Computacional/métodos , Procesamiento Automatizado de Datos , Espectrometría de Masas/métodos , Proteómica/métodos , Programas Informáticos , Envejecimiento/sangre , Proteínas Sanguíneas/química , Humanos , Anotación de Secuencia Molecular , Proteogenómica/métodos , Flujo de Trabajo
13.
Mol Cell Proteomics ; 16(4 suppl 1): S29-S41, 2017 04.
Artículo en Inglés | MEDLINE | ID: mdl-28183813

RESUMEN

Sepsis is a systemic immune response responsible for considerable morbidity and mortality. Molecular modeling of host-pathogen interactions in the disease state represents a promising strategy to define molecular events of importance for the transition from superficial to invasive infectious diseases. Here we used the Gram-positive bacterium Streptococcus pyogenes as a model system to establish a mass spectrometry based workflow for the construction of a stoichiometric surface density model between the S. pyogenes surface, the surface virulence factor M-protein, and adhered human blood plasma proteins. The workflow relies on stable isotope labeled reference peptides and selected reaction monitoring mass spectrometry analysis of a wild-type strain and an M-protein deficient mutant strain, to generate absolutely quantified protein stoichiometry ratios between S. pyogenes and interacting plasma proteins. The stoichiometry ratios in combination with a novel targeted mass spectrometry method to measure cell numbers enabled the construction of a stoichiometric surface density model using protein structures available from the protein data bank. The model outlines the topology and density of the host-pathogen protein interaction network on the S. pyogenes bacterial surface, revealing a dense and highly organized protein interaction network. Removal of the M-protein from S. pyogenes introduces a drastic change in the network topology, validated by electron microscopy. We propose that the stoichiometric surface density model of S. pyogenes in human blood plasma represents a scalable framework that can continuously be refined with the emergence of new results. Future integration of new results will improve the understanding of protein-protein interactions and their importance for bacterial virulence. Furthermore, we anticipate that the general properties of the developed workflow will facilitate the production of stoichiometric surface density models for other types of host-pathogen interactions.


Asunto(s)
Antígenos Bacterianos/metabolismo , Proteínas de la Membrana Bacteriana Externa/metabolismo , Proteínas Sanguíneas/metabolismo , Proteínas Portadoras/metabolismo , Espectrometría de Masas/métodos , Proteómica/métodos , Infecciones Estreptocócicas/microbiología , Streptococcus pyogenes/fisiología , Interacciones Huésped-Patógeno , Humanos , Marcaje Isotópico , Proteínas de la Membrana/metabolismo , Modelos Moleculares , Unión Proteica , Mapas de Interacción de Proteínas , Streptococcus pyogenes/metabolismo
14.
Nucleic Acids Res ; 45(D1): D404-D407, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899646

RESUMEN

The FAIRDOMHub is a repository for publishing FAIR (Findable, Accessible, Interoperable and Reusable) Data, Operating procedures and Models (https://fairdomhub.org/) for the Systems Biology community. It is a web-accessible repository for storing and sharing systems biology research assets. It enables researchers to organize, share and publish data, models and protocols, interlink them in the context of the systems biology investigations that produced them, and to interrogate them via API interfaces. By using the FAIRDOMHub, researchers can achieve more effective exchange with geographically distributed collaborators during projects, ensure results are sustained and preserved and generate reproducible publications that adhere to the FAIR guiding principles of data stewardship.


Asunto(s)
Bases de Datos Factuales , Biología de Sistemas/métodos , Carbono/metabolismo , Curaduría de Datos , Difusión de la Información , Redes y Vías Metabólicas , Investigación
15.
Nat Methods ; 12(12): 1185-90, 2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26501516

RESUMEN

Chemical cross-linking in combination with mass spectrometry generates distance restraints of amino acid pairs in close proximity on the surface of native proteins and protein complexes. In this study we used quantitative mass spectrometry and chemical cross-linking to quantify differences in cross-linked peptides obtained from complexes in spatially discrete states. We describe a generic computational pipeline for quantitative cross-linking mass spectrometry consisting of modules for quantitative data extraction and statistical assessment of the obtained results. We used the method to detect conformational changes in two model systems: firefly luciferase and the bovine TRiC complex. Our method discovers and explains the structural heterogeneity of protein complexes using only sparse structural information.


Asunto(s)
Chaperonina con TCP-1/química , Reactivos de Enlaces Cruzados/química , Luciferasas de Luciérnaga/química , Espectrometría de Masas/métodos , Complejos Multiproteicos/química , Programas Informáticos , Algoritmos , Animales , Interpretación Estadística de Datos , Bases de Datos de Proteínas , Modelos Moleculares , Conformación Proteica
16.
Mol Cell Proteomics ; 14(10): 2800-13, 2015 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-26199342

RESUMEN

Accurate knowledge of retention time (RT) in liquid chromatography-based mass spectrometry data facilitates peptide identification, quantification, and multiplexing in targeted and discovery-based workflows. Retention time prediction is particularly important for peptide analysis in emerging data-independent acquisition (DIA) experiments such as SWATH-MS. The indexed RT approach, iRT, uses synthetic spiked-in peptide standards (SiRT) to set RT to a unit-less scale, allowing for normalization of peptide RT between different samples and chromatographic set-ups. The obligatory use of SiRTs can be costly and complicates comparisons and data integration if standards are not included in every sample. Reliance on SiRTs also prevents the inclusion of archived mass spectrometry data for generation of the peptide assay libraries central to targeted DIA-MS data analysis. We have identified a set of peptide sequences that are conserved across most eukaryotic species, termed Common internal Retention Time standards (CiRT). In a series of tests to support the appropriateness of the CiRT-based method, we show: (1) the CiRT peptides normalized RT in human, yeast, and mouse cell lysate derived peptide assay libraries and enabled merging of archived libraries for expanded DIA-MS quantitative applications; (2) CiRTs predicted RT in SWATH-MS data within a 2-min margin of error for the majority of peptides; and (3) normalization of RT using the CiRT peptides enabled the accurate SWATH-MS-based quantification of 340 synthetic isotopically labeled peptides that were spiked into either human or yeast cell lysate. To automate and facilitate the use of these CiRT peptide lists or other custom user-defined internal RT reference peptides in DIA workflows, an algorithm was designed to automatically select a high-quality subset of datapoints for robust linear alignment of RT for use. Implementations of this algorithm are available for the OpenSWATH and Skyline platforms. Thus, CiRT peptides can be used alone or as a complement to SiRTs for RT normalization across peptide spectral libraries and in quantitative DIA-MS studies.


Asunto(s)
Espectrometría de Masas/normas , Péptidos/análisis , Proteómica/normas , Animales , Línea Celular , Cromatografía Liquida , Células HEK293 , Humanos , Espectrometría de Masas/métodos , Ratones , Biblioteca de Péptidos , Proteómica/métodos , Factores de Tiempo , Levaduras
17.
Bioinformatics ; 31(14): 2415-7, 2015 Jul 15.
Artículo en Inglés | MEDLINE | ID: mdl-25788625

RESUMEN

MOTIVATION: Targeted mass spectrometry comprises a set of powerful methods to obtain accurate and consistent protein quantification in complex samples. To fully exploit these techniques, a cross-platform and open-source software stack based on standardized data exchange formats is required. RESULTS: We present TAPIR, a fast and efficient Python visualization software for chromatograms and peaks identified in targeted proteomics experiments. The input formats are open, community-driven standardized data formats (mzML for raw data storage and TraML encoding the hierarchical relationships between transitions, peptides and proteins). TAPIR is scalable to proteome-wide targeted proteomics studies (as enabled by SWATH-MS), allowing researchers to visualize high-throughput datasets. The framework integrates well with existing automated analysis pipelines and can be extended beyond targeted proteomics to other types of analyses. AVAILABILITY AND IMPLEMENTATION: TAPIR is available for all computing platforms under the 3-clause BSD license at https://github.com/msproteomicstools/msproteomicstools.


Asunto(s)
Espectrometría de Masas , Proteómica/métodos , Programas Informáticos , Gráficos por Computador
18.
Bioinformatics ; 31(4): 555-62, 2015 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-25348213

RESUMEN

MOTIVATION: Data independent acquisition mass spectrometry has emerged as a reproducible and sensitive alternative in quantitative proteomics, where parsing the highly complex tandem mass spectra requires dedicated algorithms. Recently, targeted data extraction was proposed as a novel analysis strategy for this type of data, but it is important to further develop these concepts to provide quality-controlled, interference-adjusted and sensitive peptide quantification. RESULTS: We here present the algorithm DIANA and the classifier PyProphet, which are based on new probabilistic sub-scores to classify the chromatographic peaks in targeted data-independent acquisition data analysis. The algorithm is capable of providing accurate quantitative values and increased recall at a controlled false discovery rate, in a complex gold standard dataset. Importantly, we further demonstrate increased confidence gained by the use of two complementary data-independent acquisition targeted analysis algorithms, as well as increased numbers of quantified peptide precursors in complex biological samples. AVAILABILITY AND IMPLEMENTATION: DIANA is implemented in scala and python and available as open source (Apache 2.0 license) or pre-compiled binaries from http://quantitativeproteomics.org/diana. PyProphet can be installed from PyPi (https://pypi.python.org/pypi/pyprophet). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Proteínas Bacterianas/metabolismo , Minería de Datos/métodos , Bases de Datos de Proteínas , Fragmentos de Péptidos/análisis , Proteómica/métodos , Programas Informáticos , Espectrometría de Masas en Tándem/métodos , Proteínas Bacterianas/química , Humanos , Cadenas de Markov , Streptococcus pyogenes/metabolismo
19.
Mol Cell Proteomics ; 13(6): 1537-42, 2014 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-24677029

RESUMEN

The open XML format mzML, used for representation of MS data, is pivotal for the development of platform-independent MS analysis software. Although conversion from vendor formats to mzML must take place on a platform on which the vendor libraries are available (i.e. Windows), once mzML files have been generated, they can be used on any platform. However, the mzML format has turned out to be less efficient than vendor formats. In many cases, the naïve mzML representation is fourfold or even up to 18-fold larger compared with the original vendor file. In disk I/O limited setups, a larger data file also leads to longer processing times, which is a problem given the data production rates of modern mass spectrometers. In an attempt to reduce this problem, we here present a family of numerical compression algorithms called MS-Numpress, intended for efficient compression of MS data. To facilitate ease of adoption, the algorithms target the binary data in the mzML standard, and support in main proteomics tools is already available. Using a test set of 10 representative MS data files we demonstrate typical file size decreases of 90% when combined with traditional compression, as well as read time decreases of up to 50%. It is envisaged that these improvements will be beneficial for data handling within the MS community.


Asunto(s)
Espectrometría de Masas , Proteómica , Programas Informáticos , Algoritmos , Bases de Datos de Proteínas , Análisis Numérico Asistido por Computador
20.
J Proteome Res ; 14(7): 2807-18, 2015 Jul 02.
Artículo en Inglés | MEDLINE | ID: mdl-25944384

RESUMEN

It is of highest importance to find proteins responsible for breast cancer dissemination, for use as biomarkers or treatment targets. We established and performed a combined nontargeted LC-MS/MS and a targeted LC-SRM workflow for discovery and validation of protein biomarkers. Eighty breast tumors, stratified for estrogen receptor status and development of distant recurrence (DR ± ), were collected. After enrichment of N-glycosylated peptides, label-free LC-MS/MS was performed on each individual tumor in triplicate. In total, 1515 glycopeptides from 778 proteins were identified and used to create a map of the breast cancer N-glycosylated proteome. Based on this specific proteome map, we constructed a 92-plex targeted label-free LC-SRM panel. These proteins were quantified across samples by LC-SRM, resulting in 10 proteins consistently differentially regulated between DR+/DR- tumors. Five proteins were further validated in a separate cohort as prognostic biomarkers at the gene expression level. We also compared the LC-SRM results to clinically reported HER2 status, demonstrating its clinical accuracy. In conclusion, we demonstrate a combined mass spectrometry strategy, at large scale on clinical samples, leading to the identification and validation of five proteins as potential biomarkers for breast cancer recurrence. All MS data are available via ProteomeXchange and PASSEL with identifiers PXD001685 and PASS00643.


Asunto(s)
Biomarcadores de Tumor/análisis , Neoplasias de la Mama/diagnóstico , Espectrometría de Masas en Tándem/métodos , Femenino , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA