RESUMEN
Thousands of interactions assemble proteins into modules that impart spatial and functional organization to the cellular proteome. Through affinity-purification mass spectrometry, we have created two proteome-scale, cell-line-specific interaction networks. The first, BioPlex 3.0, results from affinity purification of 10,128 human proteins-half the proteome-in 293T cells and includes 118,162 interactions among 14,586 proteins. The second results from 5,522 immunoprecipitations in HCT116 cells. These networks model the interactome whose structure encodes protein function, localization, and complex membership. Comparison across cell lines validates thousands of interactions and reveals extensive customization. Whereas shared interactions reside in core complexes and involve essential proteins, cell-specific interactions link these complexes, "rewiring" subnetworks within each cell's interactome. Interactions covary among proteins of shared function as the proteome remodels to produce each cell's phenotype. Viewable interactively online through BioPlexExplorer, these networks define principles of proteome organization and enable unknown protein characterization.
Asunto(s)
Mapeo de Interacción de Proteínas/métodos , Mapas de Interacción de Proteínas/genética , Proteoma/genética , Biología Computacional/métodos , Células HCT116/metabolismo , Células HEK293/metabolismo , Humanos , Espectrometría de Masas/métodos , Mapas de Interacción de Proteínas/fisiología , Proteoma/metabolismo , Proteómica/métodosRESUMEN
Proteins are essential agents of biological processes. To date, large-scale profiling of cell line collections including the Cancer Cell Line Encyclopedia (CCLE) has focused primarily on genetic information whereas deep interrogation of the proteome has remained out of reach. Here, we expand the CCLE through quantitative profiling of thousands of proteins by mass spectrometry across 375 cell lines from diverse lineages to reveal information undiscovered by DNA and RNA methods. We observe unexpected correlations within and between pathways that are largely absent from RNA. An analysis of microsatellite instable (MSI) cell lines reveals the dysregulation of specific protein complexes associated with surveillance of mutation and translation. These and other protein complexes were associated with sensitivity to knockdown of several different genes. These data in conjunction with the wider CCLE are a broad resource to explore cellular behavior and facilitate cancer research.
Asunto(s)
Regulación Neoplásica de la Expresión Génica/genética , Neoplasias/metabolismo , Proteoma/metabolismo , Línea Celular Tumoral , Perfilación de la Expresión Génica/métodos , Humanos , Espectrometría de Masas/métodos , Inestabilidad de Microsatélites , Mutación/genética , Proteómica/métodosRESUMEN
Quantitative proteomics employing isobaric reagents has been established as a powerful tool for biological discovery. Current workflows often utilize a dedicated quantitative spectrum to improve quantitative accuracy and precision. A consequence of this approach is a dramatic reduction in the spectral acquisition rate, which necessitates the use of additional instrument time to achieve comprehensive proteomic depth. This work assesses the performance and benefits of online and real-time spectral identification in quantitative multiplexed workflows. A Real-Time Search (RTS) algorithm was implemented to identify fragment spectra within milliseconds as they are acquired using a probabilistic score and to trigger quantitative spectra only upon confident peptide identification. The RTS-MS3 was benchmarked against standard workflows using a complex two-proteome model of interference and a targeted 10-plex comparison of kinase abundance profiles. Applying the RTS-MS3 method provided the comprehensive characterization of a 10-plex proteome in 50% less acquisition time. These data indicate that the RTS-MS3 approach provides dramatic performance improvements for quantitative multiplexed experiments.
Asunto(s)
Péptidos/aislamiento & purificación , Proteoma/aislamiento & purificación , Proteómica/métodos , Algoritmos , Bases de Datos Factuales , Humanos , Péptidos/química , Proteoma/química , Espectrometría de Masas en Tándem , Flujo de TrabajoRESUMEN
Multiplexed quantitation via isobaric chemical tags (e.g., tandem mass tags (TMT) and isobaric tags for relative and absolute quantitation (iTRAQ)) has the potential to revolutionize quantitative proteomics. However, until recently the utility of these tags was questionable due to reporter ion ratio distortion resulting from fragmentation of coisolated interfering species. These interfering signals can be negated through additional gas-phase manipulations (e.g., MS/MS/MS (MS3) and proton-transfer reactions (PTR)). These methods, however, have a significant sensitivity penalty. Using isolation waveforms with multiple frequency notches (i.e., synchronous precursor selection, SPS), we coisolated and cofragmented multiple MS2 fragment ions, thereby increasing the number of reporter ions in the MS3 spectrum 10-fold over the standard MS3 method (i.e., MultiNotch MS3). By increasing the reporter ion signals, this method improves the dynamic range of reporter ion quantitation, reduces reporter ion signal variance, and ultimately produces more high-quality quantitative measurements. To demonstrate utility, we analyzed biological triplicates of eight colon cancer cell lines using the MultiNotch MS3 method. Across all the replicates we quantified 8,378 proteins in union and 6,168 proteins in common. Taking into account that each of these quantified proteins contains eight distinct cell-line measurements, this data set encompasses 174,704 quantitative ratios each measured in triplicate across the biological replicates. Herein, we demonstrate that the MultiNotch MS3 method uniquely combines multiplexing capacity with quantitative sensitivity and accuracy, drastically increasing the informational value obtainable from proteomic experiments.
Asunto(s)
Neoplasias del Colon/metabolismo , Proteómica/métodos , Espectrometría de Masas en Tándem/métodos , Algoritmos , Línea Celular Tumoral , Cromatografía Líquida de Alta Presión/métodos , Células HeLa , Humanos , Iones , Isocitrato Deshidrogenasa/análisis , Isocitrato Deshidrogenasa/metabolismo , Análisis de Componente Principal , Reproducibilidad de los Resultados , Sensibilidad y Especificidad , Proteína Smad4/análisis , Proteína Smad4/metabolismo , Espectrometría de Masas en Tándem/instrumentaciónRESUMEN
MOTIVATION: Proteomics presents the opportunity to provide novel insights about the global biochemical state of a tissue. However, a significant problem with current methods is that shotgun proteomics has limited success at detecting many low abundance proteins, such as transcription factors from complex mixtures of cells and tissues. The ability to assay for these proteins in the context of the entire proteome would be useful in many areas of experimental biology. RESULTS: We used network-based inference in an approach named SNIPE (Software for Network Inference of Proteomics Experiments) that selectively highlights proteins that are more likely to be active but are otherwise undetectable in a shotgun proteomic sample. SNIPE integrates spectral counts from paired case-control samples over a network neighbourhood and assesses the statistical likelihood of enrichment by a permutation test. As an initial application, SNIPE was able to select several proteins required for early murine tooth development. Multiple lines of additional experimental evidence confirm that SNIPE can uncover previously unreported transcription factors in this system. We conclude that SNIPE can enhance the utility of shotgun proteomics data to facilitate the study of poorly detected proteins in complex mixtures. AVAILABILITY AND IMPLEMENTATION: An implementation for the R statistical computing environment named snipeR has been made freely available at http://genetics.bwh.harvard.edu/snipe/. CONTACT: ssunyaev@rics.bwh.harvard.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Algoritmos , Proteoma/análisis , Proteómica/métodos , Programas Informáticos , Animales , Biología Computacional/métodos , Ratones , Diente/metabolismoRESUMEN
Current methods used for measuring amino acid side-chain reactivity lack the throughput needed to screen large chemical libraries for interactions across the proteome. Here we redesigned the workflow for activity-based protein profiling of reactive cysteine residues by using a smaller desthiobiotin-based probe, sample multiplexing, reduced protein starting amounts and software to boost data acquisition in real time on the mass spectrometer. Our method, streamlined cysteine activity-based protein profiling (SLC-ABPP), achieved a 42-fold improvement in sample throughput, corresponding to profiling library members at a depth of >8,000 reactive cysteine sites at 18 min per compound. We applied it to identify proteome-wide targets of covalent inhibitors to mutant Kirsten rat sarcoma (KRAS)G12C and Bruton's tyrosine kinase (BTK). In addition, we created a resource of cysteine reactivity to 285 electrophiles in three human cell lines, which includes >20,000 cysteines from >6,000 proteins per line. The goal of proteome-wide profiling of cysteine reactivity across thousand-member libraries under several cellular contexts is now within reach.
Asunto(s)
Aminoácidos/genética , Elementos de Respuesta Antioxidante/genética , Cisteína/genética , Proteoma/genética , Agammaglobulinemia Tirosina Quinasa/genética , Humanos , Espectrometría de Masas , Proteómica/tendencias , Proteínas Proto-Oncogénicas p21(ras)/genéticaRESUMEN
Governance of protein phosphorylation by kinases and phosphatases constitutes an essential regulatory network in eukaryotic cells. Network dysregulation leads to severe consequences and is often a key factor in disease pathogenesis. Previous studies revealed multiple roles for protein phosphorylation and pathway structures in cellular functions from different perspectives. We seek to understand the roles of kinases and phosphatases from a protein homeostasis point of view. Using a streamlined tandem mass tag (SL-TMT) strategy, we systematically measure proteomic and phosphoproteomic responses to perturbations of phosphorylation signaling networks in yeast deletion strains. Our results emphasize the requirement for protein normalization for more complete interpretation of phosphorylation data. Functional relationships between kinases and phosphatases were characterized at both proteome and phosphoproteome levels in three ways: (1) Gene Ontology enrichment analysis, (2) Δgene-Δgene correlation networks, and (3) molecule covariance networks. This resource illuminates kinase and phosphatase functions and pathway organizations.
Asunto(s)
Fosfoproteínas/metabolismo , Proteómica , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/metabolismo , Transducción de Señal , Eliminación de Gen , Fosfoproteínas/genética , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genéticaRESUMEN
Fewer than half of all tandem mass spectrometry (MS/MS) spectra acquired in shotgun proteomics experiments are typically matched to a peptide with high confidence. Here we determine the identity of unassigned peptides using an ultra-tolerant Sequest database search that allows peptide matching even with modifications of unknown masses up to ± 500 Da. In a proteome-wide data set on HEK293 cells (9,513 proteins and 396,736 peptides), this approach matched an additional 184,000 modified peptides, which were linked to biological and chemical modifications representing 523 distinct mass bins, including phosphorylation, glycosylation and methylation. We localized all unknown modification masses to specific regions within a peptide. Known modifications were assigned to the correct amino acids with frequencies >90%. We conclude that at least one-third of unassigned spectra arise from peptides with substoichiometric modifications.