RESUMO
Single-cell proteomics by mass spectrometry is emerging as a powerful and unbiased method for the characterization of biological heterogeneity. So far, it has been limited to cultured cells, whereas an expansion of the method to complex tissues would greatly enhance biological insights. Here we describe single-cell Deep Visual Proteomics (scDVP), a technology that integrates high-content imaging, laser microdissection and multiplexed mass spectrometry. scDVP resolves the context-dependent, spatial proteome of murine hepatocytes at a current depth of 1,700 proteins from a cell slice. Half of the proteome was differentially regulated in a spatial manner, with protein levels changing dramatically in proximity to the central vein. We applied machine learning to proteome classes and images, which subsequently inferred the spatial proteome from imaging data alone. scDVP is applicable to healthy and diseased tissues and complements other spatial proteomics and spatial omics technologies.
Assuntos
Proteoma , Proteômica , Animais , Camundongos , Proteoma/análise , Espectrometria de Massas/métodos , Proteômica/métodos , Microdissecção e Captura a Laser/métodosRESUMO
Recent advances in mass spectrometry-based proteomics enable the acquisition of increasingly large datasets within relatively short times, which exposes bottlenecks in the bioinformatics pipeline. Although peptide identification is already scalable, most label-free quantification (LFQ) algorithms scale quadratic or cubic with the sample numbers, which may even preclude the analysis of large-scale data. Here we introduce directLFQ, a ratio-based approach for sample normalization and the calculation of protein intensities. It estimates quantities via aligning samples and ion traces by shifting them on top of each other in logarithmic space. Importantly, directLFQ scales linearly with the number of samples, allowing analyses of large studies to finish in minutes instead of days or months. We quantify 10,000 proteomes in 10 min and 100,000 proteomes in less than 2 h, a 1000-fold faster than some implementations of the popular LFQ algorithm MaxLFQ. In-depth characterization of directLFQ reveals excellent normalization properties and benchmark results, comparing favorably to MaxLFQ for both data-dependent acquisition and data-independent acquisition. In addition, directLFQ provides normalized peptide intensity estimates for peptide-level comparisons. It is an important part of an overall quantitative proteomic pipeline that also needs to include high sensitive statistical analysis leading to proteoform resolution. Available as an open-source Python package and a graphical user interface with a one-click installer, it can be used in the AlphaPept ecosystem as well as downstream of most common computational proteomics pipelines.
Assuntos
Proteoma , Proteômica , Proteoma/análise , Proteômica/métodos , Ecossistema , Peptídeos/análise , Espectrometria de Massas/métodos , SoftwareRESUMO
Single-cell proteomics aims to characterize biological function and heterogeneity at the level of proteins in an unbiased manner. It is currently limited in proteomic depth, throughput, and robustness, which we address here by a streamlined multiplexed workflow using data-independent acquisition (mDIA). We demonstrate automated and complete dimethyl labeling of bulk or single-cell samples, without losing proteomic depth. Lys-N digestion enables five-plex quantification at MS1 and MS2 level. Because the multiplexed channels are quantitatively isolated from each other, mDIA accommodates a reference channel that does not interfere with the target channels. Our algorithm RefQuant takes advantage of this and confidently quantifies twice as many proteins per single cell compared to our previous work (Brunner et al, PMID 35226415), while our workflow currently allows routine analysis of 80 single cells per day. Finally, we combined mDIA with spatial proteomics to increase the throughput of Deep Visual Proteomics seven-fold for microdissection and four-fold for MS analysis. Applying this to primary cutaneous melanoma, we discovered proteomic signatures of cells within distinct tumor microenvironments, showcasing its potential for precision oncology.
Assuntos
Melanoma , Neoplasias Cutâneas , Humanos , Proteoma , Proteômica , Medicina de Precisão , Microambiente TumoralRESUMO
BACKGROUND: Amino acid metabolism is crucial for inflammatory processes during atherogenesis. The endogenous amino acid homoarginine is a robust biomarker for cardiovascular outcome and mortality with high levels being protective. However, the underlying mechanisms remain elusive. We investigated the effect of homoarginine supplementation on atherosclerotic plaque development with a particular focus on inflammation. METHODS: Female ApoE-deficient mice were supplemented with homoarginine (14 mg/L) in drinking water starting 2 weeks before and continuing throughout a 6-week period of Western-type diet feeding. Control mice received normal drinking water. Immunohistochemistry and flow cytometry were used for plaque- and immunological phenotyping. T cells were characterized using mass spectrometry-based proteomics, by functional in vitro approaches, for example, proliferation and migration/chemotaxis assays as well as by super-resolution microscopy. RESULTS: Homoarginine supplementation led to a 2-fold increase in circulating homoarginine concentrations. Homoarginine-treated mice exhibited reduced atherosclerosis in the aortic root and brachiocephalic trunk. A substantial decrease in CD3+ T cells in the atherosclerotic lesions suggested a T-cell-related effect of homoarginine supplementation, which was mainly attributed to CD4+ T cells. Macrophages, dendritic cells, and B cells were not affected. CD4+ T-cell proteomics and subsequent pathway analysis together with in vitro studies demonstrated that homoarginine profoundly modulated the spatial organization of the T-cell actin cytoskeleton and increased filopodia formation via inhibition of Myh9 (myosin heavy chain 9). Further mechanistic studies revealed an inhibition of T-cell proliferation as well as a striking impairment of the migratory capacities of T cells in response to relevant chemokines by homoarginine, all of which likely contribute to its atheroprotective effects. CONCLUSIONS: Our study unravels a novel mechanism by which the amino acid homoarginine reduces atherosclerosis, establishing that homoarginine modulates the T-cell cytoskeleton and thereby mitigates T-cell functions important during atherogenesis. These findings provide a molecular explanation for the beneficial effects of homoarginine in atherosclerotic cardiovascular disease.
Assuntos
Aterosclerose , Água Potável , Placa Aterosclerótica , Aminoácidos , Animais , Apolipoproteínas E , Aterosclerose/tratamento farmacológico , Aterosclerose/metabolismo , Aterosclerose/prevenção & controle , Feminino , Homoarginina/farmacologia , Camundongos , Cadeias Pesadas de Miosina , Linfócitos T/metabolismoRESUMO
Bacteria reorganize their physiology upon entry to stationary phase. What part of this reorganization improves starvation survival is a difficult question because the change in physiology includes a global reorganization of the proteome, envelope, and metabolism of the cell. In this work, we used several trade-offs between fast growth and long survival to statistically score over 2,000 Escherichia coli proteins for their global correlation with death rate. The combined ranking allowed us to narrow down the set of proteins that positively correlate with survival and validate the causal role of a subset of proteins. Remarkably, we found that important survival genes are related to the cell envelope, i.e., periplasm and outer membrane, because the maintenance of envelope integrity of E. coli plays a crucial role during starvation. Our results uncover a new protective feature of the outer membrane that adds to the growing evidence that the outer membrane is not only a barrier that prevents abiotic substances from reaching the cytoplasm but also essential for bacterial proliferation and survival.
Assuntos
Escherichia coli , Proteoma , Escherichia coli/genéticaRESUMO
Single-cell technologies are revolutionizing biology but are today mainly limited to imaging and deep sequencing. However, proteins are the main drivers of cellular function and in-depth characterization of individual cells by mass spectrometry (MS)-based proteomics would thus be highly valuable and complementary. Here, we develop a robust workflow combining miniaturized sample preparation, very low flow-rate chromatography, and a novel trapped ion mobility mass spectrometer, resulting in a more than 10-fold improved sensitivity. We precisely and robustly quantify proteomes and their changes in single, FACS-isolated cells. Arresting cells at defined stages of the cell cycle by drug treatment retrieves expected key regulators. Furthermore, it highlights potential novel ones and allows cell phase prediction. Comparing the variability in more than 430 single-cell proteomes to transcriptome data revealed a stable-core proteome despite perturbation, while the transcriptome appears stochastic. Our technology can readily be applied to ultra-high sensitivity analyses of tissue material, posttranslational modifications, and small molecule studies from small cell counts to gain unprecedented insights into cellular heterogeneity in health and disease.
Assuntos
Proteoma , Proteômica , Espectrometria de Massas/métodos , Processamento de Proteína Pós-Traducional , Proteoma/metabolismo , Proteômica/métodos , Fluxo de TrabalhoRESUMO
Mass spectrometry based proteomics is the method of choice for quantifying genome-wide differential changes of protein expression in a wide range of biological and biomedical applications. Protein expression changes need to be reliably derived from many measured peptide intensities and their corresponding peptide fold changes. These peptide fold changes vary considerably for a given protein. Numerous instrumental setups aim to reduce this variability, whereas current computational methods only implicitly account for this problem. We introduce a new method, MS-EmpiRe, which explicitly accounts for the noise underlying peptide fold changes. We derive data set-specific, intensity-dependent empirical error fold change distributions, which are used for individual weighing of peptide fold changes to detect differentially expressed proteins (DEPs).In a recently published proteome-wide benchmarking data set, MS-EmpiRe doubles the number of correctly identified DEPs at an estimated FDR cutoff compared with state-of-the-art tools. We additionally confirm the superior performance of MS-EmpiRe on simulated data. MS-EmpiRe requires only peptide intensities mapped to proteins and, thus, can be applied to any common quantitative proteomics setup. We apply our method to diverse MS data sets and observe consistent increases in sensitivity with more than 1000 additional significant proteins in deep data sets, including a clinical study over multiple patients. At the same time, we observe that even the proteins classified as most insignificant by other methods but significant by MS-EmpiRe show very clear regulation on the peptide intensity level. MS-EmpiRe provides rapid processing (< 2 min for 6 LC-MS/MS runs (3 h gradients)) and is publicly available under github.com/zimmerlab/MS-EmpiRe with a manual including examples.
Assuntos
Espectrometria de Massas/métodos , Peptídeos/análise , Proteoma/análise , Proteômica/métodos , Software , Doença de Alzheimer/metabolismo , Benchmarking , Bases de Dados Factuais , Francisella/metabolismo , Proteínas Fúngicas/análise , Células HeLa , Humanos , Doença de Parkinson/metabolismo , Proteínas de Plantas/análise , Reprodutibilidade dos Testes , Razão Sinal-RuídoRESUMO
Spectral libraries play a central role in the analysis of data-independent-acquisition (DIA) proteomics experiments. A main assumption in current spectral library tools is that a single characteristic intensity pattern (CIP) suffices to describe the fragmentation of a peptide in a particular charge state (peptide charge pair). However, we find that this is often not the case. We carry out a systematic evaluation of spectral variability over public repositories and in-house data sets. We show that spectral variability is widespread and partly occurs under fixed experimental conditions. Using clustering of preprocessed spectra, we derive a limited number of multiple characteristic intensity patterns (MCIPs) for each peptide charge pair, which allow almost complete coverage of our heterogeneous data set without affecting the false discovery rate. We show that a MCIP library derived from public repositories performs in most cases similar to a "custom-made" spectral library, which has been acquired under identical experimental conditions as the query spectra. We apply the MCIP approach to a DIA data set and observe a significant increase in peptide recognition. We propose the MCIP approach as an easy-to-implement addition to current spectral library search engines and as a new way to utilize the data stored in spectral repositories.
Assuntos
Cromatografia Líquida , Bases de Dados de Proteínas , Biblioteca de Peptídeos , Proteômica/métodos , Espectrometria de Massas em Tandem , Algoritmos , Fragmentos de Peptídeos/química , Fragmentos de Peptídeos/genéticaRESUMO
In common with other omics technologies, mass spectrometry (MS)-based proteomics produces ever-increasing amounts of raw data, making efficient analysis a principal challenge. A plethora of different computational tools can process the MS data to derive peptide and protein identification and quantification. However, during the last years there has been dramatic progress in computer science, including collaboration tools that have transformed research and industry. To leverage these advances, we develop AlphaPept, a Python-based open-source framework for efficient processing of large high-resolution MS data sets. Numba for just-in-time compilation on CPU and GPU achieves hundred-fold speed improvements. AlphaPept uses the Python scientific stack of highly optimized packages, reducing the code base to domain-specific tasks while accessing the latest advances. We provide an easy on-ramp for community contributions through the concept of literate programming, implemented in Jupyter Notebooks. Large datasets can rapidly be processed as shown by the analysis of hundreds of proteomes in minutes per file, many-fold faster than acquisition. AlphaPept can be used to build automated processing pipelines with web-serving functionality and compatibility with downstream analysis tools. It provides easy access via one-click installation, a modular Python library for advanced users, and via an open GitHub repository for developers.
Assuntos
Proteômica , Software , Proteômica/métodos , Espectrometria de Massas/métodos , ProteomaRESUMO
Machine learning and in particular deep learning (DL) are increasingly important in mass spectrometry (MS)-based proteomics. Recent DL models can predict the retention time, ion mobility and fragment intensities of a peptide just from the amino acid sequence with good accuracy. However, DL is a very rapidly developing field with new neural network architectures frequently appearing, which are challenging to incorporate for proteomics researchers. Here we introduce AlphaPeptDeep, a modular Python framework built on the PyTorch DL library that learns and predicts the properties of peptides ( https://github.com/MannLabs/alphapeptdeep ). It features a model shop that enables non-specialists to create models in just a few lines of code. AlphaPeptDeep represents post-translational modifications in a generic manner, even if only the chemical composition is known. Extensive use of transfer learning obviates the need for large data sets to refine models for particular experimental conditions. The AlphaPeptDeep models for predicting retention time, collisional cross sections and fragment intensities are at least on par with existing tools. Additional sequence-based properties can also be predicted by AlphaPeptDeep, as demonstrated with a HLA peptide prediction model to improve HLA peptide identification for data-independent acquisition ( https://github.com/MannLabs/PeptDeep-HLA ).
Assuntos
Aprendizado Profundo , Proteômica , Proteômica/métodos , Peptídeos/química , Sequência de Aminoácidos , Redes Neurais de ComputaçãoRESUMO
To break down organismal fitness into molecular contributions, costs and benefits of cellular components must be analyzed in all phases of the organism's life cycle. Here, we establish the required quantitative approach for the death phase of the model bacterium Escherichia coli. We show that in carbon starvation, an exponential decay of viability emerges as a collective phenomenon, with viable cells recycling nutrients from cell carcasses to maintain viability. The observed collective death rate is determined by the maintenance rate of viable cells and the amount of nutrients recovered from dead cells. Using this relation, we study the cost of a wasteful enzyme during starvation and the benefit of the stress response sigma factor RpoS. While the enzyme increases maintenance and thereby the death rate, RpoS improves biomass recycling, decreasing the death rate. Our approach thus enables quantitative analyses of how cellular components affect the survival of non-growing cells.