RESUMO
Single-cell tissue atlases commonly use RNA abundances as surrogates for protein abundances. Yet, protein abundance also depends on the regulation of protein synthesis and degradation rates. To estimate the contributions of such post transcriptional regulation, we quantified the proteomes of 5,883 single cells from human testis using 3 distinct mass spectrometry methods (SCoPE2, pSCoPE, and plexDIA). To distinguish between biological and technical factors contributing to differences between protein and RNA levels, we developed BayesPG, a Bayesian model of transcript and protein abundance that systematically accounts for technical variation and infers biological differences. We use BayesPG to jointly model RNA and protein data collected from 29,709 single cells across different methods and datasets. BayesPG estimated consensus mRNA and protein levels for 3,861 gene products and quantified the relative protein-to-mRNA ratio (rPTR) for each gene across six distinct cell types in samples from human testis. About 28% of the gene products exhibited significant differences at protein and RNA levels and contributed to about 1, 500 significant GO groups. We observe that specialized and context specific functions, such as those related to spermatogenesis are regulated after transcription. Among hundreds of detected post translationally modified peptides, many show significant abundance differences across cell types. Furthermore, some phosphorylated peptides covary with kinases in a cell-type dependent manner, suggesting cell-type specific regulation. Our results demonstrate the potential of inferring protein regulation in from single-cell proteogenomic data and provide a generalizable model, BayesPG, for performing such analyses.
RESUMO
Amino acid substitutions may substantially alter protein stability and function, but the contribution of substitutions arising from alternate translation (deviations from the genetic code) is unknown. To explore it, we analyzed deep proteomic and transcriptomic data from over 1,000 human samples, including 6 cancer types and 26 healthy human tissues. This global analysis identified 60,024 high confidence substitutions corresponding to 8,801 unique sites in proteins derived from 1,990 genes. Some substitutions are shared across samples, while others exhibit strong tissue-type and cancer specificity. Surprisingly, products of alternate translation are more abundant than their canonical counterparts for hundreds of proteins, suggesting sense codon recoding. Recoded proteins include transcription factors, proteases, signaling proteins, and proteins associated with neurodegeneration. Mechanisms contributing to substitution abundance include protein stability, codon frequency, codon-anticodon mismatches, and RNA modifications. We characterize sequence motifs around alternatively translated amino acids and how substitution ratios vary across protein domains, tissue types and cancers. The substitution ratios are positively associated with intrinsically disordered regions and genetic polymorphisms in gnomAD, though the polymorphisms cannot account for the substitutions. Both the sequence and the tissue-specificity of alternatively translated proteins are conserved between human and mouse. These results demonstrate the contribution of alternate translation to diversifying mammalian proteomes, and its association with protein stability, tissue-specific proteomes, and diseases.
RESUMO
Pre-patterning of the embryo, driven by spatially localized factors, is a common feature across several non-mammalian species 1-4 . However, mammals display regulative development and thus it was thought that blastomeres of the embryo do not show such pre-patterning, contributing randomly to the three lineages of the blastocyst: the epiblast, primitive endoderm and trophectoderm that will generate the new organism, the yolk sac and placenta respectively 4-6 . Unexpectedly, early blastomeres of mouse and human embryos have been reported to have distinct developmental fates, potential and heterogeneous abundance of certain transcripts 7-12 . Nevertheless, the extent of the earliest intra-embryo differences remains unclear and controversial. Here, by utilizing multiplexed and label-free single-cell proteomics by mass-spectrometry 13 , we show that 2-cell mouse and human embryos contain an alpha and a beta blastomere as defined by differential abundance of hundreds of proteins exhibiting strong functional enrichment for protein synthesis, transport, and degradation. Such asymmetrically distributed proteins include Gps1 and Nedd8, depletion or overexpression of which in one blastomere of the 2-cell embryo impacts lineage segregation. These protein asymmetries increase at 4-cell stage. Intriguingly, halved mouse zygotes display asymmetric protein abundance that resembles alpha and beta blastomeres, suggesting differential proteome localization already within zygotes. We find that beta blastomeres give rise to a blastocyst with a higher proportion of epiblast cells than alpha blastomeres and that vegetal blastomeres, which are known to have a reduced developmental potential, are more likely to be alpha. Human 2-cell blastomeres also partition into two clusters sharing strong concordance with clusters found in mouse, in terms of differentially abundant proteins and functional enrichment. To our knowledge, this is the first demonstration of intra-zygotic and inter-blastomere proteomic asymmetry in mammals that has a role in lineage segregation.
RESUMO
Single-cell proteomics by mass spectrometry (MS) allows the quantification of proteins with high specificity and sensitivity. To increase its throughput, we developed nano-proteomic sample preparation (nPOP), a method for parallel preparation of thousands of single cells in nanoliter-volume droplets deposited on glass slides. Here, we describe its protocol with emphasis on its flexibility to prepare samples for different multiplexed MS methods. An implementation using the plexDIA MS multiplexing method, which uses non-isobaric mass tags to barcode peptides from different samples for data-independent acquisition, demonstrates accurate quantification of ~3,000-3,700 proteins per human cell. A separate implementation with isobaric mass tags and prioritized data acquisition demonstrates analysis of 1,827 single cells at a rate of >1,000 single cells per day at a depth of 800-1,200 proteins per human cell. The protocol is implemented by using a cell-dispensing and liquid-handling robot-the CellenONE instrument-and uses readily available consumables, which should facilitate broad adoption. nPOP can be applied to all samples that can be processed to a single-cell suspension. It takes 1 or 2 d to prepare >3,000 single cells. We provide metrics and software (the QuantQC R package) for quality control and data exploration. QuantQC supports the robust scaling of nPOP to higher plex reagents for achieving reliable and scalable single-cell proteomics.
RESUMO
Limiting artifacts during sample preparation can significantly increase data quality in single-cell proteomics experiments. Towards this goal, we characterize the impact of protein leakage by analyzing thousands of primary single cells that were prepared either fresh immediately after dissociation or cryopreserved and prepared at a later date. We directly identify permeabilized cells and use the data to define a signature for protein leakage. We use this signature to build a classifier for identifying damaged cells that performs accurately across cell types and species.
RESUMO
The Y-linked gene DDX3Y and its X-linked homolog DDX3X survived the evolution of the human sex chromosomes from ordinary autosomes. DDX3X encodes a multi-functional RNA helicase, with mutations causing developmental disorders and cancers. We find that, among X-linked genes with surviving Y homologs, DDX3X is extraordinarily dosage-sensitive. Studying cells of individuals with sex chromosome aneuploidy, we observe that when the number of Y chromosomes increases, DDX3X transcript levels fall; conversely, when the number of X chromosomes increases, DDX3Y transcript levels fall. In 46,XY cells, CRISPRi knockdown of either DDX3X or DDX3Y causes transcript levels of the homologous gene to rise. In 46,XX cells, chemical inhibition of DDX3X protein activity elicits an increase in DDX3X transcript levels. Thus, perturbation of either DDX3X or DDX3Y expression is buffered - by negative cross-regulation of DDX3X and DDX3Y in 46,XY cells, and by negative auto-regulation of DDX3X in 46,XX cells. DDX3X-DDX3Y cross-regulation is mediated through mRNA destabilization - as shown by metabolic labeling of newly transcribed RNA - and buffers total levels of DDX3X and DDX3Y protein in human cells. We infer that post-transcriptional auto-regulation of the ancestral (autosomal) DDX3 gene transmuted into auto- and cross-regulation of DDX3X and DDX3Y as these sex-linked genes evolved from ordinary alleles of their autosomal precursor.
RESUMO
The physiological response of a cell to stimulation depends on its proteome configuration. Therefore, the abundance variation of regulatory proteins across unstimulated single cells can be associatively linked with their response to stimulation. Here we developed an approach that leverages this association across individual cells and nuclei to systematically identify potential regulators of biological processes, followed by targeted validation. Specifically, we applied this approach to identify regulators of nucleocytoplasmic protein transport in macrophages stimulated with lipopolysaccharide (LPS). To this end, we quantified the proteomes of 3,412 individual nuclei, sampling the dynamic response to LPS treatment, and linking functional variability to proteomic variability. Minutes after the stimulation, the protein transport in individual nuclei correlated strongly with the abundance of known protein transport regulators, thus revealing the impact of natural protein variability on functional cellular response. We found that simple biophysical constraints, such as the quantity of nuclear pores, partially explain the variability in LPS-induced nucleocytoplasmic transport. Among the many proteins newly identified to be associated with the response, we selected 16 for targeted validation by knockdown. The knockdown phenotypes confirmed the inferences derived from natural protein and functional variation of single nuclei, thus demonstrating the potential of (sub-)single-cell proteomics to infer functional regulation. We expect this approach to generalize to broad applications and enhance the functional interpretability of single-cell omics data.
RESUMO
Advanced maternal age is associated with a decline in oocyte quality, which often leads to reproductive failure in humans. However, the mechanisms behind this age-related decline remain unclear. To gain insights into this phenomenon, we applied plexDIA, a multiplexed data-independent acquisition, single-cell mass spectrometry method, to analyze the proteome of oocytes from both young women and women of advanced maternal age. Our findings primarily revealed distinct proteomic profiles between immature fully grown germinal vesicle and mature metaphase II oocytes. Importantly, we further show that a woman's age is associated with changes in her oocyte proteome. Specifically, when compared to oocytes obtained from young women, advanced maternal age oocytes exhibited lower levels of the proteasome and TRiC complex, as well as other key regulators of proteostasis and meiosis. This suggests that aging adversely affects the proteostasis and meiosis networks in human oocytes. The proteins identified in this study hold potential as targets for improving oocyte quality and may guide future studies into the molecular processes underlying oocyte aging.
Assuntos
Idade Materna , Meiose , Oócitos , Proteoma , Proteômica , Proteostase , Análise de Célula Única , Humanos , Oócitos/metabolismo , Oócitos/citologia , Feminino , Meiose/fisiologia , Adulto , Proteômica/métodos , Análise de Célula Única/métodos , Proteoma/metabolismo , Complexo de Endopeptidases do Proteassoma/metabolismo , Pessoa de Meia-IdadeRESUMO
In oxygen (O2)-controlled cell culture, an indispensable tool in biological research, it is presumed that the incubator setpoint equals the O2 tension experienced by cells (i.e., pericellular O2). However, it is discovered that physioxic (5% O2) and hypoxic (1% O2) setpoints regularly induce anoxic (0% O2) pericellular tensions in both adherent and suspension cell cultures. Electron transport chain inhibition ablates this effect, indicating that cellular O2 consumption is the driving factor. RNA-seq analysis revealed that primary human hepatocytes cultured in physioxia experience ischemia-reperfusion injury due to cellular O2 consumption. A reaction-diffusion model is developed to predict pericellular O2 tension a priori, demonstrating that the effect of cellular O2 consumption has the greatest impact in smaller volume culture vessels. By controlling pericellular O2 tension in cell culture, it is found that hypoxia vs. anoxia induce distinct breast cancer transcriptomic and translational responses, including modulation of the hypoxia-inducible factor (HIF) pathway and metabolic reprogramming. Collectively, these findings indicate that breast cancer cells respond non-monotonically to low O2, suggesting that anoxic cell culture is not suitable for modeling hypoxia. Furthermore, it is shown that controlling atmospheric O2 tension in cell culture incubators is insufficient to regulate O2 in cell culture, thus introducing the concept of pericellular O2-controlled cell culture.
Assuntos
Neoplasias da Mama , Técnicas de Cultura de Células , Oxigênio , Humanos , Oxigênio/metabolismo , Neoplasias da Mama/metabolismo , Neoplasias da Mama/genética , Técnicas de Cultura de Células/métodos , Feminino , Hipóxia Celular/fisiologia , Consumo de OxigênioRESUMO
Physiological processes, such as the epithelial-mesenchymal transition (EMT), are mediated by changes in protein interactions. These changes may be better reflected in protein covariation within a cellular cluster than in the temporal dynamics of cluster-average protein abundance. To explore this possibility, we quantified proteins in single human cells undergoing EMT. Covariation analysis of the data revealed that functionally coherent protein clusters dynamically changed their protein-protein correlations without concomitant changes in the cluster-average protein abundance. These dynamics of protein-protein correlations were monotonic in time and delineated protein modules functioning in actin cytoskeleton organization, energy metabolism, and protein transport. These protein modules are defined by protein covariation within the same time point and cluster and, thus, reflect biological regulation masked by the cluster-average protein dynamics. Thus, protein correlation dynamics across single cells offers a window into protein regulation during physiological transitions.
RESUMO
Physiological processes, such as epithelial-mesenchymal transition (EMT), are mediated by changes in protein interactions. These changes may be better reflected in protein covariation within cellular cluster than in the temporal dynamics of cluster-average protein abundance. To explore this possibility, we quantified proteins in single human cells undergoing EMT. Covariation analysis of the data revealed that functionally coherent protein clusters dynamically changed their protein-protein correlations without concomitant changes in cluster-average protein abundance. These dynamics of protein-protein correlations were monotonic in time and delineated protein modules functioning in actin cytoskeleton organization, energy metabolism and protein transport. These protein modules are defined by protein covariation within the same time point and cluster and thus reflect biological regulation masked by the cluster-average protein dynamics. Thus, protein correlation dynamics across single cells offer a window into protein regulation during physiological transitions.
RESUMO
Single-cell proteomics by mass spectrometry (MS) allows quantifying proteins with high specificity and sensitivity. To increase its throughput, we developed nPOP, a method for parallel preparation of thousands of single cells in nanoliter volume droplets deposited on glass slides. Here, we describe its protocol with emphasis on its flexibility to prepare samples for different multiplexed MS methods. An implementation with plexDIA demonstrates accurate quantification of about 3,000 - 3,700 proteins per human cell. The protocol is implemented on the CellenONE instrument and uses readily available consumables, which should facilitate broad adoption. nPOP can be applied to all samples that can be processed to a single-cell suspension. It takes 1 or 2 days to prepare over 3,000 single cells. We provide metrics and software for quality control that can support the robust scaling of nPOP to higher plex reagents for achieving reliable high-throughput single-cell protein analysis.
RESUMO
Oxygen (O2) tension plays a key role in tissue function and pathophysiology. O2-controlled cell culture, in which the O2 concentration in an incubator's gas phase is controlled, is an indispensable tool to study the role of O2 in vivo. For this technique, it is presumed that the incubator setpoint is equal to the O2 tension that cells experience (i.e., pericellular O2). We discovered that physioxic (5% O2) and hypoxic (1% O2) setpoints regularly induce anoxic (0.0% O2) pericellular tensions in both adherent and suspension cell cultures. Electron transport chain inhibition ablates this effect, indicating that cellular O2 consumption is the driving factor. RNA-seq revealed that primary human hepatocytes cultured in physioxia experience ischemia-reperfusion injury due to anoxic exposure followed by rapid reoxygenation. To better understand the relationship between incubator gas phase and pericellular O2 tensions, we developed a reaction-diffusion model that predicts pericellular O2 tension a priori. This model revealed that the effect of cellular O2 consumption is greatest in smaller volume culture vessels (e.g., 96-well plate). By controlling pericellular O2 tension in cell culture, we discovered that MCF7 cells have stronger glycolytic and glutamine metabolism responses in anoxia vs. hypoxia. MCF7 also expressed higher levels of HIF2A, CD73, NDUFA4L2, etc. and lower levels of HIF1A, CA9, VEGFA, etc. in response to hypoxia vs. anoxia. Proteomics revealed that 4T1 cells had an upregulated epithelial-to-mesenchymal transition (EMT) response and downregulated reactive oxygen species (ROS) management, glycolysis, and fatty acid metabolism pathways in hypoxia vs. anoxia. Collectively, these results reveal that breast cancer cells respond non-monotonically to low O2, suggesting that anoxic cell culture is not suitable to model hypoxia. We demonstrate that controlling atmospheric O2 tension in cell culture incubators is insufficient to control O2 in cell culture and introduce the concept of pericellular O2-controlled cell culture.
RESUMO
Mass spectrometry (MS) enables specific and accurate quantification of proteins with ever-increasing throughput and sensitivity. Maximizing this potential of MS requires optimizing data acquisition parameters and performing efficient quality control for large datasets. To facilitate these objectives for data-independent acquisition (DIA), we developed a second version of our framework for data-driven optimization of MS methods (DO-MS). The DO-MS app v2.0 (do-ms.slavovlab.net) allows one to optimize and evaluate results from both label-free and multiplexed DIA (plexDIA) and supports optimizations particularly relevant to single-cell proteomics. We demonstrate multiple use cases, including optimization of duty cycle methods, peptide separation, number of survey scans per duty cycle, and quality control of single-cell plexDIA data. DO-MS allows for interactive data display and generation of extensive reports, including publication of quality figures that can be easily shared. The source code is available at github.com/SlavovLab/DO-MS.
Assuntos
Peptídeos , Proteínas , Espectrometria de Massas/métodos , Peptídeos/análise , Proteômica/métodos , SoftwareRESUMO
How ubiquitous circadian clocks orchestrate tissue-specific outputs is not well understood. Pancreatic ß cell-autonomous clocks attune insulin secretion to daily energy cycles, and desynchrony from genetic or behavioral disruptions raises type 2 diabetes risk. We show that the transcription factor DEC1, a clock component induced in adult ß cells, coordinates their glucose responsiveness by synchronizing energy metabolism and secretory gene oscillations. Dec1-ablated mice develop lifelong hypo-insulinemic diabetes, despite normal islet formation and intact circadian Clock and Bmal1 activators. DEC1, but not CLOCK/BMAL1, binds maturity-linked genes that mediate respiratory metabolism and insulin exocytosis, and Dec1 loss disrupts their transcription synchrony. Accordingly, ß-cell Dec1 ablation causes hypo-insulinemia due to immature glucose responsiveness, dampening insulin rhythms. Thus, Dec1 links circadian clockwork to the ß-cell maturation process, aligning metabolism to diurnal energy cycles.
RESUMO
Biological functions stem from coordinated interactions among proteins, nucleic acids and small molecules. Mass spectrometry technologies for reliable, high throughput single-cell proteomics will add a new modality to genomics and enable data-driven modeling of the molecular mechanisms coordinating proteins and nucleic acids at single-cell resolution. This promising potential requires estimating the reliability of measurements and computational analysis so that models can distinguish biological regulation from technical artifacts. We highlight different measurement modes that can support single-cell proteogenomic analysis and how to estimate their reliability. We then discuss approaches for developing both abstract and mechanistic models that aim to biologically interpret the measured differences across modalities, including specific applications to directed stem cell differentiation and to inferring protein interactions in cancer cells from the buffing of DNA copy-number variations. Single-cell proteogenomic data will support mechanistic models of direct molecular interactions that will provide generalizable and predictive representations of biological systems.
RESUMO
Many developmental processes are regulated post-transcriptionally. Such post-transcriptional regulatory mechanisms can now be analyzed by robust single-cell mass spectrometry methods that allow accurate quantification of proteins and their modification in single cells. These methods can enable quantitative exploration of protein synthesis and degradation mechanisms that contribute to developmental cell fate specification. Furthermore, they may support functional analysis of protein conformations and activities in single cells, and thus link protein functions to developmental processes. This Spotlight provides an accessible introduction to single-cell mass spectrometry methods and suggests initial biological questions that are ripe for investigation.
Assuntos
Regulação da Expressão Gênica , Proteômica , Diferenciação Celular , Espectrometria de Massas , Biossíntese de ProteínasRESUMO
Stem cell differentiation is a highly dynamic process involving pervasive changes in gene expression. The large majority of existing studies has characterized differentiation at the level of individual molecular profiles, such as the transcriptome or the proteome. To obtain a more comprehensive view, we measured protein, mRNA and microRNA abundance during retinoic acid-driven differentiation of mouse embryonic stem cells. We found that mRNA and protein abundance are typically only weakly correlated across time. To understand this finding, we developed a hierarchical dynamical model that allowed us to integrate all data sets. This model was able to explain mRNA-protein discordance for most genes and identified instances of potential microRNA-mediated regulation. Overexpression or depletion of microRNAs identified by the model, followed by RNA sequencing and protein quantification, were used to follow up on the predictions of the model. Overall, our study shows how multi-omics integration by a dynamical model could be used to nominate candidate regulators.
Assuntos
MicroRNAs , Multiômica , Animais , Camundongos , Diferenciação Celular/genética , MicroRNAs/genética , Transcriptoma , RNA Mensageiro/genéticaRESUMO
Major aims of single-cell proteomics include increasing the consistency, sensitivity and depth of protein quantification, especially for proteins and modifications of biological interest. Here, to simultaneously advance all these aims, we developed prioritized Single-Cell ProtEomics (pSCoPE). pSCoPE consistently analyzes thousands of prioritized peptides across all single cells (thus increasing data completeness) while maximizing instrument time spent analyzing identifiable peptides, thus increasing proteome depth. These strategies increased the sensitivity, data completeness and proteome coverage over twofold. The gains enabled quantifying protein variation in untreated and lipopolysaccharide-treated primary macrophages. Within each condition, proteins covaried within functional sets, including phagosome maturation and proton transport, similarly across both treatment conditions. This covariation is coupled to phenotypic variability in endocytic activity. pSCoPE also enabled quantifying proteolytic products, suggesting a gradient of cathepsin activities within a treatment condition. pSCoPE is freely available and widely applicable, especially for analyzing proteins of interest without sacrificing proteome coverage. Support for pSCoPE is available at http://scp.slavovlab.net/pSCoPE .