RESUMO
A major scientific drive is to characterize the protein-coding genome as it provides the primary basis for the study of human health. But the fundamental question remains: what has been missed in prior genomic analyses? Over the past decade, the translation of non-canonical open reading frames (ncORFs) has been observed across human cell types and disease states, with major implications for proteomics, genomics, and clinical science. However, the impact of ncORFs has been limited by the absence of a large-scale understanding of their contribution to the human proteome. Here, we report the collaborative efforts of stakeholders in proteomics, immunopeptidomics, Ribo-seq ORF discovery, and gene annotation, to produce a consensus landscape of protein-level evidence for ncORFs. We show that at least 25% of a set of 7,264 ncORFs give rise to translated gene products, yielding over 3,000 peptides in a pan-proteome analysis encompassing 3.8 billion mass spectra from 95,520 experiments. With these data, we developed an annotation framework for ncORFs and created public tools for researchers through GENCODE and PeptideAtlas. This work will provide a platform to advance ncORF-derived proteins in biomedical discovery and, beyond humans, diverse animals and plants where ncORFs are similarly observed.
RESUMO
The detection of mitochondrial DNA (mtDNA) mutations in single cells holds considerable potential to define clonal relationships coupled with information on cell state in humans. Previous methods focused on higher heteroplasmy mutations that are limited in number and can be influenced by functional selection, introducing biases for lineage tracing. Although more challenging to detect, intermediate to low heteroplasmy mtDNA mutations are valuable due to their high diversity, abundance, and lower propensity to selection. To enhance mtDNA mutation detection and facilitate fine-scale lineage tracing, we developed the single-cell Regulatory multi-omics with Deep Mitochondrial mutation profiling (ReDeeM) approach, an integrated experimental and computational framework. Recently, some concerns have been raised about the analytical workflow in the ReDeeM framework. Specifically, it was noted that the mutations detected in a single molecule per cell are enriched on edges of mtDNA molecules, suggesting they resemble artifacts reported in other sequencing approaches. It was then proposed that all mutations found in one molecule per cell should be removed. We detail our error correction method, demonstrating that the observed edge mutations are distinct from previously reported sequencing artifacts. We further show that the proposed removal leads to massive elimination of bona fide and informative mutations. Indeed, mutations accumulating on edges impact a minority of all mutation calls (for example, in hematopoietic stem cells, the excess mutations on the edge account for only 4.3%-7.6% of the total). Recognizing the value of addressing edge mutations even after applying consensus correction, we provide an additional filtering option in the ReDeeM-R package. This approach effectively eliminates the position biases, leads to a mutational signature indistinguishable from bona fide mitochondrial mutations, and removes excess low molecule high connectedness mutations. Importantly, this option preserves the large majority of unique mutations identified by ReDeeM, maintaining the ability of ReDeeM to provide a more than 10-fold increase in variant detection compared to previous methods. Additionally, the cells remain well-connected. While there is room for further refinement in mutation calling strategies, the significant advances and biological insights provided by the ReDeeM framework are unique and remain intact. We hope that this detailed discussion and analysis enables the community to employ this approach and contribute to its further development.
RESUMO
Cells must adapt to environmental changes to maintain homeostasis. One of the most striking environmental adaptations is entry into hibernation during which core body temperature can decrease from 37°C to as low at 4°C. How mammalian cells, which evolved to optimally function within a narrow range of temperatures, adapt to this profound decrease in temperature remains poorly understood. In this study, we conducted the first genome-scale CRISPR-Cas9 screen in cells derived from Syrian hamster, a facultative hibernator, as well as human cells to investigate the genetic basis of cold tolerance in a hibernator and a non-hibernator in an unbiased manner. Both screens independently revealed glutathione peroxidase 4 (GPX4), a selenium-containing enzyme, and associated proteins as critical for cold tolerance. We utilized genetic and pharmacological approaches to demonstrate that GPX4 is active in the cold and its catalytic activity is required for cold tolerance. Furthermore, we show that the role of GPX4 as a suppressor of cold-induced cell death extends across hibernating species, including 13-lined ground squirrels and greater horseshoe bats, highlighting the evolutionary conservation of this mechanism of cold tolerance. This study identifies GPX4 as a central modulator of mammalian cold tolerance and advances our understanding of the evolved mechanisms by which cells mitigate cold-associated damage-one of the most common challenges faced by cells and organisms in nature.
RESUMO
Single cell CRISPR screens such as Perturb-seq enable transcriptomic profiling of genetic perturbations at scale. However, the data produced by these screens are often noisy due to cost and technical constraints, limiting power to detect true effects with conventional differential expression analyses. Here, we introduce TRanscriptome-wide Analysis of Differential Expression (TRADE), a statistical framework which estimates the transcriptome-wide distribution of true differential expression effects from noisy gene-level measurements. Within TRADE, we derive multiple novel, interpretable statistical metrics, including the "transcriptome-wide impact", an estimator of the overall transcriptional effect of a perturbation which is stable across sampling depths. We analyze new and published large-scale Perturb-seq datasets to show that many true transcriptional effects are not statistically significant, but detectable in aggregate with TRADE. In a genome-scale Perturb-seq screen, we find that a typical gene perturbation affects an estimated 45 genes, whereas a typical essential gene perturbation affects over 500 genes. An advantage of our approach is its ability to compare the transcriptomic effects of genetic perturbations across contexts and dosages despite differences in power. We use this ability to identify perturbations with cell-type dependent effects and to find examples of perturbations where transcriptional responses are not only larger in magnitude, but also qualitatively different, as a function of dosage. Lastly, we expand our analysis to case/control comparison of gene expression for neuropsychiatric conditions, finding that transcriptomic effect correlations are greater than genetic correlations for these diagnoses. TRADE lays an analytic foundation for the systematic comparison of genetic perturbation atlases, as well as differential expression experiments more broadly.
RESUMO
Control of cellular identity requires coordination of developmental programs with environmental factors such as nutrient availability, suggesting that perturbing metabolism can alter cell state. Here, we find that nucleotide depletion and DNA replication stress drive differentiation in human and murine normal and transformed hematopoietic systems, including patient-derived acute myeloid leukemia (AML) xenografts. These cell state transitions begin during S phase and are independent of ATR/ATM checkpoint signaling, double-stranded DNA break formation, and changes in cell cycle length. In systems where differentiation is blocked by oncogenic transcription factor expression, replication stress activates primed regulatory loci and induces lineage-appropriate maturation genes despite the persistence of progenitor programs. Altering the baseline cell state by manipulating transcription factor expression causes replication stress to induce genes specific for alternative lineages. The ability of replication stress to selectively activate primed maturation programs across different contexts suggests a general mechanism by which changes in metabolism can promote lineage-appropriate cell state transitions.
Assuntos
Diferenciação Celular , Replicação do DNA , Replicação do DNA/genética , Animais , Humanos , Diferenciação Celular/genética , Camundongos , Nucleotídeos/metabolismo , Nucleotídeos/genética , Linhagem da Célula/genética , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/patologia , Leucemia Mieloide Aguda/metabolismo , Fase S/genética , Transdução de SinaisRESUMO
Prion disease is caused by misfolding of the prion protein (PrP) into pathogenic self-propagating conformations, leading to rapid-onset dementia and death. However, elimination of endogenous PrP halts prion disease progression. In this study, we describe Coupled Histone tail for Autoinhibition Release of Methyltransferase (CHARM), a compact, enzyme-free epigenetic editor capable of silencing transcription through programmable DNA methylation. Using a histone H3 tail-Dnmt3l fusion, CHARM recruits and activates endogenous DNA methyltransferases, thereby reducing transgene size and cytotoxicity. When delivered to the mouse brain by systemic injection of adeno-associated virus (AAV), Prnp-targeted CHARM ablates PrP expression across the brain. Furthermore, we have temporally limited editor expression by implementing a kinetically tuned self-silencing approach. CHARM potentially represents a broadly applicable strategy to suppress pathogenic proteins, including those implicated in other neurodegenerative diseases.
Assuntos
Encéfalo , Metilação de DNA , Dependovirus , Inativação Gênica , Histonas , Proteínas Priônicas , Animais , Humanos , Camundongos , Encéfalo/metabolismo , Dependovirus/genética , DNA (Citosina-5-)-Metiltransferases/metabolismo , DNA (Citosina-5-)-Metiltransferases/genética , Histonas/metabolismo , Doenças Priônicas/genética , Doenças Priônicas/metabolismo , Proteínas Priônicas/genética , Proteínas Priônicas/metabolismo , TransgenesRESUMO
Long noncoding RNAs (lncRNAs) account for the largest portion of RNA from the transcriptome, yet most of their functions remain unknown. Here, we performed two independent high-throughput CRISPRi screens to understand the role of lncRNAs in monocyte function and differentiation. The first was a reporter-based screen to identify lncRNAs that regulate TLR4-NFkB signaling in human monocytes and the second screen identified lncRNAs involved in monocyte to macrophage differentiation. We successfully identified numerous noncoding and protein-coding genes that can positively or negatively regulate inflammation and differentiation. To understand the functional roles of lncRNAs in both processes, we chose to further study the lncRNA LOUP [lncRNA originating from upstream regulatory element of SPI1 (also known as PU.1)], as it emerged as a top hit in both screens. Not only does LOUP regulate its neighboring gene, the myeloid fate-determining factor SPI1, thereby affecting monocyte to macrophage differentiation, but knockdown of LOUP leads to a broad upregulation of NFkB-targeted genes at baseline and upon TLR4-NFkB activation. LOUP also harbors three small open reading frames capable of being translated and are responsible for LOUP's ability to negatively regulate TLR4/NFkB signaling. This work emphasizes the value of high-throughput screening to rapidly identify functional lncRNAs in the innate immune system.
Assuntos
Diferenciação Celular , Inflamação , Macrófagos , Monócitos , RNA Longo não Codificante , Transdução de Sinais , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Humanos , Macrófagos/metabolismo , Macrófagos/citologia , Diferenciação Celular/genética , Monócitos/metabolismo , Monócitos/citologia , Inflamação/genética , Inflamação/metabolismo , Receptor 4 Toll-Like/metabolismo , Receptor 4 Toll-Like/genética , NF-kappa B/metabolismo , Transativadores/metabolismo , Transativadores/genética , Proteínas Proto-Oncogênicas/metabolismo , Proteínas Proto-Oncogênicas/genética , Sistemas CRISPR-Cas , Regulação da Expressão GênicaRESUMO
Learning and memory require activity-induced changes in dendritic translation, but which mRNAs are involved and how they are regulated are unclear. In this study, to monitor how depolarization impacts local dendritic biology, we employed a dendritically targeted proximity labeling approach followed by crosslinking immunoprecipitation, ribosome profiling and mass spectrometry. Depolarization of primary cortical neurons with KCl or the glutamate agonist DHPG caused rapid reprogramming of dendritic protein expression, where changes in dendritic mRNAs and proteins are weakly correlated. For a subset of pre-localized messages, depolarization increased the translation of upstream open reading frames (uORFs) and their downstream coding sequences, enabling localized production of proteins involved in long-term potentiation, cell signaling and energy metabolism. This activity-dependent translation was accompanied by the phosphorylation and recruitment of the non-canonical translation initiation factor eIF4G2, and the translated uORFs were sufficient to confer depolarization-induced, eIF4G2-dependent translational control. These studies uncovered an unanticipated mechanism by which activity-dependent uORF translational control by eIF4G2 couples activity to local dendritic remodeling.
Assuntos
Dendritos , Fator de Iniciação Eucariótico 4G , Fases de Leitura Aberta , Biossíntese de Proteínas , Animais , Camundongos , Células Cultivadas , Dendritos/metabolismo , Fator de Iniciação Eucariótico 4G/metabolismo , Neurônios/metabolismo , Fases de Leitura Aberta/genética , Cloreto de Potássio/farmacologia , Biossíntese de Proteínas/fisiologiaRESUMO
Mitochondrial outer membrane âº-helical proteins play critical roles in mitochondrial-cytoplasmic communication, but the rules governing the targeting and insertion of these biophysically diverse proteins remain unknown. Here, we first defined the complement of required mammalian biogenesis machinery through genome-wide CRISPRi screens using topologically distinct membrane proteins. Systematic analysis of nine identified factors across 21 diverse âº-helical substrates reveals that these components are organized into distinct targeting pathways that act on substrates based on their topology. NAC is required for the efficient targeting of polytopic proteins, whereas signal-anchored proteins require TTC1, a cytosolic chaperone that physically engages substrates. Biochemical and mutational studies reveal that TTC1 employs a conserved TPR domain and a hydrophobic groove in its C-terminal domain to support substrate solubilization and insertion into mitochondria. Thus, the targeting of diverse mitochondrial membrane proteins is achieved through topological triaging in the cytosol using principles with similarities to ER membrane protein biogenesis systems.
Assuntos
Membranas Mitocondriais , Proteínas de Saccharomyces cerevisiae , Animais , Membranas Mitocondriais/metabolismo , Mitocôndrias/genética , Mitocôndrias/metabolismo , Proteínas de Membrana/genética , Proteínas de Membrana/metabolismo , Chaperonas Moleculares/genética , Chaperonas Moleculares/metabolismo , Mutação , Proteínas Mitocondriais/genética , Proteínas Mitocondriais/metabolismo , Transporte Proteico , Proteínas de Saccharomyces cerevisiae/metabolismo , Mamíferos/metabolismoRESUMO
The human blood system is maintained through the differentiation and massive amplification of a limited number of long-lived haematopoietic stem cells (HSCs)1. Perturbations to this process underlie diverse diseases, but the clonal contributions to human haematopoiesis and how this changes with age remain incompletely understood. Although recent insights have emerged from barcoding studies in model systems2-5, simultaneous detection of cell states and phylogenies from natural barcodes in humans remains challenging. Here we introduce an improved, single-cell lineage-tracing system based on deep detection of naturally occurring mitochondrial DNA mutations with simultaneous readout of transcriptional states and chromatin accessibility. We use this system to define the clonal architecture of HSCs and map the physiological state and output of clones. We uncover functional heterogeneity in HSC clones, which is stable over months and manifests as both differences in total HSC output and biases towards the production of different mature cell types. We also find that the diversity of HSC clones decreases markedly with age, leading to an oligoclonal structure with multiple distinct clonal expansions. Our study thus provides a clonally resolved and cell-state-aware atlas of human haematopoiesis at single-cell resolution, showing an unappreciated functional diversity of human HSC clones and, more broadly, paving the way for refined studies of clonal dynamics across a range of tissues in human health and disease.
Assuntos
Linhagem da Célula , Hematopoese , Células-Tronco Hematopoéticas , Humanos , Cromatina/genética , Cromatina/metabolismo , Células Clonais/classificação , Células Clonais/citologia , Células Clonais/metabolismo , DNA Mitocondrial/genética , Células-Tronco Hematopoéticas/classificação , Células-Tronco Hematopoéticas/citologia , Células-Tronco Hematopoéticas/metabolismo , Mutação , Análise de Célula Única , Transcrição Gênica , EnvelhecimentoRESUMO
Genomic and proteomic screens have identified numerous host factors of SARS-CoV-2, but efficient delineation of their molecular roles during infection remains a challenge. Here we use Perturb-seq, combining genetic perturbations with a single-cell readout, to investigate how inactivation of host factors changes the course of SARS-CoV-2 infection and the host response in human lung epithelial cells. Our high-dimensional data resolve complex phenotypes such as shifts in the stages of infection and modulations of the interferon response. However, only a small percentage of host factors showed such phenotypes upon perturbation. We further identified the NF-κB inhibitor IκBα (NFKBIA), as well as the translation factors EIF4E2 and EIF4H as strong host dependency factors acting early in infection. Overall, our study provides massively parallel functional characterization of host factors of SARS-CoV-2 and quantitatively defines their roles both in virus-infected and bystander cells.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , COVID-19/genética , Proteômica , Pulmão , Células EpiteliaisRESUMO
Mapping genetic interactions is essential for determining gene function and defining novel biological pathways. We report a simple to use CRISPR interference (CRISPRi) based platform, compatible with Fluorescence Activated Cell Sorting (FACS)-based reporter screens, to query epistatic relationships at scale. This is enabled by a flexible dual-sgRNA library design that allows for the simultaneous delivery and selection of a fixed sgRNA and a second randomized guide, comprised of a genome-wide library, with a single transduction. We use this approach to identify epistatic relationships for a defined biological pathway, showing both increased sensitivity and specificity than traditional growth screening approaches.
Assuntos
Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , RNA Guia de Sistemas CRISPR-Cas , Biblioteca Gênica , Genoma , Sistemas CRISPR-CasRESUMO
Mitochondrial outer membrane α-helical proteins play critical roles in mitochondrial-cytoplasmic communication, but the rules governing the targeting and insertion of these biophysically diverse substrates remain unknown. Here, we first defined the complement of required mammalian biogenesis machinery through genome-wide CRISPRi screens using topologically distinct membrane proteins. Systematic analysis of nine identified factors across 21 diverse α-helical substrates reveals that these components are organized into distinct targeting pathways which act on substrates based on their topology. NAC is required for efficient targeting of polytopic proteins whereas signal-anchored proteins require TTC1, a novel cytosolic chaperone which physically engages substrates. Biochemical and mutational studies reveal that TTC1 employs a conserved TPR domain and a hydrophobic groove in its C-terminal domain to support substrate solubilization and insertion into mitochondria. Thus, targeting of diverse mitochondrial membrane proteins is achieved through topological triaging in the cytosol using principles with similarities to ER membrane protein biogenesis systems.
RESUMO
Comparative studies of great apes provide a window into our evolutionary past, but the extent and identity of cellular differences that emerged during hominin evolution remain largely unexplored. We established a comparative loss-of-function approach to evaluate whether human cells exhibit distinct genetic dependencies. By performing genome-wide CRISPR interference screens in human and chimpanzee pluripotent stem cells, we identified 75 genes with species-specific effects on cellular proliferation. These genes comprised coherent processes, including cell-cycle progression and lysosomal signaling, which we determined to be human-derived by comparison with orangutan cells. Human-specific robustness to CDK2 and CCNE1 depletion persisted in neural progenitor cells and cerebral organoids, supporting the G1-phase length hypothesis as a potential evolutionary mechanism in human brain expansion. Our findings demonstrate that evolutionary changes in human cells reshaped the landscape of essential genes and establish a platform for systematically uncovering latent cellular and molecular differences between species.
Assuntos
Hominidae , Células-Tronco Neurais , Células-Tronco Pluripotentes , Células-Tronco , Animais , Humanos , Pan troglodytes/genéticaRESUMO
Translation of mRNAs containing premature termination codons (PTCs) results in truncated protein products with deleterious effects. Nonsense-mediated decay (NMD) is a surveillance pathway responsible for detecting PTC containing transcripts. Although the molecular mechanisms governing mRNA degradation have been extensively studied, the fate of the nascent protein product remains largely uncharacterized. Here, we use a fluorescent reporter system in mammalian cells to reveal a selective degradation pathway specifically targeting the protein product of an NMD mRNA. We show that this process is post-translational and dependent on the ubiquitin proteasome system. To systematically uncover factors involved in NMD-linked protein quality control, we conducted genome-wide flow cytometry-based screens. Our screens recovered known NMD factors but suggested that protein degradation did not depend on the canonical ribosome-quality control (RQC) pathway. A subsequent arrayed screen demonstrated that protein and mRNA branches of NMD rely on a shared recognition event. Our results establish the existence of a targeted pathway for nascent protein degradation from PTC containing mRNAs, and provide a reference for the field to identify and characterize required factors.
Assuntos
Mamíferos , Degradação do RNAm Mediada por Códon sem Sentido , Animais , Degradação do RNAm Mediada por Códon sem Sentido/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Mamíferos/metabolismoRESUMO
The translation initiation machinery and the ribosome orchestrate a highly dynamic scanning process to distinguish proper start codons from surrounding nucleotide sequences. Here, we performed genome-wide CRISPRi screens in human K562 cells to systematically identify modulators of the frequency of translation initiation at near-cognate start codons. We observed that depletion of any eIF3 core subunit promoted near-cognate start codon usage, though sensitivity thresholds of each subunit to sgRNA-mediated depletion varied considerably. Double sgRNA depletion experiments suggested that enhanced near-cognate usage in eIF3D depleted cells required canonical eIF4E cap-binding and was not driven by eIF2A or eIF2D-dependent leucine tRNA initiation. We further characterized the effects of eIF3D depletion and found that the N-terminus of eIF3D was strictly required for accurate start codon selection, whereas disruption of the cap-binding properties of eIF3D had no effect. Lastly, depletion of eIF3D activated TNFα signaling via NF-κB and the interferon gamma response. Similar transcriptional profiles were observed upon knockdown of eIF1A and eIF4G2, which also promoted near-cognate start codon usage, suggesting that enhanced near-cognate usage could potentially contribute to NF-κB activation. Our study thus provides new avenues to study the mechanisms and consequences of alternative start codon usage.
Assuntos
Fator de Iniciação 3 em Eucariotos , RNA Guia de Sistemas CRISPR-Cas , Humanos , Códon de Iniciação/metabolismo , Fator de Iniciação 3 em Eucariotos/genética , Fator de Iniciação 3 em Eucariotos/metabolismo , NF-kappa B/genética , NF-kappa B/metabolismo , Iniciação Traducional da Cadeia Peptídica , Biossíntese de Proteínas , Ribossomos/metabolismoRESUMO
Comparative studies of great apes provide a window into our evolutionary past, but the extent and identity of cellular differences that emerged during hominin evolution remain largely unexplored. We established a comparative loss-of-function approach to evaluate whether changes in human cells alter requirements for essential genes. By performing genome-wide CRISPR interference screens in human and chimpanzee pluripotent stem cells, we identified 75 genes with species-specific effects on cellular proliferation. These genes comprised coherent processes, including cell cycle progression and lysosomal signaling, which we determined to be human-derived by comparison with orangutan cells. Human-specific robustness to CDK2 and CCNE1 depletion persisted in neural progenitor cells, providing support for the G1-phase length hypothesis as a potential evolutionary mechanism in human brain expansion. Our findings demonstrate that evolutionary changes in human cells can reshape the landscape of essential genes and establish a platform for systematically uncovering latent cellular and molecular differences between species.
RESUMO
Current single-cell RNA-sequencing approaches have limitations that stem from the microfluidic devices or fluid handling steps required for sample processing. We develop a method that does not require specialized microfluidic devices, expertise or hardware. Our approach is based on particle-templated emulsification, which allows single-cell encapsulation and barcoding of cDNA in uniform droplet emulsions with only a vortexer. Particle-templated instant partition sequencing (PIP-seq) accommodates a wide range of emulsification formats, including microwell plates and large-volume conical tubes, enabling thousands of samples or millions of cells to be processed in minutes. We demonstrate that PIP-seq produces high-purity transcriptomes in mouse-human mixing studies, is compatible with multiomics measurements and can accurately characterize cell types in human breast tissue compared to a commercial microfluidic platform. Single-cell transcriptional profiling of mixed phenotype acute leukemia using PIP-seq reveals the emergence of heterogeneity within chemotherapy-resistant cell subsets that were hidden by standard immunophenotyping. PIP-seq is a simple, flexible and scalable next-generation workflow that extends single-cell sequencing to new applications.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Microfluídica , Humanos , Animais , Camundongos , Microfluídica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Célula Única/métodos , Genômica/métodos , Transcriptoma/genéticaRESUMO
Mapping genetic interactions is essential for determining gene function and defining novel biological pathways. We report a simple to use CRISPR interference (CRISPRi) based platform, compatible with Fluorescence Activated Cell Sorting (FACS)-based reporter screens, to query epistatic relationships at scale. This is enabled by a flexible dual-sgRNA library design that allows for the simultaneous delivery and selection of a fixed sgRNA and a second randomized guide, comprised of a genome-wide library, with a single transduction. We use this approach to identify epistatic relationships for a defined biological pathway, showing both increased sensitivity and specificity than traditional growth screening approaches.