RESUMEN
A central goal of genetics is to define the relationships between genotypes and phenotypes. High-content phenotypic screens such as Perturb-seq (CRISPR-based screens with single-cell RNA-sequencing readouts) enable massively parallel functional genomic mapping but, to date, have been used at limited scales. Here, we perform genome-scale Perturb-seq targeting all expressed genes with CRISPR interference (CRISPRi) across >2.5 million human cells. We use transcriptional phenotypes to predict the function of poorly characterized genes, uncovering new regulators of ribosome biogenesis (including CCDC86, ZNF236, and SPATA5L1), transcription (C7orf26), and mitochondrial respiration (TMEM242). In addition to assigning gene function, single-cell transcriptional phenotypes allow for in-depth dissection of complex cellular phenomena-from RNA processing to differentiation. We leverage this ability to systematically identify genetic drivers and consequences of aneuploidy and to discover an unanticipated layer of stress-specific regulation of the mitochondrial genome. Our information-rich genotype-phenotype map reveals a multidimensional portrait of gene and cellular function.
Asunto(s)
Genómica , Análisis de la Célula Individual , Sistemas CRISPR-Cas/genética , Mapeo Cromosómico , Genotipo , Fenotipo , Análisis de la Célula Individual/métodosRESUMEN
Inflammation of non-barrier immunologically quiescent tissues is associated with a massive influx of blood-borne innate and adaptive immune cells. Cues from the latter are likely to alter and expand activated states of the resident cells. However, local communications between immigrant and resident cell types in human inflammatory disease remain poorly understood. Here, we explored drivers of fibroblast-like synoviocyte (FLS) heterogeneity in inflamed joints of patients with rheumatoid arthritis using paired single-cell RNA and ATAC sequencing, multiplexed imaging and spatial transcriptomics along with in vitro modeling of cell-extrinsic factor signaling. These analyses suggest that local exposures to myeloid and T cell-derived cytokines, TNF, IFN-γ, IL-1ß or lack thereof, drive four distinct FLS states some of which closely resemble fibroblast states in other disease-affected tissues including skin and colon. Our results highlight a role for concurrent, spatially distributed cytokine signaling within the inflamed synovium.
Asunto(s)
Artritis Reumatoide , Humanos , Células Cultivadas , Artritis Reumatoide/genética , Membrana Sinovial , Citocinas/metabolismo , FibroblastosRESUMEN
Genetic screens help infer gene function in mammalian cells, but it has remained difficult to assay complex phenotypes-such as transcriptional profiles-at scale. Here, we develop Perturb-seq, combining single-cell RNA sequencing (RNA-seq) and clustered regularly interspaced short palindromic repeats (CRISPR)-based perturbations to perform many such assays in a pool. We demonstrate Perturb-seq by analyzing 200,000 cells in immune cells and cell lines, focusing on transcription factors regulating the response of dendritic cells to lipopolysaccharide (LPS). Perturb-seq accurately identifies individual gene targets, gene signatures, and cell states affected by individual perturbations and their genetic interactions. We posit new functions for regulators of differentiation, the anti-viral response, and mitochondrial function during immune activation. By decomposing many high content measurements into the effects of perturbations, their interactions, and diverse cell metadata, Perturb-seq dramatically increases the scope of pooled genomic assays.
Asunto(s)
Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Animales , Ciclo Celular , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Retroalimentación , Perfilación de la Expresión Génica , Técnicas de Silenciamiento del Gen , Humanos , Células K562 , Ratones , Ratones Transgénicos , Factores de Transcripción/metabolismoRESUMEN
Functional genomics efforts face tradeoffs between number of perturbations examined and complexity of phenotypes measured. We bridge this gap with Perturb-seq, which combines droplet-based single-cell RNA-seq with a strategy for barcoding CRISPR-mediated perturbations, allowing many perturbations to be profiled in pooled format. We applied Perturb-seq to dissect the mammalian unfolded protein response (UPR) using single and combinatorial CRISPR perturbations. Two genome-scale CRISPR interference (CRISPRi) screens identified genes whose repression perturbs ER homeostasis. Subjecting â¼100 hits to Perturb-seq enabled high-precision functional clustering of genes. Single-cell analyses decoupled the three UPR branches, revealed bifurcated UPR branch activation among cells subject to the same perturbation, and uncovered differential activation of the branches across hits, including an isolated feedback loop between the translocon and IRE1α. These studies provide insight into how the three sensors of ER homeostasis monitor distinct types of stress and highlight the ability of Perturb-seq to dissect complex cellular responses.
Asunto(s)
Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Animales , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Endorribonucleasas , Retroalimentación , Humanos , Modelos Moleculares , Proteínas Serina-Treonina Quinasas , ARN Guía de Kinetoplastida/metabolismo , Transcripción Genética , Respuesta de Proteína DesplegadaRESUMEN
Ontogeny describes the emergence of complex multicellular organisms from single totipotent cells. This field is particularly challenging in mammals, owing to the indeterminate relationship between self-renewal and differentiation, variation in progenitor field sizes, and internal gestation in these animals. Here we present a flexible, high-information, multi-channel molecular recorder with a single-cell readout and apply it as an evolving lineage tracer to assemble mouse cell-fate maps from fertilization through gastrulation. By combining lineage information with single-cell RNA sequencing profiles, we recapitulate canonical developmental relationships between different tissue types and reveal the nearly complete transcriptional convergence of endodermal cells of extra-embryonic and embryonic origins. Finally, we apply our cell-fate maps to estimate the number of embryonic progenitor cells and their degree of asymmetric partitioning during specification. Our approach enables massively parallel, high-resolution recording of lineage and other information in mammalian systems, which will facilitate the construction of a quantitative framework for understanding developmental processes.
Asunto(s)
Embrión de Mamíferos/embriología , Embrión de Mamíferos/metabolismo , Desarrollo Embrionario/genética , Animales , Diferenciación Celular/genética , Linaje de la Célula/genética , Embrión de Mamíferos/citología , Células Madre Embrionarias/citología , Células Madre Embrionarias/metabolismo , Endodermo/embriología , Endodermo/metabolismo , Femenino , Fertilización , Gastrulación , Regulación del Desarrollo de la Expresión Génica/genética , Masculino , Ratones , Especificidad de Órganos/genética , Fenotipo , Análisis de Secuencia de ARN , Análisis de la Célula IndividualRESUMEN
Microbes transiently differentiate into distinct, specialized cell types to generate functional diversity and cope with changing environmental conditions. Though alternate programs often entail radically different physiological and morphological states, recent single-cell studies have revealed that these crucial decisions are often left to chance. In these cases, the underlying genetic circuits leverage the intrinsic stochasticity of intracellular chemistry to drive transition between states. Understanding how these circuits transform transient gene expression fluctuations into lasting phenotypic programs will require a combination of quantitative modeling and extensive, time-resolved observation of switching events in single cells. In this article, we survey microbial cell fate decisions demonstrated to involve a random element, describe theoretical frameworks for understanding stochastic switching between states, and highlight recent advances in microfluidics that will enable characterization of key dynamic features of these circuits.
Asunto(s)
Bacterias/citología , Fenómenos Fisiológicos Bacterianos , Bacterias/química , Bacterias/clasificación , Bacterias/genética , Análisis de la Célula IndividualRESUMEN
Genetically identical cells sharing an environment can display markedly different phenotypes. It is often unclear how much of this variation derives from chance, external signals, or attempts by individual cells to exert autonomous phenotypic programs. By observing thousands of cells for hundreds of consecutive generations under constant conditions, we dissect the stochastic decision between a solitary, motile state and a chained, sessile state in Bacillus subtilis. We show that the motile state is 'memoryless', exhibiting no autonomous control over the time spent in the state. In contrast, the time spent as connected chains of cells is tightly controlled, enforcing coordination among related cells in the multicellular state. We show that the three-protein regulatory circuit governing the decision is modular, as initiation and maintenance of chaining are genetically separable functions. As stimulation of the same initiating pathway triggers biofilm formation, we argue that autonomous timing allows a trial commitment to multicellularity that external signals could extend.
Asunto(s)
Bacillus subtilis/citología , Bacillus subtilis/fisiología , Bacillus subtilis/genética , Modelos Biológicos , Movimiento , Fenotipo , Procesos Estocásticos , Factores de TiempoRESUMEN
Biochemical processes are inherently stochastic, creating molecular fluctuations in otherwise identical cells. Such "noise" is widespread but has proven difficult to analyze because most systems are sparsely characterized at the single cell level and because nonlinear stochastic models are analytically intractable. Here, we exactly relate average abundances, lifetimes, step sizes, and covariances for any pair of components in complex stochastic reaction systems even when the dynamics of other components are left unspecified. Using basic mathematical inequalities, we then establish bounds for whole classes of systems. These bounds highlight fundamental trade-offs that show how efficient assembly processes must invariably exhibit large fluctuations in subunit levels and how eliminating fluctuations in one cellular component requires creating heterogeneity in another.
Asunto(s)
Modelos Biológicos , Biología de Sistemas , Dinámicas no Lineales , ARN Mensajero/genética , ARN Mensajero/metabolismo , Procesos EstocásticosRESUMEN
Single-cell CRISPR screens link genetic perturbations to transcriptional states, but high-throughput methods connecting these induced changes to their regulatory foundations are limited. Here we introduce Multiome Perturb-seq, extending single-cell CRISPR screens to simultaneously measure perturbation-induced changes in gene expression and chromatin accessibility. We apply Multiome Perturb-seq in a CRISPRi screen of 13 chromatin remodelers in human RPE-1 cells, achieving efficient assignment of sgRNA identities to single nuclei via an improved method for capturing barcode transcripts from nuclear RNA. We organize expression and accessibility measurements into coherent programs describing the integrated effects of perturbations on cell state, finding that ARID1A and SUZ12 knockdowns induce programs enriched for developmental features. Pseudotime analysis of perturbations connects accessibility changes to changes in gene expression, highlighting the value of multimodal profiling. Overall, our method provides a scalable and simply implemented system to dissect the regulatory logic underpinning cell state.
RESUMEN
Cell atlas projects have nominated recurrent transcriptional states as drivers of biological processes and disease, but their origins, regulation, and properties remain unclear. To enable complementary functional studies, we developed a scalable approach for recapitulating cell states in vitro using CRISPR activation (CRISPRa) Perturb-seq. Aided by a novel multiplexing method, we activated 1,836 transcription factors in two cell types. Measuring 21,958 perturbations showed that CRISPRa activated targets within physiological ranges, that epigenetic features predicted activatable genes, and that the protospacer seed region drove an off-target effect. Perturbations recapitulated in vivo fibroblast states, including universal and inflammatory states, and identified KLF4 and KLF5 as key regulators of the universal state. Inducing the universal state suppressed disease-associated states, highlighting its therapeutic potential. Our findings cement CRISPRa as a tool for perturbing differentiated cells and indicate that in vivo states can be elicited via perturbation, enabling studies of clinically relevant states ex vivo.
RESUMEN
A model system for investigating how developmental regulatory networks determine cell fate is spore formation in Bacillus subtilis. The master regulator for sporulation is Spo0A, which is activated by phosphorylation via a phosphorelay that is subject to three positive feedback loops. The ultimate decision to sporulate is, however, stochastic in that only a portion of the population sporulates even under optimal conditions. It was previously assumed that activation of Spo0A and hence entry into sporulation is subject to a bistable switch mediated by one or more feedback loops. Here we reinvestigate the basis for bimodality in sporulation. We show that none of the feedback loops is rate limiting for the synthesis and phosphorylation of Spo0A. Instead, the loops ensure a just-in-time supply of relay components for rising levels of phosphorylated Spo0A, with phosphate flux through the relay being limiting for Spo0A activation and sporulation. In addition, genes under Spo0A control did not exhibit a bimodal pattern of expression as expected for a bistable switch. In contrast, we observed a highly heterogeneous pattern of Spo0A activation that increased in a nonlinear manner with time. We present a computational model for the nonlinear increase and propose that the phosphorelay is a noise generator and that only cells that attain a threshold level of phosphorylated Spo0A sporulate.
Asunto(s)
Bacillus subtilis/metabolismo , Bacillus subtilis/genética , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Biología Computacional , Regulación Bacteriana de la Expresión Génica , Fosfatos/metabolismo , Esporas Bacterianas/genética , Esporas Bacterianas/metabolismoRESUMEN
A fundamental question in biology is how a limited number of genes combinatorially govern cellular responses to environmental changes. While the prevailing hypothesis is that relationships between genes, processes, and ontologies could be plastic to achieve this adaptability, quantitatively comparing human gene functional connections between specific environmental conditions at scale is very challenging. Therefore, it remains unclear whether and how human genetic interaction networks are rewired in response to changing environmental conditions. Here, we developed a framework for mapping context-specific genetic interactions, enabling us to measure the plasticity of human genetic architecture upon environmental challenge for ~250,000 interactions, using cell cycle interruption, genotoxic perturbation, and nutrient deprivation as archetypes. We discover large-scale rewiring of human gene relationships across conditions, highlighted by dramatic shifts in the functional connections of epigenetic regulators (TIP60), cell cycle regulators (PP2A), and glycolysis metabolism. Our study demonstrates that upon environmental perturbation, intra-complex genetic rewiring is rare while inter-complex rewiring is common, suggesting a modular and flexible evolutionary genetic strategy that allows a limited number of human genes to enable adaptation to a large number of environmental conditions.
RESUMEN
CRISPR screens are a powerful source of biological discovery, enabling the unbiased interrogation of gene function in a wide range of applications and species. In pooled CRISPR screens, various genetically encoded perturbations are introduced into pools of cells. The targeted cells proliferate under a biological challenge such as cell competition, drug treatment or viral infection. Subsequently, the perturbation-induced effects are evaluated by sequencing-based counting of the guide RNAs that specify each perturbation. The typical results of such screens are ranked lists of genes that confer sensitivity or resistance to the biological challenge of interest. Contributing to the broad utility of CRISPR screens, adaptations of the core CRISPR technology make it possible to activate, silence or otherwise manipulate the target genes. Moreover, high-content read-outs such as single-cell RNA sequencing and spatial imaging help characterize screened cells with unprecedented detail. Dedicated software tools facilitate bioinformatic analysis and enhance reproducibility. CRISPR screening has unravelled various molecular mechanisms in basic biology, medical genetics, cancer research, immunology, infectious diseases, microbiology and other fields. This Primer describes the basic and advanced concepts of CRISPR screening and its application as a flexible and reliable method for biological discovery, biomedical research and drug development - with a special emphasis on high-content methods that make it possible to obtain detailed biological insights directly as part of the screen.
RESUMEN
CRISPR interference (CRISPRi) enables programmable, reversible, and titratable repression of gene expression (knockdown) in mammalian cells. Initial CRISPRi-mediated genetic screens have showcased the potential to address basic questions in cell biology, genetics, and biotechnology, but wider deployment of CRISPRi screening has been constrained by the large size of single guide RNA (sgRNA) libraries and challenges in generating cell models with consistent CRISPRi-mediated knockdown. Here, we present next-generation CRISPRi sgRNA libraries and effector expression constructs that enable strong and consistent knockdown across mammalian cell models. First, we combine empirical sgRNA selection with a dual-sgRNA library design to generate an ultra-compact (1-3 elements per gene), highly active CRISPRi sgRNA library. Next, we compare CRISPRi effectors to show that the recently published Zim3-dCas9 provides an excellent balance between strong on-target knockdown and minimal non-specific effects on cell growth or the transcriptome. Finally, we engineer a suite of cell lines with stable expression of Zim3-dCas9 and robust on-target knockdown. Our results and publicly available reagents establish best practices for CRISPRi genetic screening.
Asunto(s)
Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , ARN Guía de Sistemas CRISPR-Cas , Línea Celular , Sistemas CRISPR-CasRESUMEN
A lack of tools to precisely control gene expression has limited our ability to evaluate relationships between expression levels and phenotypes. Here, we describe an approach to titrate expression of human genes using CRISPR interference and series of single-guide RNAs (sgRNAs) with systematically modulated activities. We used large-scale measurements across multiple cell models to characterize activities of sgRNAs containing mismatches to their target sites and derived rules governing mismatched sgRNA activity using deep learning. These rules enabled us to synthesize a compact sgRNA library to titrate expression of ~2,400 genes essential for robust cell growth and to construct an in silico sgRNA library spanning the human genome. Staging cells along a continuum of gene expression levels combined with single-cell RNA-seq readout revealed sharp transitions in cellular behaviors at gene-specific expression thresholds. Our work provides a general tool to control gene expression, with applications ranging from tuning biochemical pathways to identifying suppressors for diseases of dysregulated gene expression.
Asunto(s)
Biología Computacional/métodos , Expresión Génica , ARN Guía de Kinetoplastida/genética , Análisis de la Célula Individual/métodos , Sistemas CRISPR-Cas , Aprendizaje Profundo , Edición Génica , Biblioteca Genómica , Células HeLa , Humanos , Células K562 , Fenotipo , Análisis de Secuencia de ARNRESUMEN
Single-cell CRISPR screens enable the exploration of mammalian gene function and genetic regulatory networks. However, use of this technology has been limited by reliance on indirect indexing of single-guide RNAs (sgRNAs). Here we present direct-capture Perturb-seq, a versatile screening approach in which expressed sgRNAs are sequenced alongside single-cell transcriptomes. Direct-capture Perturb-seq enables detection of multiple distinct sgRNA sequences from individual cells and thus allows pooled single-cell CRISPR screens to be easily paired with combinatorial perturbation libraries that contain dual-guide expression vectors. We demonstrate the utility of this approach for high-throughput investigations of genetic interactions and, leveraging this ability, dissect epistatic interactions between cholesterol biogenesis and DNA repair. Using direct capture Perturb-seq, we also show that targeting individual genes with multiple sgRNAs per cell improves efficacy of CRISPR interference and activation, facilitating the use of compact, highly active CRISPR libraries for single-cell screens. Last, we show that hybridization-based target enrichment permits sensitive, specific sequencing of informative transcripts from single-cell RNA-seq experiments.
Asunto(s)
Sistemas CRISPR-Cas , Técnicas de Amplificación de Ácido Nucleico/métodos , ARN Guía de Kinetoplastida/genética , Regulación de la Expresión Génica , Marcación de Gen , Células HEK293 , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Análisis de la Célula Individual , TranscriptomaRESUMEN
Cell fate decision circuits must be variable enough for genetically identical cells to adopt a multitude of fates, yet ensure that these states are distinct, stably maintained, and coordinated with neighboring cells. A long-standing view is that this is achieved by regulatory networks involving self-stabilizing feedback loops that convert small differences into long-lived cell types. We combined regulatory mutants and in vivo reconstitution with theory for stochastic processes to show that the marquee features of a cell fate switch in Bacillus subtilis-discrete states, multigenerational inheritance, and timing of commitments-can instead be explained by simple stochastic competition between two constitutively produced proteins that form an inactive complex. Such antagonistic interactions are commonplace in cells and could provide powerful mechanisms for cell fate determination more broadly.
Asunto(s)
Bacillus subtilis/fisiología , Proteínas Bacterianas/metabolismo , Bacillus subtilis/citología , Bacillus subtilis/metabolismo , Proteínas Bacterianas/genética , Escherichia coli/genética , Escherichia coli/fisiología , Retroalimentación Fisiológica , Regulación Bacteriana de la Expresión Génica , Cinética , Modelos Biológicos , Modelos Estadísticos , Movimiento , Procesos Estocásticos , Transformación BacterianaRESUMEN
How cellular and organismal complexity emerges from combinatorial expression of genes is a central question in biology. High-content phenotyping approaches such as Perturb-seq (single-cell RNA-sequencing pooled CRISPR screens) present an opportunity for exploring such genetic interactions (GIs) at scale. Here, we present an analytical framework for interpreting high-dimensional landscapes of cell states (manifolds) constructed from transcriptional phenotypes. We applied this approach to Perturb-seq profiling of strong GIs mined from a growth-based, gain-of-function GI map. Exploration of this manifold enabled ordering of regulatory pathways, principled classification of GIs (e.g., identifying suppressors), and mechanistic elucidation of synergistic interactions, including an unexpected synergy between CBL and CNN1 driving erythroid differentiation. Finally, we applied recommender system machine learning to predict interactions, facilitating exploration of vastly larger GI manifolds.
Asunto(s)
Epistasis Genética , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Apoptosis/genética , Sistemas CRISPR-Cas , Proteínas de Unión al Calcio/genética , Puntos de Control del Ciclo Celular/genética , Línea Celular Tumoral , Células Eritroides/citología , Eritropoyesis/genética , Femenino , Perfilación de la Expresión Génica , Granulocitos/citología , Humanos , Proteínas de Microfilamentos/genética , Proteínas Proto-Oncogénicas c-cbl/genética , CalponinasRESUMEN
From biochemistry to ecology, many biological systems are stochastic, complex, and sparsely characterized. In such systems, each component may respond to changes in any directly or indirectly connected components, thus requiring knowledge of the whole to predict the dynamics of the parts. Here, we address this challenge by deriving relations between properties of fluctuations that only reflect local interactions between a subset of components but are invariant to all indirectly connected dynamics. This greatly reduces the number of assumptions when evaluating dynamic models experimentally. We illustrate the approach by revisiting systematic single-cell gene expression data, and we show that the observed fluctuations contradict the assumptions made in most published models of stochastic gene expression, even when accounting for the possibility of systematic experimental artifacts.