RESUMO
Chronic stimulation can cause T cell dysfunction and limit the efficacy of cellular immunotherapies. Improved methods are required to compare large numbers of synthetic knockin (KI) sequences to reprogram cell functions. Here, we developed modular pooled KI screening (ModPoKI), an adaptable platform for modular construction of DNA KI libraries using barcoded multicistronic adaptors. We built two ModPoKI libraries of 100 transcription factors (TFs) and 129 natural and synthetic surface receptors (SRs). Over 30 ModPoKI screens across human TCR- and CAR-T cells in diverse conditions identified a transcription factor AP4 (TFAP4) construct that enhanced fitness of chronically stimulated CAR-T cells and anti-cancer function in vitro and in vivo. ModPoKI's modularity allowed us to generate an â¼10,000-member library of TF combinations. Non-viral KI of a combined BATF-TFAP4 polycistronic construct enhanced fitness. Overexpressed BATF and TFAP4 co-occupy and regulate key gene targets to reprogram T cell function. ModPoKI facilitates the discovery of complex gene constructs to program cellular functions.
Assuntos
Terapia Baseada em Transplante de Células e Tecidos , Exercício Físico , Humanos , Biblioteca Gênica , Imunoterapia , PesquisaRESUMO
Defining long-term protective immunity to SARS-CoV-2 is one of the most pressing questions of our time and will require a detailed understanding of potential ways this virus can evolve to escape immune protection. Immune protection will most likely be mediated by antibodies that bind to the viral entry protein, spike (S). Here, we used Phage-DMS, an approach that comprehensively interrogates the effect of all possible mutations on binding to a protein of interest, to define the profile of antibody escape to the SARS-CoV-2 S protein using coronavirus disease 2019 (COVID-19) convalescent plasma. Antibody binding was common in two regions, the fusion peptide and the linker region upstream of the heptad repeat region 2. However, escape mutations were variable within these immunodominant regions. There was also individual variation in less commonly targeted epitopes. This study provides a granular view of potential antibody escape pathways and suggests there will be individual variation in antibody-mediated virus evolution.
Assuntos
Anticorpos Neutralizantes/imunologia , Anticorpos Antivirais/imunologia , COVID-19/imunologia , Epitopos/imunologia , SARS-CoV-2/imunologia , Glicoproteína da Espícula de Coronavírus/imunologia , Algoritmos , COVID-19/terapia , COVID-19/virologia , Linhagem Celular , Biblioteca Gênica , Humanos , Imunização Passiva , Mutação , Domínios Proteicos , SARS-CoV-2/genética , Software , Glicoproteína da Espícula de Coronavírus/química , Glicoproteína da Espícula de Coronavírus/genética , Soroterapia para COVID-19RESUMO
Despite remarkable clinical efficacy of immune checkpoint blockade (ICB) in cancer treatment, ICB benefits for triple-negative breast cancer (TNBC) remain limited. Through pooled in vivo CRISPR knockout (KO) screens in syngeneic TNBC mouse models, we found that deletion of the E3 ubiquitin ligase Cop1 in cancer cells decreases secretion of macrophage-associated chemokines, reduces tumor macrophage infiltration, enhances anti-tumor immunity, and strengthens ICB response. Transcriptomics, epigenomics, and proteomics analyses revealed that Cop1 functions through proteasomal degradation of the C/ebpδ protein. The Cop1 substrate Trib2 functions as a scaffold linking Cop1 and C/ebpδ, which leads to polyubiquitination of C/ebpδ. In addition, deletion of the E3 ubiquitin ligase Cop1 in cancer cells stabilizes C/ebpδ to suppress expression of macrophage chemoattractant genes. Our integrated approach implicates Cop1 as a target for improving cancer immunotherapy efficacy in TNBC by regulating chemokine secretion and macrophage infiltration in the tumor microenvironment.
Assuntos
Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Imunoterapia , Macrófagos/enzimologia , Neoplasias/imunologia , Neoplasias/terapia , Proteínas Nucleares/metabolismo , Ubiquitina-Proteína Ligases/metabolismo , Animais , Proteína delta de Ligação ao Facilitador CCAAT/metabolismo , Proteína 9 Associada à CRISPR/metabolismo , Linhagem Celular Tumoral , Quimiocinas/metabolismo , Quimiotaxia , Modelos Animais de Doenças , Biblioteca Gênica , Humanos , Evasão da Resposta Imune , Camundongos Endogâmicos BALB C , Camundongos Endogâmicos C57BL , Proteólise , Especificidade por Substrato , Neoplasias de Mama Triplo Negativas/imunologia , Neoplasias de Mama Triplo Negativas/terapiaRESUMO
Although base editors are widely used to install targeted point mutations, the factors that determine base editing outcomes are not well understood. We characterized sequence-activity relationships of 11 cytosine and adenine base editors (CBEs and ABEs) on 38,538 genomically integrated targets in mammalian cells and used the resulting outcomes to train BE-Hive, a machine learning model that accurately predicts base editing genotypic outcomes (R ≈ 0.9) and efficiency (R ≈ 0.7). We corrected 3,388 disease-associated SNVs with ≥90% precision, including 675 alleles with bystander nucleotides that BE-Hive correctly predicted would not be edited. We discovered determinants of previously unpredictable C-to-G, or C-to-A editing and used these discoveries to correct coding sequences of 174 pathogenic transversion SNVs with ≥90% precision. Finally, we used insights from BE-Hive to engineer novel CBE variants that modulate editing outcomes. These discoveries illuminate base editing, enable editing at previously intractable targets, and provide new base editors with improved editing capabilities.
Assuntos
Edição de Genes/métodos , Aprendizado de Máquina , Animais , Biblioteca Gênica , Humanos , Camundongos , Células-Tronco Embrionárias Murinas/citologia , Células-Tronco Embrionárias Murinas/metabolismo , Mutação Puntual , RNA Guia de Cinetoplastídeos/metabolismoRESUMO
Drug resistance and relapse remain key challenges in pancreatic cancer. Here, we have used RNA sequencing (RNA-seq), chromatin immunoprecipitation (ChIP)-seq, and genome-wide CRISPR analysis to map the molecular dependencies of pancreatic cancer stem cells, highly therapy-resistant cells that preferentially drive tumorigenesis and progression. This integrated genomic approach revealed an unexpected utilization of immuno-regulatory signals by pancreatic cancer epithelial cells. In particular, the nuclear hormone receptor retinoic-acid-receptor-related orphan receptor gamma (RORγ), known to drive inflammation and T cell differentiation, was upregulated during pancreatic cancer progression, and its genetic or pharmacologic inhibition led to a striking defect in pancreatic cancer growth and a marked improvement in survival. Further, a large-scale retrospective analysis in patients revealed that RORγ expression may predict pancreatic cancer aggressiveness, as it positively correlated with advanced disease and metastasis. Collectively, these data identify an orthogonal co-option of immuno-regulatory signals by pancreatic cancer stem cells, suggesting that autoimmune drugs should be evaluated as novel treatment strategies for pancreatic cancer patients.
Assuntos
Adenocarcinoma/patologia , Células-Tronco Neoplásicas/metabolismo , Neoplasias Pancreáticas/patologia , Adenocarcinoma/genética , Adenocarcinoma/metabolismo , Animais , Moléculas de Adesão Celular/genética , Moléculas de Adesão Celular/metabolismo , Diferenciação Celular , Epigênese Genética , Biblioteca Gênica , Humanos , Camundongos , Camundongos Knockout , Camundongos SCID , Células-Tronco Neoplásicas/citologia , Membro 3 do Grupo F da Subfamília 1 de Receptores Nucleares/antagonistas & inibidores , Membro 3 do Grupo F da Subfamília 1 de Receptores Nucleares/genética , Membro 3 do Grupo F da Subfamília 1 de Receptores Nucleares/metabolismo , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/metabolismo , Interferência de RNA , RNA Interferente Pequeno/metabolismo , Receptores Acoplados a Proteínas G/antagonistas & inibidores , Receptores Acoplados a Proteínas G/genética , Receptores Acoplados a Proteínas G/metabolismo , Receptores de Interleucina-10/antagonistas & inibidores , Receptores de Interleucina-10/genética , Receptores de Interleucina-10/metabolismo , Linfócitos T/citologia , Linfócitos T/imunologia , Linfócitos T/metabolismo , Transcriptoma , Células Tumorais CultivadasRESUMO
The discovery of organic ligands that bind specifically to proteins is a central problem in chemistry, biology, and the biomedical sciences. The encoding of individual organic molecules with distinctive DNA tags, serving as amplifiable identification bar codes, allows the construction and screening of combinatorial libraries of unprecedented size, thus facilitating the discovery of ligands to many different protein targets. Fundamentally, one links powers of genetics and chemical synthesis. After the initial description of DNA-encoded chemical libraries in 1992, several experimental embodiments of the technology have been reduced to practice. This review provides a historical account of important milestones in the development of DNA-encoded chemical libraries, a survey of relevant ongoing research activities, and a glimpse into the future.
Assuntos
Descoberta de Drogas/métodos , Biblioteca Gênica , Bibliotecas de Moléculas Pequenas , Animais , Técnicas de Química Combinatória , Humanos , Ligantes , Biblioteca de PeptídeosRESUMO
RNAs fold into defined tertiary structures to function in critical biological processes. While quantitative models can predict RNA secondary structure stability, we are still unable to predict the thermodynamic stability of RNA tertiary structure. Here, we probe conformational preferences of diverse RNA two-way junctions to develop a predictive model for the formation of RNA tertiary structure. We quantitatively measured tertiary assembly energetics of >1,000 of RNA junctions inserted in multiple structural scaffolds to generate a "thermodynamic fingerprint" for each junction. Thermodynamic fingerprints enabled comparison of junction conformational preferences, revealing principles for how sequence influences 3-dimensional conformations. Utilizing fingerprints of junctions with known crystal structures, we generated ensembles for related junctions that predicted their thermodynamic effects on assembly formation. This work reveals sequence-structure-energetic relationships in RNA, demonstrates the capacity for diverse compensation strategies within tertiary structures, and provides a path to quantitative modeling of RNA folding energetics based on "ensemble modularity."
Assuntos
RNA/metabolismo , Pareamento Incorreto de Bases , Biblioteca Gênica , Conformação de Ácido Nucleico , Fotodegradação , RNA/química , Dobramento de RNA , Estabilidade de RNA , TermodinâmicaRESUMO
Although animals have evolved multiple mechanisms to suppress transposons, "leaky" mobilizations that cause mutations and diseases still occur. This suggests that transposons employ specific tactics to accomplish robust propagation. By directly tracking mobilization, we show that, during a short and specific time window of oogenesis, retrotransposons achieve massive amplification via a cell-type-specific targeting strategy. Retrotransposons rarely mobilize in undifferentiated germline stem cells. However, as oogenesis proceeds, they utilize supporting nurse cells-which are highly polyploid and eventually undergo apoptosis-as factories to massively manufacture invading products. Moreover, retrotransposons rarely integrate into nurse cells themselves but, instead, via microtubule-mediated transport, they preferentially target the DNA of the interconnected oocytes. Blocking microtubule-dependent intercellular transport from nurse cells significantly alleviates damage to the oocyte genome. Our data reveal that parasitic genomic elements can efficiently hijack a host developmental process to propagate robustly, thereby driving evolutionary change and causing disease.
Assuntos
Drosophila melanogaster/genética , Elementos Nucleotídeos Longos e Dispersos , Oogênese , RNA Interferente Pequeno , Retroelementos , Retroviridae/genética , Animais , Proteínas de Drosophila , Feminino , Biblioteca Gênica , Inativação Gênica , Células Germinativas , Proteínas de Fluorescência Verde/metabolismo , Hibridização in Situ Fluorescente , Masculino , Oócitos/metabolismo , Células-Tronco/metabolismoRESUMO
Synthetic multicellular systems hold promise as models for understanding natural development of biofilms and higher organisms and as tools for engineering complex multi-component metabolic pathways and materials. However, such efforts require tools to adhere cells into defined morphologies and patterns, and these tools are currently lacking. Here, we report a 100% genetically encoded synthetic platform for modular cell-cell adhesion in Escherichia coli, which provides control over multicellular self-assembly. Adhesive selectivity is provided by a library of outer membrane-displayed nanobodies and antigens with orthogonal intra-library specificities, while affinity is controlled by intrinsic adhesin affinity, competitive inhibition, and inducible expression. We demonstrate the resulting capabilities for quantitative rational design of well-defined morphologies and patterns through homophilic and heterophilic interactions, lattice-like self-assembly, phase separation, differential adhesion, and sequential layering. Compatible with synthetic biology standards, this adhesion toolbox will enable construction of high-level multicellular designs and shed light on the evolutionary transition to multicellularity.
Assuntos
Adesão Celular/fisiologia , Engenharia Metabólica/métodos , Biologia Sintética/métodos , Fenômenos Fisiológicos Bacterianos , Evolução Biológica , Adesão Celular/genética , Diferenciação Celular/genética , Diferenciação Celular/fisiologia , Escherichia coli/genética , Biblioteca Gênica , Redes e Vias Metabólicas , Anticorpos de Domínio Único/genética , Anticorpos de Domínio Único/imunologia , Anticorpos de Domínio Único/fisiologiaRESUMO
Genomics has provided a detailed structural description of the cancer genome. Identifying oncogenic drivers that work primarily through dosage changes is a current challenge. Unrestrained proliferation is a critical hallmark of cancer. We constructed modular, barcoded libraries of human open reading frames (ORFs) and performed screens for proliferation regulators in multiple cell types. Approximately 10% of genes regulate proliferation, with most performing in an unexpectedly highly tissue-specific manner. Proliferation drivers in a given cell type showed specific enrichment in somatic copy number changes (SCNAs) from cognate tumors and helped predict aneuploidy patterns in those tumors, implying that tissue-type-specific genetic network architectures underlie SCNA and driver selection in different cancers. In vivo screening confirmed these results. We report a substantial contribution to the catalog of SCNA-associated cancer drivers, identifying 147 amplified and 107 deleted genes as potential drivers, and derive insights about the genetic network architecture of aneuploidy in tumors.
Assuntos
Aneuploidia , Neoplasias/patologia , Animais , Linhagem Celular Tumoral , Proliferação de Células , Mapeamento Cromossômico , Cromossomos/genética , Fator de Transcrição E2F1/antagonistas & inibidores , Fator de Transcrição E2F1/genética , Fator de Transcrição E2F1/metabolismo , Feminino , Biblioteca Gênica , Genômica , Humanos , Queratinas/metabolismo , Camundongos , Camundongos Endogâmicos NOD , Camundongos SCID , Oncogenes , Fases de Leitura Aberta/genética , Interferência de RNA , RNA Interferente Pequeno/metabolismoRESUMO
Elucidation of the mutational landscape of human cancer has progressed rapidly and been accompanied by the development of therapeutics targeting mutant oncogenes. However, a comprehensive mapping of cancer dependencies has lagged behind and the discovery of therapeutic targets for counteracting tumor suppressor gene loss is needed. To identify vulnerabilities relevant to specific cancer subtypes, we conducted a large-scale RNAi screen in which viability effects of mRNA knockdown were assessed for 7,837 genes using an average of 20 shRNAs per gene in 398 cancer cell lines. We describe findings of this screen, outlining the classes of cancer dependency genes and their relationships to genetic, expression, and lineage features. In addition, we describe robust gene-interaction networks recapitulating both protein complexes and functional cooperation among complexes and pathways. This dataset along with a web portal is provided to the community to assist in the discovery and translation of new therapeutic approaches for cancer.
Assuntos
Neoplasias/genética , Neoplasias/patologia , Interferência de RNA , Linhagem Celular Tumoral , Biblioteca Gênica , Redes Reguladoras de Genes , Humanos , Complexos Multiproteicos/metabolismo , Neoplasias/metabolismo , Oncogenes , RNA Interferente Pequeno , Transdução de Sinais , Fatores de Transcrição/metabolismoRESUMO
Genome-wide association studies (GWAS) have successfully identified thousands of associations between common genetic variants and human disease phenotypes, but the majority of these variants are non-coding, often requiring genetic fine-mapping, epigenomic profiling, and individual reporter assays to delineate potential causal variants. We employ a massively parallel reporter assay (MPRA) to simultaneously screen 2,756 variants in strong linkage disequilibrium with 75 sentinel variants associated with red blood cell traits. We show that this assay identifies elements with endogenous erythroid regulatory activity. Across 23 sentinel variants, we conservatively identified 32 MPRA functional variants (MFVs). We used targeted genome editing to demonstrate endogenous enhancer activity across 3 MFVs that predominantly affect the transcription of SMIM1, RBM38, and CD164. Functional follow-up of RBM38 delineates a key role for this gene in the alternative splicing program occurring during terminal erythropoiesis. Finally, we provide evidence for how common GWAS-nominated variants can disrupt cell-type-specific transcriptional regulatory pathways.
Assuntos
Eritrócitos , Técnicas Genéticas , Variação Genética , Processamento Alternativo , Linhagem Celular , Linhagem da Célula/genética , Eritropoese/genética , Biblioteca Gênica , Genes Reporter , Humanos , Sequências Reguladoras de Ácido Nucleico , Transcrição GênicaRESUMO
Essential gene functions underpin the core reactions required for cell viability, but their contributions and relationships are poorly studied in vivo. Using CRISPR interference, we created knockdowns of every essential gene in Bacillus subtilis and probed their phenotypes. Our high-confidence essential gene network, established using chemical genomics, showed extensive interconnections among distantly related processes and identified modes of action for uncharacterized antibiotics. Importantly, mild knockdown of essential gene functions significantly reduced stationary-phase survival without affecting maximal growth rate, suggesting that essential protein levels are set to maximize outgrowth from stationary phase. Finally, high-throughput microscopy indicated that cell morphology is relatively insensitive to mild knockdown but profoundly affected by depletion of gene function, revealing intimate connections between cell growth and shape. Our results provide a framework for systematic investigation of essential gene functions in vivo broadly applicable to diverse microorganisms and amenable to comparative analysis.
Assuntos
Bacillus subtilis/genética , Genes Bacterianos , Genes Essenciais , Sistemas CRISPR-Cas , Técnicas de Silenciamento de Genes , Biblioteca Gênica , Redes Reguladoras de Genes , Terapia de Alvo MolecularRESUMO
Although studies have identified hundreds of loci associated with human traits and diseases, pinpointing causal alleles remains difficult, particularly for non-coding variants. To address this challenge, we adapted the massively parallel reporter assay (MPRA) to identify variants that directly modulate gene expression. We applied it to 32,373 variants from 3,642 cis-expression quantitative trait loci and control regions. Detection by MPRA was strongly correlated with measures of regulatory function. We demonstrate MPRA's capabilities for pinpointing causal alleles, using it to identify 842 variants showing differential expression between alleles, including 53 well-annotated variants associated with diseases and traits. We investigated one in detail, a risk allele for ankylosing spondylitis, and provide direct evidence of a non-coding variant that alters expression of the prostaglandin EP4 receptor. These results create a resource of concrete leads and illustrate the promise of this approach for comprehensively interrogating how non-coding polymorphism shapes human biology.
Assuntos
Regulação da Expressão Gênica , Genes Reporter , Doenças Genéticas Inatas/genética , Técnicas Genéticas , Variação Genética , Alelos , Biblioteca Gênica , Células Hep G2 , Humanos , Locos de Características Quantitativas , Sensibilidade e Especificidade , Espondilite Anquilosante/genéticaRESUMO
Data of gene expression levels across individuals, cell types, and disease states is expanding, yet our understanding of how expression levels impact phenotype is limited. Here, we present a massively parallel system for assaying the effect of gene expression levels on fitness in Saccharomyces cerevisiae by systematically altering the expression level of â¼100 genes at â¼100 distinct levels spanning a 500-fold range at high resolution. We show that the relationship between expression levels and growth is gene and environment specific and provides information on the function, stoichiometry, and interactions of genes. Wild-type expression levels in some conditions are not optimal for growth, and genes whose fitness is greatly affected by small changes in expression level tend to exhibit lower cell-to-cell variability in expression. Our study addresses a fundamental gap in understanding the functional significance of gene expression regulation and offers a framework for evaluating the phenotypic effects of expression variation.
Assuntos
Regulação Fúngica da Expressão Gênica , Interação Gene-Ambiente , Aptidão Genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/crescimento & desenvolvimento , Saccharomyces cerevisiae/genética , Código de Barras de DNA Taxonômico , Biblioteca Gênica , Genes Fúngicos , Sequenciamento de Nucleotídeos em Larga EscalaRESUMO
The ability to perturb genes in human cells is crucial for elucidating gene function and holds great potential for finding therapeutic targets for diseases such as cancer. To extend the catalog of human core and context-dependent fitness genes, we have developed a high-complexity second-generation genome-scale CRISPR-Cas9 gRNA library and applied it to fitness screens in five human cell lines. Using an improved Bayesian analytical approach, we consistently discover 5-fold more fitness genes than were previously observed. We present a list of 1,580 human core fitness genes and describe their general properties. Moreover, we demonstrate that context-dependent fitness genes accurately recapitulate pathway-specific genetic vulnerabilities induced by known oncogenes and reveal cell-type-specific dependencies for specific receptor tyrosine kinases, even in oncogenic KRAS backgrounds. Thus, rigorous identification of human cell line fitness genes using a high-complexity CRISPR-Cas9 library affords a high-resolution view of the genetic vulnerabilities of a cell.
Assuntos
Genes Essenciais , Teorema de Bayes , Sistemas CRISPR-Cas , Linhagem Celular Tumoral , Técnicas de Inativação de Genes , Biblioteca Gênica , Humanos , MutaçãoRESUMO
Bacteria have adapted to phage predation by evolving a vast assortment of defence systems1. Although anti-phage immunity genes can be identified using bioinformatic tools, the discovery of novel systems is restricted to the available prokaryotic sequence data2. Here, to overcome this limitation, we infected Escherichia coli carrying a soil metagenomic DNA library3 with the lytic coliphage T4 to isolate clones carrying protective genes. Following this approach, we identified Brig1, a DNA glycosylase that excises α-glucosyl-hydroxymethylcytosine nucleobases from the bacteriophage T4 genome to generate abasic sites and inhibit viral replication. Brig1 homologues that provide immunity against T-even phages are present in multiple phage defence loci across distinct clades of bacteria. Our study highlights the benefits of screening unsequenced DNA and reveals prokaryotic DNA glycosylases as important players in the bacteria-phage arms race.
Assuntos
Bactérias , Bacteriófago T4 , DNA Glicosilases , Bactérias/classificação , Bactérias/enzimologia , Bactérias/genética , Bactérias/imunologia , Bactérias/virologia , Bacteriófago T4/crescimento & desenvolvimento , Bacteriófago T4/imunologia , Bacteriófago T4/metabolismo , DNA Glicosilases/genética , DNA Glicosilases/metabolismo , Escherichia coli/genética , Escherichia coli/virologia , Biblioteca Gênica , Metagenômica/métodos , Microbiologia do Solo , Replicação ViralRESUMO
Deconvolution of regulatory mechanisms that drive transcriptional programs in cancer cells is key to understanding tumor biology. Herein, we present matched transcriptome (scRNA-seq) and chromatin accessibility (scATAC-seq) profiles at single-cell resolution from human ovarian and endometrial tumors processed immediately following surgical resection. This dataset reveals the complex cellular heterogeneity of these tumors and enabled us to quantitatively link variation in chromatin accessibility to gene expression. We show that malignant cells acquire previously unannotated regulatory elements to drive hallmark cancer pathways. Moreover, malignant cells from within the same patients show substantial variation in chromatin accessibility linked to transcriptional output, highlighting the importance of intratumoral heterogeneity. Finally, we infer the malignant cell type-specific activity of transcription factors. By defining the regulatory logic of cancer cells, this work reveals an important reliance on oncogenic regulatory elements and highlights the ability of matched scRNA-seq/scATAC-seq to uncover clinically relevant mechanisms of tumorigenesis in gynecologic cancers.
Assuntos
Neoplasias Ovarianas/genética , Neoplasias Ovarianas/metabolismo , RNA Citoplasmático Pequeno/genética , Idoso , Carcinogênese , Cromatina/metabolismo , Elementos Facilitadores Genéticos , Transição Epitelial-Mesenquimal , Feminino , Tumores do Estroma Gastrointestinal/genética , Biblioteca Gênica , Técnicas Genéticas , Genômica , Humanos , Estimativa de Kaplan-Meier , Pessoa de Meia-Idade , Oncogenes , Ovário/metabolismo , Proteômica , RNA-Seq , Elementos Reguladores de Transcrição , Fatores de Transcrição/metabolismo , TranscriptomaRESUMO
Gene transcription occurs via a cycle of linked events, including initiation, promoter-proximal pausing, and elongation of RNA polymerase II (Pol II). A key question is how transcriptional enhancers influence these events to control gene expression. Here, we present an approach that evaluates the level and change in promoter-proximal transcription (initiation and pausing) in the context of differential gene expression, genome-wide. This combinatorial approach shows that in primary cells, control of gene expression during differentiation is achieved predominantly via changes in transcription initiation rather than via release of Pol II pausing. Using genetically engineered mouse models, deleted for functionally validated enhancers of the α- and ß-globin loci, we confirm that these elements regulate Pol II recruitment and/or initiation to modulate gene expression. Together, our data show that gene expression during differentiation is regulated predominantly at the level of initiation and that enhancers are key effectors of this process.
Assuntos
Elementos Facilitadores Genéticos , Regiões Promotoras Genéticas , RNA Polimerase II/genética , Iniciação da Transcrição Genética , alfa-Globinas/genética , Globinas beta/genética , Animais , Diferenciação Celular , Éxons , Feto , Regulação da Expressão Gênica , Biblioteca Gênica , Proteínas de Choque Térmico HSP70/genética , Proteínas de Choque Térmico HSP70/metabolismo , Humanos , Íntrons , Células K562 , Fígado/citologia , Fígado/metabolismo , Camundongos , Camundongos Knockout , Proteínas Proto-Oncogênicas c-myc/genética , Proteínas Proto-Oncogênicas c-myc/metabolismo , RNA Polimerase II/metabolismo , Transdução de Sinais , alfa-Globinas/deficiência , Globinas beta/deficiênciaRESUMO
To address how genetic variation alters gene expression in complex cell mixtures, we developed direct nuclear tagmentation and RNA sequencing (DNTR-seq), which enables whole-genome and mRNA sequencing jointly in single cells. DNTR-seq readily identified minor subclones within leukemia patients. In a large-scale DNA damage screen, DNTR-seq was used to detect regions under purifying selection and identified genes where mRNA abundance was resistant to copy-number alteration, suggesting strong genetic compensation. mRNA sequencing (mRNA-seq) quality equals RNA-only methods, and the low positional bias of genomic libraries allowed detection of sub-megabase aberrations at ultra-low coverage. Each cell library is individually addressable and can be re-sequenced at increased depth, allowing multi-tiered study designs. Additionally, the direct tagmentation protocol enables coverage-independent estimation of ploidy, which can be used to identify cell singlets. Thus, DNTR-seq directly links each cell's state to its corresponding genome at scale, enabling routine analysis of heterogeneous tumors and other complex tissues.