Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 389
Filter
Add more filters

Publication year range
1.
Cell ; 187(10): 2411-2427.e25, 2024 May 09.
Article in English | MEDLINE | ID: mdl-38608704

ABSTRACT

We set out to exhaustively characterize the impact of the cis-chromatin environment on prime editing, a precise genome engineering tool. Using a highly sensitive method for mapping the genomic locations of randomly integrated reporters, we discover massive position effects, exemplified by editing efficiencies ranging from ∼0% to 94% for an identical target site and edit. Position effects on prime editing efficiency are well predicted by chromatin marks, e.g., positively by H3K79me2 and negatively by H3K9me3. Next, we developed a multiplex perturbational framework to assess the interaction of trans-acting factors with the cis-chromatin environment on editing outcomes. Applying this framework to DNA repair factors, we identify HLTF as a context-dependent repressor of prime editing. Finally, several lines of evidence suggest that active transcriptional elongation enhances prime editing. Consistent with this, we show we can robustly decrease or increase the efficiency of prime editing by preceding it with CRISPR-mediated silencing or activation, respectively.


Subject(s)
CRISPR-Cas Systems , Chromatin , Epigenesis, Genetic , Gene Editing , Humans , Chromatin/metabolism , Chromatin/genetics , CRISPR-Cas Systems/genetics , Gene Editing/methods , Histones/metabolism , Transcription Factors/metabolism , Histone Code
2.
Cell ; 2024 Jun 07.
Article in English | MEDLINE | ID: mdl-38861993

ABSTRACT

Many growth factors and cytokines signal by binding to the extracellular domains of their receptors and driving association and transphosphorylation of the receptor intracellular tyrosine kinase domains, initiating downstream signaling cascades. To enable systematic exploration of how receptor valency and geometry affect signaling outcomes, we designed cyclic homo-oligomers with up to 8 subunits using repeat protein building blocks that can be modularly extended. By incorporating a de novo-designed fibroblast growth factor receptor (FGFR)-binding module into these scaffolds, we generated a series of synthetic signaling ligands that exhibit potent valency- and geometry-dependent Ca2+ release and mitogen-activated protein kinase (MAPK) pathway activation. The high specificity of the designed agonists reveals distinct roles for two FGFR splice variants in driving arterial endothelium and perivascular cell fates during early vascular development. Our designed modular assemblies should be broadly useful for unraveling the complexities of signaling in key developmental transitions and for developing future therapeutic applications.

3.
Cell ; 186(6): 1103-1114, 2023 03 16.
Article in English | MEDLINE | ID: mdl-36931241

ABSTRACT

Single-cell biology is facing a crisis of sorts. Vast numbers of single-cell molecular profiles are being generated, clustered and annotated. However, this is overwhelmingly ad hoc, and we continue to lack a principled, unified, and well-moored system for defining, naming, and organizing cell types. In this perspective, we argue against an atlas or periodic table-like discretization as the right metaphor for a reference taxonomy of cell types. In its place, we advocate for a data-driven, tree-based nomenclature that is rooted in a "consensus ontogeny" spanning the life cycle of a given species. We explore how such a reference cell tree, inclusive of both lineage histories and molecular states, could be constructed, represented, and segmented in practice. Analogous to the taxonomic classification of species, a consensus ontogeny would provide a universal, stable, and extendable framework for precise scientific communication, both contemporaneously and across the ages.


Subject(s)
Cytology , Communication , Life Cycle Stages , Phylogeny , Single-Cell Analysis
4.
Cell ; 186(23): 5015-5027.e12, 2023 11 09.
Article in English | MEDLINE | ID: mdl-37949057

ABSTRACT

Embryonic development is remarkably robust, but temperature stress can degrade its ability to generate animals with invariant anatomy. Phenotypes associated with environmental stress suggest that some cell types are more sensitive to stress than others, but the basis of this sensitivity is unknown. Here, we characterize hundreds of individual zebrafish embryos under temperature stress using whole-animal single-cell RNA sequencing (RNA-seq) to identify cell types and molecular programs driving phenotypic variability. We find that temperature perturbs the normal proportions and gene expression programs of numerous cell types and also introduces asynchrony in developmental timing. The notochord is particularly sensitive to temperature, which we map to a specialized cell type: sheath cells. These cells accumulate misfolded protein at elevated temperature, leading to a cascading structural failure of the notochord and anatomic defects. Our study demonstrates that whole-animal single-cell RNA-seq can identify mechanisms for developmental robustness and pinpoint cell types that constitute key failure points.


Subject(s)
Proteostasis , Zebrafish , Animals , Embryonic Development , Gene Expression Regulation, Developmental , Temperature , Zebrafish/growth & development
5.
Cell ; 177(1): 45-57, 2019 03 21.
Article in English | MEDLINE | ID: mdl-30901547

ABSTRACT

In the wake of the Human Genome Project (HGP), strong expectations were set for the timeline and impact of genomics on medicine-an anticipated transformation in the diagnosis, treatment, and prevention of disease. In this Perspective, we take stock of the nascent field of genomic medicine. In what areas, if any, is genomics delivering on this promise, or is the path to success clear? Where are we falling short, and why? What have been the unanticipated developments? Overall, we argue that the optimism surrounding the transformational potential of genomics on medicine remains justified, albeit with a considerably different form and timescale than originally projected. We also argue that the field needs to pivot back to basics, as understanding the entirety of the genotype-to-phenotype equation is a likely prerequisite for delivering on the full potential of the human genome to advance the human condition.


Subject(s)
Genome, Human/genetics , Precision Medicine/methods , Precision Medicine/trends , Genetic Testing , Genomics/methods , Genomics/trends , Human Genome Project , Humans
6.
Cell ; 176(1-2): 377-390.e19, 2019 01 10.
Article in English | MEDLINE | ID: mdl-30612741

ABSTRACT

Over one million candidate regulatory elements have been identified across the human genome, but nearly all are unvalidated and their target genes uncertain. Approaches based on human genetics are limited in scope to common variants and in resolution by linkage disequilibrium. We present a multiplex, expression quantitative trait locus (eQTL)-inspired framework for mapping enhancer-gene pairs by introducing random combinations of CRISPR/Cas9-mediated perturbations to each of many cells, followed by single-cell RNA sequencing (RNA-seq). Across two experiments, we used dCas9-KRAB to perturb 5,920 candidate enhancers with no strong a priori hypothesis as to their target gene(s), measuring effects by profiling 254,974 single-cell transcriptomes. We identified 664 (470 high-confidence) cis enhancer-gene pairs, which were enriched for specific transcription factors, non-housekeeping status, and genomic and 3D conformational proximity to their target genes. This framework will facilitate the large-scale mapping of enhancer-gene regulatory interactions, a critical yet largely uncharted component of the cis-regulatory landscape of the human genome.


Subject(s)
Chromosome Mapping/methods , Enhancer Elements, Genetic/genetics , Gene Expression Regulation/genetics , CRISPR-Cas Systems/genetics , Clustered Regularly Interspaced Short Palindromic Repeats/genetics , Gene Expression Profiling , Gene Regulatory Networks/genetics , Genome, Human , Genome-Wide Association Study , Genomics , Humans , Quantitative Trait Loci , Transcription Factors/genetics
7.
Immunity ; 57(2): 271-286.e13, 2024 Feb 13.
Article in English | MEDLINE | ID: mdl-38301652

ABSTRACT

The immune system encodes information about the severity of a pathogenic threat in the quantity and type of memory cells it forms. This encoding emerges from lymphocyte decisions to maintain or lose self-renewal and memory potential during a challenge. By tracking CD8+ T cells at the single-cell and clonal lineage level using time-resolved transcriptomics, quantitative live imaging, and an acute infection model, we find that T cells will maintain or lose memory potential early after antigen recognition. However, following pathogen clearance, T cells may regain memory potential if initially lost. Mechanistically, this flexibility is implemented by a stochastic cis-epigenetic switch that tunably and reversibly silences the memory regulator, TCF1, in response to stimulation. Mathematical modeling shows how this flexibility allows memory T cell numbers to scale robustly with pathogen virulence and immune response magnitudes. We propose that flexibility and stochasticity in cellular decisions ensure optimal immune responses against diverse threats.


Subject(s)
CD8-Positive T-Lymphocytes , Memory T Cells , Epigenesis, Genetic , Clone Cells , Immunologic Memory , Cell Differentiation
8.
Cell ; 174(5): 1309-1324.e18, 2018 08 23.
Article in English | MEDLINE | ID: mdl-30078704

ABSTRACT

We applied a combinatorial indexing assay, sci-ATAC-seq, to profile genome-wide chromatin accessibility in ∼100,000 single cells from 13 adult mouse tissues. We identify 85 distinct patterns of chromatin accessibility, most of which can be assigned to cell types, and ∼400,000 differentially accessible elements. We use these data to link regulatory elements to their target genes, to define the transcription factor grammar specifying each cell type, and to discover in vivo correlates of heterogeneity in accessibility within cell types. We develop a technique for mapping single cell gene expression data to single-cell chromatin accessibility data, facilitating the comparison of atlases. By intersecting mouse chromatin accessibility with human genome-wide association summary statistics, we identify cell-type-specific enrichments of the heritability signal for hundreds of complex traits. These data define the in vivo landscape of the regulatory genome for common mammalian cell types at single-cell resolution.


Subject(s)
Chromatin/chemistry , Single-Cell Analysis/methods , Animals , Cluster Analysis , Epigenesis, Genetic , Epigenomics , Gene Expression Regulation , Genome, Human , Genome-Wide Association Study , Humans , Male , Mammals , Mice , Mice, Inbred C57BL , Transcription Factors
9.
Cell ; 175(2): 347-359.e14, 2018 10 04.
Article in English | MEDLINE | ID: mdl-30290141

ABSTRACT

We analyze whole-genome sequencing data from 141,431 Chinese women generated for non-invasive prenatal testing (NIPT). We use these data to characterize the population genetic structure and to investigate genetic associations with maternal and infectious traits. We show that the present day distribution of alleles is a function of both ancient migration and very recent population movements. We reveal novel phenotype-genotype associations, including several replicated associations with height and BMI, an association between maternal age and EMB, and between twin pregnancy and NRG1. Finally, we identify a unique pattern of circulating viral DNA in plasma with high prevalence of hepatitis B and other clinically relevant maternal infections. A GWAS for viral infections identifies an exceptionally strong association between integrated herpesvirus 6 and MOV10L1, which affects piwi-interacting RNA (piRNA) processing and PIWI protein function. These findings demonstrate the great value and potential of accumulating NIPT data for worldwide medical and genetic analyses.


Subject(s)
Asian People/genetics , Prenatal Diagnosis/methods , Adult , Alleles , China , DNA/genetics , Ethnicity/genetics , Female , Gene Frequency/genetics , Genetic Testing , Genetic Variation/genetics , Genetics, Population/methods , Genome-Wide Association Study/methods , Genomics/methods , Human Migration , Humans , Pregnancy , Sequence Analysis, DNA
10.
Cell ; 164(1-2): 57-68, 2016 Jan 14.
Article in English | MEDLINE | ID: mdl-26771485

ABSTRACT

Nucleosome positioning varies between cell types. By deep sequencing cell-free DNA (cfDNA), isolated from circulating blood plasma, we generated maps of genome-wide in vivo nucleosome occupancy and found that short cfDNA fragments harbor footprints of transcription factors. The cfDNA nucleosome occupancies correlate well with the nuclear architecture, gene structure, and expression observed in cells, suggesting that they could inform the cell type of origin. Nucleosome spacing inferred from cfDNA in healthy individuals correlates most strongly with epigenetic features of lymphoid and myeloid cells, consistent with hematopoietic cell death as the normal source of cfDNA. We build on this observation to show how nucleosome footprints can be used to infer cell types contributing to cfDNA in pathological states such as cancer. Since this strategy does not rely on genetic differences to distinguish between contributing tissues, it may enable the noninvasive monitoring of a much broader set of clinical conditions than currently possible.


Subject(s)
DNA/chemistry , Nucleosomes/chemistry , Organ Specificity , CCCTC-Binding Factor , Cell Line , Chromatin Assembly and Disassembly , DNA/metabolism , DNA Footprinting , Genome, Human , Genome-Wide Association Study , Humans , Neoplasms/genetics , Repressor Proteins/metabolism , Sequence Analysis, DNA
11.
Mol Cell ; 83(15): 2624-2640, 2023 08 03.
Article in English | MEDLINE | ID: mdl-37419111

ABSTRACT

The four-dimensional nucleome (4DN) consortium studies the architecture of the genome and the nucleus in space and time. We summarize progress by the consortium and highlight the development of technologies for (1) mapping genome folding and identifying roles of nuclear components and bodies, proteins, and RNA, (2) characterizing nuclear organization with time or single-cell resolution, and (3) imaging of nuclear organization. With these tools, the consortium has provided over 2,000 public datasets. Integrative computational models based on these data are starting to reveal connections between genome structure and function. We then present a forward-looking perspective and outline current aims to (1) delineate dynamics of nuclear architecture at different timescales, from minutes to weeks as cells differentiate, in populations and in single cells, (2) characterize cis-determinants and trans-modulators of genome organization, (3) test functional consequences of changes in cis- and trans-regulators, and (4) develop predictive models of genome structure and function.


Subject(s)
Cell Nucleus , Genome , Genome/genetics , Cell Nucleus/genetics , Cell Nucleus/metabolism , Chromatin/metabolism
12.
Cell ; 163(3): 698-711, 2015 Oct 22.
Article in English | MEDLINE | ID: mdl-26496609

ABSTRACT

Most human transcripts are alternatively spliced, and many disease-causing mutations affect RNA splicing. Toward better modeling the sequence determinants of alternative splicing, we measured the splicing patterns of over two million (M) synthetic mini-genes, which include degenerate subsequences totaling over 100 M bases of variation. The massive size of these training data allowed us to improve upon current models of splicing, as well as to gain new mechanistic insights. Our results show that the vast majority of hexamer sequence motifs measurably influence splice site selection when positioned within alternative exons, with multiple motifs acting additively rather than cooperatively. Intriguingly, motifs that enhance (suppress) exon inclusion in alternative 5' splicing also enhance (suppress) exon inclusion in alternative 3' or cassette exon splicing, suggesting a universal mechanism for alternative exon recognition. Finally, our empirically trained models are highly predictive of the effects of naturally occurring variants on alternative splicing in vivo.


Subject(s)
Alternative Splicing , Genome, Human , Models, Genetic , Polymorphism, Single Nucleotide , Base Sequence , Humans , Molecular Sequence Data , Nucleotide Motifs , RNA Splice Sites
13.
Nature ; 626(8001): 1084-1093, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38355799

ABSTRACT

The house mouse (Mus musculus) is an exceptional model system, combining genetic tractability with close evolutionary affinity to humans1,2. Mouse gestation lasts only 3 weeks, during which the genome orchestrates the astonishing transformation of a single-cell zygote into a free-living pup composed of more than 500 million cells. Here, to establish a global framework for exploring mammalian development, we applied optimized single-cell combinatorial indexing3 to profile the transcriptional states of 12.4 million nuclei from 83 embryos, precisely staged at 2- to 6-hour intervals spanning late gastrulation (embryonic day 8) to birth (postnatal day 0). From these data, we annotate hundreds of cell types and explore the ontogenesis of the posterior embryo during somitogenesis and of kidney, mesenchyme, retina and early neurons. We leverage the temporal resolution and sampling depth of these whole-embryo snapshots, together with published data4-8 from earlier timepoints, to construct a rooted tree of cell-type relationships that spans the entirety of prenatal development, from zygote to birth. Throughout this tree, we systematically nominate genes encoding transcription factors and other proteins as candidate drivers of the in vivo differentiation of hundreds of cell types. Remarkably, the most marked temporal shifts in cell states are observed within one hour of birth and presumably underlie the massive physiological adaptations that must accompany the successful transition of a mammalian fetus to life outside the womb.


Subject(s)
Animals, Newborn , Embryo, Mammalian , Embryonic Development , Gastrula , Single-Cell Analysis , Time-Lapse Imaging , Animals , Female , Mice , Pregnancy , Animals, Newborn/embryology , Animals, Newborn/genetics , Cell Differentiation/genetics , Embryo, Mammalian/cytology , Embryo, Mammalian/embryology , Embryonic Development/genetics , Gastrula/cytology , Gastrula/embryology , Gastrulation/genetics , Kidney/cytology , Kidney/embryology , Mesoderm/cytology , Mesoderm/enzymology , Neurons/cytology , Neurons/metabolism , Retina/cytology , Retina/embryology , Somites/cytology , Somites/embryology , Time Factors , Transcription Factors/genetics , Transcription, Genetic , Organ Specificity/genetics
14.
Cell ; 159(5): 973-974, 2014 Nov 20.
Article in English | MEDLINE | ID: mdl-25416936

ABSTRACT

Genome replication programs are highly orchestrated. In this issue, Koren and colleagues leverage whole-genome sequencing data to discover that DNA replication timing patterns differ between individuals and are associated with genetic variants. These so-called rtQTLs represent a new form of quantitative trait loci that may influence gene dosage and mutational frequency and provide mechanistic insights into the regulation of DNA replication.


Subject(s)
Polymorphism, Genetic , Quantitative Trait Loci , Humans
16.
Cell ; 158(2): 263-276, 2014 Jul 17.
Article in English | MEDLINE | ID: mdl-24998929

ABSTRACT

Autism spectrum disorder (ASD) is a heterogeneous disease in which efforts to define subtypes behaviorally have met with limited success. Hypothesizing that genetically based subtype identification may prove more productive, we resequenced the ASD-associated gene CHD8 in 3,730 children with developmental delay or ASD. We identified a total of 15 independent mutations; no truncating events were identified in 8,792 controls, including 2,289 unaffected siblings. In addition to a high likelihood of an ASD diagnosis among patients bearing CHD8 mutations, characteristics enriched in this group included macrocephaly, distinct faces, and gastrointestinal complaints. chd8 disruption in zebrafish recapitulates features of the human phenotype, including increased head size as a result of expansion of the forebrain/midbrain and impairment of gastrointestinal motility due to a reduction in postmitotic enteric neurons. Our findings indicate that CHD8 disruptions define a distinct ASD subtype and reveal unexpected comorbidities between brain development and enteric innervation.


Subject(s)
Child Development Disorders, Pervasive/genetics , Child Development Disorders, Pervasive/physiopathology , DNA-Binding Proteins/genetics , Transcription Factors/genetics , Adolescent , Amino Acid Sequence , Animals , Brain/growth & development , Brain/pathology , Child , Child Development Disorders, Pervasive/classification , Child Development Disorders, Pervasive/pathology , Child, Preschool , DNA-Binding Proteins/metabolism , Female , Gastrointestinal Tract/innervation , Gastrointestinal Tract/physiopathology , Humans , Macaca mulatta , Male , Megalencephaly/pathology , Molecular Sequence Data , Mutation , Sequence Alignment , Transcription Factors/metabolism , Zebrafish , Zebrafish Proteins/genetics , Zebrafish Proteins/metabolism
17.
Nature ; 622(7983): 584-593, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37369347

ABSTRACT

The human embryo undergoes morphogenetic transformations following implantation into the uterus, but our knowledge of this crucial stage is limited by the inability to observe the embryo in vivo. Models of the embryo derived from stem cells are important tools for interrogating developmental events and tissue-tissue crosstalk during these stages1. Here we establish a model of the human post-implantation embryo, a human embryoid, comprising embryonic and extraembryonic tissues. We combine two types of extraembryonic-like cell generated by overexpression of transcription factors with wild-type embryonic stem cells and promote their self-organization into structures that mimic several aspects of the post-implantation human embryo. These self-organized aggregates contain a pluripotent epiblast-like domain surrounded by extraembryonic-like tissues. Our functional studies demonstrate that the epiblast-like domain robustly differentiates into amnion, extraembryonic mesenchyme and primordial germ cell-like cells in response to bone morphogenetic protein cues. In addition, we identify an inhibitory role for SOX17 in the specification of anterior hypoblast-like cells2. Modulation of the subpopulations in the hypoblast-like compartment demonstrates that extraembryonic-like cells influence epiblast-like domain differentiation, highlighting functional tissue-tissue crosstalk. In conclusion, we present a modular, tractable, integrated3 model of the human embryo that will enable us to probe key questions of human post-implantation development, a critical window during which substantial numbers of pregnancies fail.


Subject(s)
Embryo Implantation , Embryo, Mammalian , Embryonic Development , Models, Biological , Pluripotent Stem Cells , Female , Humans , Pregnancy , Bone Morphogenetic Proteins , Cell Differentiation , Embryo, Mammalian/cytology , Embryo, Mammalian/embryology , Embryoid Bodies/cytology , Germ Layers/cytology , Germ Layers/embryology , Human Embryonic Stem Cells/cytology , Transcription Factors/genetics , Transcription Factors/metabolism , Pluripotent Stem Cells/cytology
18.
Nature ; 623(7988): 782-791, 2023 Nov.
Article in English | MEDLINE | ID: mdl-37968389

ABSTRACT

The maturation of single-cell transcriptomic technologies has facilitated the generation of comprehensive cellular atlases from whole embryos1-4. A majority of these data, however, has been collected from wild-type embryos without an appreciation for the latent variation that is present in development. Here we present the 'zebrafish single-cell atlas of perturbed embryos': single-cell transcriptomic data from 1,812 individually resolved developing zebrafish embryos, encompassing 19 timepoints, 23 genetic perturbations and a total of 3.2 million cells. The high degree of replication in our study (eight or more embryos per condition) enables us to estimate the variance in cell type abundance organism-wide and to detect perturbation-dependent deviance in cell type composition relative to wild-type embryos. Our approach is sensitive to rare cell types, resolving developmental trajectories and genetic dependencies in the cranial ganglia neurons, a cell population that comprises less than 1% of the embryo. Additionally, time-series profiling of individual mutants identified a group of brachyury-independent cells with strikingly similar transcriptomes to notochord sheath cells, leading to new hypotheses about early origins of the skull. We anticipate that standardized collection of high-resolution, organism-scale single-cell data from large numbers of individual embryos will enable mapping of the genetic dependencies of zebrafish cell types, while also addressing longstanding challenges in developmental genetics, including the cellular and transcriptional plasticity underlying phenotypic diversity across individuals.


Subject(s)
Embryo, Mammalian , Reverse Genetics , Single-Cell Analysis , Zebrafish , Animals , Embryo, Mammalian/embryology , Embryo, Mammalian/metabolism , Gene Expression Profiling , Gene Expression Regulation, Developmental , Reverse Genetics/methods , Transcriptome/genetics , Zebrafish/embryology , Zebrafish/genetics , Mutation , Single-Cell Analysis/methods , Notochord/cytology , Notochord/embryology
19.
Nature ; 623(7988): 772-781, 2023 Nov.
Article in English | MEDLINE | ID: mdl-37968388

ABSTRACT

Mouse models are a critical tool for studying human diseases, particularly developmental disorders1. However, conventional approaches for phenotyping may fail to detect subtle defects throughout the developing mouse2. Here we set out to establish single-cell RNA sequencing of the whole embryo as a scalable platform for the systematic phenotyping of mouse genetic models. We applied combinatorial indexing-based single-cell RNA sequencing3 to profile 101 embryos of 22 mutant and 4 wild-type genotypes at embryonic day 13.5, altogether profiling more than 1.6 million nuclei. The 22 mutants represent a range of anticipated phenotypic severities, from established multisystem disorders to deletions of individual regulatory regions4,5. We developed and applied several analytical frameworks for detecting differences in composition and/or gene expression across 52 cell types or trajectories. Some mutants exhibit changes in dozens of trajectories whereas others exhibit changes in only a few cell types. We also identify differences between widely used wild-type strains, compare phenotyping of gain- versus loss-of-function mutants and characterize deletions of topological associating domain boundaries. Notably, some changes are shared among mutants, suggesting that developmental pleiotropy might be 'decomposable' through further scaling of this approach. Overall, our findings show how single-cell profiling of whole embryos can enable the systematic molecular and cellular phenotypic characterization of mouse mutants with unprecedented breadth and resolution.


Subject(s)
Developmental Disabilities , Embryo, Mammalian , Mutation , Phenotype , Single-Cell Gene Expression Analysis , Animals , Mice , Cell Nucleus/genetics , Developmental Disabilities/genetics , Developmental Disabilities/pathology , Embryo, Mammalian/metabolism , Embryo, Mammalian/pathology , Gain of Function Mutation , Genotype , Loss of Function Mutation , Models, Genetic , Disease Models, Animal
20.
Nature ; 608(7921): 98-107, 2022 08.
Article in English | MEDLINE | ID: mdl-35794474

ABSTRACT

DNA is naturally well suited to serve as a digital medium for in vivo molecular recording. However, contemporary DNA-based memory devices are constrained in terms of the number of distinct 'symbols' that can be concurrently recorded and/or by a failure to capture the order in which events occur1. Here we describe DNA Typewriter, a general system for in vivo molecular recording that overcomes these and other limitations. For DNA Typewriter, the blank recording medium ('DNA Tape') consists of a tandem array of partial CRISPR-Cas9 target sites, with all but the first site truncated at their 5' ends and therefore inactive. Short insertional edits serve as symbols that record the identity of the prime editing guide RNA2 mediating the edit while also shifting the position of the 'type guide' by one unit along the DNA Tape, that is, sequential genome editing. In this proof of concept of DNA Typewriter, we demonstrate recording and decoding of thousands of symbols, complex event histories and short text messages; evaluate the performance of dozens of orthogonal tapes; and construct 'long tape' potentially capable of recording as many as 20 serial events. Finally, we leverage DNA Typewriter in conjunction with single-cell RNA-seq to reconstruct a monophyletic lineage of 3,257 cells and find that the Poisson-like accumulation of sequential edits to multicopy DNA tape can be maintained across at least 20 generations and 25 days of in vitro clonal expansion.


Subject(s)
DNA , Gene Editing , Genome , CRISPR-Cas Systems/genetics , DNA/genetics , Gene Editing/methods , Genome/genetics , RNA, Guide, Kinetoplastida/genetics , RNA-Seq , Single-Cell Analysis , Time Factors
SELECTION OF CITATIONS
SEARCH DETAIL