RESUMEN
Bacterial transposons are pervasive mobile genetic elements that use distinct DNA-binding proteins for horizontal transmission. For example, Escherichia coli Tn7 homes to a specific attachment site using TnsD1, whereas CRISPR-associated transposons use type I or type V Cas effectors to insert downstream of target sites specified by guide RNAs2,3. Despite this targeting diversity, transposition invariably requires TnsB, a DDE-family transposase that catalyses DNA excision and insertion, and TnsC, a AAA+ ATPase that is thought to communicate between transposase and targeting proteins4. How TnsC mediates this communication and thereby regulates transposition fidelity has remained unclear. Here we use chromatin immunoprecipitation with sequencing to monitor in vivo formation of the type I-F RNA-guided transpososome, enabling us to resolve distinct protein recruitment events before integration. DNA targeting by the TniQ-Cascade complex is surprisingly promiscuous-hundreds of genomic off-target sites are sampled, but only a subset of those sites is licensed for TnsC and TnsB recruitment, revealing a crucial proofreading checkpoint. To advance the mechanistic understanding of interactions responsible for transpososome assembly, we determined structures of TnsC using cryogenic electron microscopy and found that ATP binding drives the formation of heptameric rings that thread DNA through the central pore, thereby positioning the substrate for downstream integration. Collectively, our results highlight the molecular specificity imparted by consecutive factor binding to genomic target sites during RNA-guided transposition, and provide a structural roadmap to guide future engineering efforts.
Asunto(s)
Adenosina Trifosfatasas , Elementos Transponibles de ADN , Proteínas de Unión al ADN , Proteínas de Escherichia coli , ARN Bacteriano , Adenosina Trifosfatasas/metabolismo , Secuenciación de Inmunoprecipitación de Cromatina , Elementos Transponibles de ADN/genética , ADN Bacteriano/química , ADN Bacteriano/metabolismo , Proteínas de Unión al ADN/metabolismo , Escherichia coli/enzimología , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/metabolismo , ARN Bacteriano/genética , ARN Bacteriano/metabolismo , Especificidad por Sustrato , Transposasas/metabolismoRESUMEN
Clustered regularly interspaced short palindromic repeats (CRISPR)-associated transposases have the potential to transform the technology landscape for kilobase-scale genome engineering, by virtue of their ability to integrate large genetic payloads with high accuracy, easy programmability and no requirement for homologous recombination machinery. These transposons encode efficient, CRISPR RNA-guided transposases that execute genomic insertions in Escherichia coli at efficiencies approaching ~100%. Moreover, they generate multiplexed edits when programmed with multiple guides, and function robustly in diverse Gram-negative bacterial species. Here we present a detailed protocol for engineering bacterial genomes using CRISPR-associated transposase (CAST) systems, including guidelines on the available vectors, customization of guide RNAs and DNA payloads, selection of common delivery methods, and genotypic analysis of integration events. We further describe a computational CRISPR RNA design algorithm to avoid potential off-targets, and a CRISPR array cloning pipeline for performing multiplexed DNA insertions. The method presented here allows the isolation of clonal strains containing a novel genomic integration event of interest within 1-2 weeks using available plasmid constructs and standard molecular biology techniques.
Asunto(s)
Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Transposasas , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas/genética , Transposasas/genética , ARN Guía de Sistemas CRISPR-Cas , Genoma Bacteriano , ADN , Escherichia coli/genética , Sistemas CRISPR-Cas/genética , Ingeniería Genética/métodos , Edición GénicaRESUMEN
Conventional genome engineering with CRISPR-Cas9 creates double-strand breaks (DSBs) that lead to undesirable byproducts and reduce product purity. Here we report an approach for programmable integration of large DNA sequences in human cells that avoids the generation of DSBs by using Type I-F CRISPR-associated transposases (CASTs). We optimized DNA targeting by the QCascade complex through protein design and developed potent transcriptional activators by exploiting the multi-valent recruitment of the AAA+ ATPase TnsC to genomic sites targeted by QCascade. After initial detection of plasmid-based integration, we screened 15 additional CAST systems from a wide range of bacterial hosts, identified a homolog from Pseudoalteromonas that exhibits improved activity and further increased integration efficiencies. Finally, we discovered that bacterial ClpX enhances genomic integration by multiple orders of magnitude, likely by promoting active disassembly of the post-integration CAST complex, akin to its known role in Mu transposition. Our work highlights the ability to reconstitute complex, multi-component machineries in human cells and establishes a strong foundation to exploit CRISPR-associated transposases for eukaryotic genome engineering.
Asunto(s)
Sistemas CRISPR-Cas , Transposasas , Humanos , Sistemas CRISPR-Cas/genética , Transposasas/genética , Plásmidos , ADN , Genoma , Edición GénicaRESUMEN
CRISPR-associated transposons (CASTs) have the potential to transform the technology landscape for kilobase-scale genome engineering, by virtue of their ability to integrate large genetic payloads with high accuracy, easy programmability, and no requirement for homologous recombination machinery. These transposons encode efficient, CRISPR RNA-guided transposases that execute genomic insertions in E. coli at efficiencies approaching ~100%, generate multiplexed edits when programmed with multiple guides, and function robustly in diverse Gram-negative bacterial species. Here we present a detailed protocol for engineering bacterial genomes using CAST systems, including guidelines on the available homologs and vectors, customization of guide RNAs and DNA payloads, selection of common delivery methods, and genotypic analysis of integration events. We further describe a computational crRNA design algorithm to avoid potential off-targets and CRISPR array cloning pipeline for DNA insertion multiplexing. Starting from available plasmid constructs, the isolation of clonal strains containing a novel genomic integration event-of-interest can be achieved in 1 week using standard molecular biology techniques.
RESUMEN
Traditional genome-editing reagents such as CRISPR-Cas9 achieve targeted DNA modification by introducing double-strand breaks (DSBs), thereby stimulating localized DNA repair by endogenous cellular repair factors. While highly effective at generating heterogenous knockout mutations, this approach suffers from undesirable byproducts and an inability to control product purity. Here we develop a system in human cells for programmable, DSB-free DNA integration using Type I CRISPR-associated transposons (CASTs). To adapt our previously described CAST systems, we optimized DNA targeting by the QCascade complex through a comprehensive assessment of protein design, and we developed potent transcriptional activators by exploiting the multi-valent recruitment of the AAA+ ATPase, TnsC, to genomic sites targeted by QCascade. After initial detection of plasmid-based transposition, we screened 15 homologous CAST systems from a wide range of bacterial hosts, identified a CAST homolog from Pseudoalteromonas that exhibited improved activity, and increased integration efficiencies through parameter optimization. We further discovered that bacterial ClpX enhances genomic integration by multiple orders of magnitude, and we propose that this critical accessory factor functions to drive active disassembly of the post-transposition CAST complex, akin to its demonstrated role in Mu transposition. Our work highlights the ability to functionally reconstitute complex, multi-component machineries in human cells, and establishes a strong foundation to realize the full potential of CRISPR-associated transposons for human genome engineering.
RESUMEN
Bacterial transposons propagate through either non-replicative (cut-and-paste) or replicative (copy-and-paste) pathways, depending on how the mobile element is excised from its donor source. In the well-characterized E. coli transposon Tn7, a heteromeric TnsA-TnsB transposase directs cut-and-paste transposition by cleaving both strands at each transposon end during the excision step. Whether a similar pathway is involved for RNA-guided transposons, in which CRISPR-Cas systems confer DNA target specificity, has not been determined. Here, we apply long-read, population-based whole-genome sequencing (WGS) to unambiguously resolve transposition products for two evolutionarily distinct transposon types that employ either Cascade or Cas12k for RNA-guided DNA integration. Our results show that RNA-guided transposon systems lacking functional TnsA primarily undergo copy-and-paste transposition, generating cointegrate products that comprise duplicated transposon copies and genomic insertion of the vector backbone. Finally, we report natural and engineered transposon variants encoding a TnsAB fusion protein, revealing a novel strategy for achieving RNA-guided transposition with fewer molecular components.
RESUMEN
Existing technologies for site-specific integration of kilobase-sized DNA sequences in bacteria are limited by low efficiency, a reliance on recombination, the need for multiple vectors, and challenges in multiplexing. To address these shortcomings, we introduce a substantially improved version of our previously reported Tn7-like transposon from Vibrio cholerae, which uses a Type I-F CRISPR-Cas system for programmable, RNA-guided transposition. The optimized insertion of transposable elements by guide RNA-assisted targeting (INTEGRATE) system achieves highly accurate and marker-free DNA integration of up to 10 kilobases at ~100% efficiency in bacteria. Using multi-spacer CRISPR arrays, we achieved simultaneous multiplexed insertions in three genomic loci and facile, multi-loci deletions by combining orthogonal integrases and recombinases. Finally, we demonstrated robust function in biomedically and industrially relevant bacteria and achieved target- and species-specific integration in a complex bacterial community. This work establishes INTEGRATE as a versatile tool for multiplexed, kilobase-scale genome engineering.