RESUMO
Site-specific recombinase Int mediates integration of the bacteriophage λ genome into the Escherichia coli chromosome. Integration occurs once the Int tetramer, assisted by the integration host factor IHF, forms the intasome, a higher order structure, within which Int, a heterobivalent protein, interacts with two nonhomologous DNA sequences: the core recombination sites and the accessory arm sites. The binding to these sites is mediated by the catalytic C-terminal domain (CTD) and the regulatory N-terminal domain (NTD) of Int, respectively. Within Int, the NTD can activate or inhibit the recombination activity of the CTD depending on whether the NTD is bound to the arm sites. The CTD alone cannot mediate recombination, and even when the NTD and the CTD are mixed together as individual polypeptides, the NTD cannot trigger recombination in the CTD. In this work, we set to determine what modifications can unlock the recombination activity in the CTD alone and how the CTD can be modified to respond to recombination-triggering signals from the NTD. For this, we performed a series of genetic analyses, which showed that a single mutation that stabilizes the CTD on DNA, E174K, allows the CTD to recombine the core DNA sequences. When the NTD is paired with the CTD (E174K) that also bears a short polypeptide from the C terminus of the NTD, the resulting binary Int can recombine arm-bearing substrates. Our results provide insights into the molecular basis of the regulation of the Int activity and suggest how binary recombinases of the integrase type can be engineered.
Assuntos
Bacteriófago lambda , Integrases , Integrases/metabolismo , Bacteriófago lambda/genética , Recombinases/genética , Domínio Catalítico , Sítios de Ligação , Recombinação Genética , Escherichia coli/genética , Escherichia coli/metabolismoRESUMO
Genome engineering is a rapidly evolving field that benefits from the availability of different tools that can be used to perform genome manipulation tasks. We describe here the development of the Flp-TAL recombinases that can target genomic FRT-like sequences in their native chromosomal locations. Flp-TAL recombinases are hybrid enzymes that are composed of two functional modules: a variant of site-specific tyrosine recombinase Flp, which can have either narrow or broad target specificity, and the DNA-binding domain of the transcription activator-like effector, TAL. In Flp-TAL, the TAL module is responsible for delivering and stabilizing the Flp module onto the desired genomic FRT-like sequence where the Flp module mediates recombination. We demonstrate the functionality of the Flp-TAL recombinases by performing integration and deletion experiments in human HEK-293 cells. In the integration experiments we targeted a vector to three genomic FRT-like sequences located in the ß-globin locus. In the deletion experiments we excised ~ 15 kilobases of DNA that contained a fragment of the integrated vector sequence and the neighboring genome sequence. On average, the efficiency of the integration and deletion reactions was about 0.1% and 20%, respectively.
Assuntos
DNA Nucleotidiltransferases/metabolismo , Engenharia Genética/métodos , Recombinases/metabolismo , Tirosina/química , Catálise , DNA , Escherichia coli/genética , Deleção de Genes , Biblioteca Gênica , Terapia Genética/métodos , Genômica , Células HEK293 , Humanos , Mutação , Reação em Cadeia da Polimerase , Recombinação Genética , Biologia de Sistemas , Globinas beta/genéticaRESUMO
Recombinase-mediated cassette exchange, or RMCE, is a genome engineering tool that can be used to swap DNA fragments of interest between two DNA molecules. In a variation of RMCE, called dual RMCE, the exchange of DNA fragments is mediated by two recombinases in contrast to one recombinase in the classic RMCE reaction. Under optimal conditions, the efficiency of dual RMCE can be quite high: up to ~45% of the transfected cells depending on the recombinase pair used to mediate the replacement reaction. Here we describe protocols for preparing for, performing, and optimizing the parameters of dual RMCE.
Assuntos
DNA Nucleotidiltransferases/genética , Marcação de Genes/métodos , Integrases/genética , Mutagênese Sítio-Dirigida/métodos , Recombinação Genética , Tirosina/metabolismo , DNA Nucleotidiltransferases/metabolismo , Genes Reporter , Loci Gênicos , Vetores Genéticos/química , Vetores Genéticos/metabolismo , Proteínas de Fluorescência Verde/genética , Proteínas de Fluorescência Verde/metabolismo , Células HEK293 , Humanos , Integrases/metabolismo , Proteínas Luminescentes/genética , Proteínas Luminescentes/metabolismo , Regiões Promotoras Genéticas , Transfecção , Tirosina/químicaRESUMO
Genome engineering benefits from the availability of DNA modifying enzymes that have different target specificities and have optimized performance in different cell types. This variety of site-specific enzymes can be used to develop complex genome engineering applications at multiple loci. Although eight yeast site-specific tyrosine recombinases are known, only Flp is actively used in genome engineering. To expand the pool of the yeast site-specific tyrosine recombinases capable of mediating genome manipulations in mammalian cells, we engineered and analyzed variants of two tyrosine recombinases: R and TD. The activity of the evolved variants, unlike the activity of the native R and TD recombinases, is suitable for genome engineering in Escherichia coli and mammalian cells. Unexpectedly, we found that R recombinase benefits from the shortening of its C-terminus. We also found that the activity of wild-type R can be modulated by its non-consensus "head" sequence but this modulation became not apparent in the evolved R variants. The engineered recombinase variants were found to be active in all recombination reactions tested: excision, integration, and dual recombinase-mediated cassette exchange. The analysis of the latter reaction catalyzed by the R/TD recombinase pair shows that the condition supporting the most efficient replacement reaction favors efficient TD-mediated integration reaction while favoring efficient R-mediated integration and deletion reactions.
Assuntos
Marcação de Genes/métodos , Engenharia Genética/métodos , Recombinases/metabolismo , Saccharomyces cerevisiae/enzimologia , Animais , Escherichia coli/genética , Mamíferos , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Recombinases/genética , Sorbitol/análogos & derivados , Tiramina/análogos & derivadosRESUMO
Tyrosine site-specific recombinases (YRs) are widely distributed among prokaryotes and their viruses, and were thought to be confined to the budding yeast lineage among eukaryotes. However, YR-harboring retrotransposons (the DIRS and PAT families) and DNA transposons (Cryptons) have been identified in a variety of eukaryotes. The YRs utilize a common chemical mechanism, analogous to that of type IB topoisomerases, to bring about a plethora of genetic rearrangements with important physiological consequences in their respective biological contexts. A subset of the tyrosine recombinases has provided model systems for analyzing the chemical mechanisms and conformational features of the recombination reaction using chemical, biochemical, topological, structural, and single molecule-biophysical approaches. YRs with simple reaction requirements have been utilized to bring about programmed DNA rearrangements for addressing fundamental questions in developmental biology. They have also been employed to trace the topological features of DNA within high-order DNA interactions established by protein machines. The directed evolution of altered specificity YRs, combined with their spatially and temporally regulated expression, heralds their emergence as vital tools in genome engineering projects with wide-ranging biotechnological and medical applications.
Assuntos
Bactérias/enzimologia , Proteínas de Bactérias/metabolismo , DNA Nucleotidiltransferases/metabolismo , Proteínas Fúngicas/metabolismo , Recombinação Genética , Saccharomycetales/enzimologia , Tirosina/metabolismo , Bactérias/química , Bactérias/genética , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , DNA Nucleotidiltransferases/química , DNA Nucleotidiltransferases/genética , Proteínas Fúngicas/química , Proteínas Fúngicas/genética , Saccharomycetales/química , Saccharomycetales/genéticaRESUMO
Genome engineering relies on DNA-modifying enzymes that are able to locate a DNA sequence of interest and initiate a desired genome rearrangement. Currently, the field predominantly utilizes site-specific DNA nucleases that depend on the host DNA repair machinery to complete a genome modification task. We show here that genome engineering approaches that employ target-specific variants of the self-sufficient, versatile site-specific DNA recombinase Flp can be developed into promising alternatives. We demonstrate that the Flp variant evolved to recombine an FRT-like sequence, FL-IL10A, which is located upstream of the human interleukin-10 gene, and can target this sequence in the model setting of Chinese hamster ovary and human embryonic kidney 293 cells. This target-specific Flp variant is able to perform the integration reaction and, when paired with another recombinase, the dual recombinase-mediated cassette exchange reaction. The efficiency of the integration reaction in human cells can be enhanced by 'humanizing' the Flp variant gene and by adding the nuclear localization sequence to the recombinase.
Assuntos
DNA Nucleotidiltransferases/genética , Engenharia Genética/métodos , Genoma , Plasmídeos/metabolismo , Recombinação Genética , Animais , Sequência de Bases , Células CHO , Códon , Cricetulus , DNA Nucleotidiltransferases/metabolismo , Evolução Molecular , Expressão Gênica , Genes Reporter , Proteínas de Fluorescência Verde/genética , Proteínas de Fluorescência Verde/metabolismo , Células HEK293 , Humanos , Integrases/genética , Integrases/metabolismo , Interleucina-10/genética , Interleucina-10/metabolismo , Proteínas Luminescentes/genética , Proteínas Luminescentes/metabolismo , Dados de Sequência Molecular , Plasmídeos/química , Proteínas Recombinantes de Fusão/genética , Proteínas Recombinantes de Fusão/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismoRESUMO
Recombinase-mediated cassette exchange, or RMCE, is a clean approach of gene delivery into a desired chromosomal location, as it is able to insert only the required sequences, leaving behind the unwanted ones. RMCE can be mediated by a single site-specific DNA recombinase or by two recombinases with different target specificities (dual RMCE). Recently, using the Flp-Cre recombinase pair, dual RMCE proved to be efficient, provided the relative ratio of the enzymes during the reaction is optimal. In the present report, we analyzed how the efficiency of dual RMCE mediated by the Flp-Int (HK022) pair depends on the variable input of the recombinases-the amount of the recombinase expression vectors added at transfection-and on the order of the addition of these vectors: sequential or simultaneous. We found that both in the sequential and the simultaneous modes, the efficiency of dual RMCE was critically dependent on the absolute and the relative concentrations of the Flp and Int expression vectors. Under optimal conditions, the efficiency of 'simultaneous' dual RMCE reached â¼12% of the transfected cells. Our results underline the importance of fine-tuning the reaction conditions for achieving the highest levels of dual RMCE.
Assuntos
DNA Nucleotidiltransferases/metabolismo , Integrases/metabolismo , Recombinação Genética , Animais , Bacteriófago HK022/enzimologia , Células CHO , Cricetinae , Cricetulus , Genes ReporterRESUMO
Recombinase-mediated cassette exchange (RMCE) is a powerful tool for unidirectional integration of DNA fragments of interest into a pre-determined genome locale. In this report, we examined how the efficiency of dual RMCE catalyzed by Flp and Cre depends on the nature of transcription units that express the recombinases. The following recombinase transcription units were analyzed: (i) Flp and Cre genes expressed as individual transcription units located on different vectors, (ii) Flp and Cre genes expressed as individual transcription units located on the same vector, (iii) Flp and Cre genes expressed from a single promoter and separated by internal ribosome entry sequence and (iv) Flp and Cre coding sequences separated by the 2A peptide and expressed as a single gene. We found that the highest level of dual RMCE (35-45% of the transfected cells) can be achieved when Flp and Cre recombinases are expressed as Flp-2A-Cre and Flp-IRES-Cre transcription units. In contrast, the lowest level of dual RMCE (â¼1% of the transfected cells) is achieved when Flp and Cre are expressed as individual transcription units. The analysis shows that it is the relative Flp-to-Cre ratio that critically affects the efficiency of dual RMCE. Our results will be helpful for maximizing the efficiency of dual RMCE aimed to engineer and re-engineer genomes.
Assuntos
DNA Nucleotidiltransferases/metabolismo , Integrases/metabolismo , Recombinação Genética , Animais , Células CHO , Cricetinae , Cricetulus , DNA Nucleotidiltransferases/genética , Vetores Genéticos , Integrases/genética , Peptídeos/genética , Transcrição Gênica , Transfecção , Regiões não TraduzidasRESUMO
Efficient and precise genome manipulations can be achieved by the Flp/FRT system of site-specific DNA recombination. Applications of this system are limited, however, to cases when target sites for Flp recombinase, FRT sites, are pre-introduced into a genome locale of interest. To expand use of the Flp/FRT system in genome engineering, variants of Flp recombinase can be evolved to recognize pre-existing genomic sequences that resemble FRT and thus can serve as recombination sites. To understand the distribution and sequence properties of genomic FRT-like sites, we performed a genome-wide analysis of FRT-like sites in the human genome using the experimentally-derived parameters. Out of 642,151 identified FRT-like sequences, 581,157 sequences were unique and 12,452 sequences had at least one exact duplicate. Duplicated FRT-like sequences are located mostly within LINE1, but also within LTRs of endogenous retroviruses, Alu repeats and other repetitive DNA sequences. The unique FRT-like sequences were classified based on the number of matches to FRT within the first four proximal bases pairs of the Flp binding elements of FRT and the nature of mismatched base pairs in the same region. The data obtained will be useful for the emerging field of genome engineering.
Assuntos
Genoma Humano/genética , Análise de Sequência de DNA/métodos , Sequência de Bases , Linhagem Celular , Cromossomos Humanos/genética , DNA Nucleotidiltransferases/metabolismo , Humanos , Dados de Sequência Molecular , Sequências Repetitivas de Ácido Nucleico/genética , Homologia de Sequência do Ácido Nucleico , SoftwareRESUMO
A comparison between the efficiency of recombinase-mediated cassette exchange (RMCE) reactions catalyzed in Escherichia coli by the site-specific recombinases Flp of yeast and Int of coliphage HK022 has revealed that an Flp-catalyzed RMCE reaction is more efficient than an Int-HK022 catalyzed reaction. In contrast, an RMCE reaction with 1 pair of frt sites and 1 pair of att sites catalyzed in the presence of both recombinases is very inefficient. However, the same reaction catalyzed by each recombinase individually supplied in a sequential order is very efficient, regardless of the order. Atomic force microscopy images of Flp with its DNA substrates show that only 1 pair of recombination sites forms a synaptic complex with the recombinase. The results suggest that the RMCE reaction is sequential.
Assuntos
Escherichia coli/genética , Escherichia coli/metabolismo , Recombinases/metabolismo , Recombinação Genética , DNA Nucleotidiltransferases/metabolismo , Microscopia de Força Atômica , Recombinases/genéticaRESUMO
The catalytic pentad of tyrosine recombinases, that assists the tyrosine nucleophile, includes a conserved histidine/tryptophan (His/Trp-III). Flp and Cre harbor tryptophan at this position; most of their kin recombinases display histidine. Contrary to the conservation rule, Flp(W330F) is a much stronger recombinase than Flp(W330H). The hydrophobicity of Trp330 or Phe330 is utilized in correctly positioning Tyr343 during the strand cleavage step of recombination. Why then is phenylalanine almost never encountered in the recombinase family at this conserved position? Using exogenous nucleophiles and synthetic methylphosphonate or 5'-thiolate substrates, we decipher that Trp330 also assists in the activation of the scissile phosphate and the departure of the 5'-hydroxyl leaving group. These two functions are consistent with the hydrogen bonding property of Trp330 as well as its location in structures of the Flp recombination complexes. However, van der Waals contact between Trp330 and Arg308 may also be important for the phosphate activation step. A structure based suppression strategy permits the inactive variant Flp(W330A) to be rescued by a second site mutation A339M. Modeling alanine and methionine at positions 330 and 339, respectively, in the Flp crystal structure suggests a plausible mechanism for active site restoration. Successful suppression suggests the possibility of evolving, by design, new active site configurations for tyrosine recombination.
Assuntos
Alanina/fisiologia , Domínio Catalítico , DNA Nucleotidiltransferases/química , Histidina/fisiologia , Triptofano/fisiologia , Alanina/química , Sequência de Aminoácidos , Domínio Catalítico/efeitos dos fármacos , Sequência Conservada/fisiologia , DNA Nucleotidiltransferases/genética , Histidina/química , Peróxido de Hidrogênio/farmacologia , Modelos Genéticos , Modelos Moleculares , Mutação , Recombinases/química , Recombinases/genética , Recombinases/metabolismo , Especificidade por Substrato , Triptofano/química , Triptofano/genética , Tiramina/farmacologia , Tirosina/metabolismoRESUMO
As a tool in directed genome manipulations, site-specific recombination is a double-edged sword. Exquisite specificity, while highly desirable, makes it imperative that the target site be first inserted at the desired genomic locale before it can be manipulated. We describe a combination of computational and experimental strategies, based on the tyrosine recombinase Flp and its target site FRT, to overcome this impediment. We document the systematic evolution of Flp variants that can utilize, in a bacterial assay, two sites from the human interleukin 10 gene, IL10, as recombination substrates. Recombination competence on an end target site is acquired via chimeric sites containing mixed sequences from FRT and the genomic locus. This is the first time that a tyrosine site-specific recombinase has been coaxed successfully to perform DNA exchange within naturally occurring sequences derived from a foreign genomic context. We demonstrate the ability of an Flp variant to mediate integration of a reporter cassette in Escherichia coli via recombination at one of the IL10-derived sites.
Assuntos
DNA Nucleotidiltransferases/metabolismo , Recombinação Genética , Sequência de Bases , Sítios de Ligação , DNA/química , DNA/metabolismo , DNA Nucleotidiltransferases/química , Evolução Molecular Direcionada , Escherichia coli/genética , Genômica , Humanos , Interleucina-10/genética , Plasmídeos/genética , Leveduras/genéticaAssuntos
DNA Nucleotidiltransferases/metabolismo , Proteínas Fúngicas/metabolismo , Engenharia Genética , Plasmídeos/metabolismo , Leveduras/genética , Proteínas de Ciclo Celular , Proteínas Cromossômicas não Histona , Proteínas Nucleares/metabolismo , Estrutura Terciária de Proteína , Recombinação Genética , Leveduras/metabolismo , CoesinasRESUMO
Strategies of directed evolution and combinatorial mutagenesis applied to the Flp site-specific recombinase have yielded recombination systems that utilize bi-specific hybrid target sites. A hybrid site is assembled from two half-sites, each harboring a distinct binding specificity. Satisfying the two specificities by a binary combination of Flp variants, while necessary, may not be sufficient to elicit recombination. We have identified amino acid substitutions that foster interprotomer collaboration between partner Flp variants to potentiate strand exchange in hybrid sites. One such substitution, A35T, acts specifically in cis with one of the two partners of a variant pair, Flp(K82M) and Flp(A35T, R281V). The same A35T mutation is also present within a group of mutations that rescue a Flp variant, Flp(Y60S), that is defective in establishing monomer-monomer interactions on the native Flp target site. Strikingly, these mutations are localized to peptide regions involved in interdomain and interprotomer interactions within the recombination complex. The same group of mutations, when transferred to the context of wild-type Flp, can relax its specificity to include non-native target sites. The hybrid Flp systems described here mimic the naturally occurring XerC/XerD recombination system that utilizes two recombinases with distinct DNA binding specificities. The ability to overcome the constraints of binding site symmetry in Flp recombination has important implications in the targeted manipulations of genomes.
Assuntos
DNA Nucleotidiltransferases/metabolismo , Mutação , Sequência de Bases , DNA , DNA Nucleotidiltransferases/química , DNA Nucleotidiltransferases/genética , Evolução Molecular , Hidrólise , Modelos Moleculares , Dados de Sequência Molecular , Mutagênese , Especificidade por SubstratoRESUMO
The Flp protein from Saccharomyces cerevisiae is one of the site-specific tyrosine family recombinases that are used widely in genomic engineering. As a first step towards mediating directed DNA rearrangements at non-native Flp recombination targets (mFRTs), we have evolved three separate groups of Flp variants that preferentially act on mFRTs containing substitutions at the first, seventh or both positions of the Flp-binding elements. The variants that recombine the double-mutant mFRT contain a subset of the mutations present in those that are active on the single-mutant mFRTs, plus additional mutations. Specificity for and discrimination between target sites, effected primarily by amino acid residues that contact DNA, can be modulated by those that do not interact with DNA or with a DNA-contacting residue. The degree of modulation can range from relaxed DNA specificity to almost completely altered specificity. Our results suggest that combined DNA shuffling and mutagenesis of libraries of Flp variants active on distinct mFRTs can yield variants that can recombine mFRTs containing combinations of the individual mutations.
Assuntos
DNA Nucleotidiltransferases/genética , DNA Nucleotidiltransferases/metabolismo , DNA/genética , DNA/metabolismo , Evolução Molecular Direcionada , Mutação/genética , Recombinação Genética , Pareamento de Bases , Sequência de Bases , Catálise , DNA/química , DNA Nucleotidiltransferases/química , Genes Reporter/genética , Lisina/genética , Modelos Moleculares , Dados de Sequência Molecular , Conformação Proteica , Engenharia de Proteínas , Saccharomyces cerevisiae/enzimologia , Saccharomyces cerevisiae/genética , Relação Estrutura-Atividade , Especificidade por SubstratoRESUMO
We have developed a dual reporter screen in Escherichia coli for identifying variants of the Flp site-specific recombinase that have acquired reactivity at an altered target site (mFRT). In one reporter, the lacZalpha gene segment is flanked by mFRTs in direct orientation. In the other, the red fluorescence protein (RFP) gene is flanked by the native FRTs. Hence, the color of a colony on an X-gal indicator plate indicates the recombination potential of the variant Flp protein expressed in it: blue if no recombination or only FRT recombination occurs, red if only mFRT recombination occurs and white if both FRT and mFRT recombinations occur. The scheme was validated by identification and in vivo characterization of Flp variants that show either relaxed specificity (active on FRT and mFRT) or moderately shifted specificity toward mFRT. We find that alteration of Lys-82 to Met, Thr, Arg or His enables the corresponding Flp variants to recombine FRT sites as well as altered FRT sites containing a substitution of G-C by C-G at position 1 of the Flp binding element (mFRT11). In contrast, wild-type Flp has no detectable activity on mFRT11. When Lys-82 is replaced by Tyr, the resulting Flp variant shows a small but reproducible preference for mFRT11 over FRT. However, this preference for mFRT11 is nearly lost when Tyr-82 is substituted by Phe.