Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 36
Filter
1.
Bioinformatics ; 2024 Jul 22.
Article in English | MEDLINE | ID: mdl-39037932

ABSTRACT

SUMMARY: Analysing protein structure similarities is an important step in protein engineering and drug discovery. Methodologies that are more advanced than simple RMSD are available but often require extensive mathematical or computational knowledge for implementation. Grouping and optimising such tools in an efficient open-source library increases accessibility and encourages the adoption of more advanced metrics. Melodia is a Python library with a complete set of components devised for describing, comparing and analysing the shape of protein structures using differential geometry of three-dimensional curves and knot theory. It can generate robust geometric descriptors for thousands of shapes in just a few minutes. Those descriptors are more sensitive to structural feature variation than RMSD deviation. Melodia also incorporates sequence structural annotation and three-dimensional visualisations. AVAILABILITY AND IMPLEMENTATION: Melodia is an open-source Python library freely available on https://github.com/rwmontalvao/Melodia_py, along with interactive Jupyter Notebook tutorials. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

2.
Nucleic Acids Res ; 50(16): 9568-9579, 2022 09 09.
Article in English | MEDLINE | ID: mdl-36018798

ABSTRACT

Plasmids of the ColE1 family are among the most frequently used in molecular biology. They were adopted early for many biotechnology applications, and as models to study plasmid biology. Their mechanism of replication is well understood, involving specific interactions between a plasmid encoded sense-antisense gene pair (RNAI and RNAII). Due to such mechanism, two plasmids with the same origin cannot be stably maintained in cells-a process known as incompatibility. While mutations in RNAI and RNAII can make colE1 more compatible, there has been no systematic effort to engineer new compatible colE1 origins, which could bypass technical design constraints for multi-plasmid applications. Here, we show that by diversifying loop regions in RNAI (and RNAII), it is possible to select new viable colE1 origins compatible with the wild-type one. We demonstrate that sequence divergence is not sufficient to enable compatibility and pairwise interactions are not an accurate guide for higher order interactions. We identify potential principles to engineer plasmid copy number independently from other regulatory strategies and we propose plasmid compatibility as a tractable model to study biological orthogonality.


Subject(s)
DNA Replication , RNA, Bacterial , RNA, Bacterial/genetics , DNA Replication/genetics , Escherichia coli/genetics , Base Sequence , Plasmids/genetics
3.
Nature ; 518(7539): 427-30, 2015 Feb 19.
Article in English | MEDLINE | ID: mdl-25470036

ABSTRACT

The emergence of catalysis in early genetic polymers such as RNA is considered a key transition in the origin of life, pre-dating the appearance of protein enzymes. DNA also demonstrates the capacity to fold into three-dimensional structures and form catalysts in vitro. However, to what degree these natural biopolymers comprise functionally privileged chemical scaffolds for folding or the evolution of catalysis is not known. The ability of synthetic genetic polymers (XNAs) with alternative backbone chemistries not found in nature to fold into defined structures and bind ligands raises the possibility that these too might be capable of forming catalysts (XNAzymes). Here we report the discovery of such XNAzymes, elaborated in four different chemistries (arabino nucleic acids, ANA; 2'-fluoroarabino nucleic acids, FANA; hexitol nucleic acids, HNA; and cyclohexene nucleic acids, CeNA) directly from random XNA oligomer pools, exhibiting in trans RNA endonuclease and ligase activities. We also describe an XNA-XNA ligase metalloenzyme in the FANA framework, establishing catalysis in an entirely synthetic system and enabling the synthesis of FANA oligomers and an active RNA endonuclease FANAzyme from its constituent parts. These results extend catalysis beyond biopolymers and establish technologies for the discovery of catalysts in a wide range of polymer scaffolds not found in nature. Evolution of catalysis independent of any natural polymer has implications for the definition of chemical boundary conditions for the emergence of life on Earth and elsewhere in the Universe.


Subject(s)
Nucleic Acids/chemical synthesis , Nucleic Acids/metabolism , Polymers/chemistry , Polymers/chemical synthesis , Base Sequence , Catalysis , Endonucleases/metabolism , Ligases/metabolism , Nucleic Acids/chemistry , Polymers/metabolism , RNA/metabolism
4.
Nucleic Acids Res ; 47(13): 7130-7142, 2019 07 26.
Article in English | MEDLINE | ID: mdl-31334814

ABSTRACT

Xenobiotic nucleic acids (XNA) are nucleic acid analogues not present in nature that can be used for the storage of genetic information. In vivo XNA applications could be developed into novel biocontainment strategies, but are currently limited by the challenge of developing XNA processing enzymes such as polymerases, ligases and nucleases. Here, we present a structure-guided modelling-based strategy for the rational design of those enzymes essential for the development of XNA molecular biology. Docking of protein domains to unbound double-stranded nucleic acids is used to generate a first approximation of the extensive interaction of nucleic acid processing enzymes with their substrate. Molecular dynamics is used to optimise that prediction allowing, for the first time, the accurate prediction of how proteins that form toroidal complexes with nucleic acids interact with their substrate. Using the Chlorella virus DNA ligase as a proof of principle, we recapitulate the ligase's substrate specificity and successfully predict how to convert it into an XNA-templated XNA ligase.


Subject(s)
DNA Ligases/metabolism , Viral Proteins/metabolism , Computer Simulation , DNA Ligases/chemistry , DNA Viruses/enzymology , DNA, Viral/metabolism , Deoxyribonuclease BamHI/metabolism , Models, Chemical , Molecular Docking Simulation , Mutagenesis, Site-Directed , Nucleic Acid Conformation , Protein Binding , Protein Conformation , Structure-Activity Relationship , Substrate Specificity , Templates, Genetic , Viral Proteins/chemistry
5.
Nucleic Acids Res ; 47(5): 2160-2168, 2019 03 18.
Article in English | MEDLINE | ID: mdl-30698800

ABSTRACT

Six 1',5'-anhydrohexitol uridine triphosphates were synthesized with aromatic substitutions appended via a carboxamide linker to the 5-position of their bases. An improved method for obtaining such 5-substituted hexitol nucleosides and nucleotides is described. The incorporation profile of the nucleotide analogues into a DNA duplex overhang using recently evolved XNA polymerases is compared. Long, mixed HNA sequences featuring the base modifications are generated. The apparent binding affinity of four of the nucleotides to the enzyme, the rate of the chemical step and of product release, plus the specificity constant for the incorporation of these modified nucleotides into a DNA duplex overhang using the HNA polymerase T6G12_I521L are determined via pre-steady-state kinetics. HNA polymers displaying aromatic functional groups could have significant impact on the isolation of stable and high-affinity binders and catalysts, or on the design of nanomaterials.


Subject(s)
DNA-Directed DNA Polymerase/metabolism , Nucleotides/chemical synthesis , Nucleotides/metabolism , Sugar Alcohols/chemistry , Sugar Alcohols/metabolism , Kinetics , Nucleotides/chemistry , Protein Engineering , Substrate Specificity
6.
Chembiochem ; 21(19): 2844-2853, 2020 10 01.
Article in English | MEDLINE | ID: mdl-32413179

ABSTRACT

Directed evolution has been remarkably successful at expanding the chemical and functional boundaries of biology. That progress is heavily dependent on the robustness and flexibility of the available selection platforms, given the significant cost to (re)develop a given platform to target a new desired function. Bacterial cell display has a significant track record as a viable strategy for the engineering of mesophilic enzymes, as enzyme activity can be probed directly and free from interference from the cellular milieu, but its adoption has lagged behind other display-based methods. Herein, we report the development of SNAP as a quantitative reporter for bacterial cell display, which enables fast troubleshooting and the systematic development of the display-based selection platform, thus improving its robustness. In addition, we demonstrate that even weak interactions between displayed proteins and nucleic acids can be harnessed for the specific labelling of bacterial cells, allowing functional characterisation of DNA binding proteins and enzymes, thus making it a highly flexible platform for these biochemical functions. Together, this establishes bacterial display as a robust and flexible platform, ideally suited for the systematic engineering of ligands and enzymes needed for XNA molecular biology.


Subject(s)
Alkyl and Aryl Transferases/metabolism , Bacteria/metabolism , Bioengineering , Alkyl and Aryl Transferases/genetics , Bacteria/cytology , Bacteria/genetics , Humans , Ligands , Nucleic Acids/metabolism , Xenobiotics/metabolism
7.
Nucleic Acids Res ; 46(8): e51, 2018 05 04.
Article in English | MEDLINE | ID: mdl-29409059

ABSTRACT

Engineering proteins for designer functions and biotechnological applications almost invariably requires (or at least benefits from) multiple mutations to non-contiguous residues. Several methods for multiple site-directed mutagenesis exist, but there remains a need for fast and simple methods to efficiently introduce such mutations - particularly for generating large, high quality libraries for directed evolution. Here, we present Darwin Assembly, which can deliver high quality libraries of >108 transformants, targeting multiple (>10) distal sites with minimal wild-type contamination (<0.25% of total population) and which takes a single working day from purified plasmid to library transformation. We demonstrate its efficacy with whole gene codon reassignment of chloramphenicol acetyl transferase, mutating 19 codons in a single reaction in KOD DNA polymerase and generating high quality, multiple-site libraries in T7 RNA polymerase and Tgo DNA polymerase. Darwin Assembly uses commercially available enzymes, can be readily automated, and offers a cost-effective route to highly complex and customizable library generation.


Subject(s)
Directed Molecular Evolution/methods , Mutagenesis, Site-Directed/methods , Peptide Library , Protein Engineering/methods , Chloramphenicol O-Acetyltransferase/genetics , Codon/genetics , INDEL Mutation , Plasmids/genetics , Point Mutation , Synthetic Biology/methods
8.
J Am Chem Soc ; 140(21): 6690-6699, 2018 05 30.
Article in English | MEDLINE | ID: mdl-29722977

ABSTRACT

Although several synthetic or xenobiotic nucleic acids (XNAs) have been shown to be viable genetic materials in vitro, major hurdles remain for their in vivo applications, particularly orthogonality. The availability of XNAs that do not interact with natural nucleic acids and are not affected by natural DNA processing enzymes, as well as specialized XNA processing enzymes that do not interact with natural nucleic acids, is essential. Here, we report 3'-2' phosphonomethyl-threosyl nucleic acid (tPhoNA) as a novel XNA genetic material and a prime candidate for in vivo XNA applications. We established routes for the chemical synthesis of phosphonate nucleic acids and phosphorylated monomeric building blocks, and we demonstrated that DNA duplexes were destabilized upon replacement with tPhoNA. We engineered a novel tPhoNA synthetase enzyme and, with a previously reported XNA reverse transcriptase, demonstrated that tPhoNA is a viable genetic material (with an aggregate error rate of approximately 17 × 10-3 per base) compatible with the isolation of functional XNAs. In vivo experiments to test tPhoNA orthogonality showed that the E. coli cellular machinery had only very limited potential to access genetic information in tPhoNA. Our work is the first report of a synthetic genetic material modified in both sugar and phosphate backbone moieties and represents a significant advance in biorthogonality toward the introduction of XNA systems in vivo.


Subject(s)
DNA/chemistry , Organophosphonates/chemistry , Polymers/metabolism , Xenobiotics/metabolism , DNA/metabolism , Ligases/chemistry , Ligases/metabolism , Models, Molecular , Molecular Structure , Organophosphonates/metabolism , Polymers/chemistry , Protein Engineering , Xenobiotics/chemistry
9.
Chemistry ; 24(48): 12695-12707, 2018 Aug 27.
Article in English | MEDLINE | ID: mdl-29883012

ABSTRACT

The synthesis, base pairing properties and in vitro (polymerase) and in vivo (E. coli) recognition of 2'-deoxynucleotides with a 2-amino-6-methyl-8-oxo-7,8-dihydro-purine (X), a 2-methyl-6-thiopurine (Y) and a 6-methyl-4-pyrimidone (Z) base moiety are described. As demonstrated by Tm measurements, the X and Y bases fail to form a self-complementary base pair. Despite this failure, enzymatic incorporation experiments show that selected DNA polymerases recognize the X nucleotide and incorporate this modified nucleotide versus X in the template. In vivo, X is mainly recognized as a A/G or C base; Y is recognized as a G or C base and Z is mostly recognized as T or C. Replacing functional groups in nucleobases normally involved in W-C recognition (6-carbonyl and 2-amino group of purine; 6-carbonyl of pyrimidine) readily leads to orthogonality (absence of base pairing with natural bases).

10.
Biochem Soc Trans ; 44(4): 1165-75, 2016 08 15.
Article in English | MEDLINE | ID: mdl-27528765

ABSTRACT

Life on Earth is incredibly diverse. Yet, underneath that diversity, there are a number of constants and highly conserved processes: all life is based on DNA and RNA; the genetic code is universal; biology is limited to a small subset of potential chemistries. A vast amount of knowledge has been accrued through describing and characterizing enzymes, biological processes and organisms. Nevertheless, much remains to be understood about the natural world. One of the goals in Synthetic Biology is to recapitulate biological complexity from simple systems made from biological molecules-gaining a deeper understanding of life in the process. Directed evolution is a powerful tool in Synthetic Biology, able to bypass gaps in knowledge and capable of engineering even the most highly conserved biological processes. It encompasses a range of methodologies to create variation in a population and to select individual variants with the desired function-be it a ligand, enzyme, pathway or even whole organisms. Here, we present some of the basic frameworks that underpin all evolution platforms and review some of the recent contributions from directed evolution to synthetic biology, in particular methods that have been used to engineer the Central Dogma and the genetic code.


Subject(s)
Directed Molecular Evolution , Genetic Code/genetics , Nucleic Acids/genetics , Synthetic Biology/methods , Animals , Evolution, Molecular , Genetic Variation , Humans , Origin of Life , Selection, Genetic
12.
Chembiochem ; 16(6): 899-901, 2015 Apr 13.
Article in English | MEDLINE | ID: mdl-25739380

ABSTRACT

Origin of life: Although current life is homochiral (with D nucleic acids), little is known about how homochirality emerged or even if it was a necessary step. The isolation of cross-chiral nucleic acid ligases demonstrate that an early heterochiral life could have been possible.


Subject(s)
DNA-Directed RNA Polymerases/metabolism , RNA, Catalytic/chemistry , RNA, Catalytic/metabolism , RNA/biosynthesis , RNA/chemistry
13.
Chemistry ; 21(13): 5009-22, 2015 Mar 23.
Article in English | MEDLINE | ID: mdl-25684598

ABSTRACT

The synthesis, base-pairing properties and in vitro and in vivo characteristics of 5-methyl-isocytosine (isoC(Me) ) and isoguanine (isoG) nucleosides, incorporated in an HNA(h) (hexitol nucleic acid)-DNA(d) mosaic backbone, are described. The required h-isoG phosphoramidite was prepared by a selective deamination as a key step. As demonstrated by Tm measurements the hexitol sugar showed slightly better mismatch discrimination against dT. The d-isoG base mispairing follows the order T>G>C while the h-isoG base mispairing follows the order G>C>T. The h- and d-isoC(Me) bases mainly mispair with G. Enzymatic incorporation experiments show that the hexitol backbone has a variable effect on selectivity. In the enzymatic assays, isoG misincorporates mainly with T, and isoC(Me) misincorporates mainly with A. Further analysis in vivo confirmed the patterns of base-pair interpretation for the deoxyribose and hexitol isoC(Me) /isoG bases in a cellular context, through incorporation of the bases into plasmidic DNA. Results in vivo demonstrated that mispairing and misincorporation was dependent on the backbone scaffold of the base, which indicates rational advances towards orthogonality.


Subject(s)
5-Methylcytosine/analogs & derivatives , Guanine/chemistry , Nucleosides/chemistry , 5-Methylcytosine/chemistry , Molecular Structure
14.
Bioessays ; 35(2): 113-22, 2013 Feb.
Article in English | MEDLINE | ID: mdl-23281109

ABSTRACT

DNA and RNA are the only known natural genetic materials. Systematic modification of each of their chemical building blocks (nucleobase, sugar, and phosphate) has enabled the study of the key properties that make those nucleic acids genetic materials. All three moieties contribute to replication and, significantly, all three moieties can be replaced by synthetic analogs without loss of function. Synthetic nucleic acid polymers capable of storing and propagating information not only expand the central dogma, but also highlight that DNA and RNA are not unique chemical solutions for genetic information storage. By considering replication as a question of information transfer, we propose that any polymer that can be replicated could serve as a genetic material.


Subject(s)
Carbohydrates/chemical synthesis , DNA/chemistry , Polymers/chemical synthesis , Purines/chemistry , Pyrimidines/chemistry , RNA/chemistry , Base Pairing , Carbohydrates/genetics , DNA/genetics , DNA Replication , Evolution, Molecular , Information Theory , Oligonucleotides/chemistry , Oligonucleotides/genetics , RNA/genetics , Thermodynamics
15.
Proc Natl Acad Sci U S A ; 109(21): 8067-72, 2012 May 22.
Article in English | MEDLINE | ID: mdl-22566643

ABSTRACT

DNA polymerase substrate specificity is fundamental to genome integrity and to polymerase applications in biotechnology. In the current paradigm, active site geometry is the main site of specificity control. Here, we describe the discovery of a distinct specificity checkpoint located over 25 Å from the active site in the polymerase thumb subdomain. In Tgo, the replicative DNA polymerase from Thermococcus gorgonarius, we identify a single mutation (E664K) within this region that enables translesion synthesis across a template abasic site or a cyclobutane thymidine dimer. In conjunction with a classic "steric-gate" mutation (Y409G) in the active site, E664K transforms Tgo DNA polymerase into an RNA polymerase capable of synthesizing RNAs up to 1.7 kb long as well as fully pseudouridine-, 5-methyl-C-, 2'-fluoro-, or 2'-azido-modified RNAs primed from a wide range of primer chemistries comprising DNA, RNA, locked nucleic acid (LNA), or 2'O-methyl-DNA. We find that E664K enables RNA synthesis by selectively increasing polymerase affinity for the noncognate RNA/DNA duplex as well as lowering the K(m) for ribonucleotide triphosphate incorporation. This gatekeeper mutation therefore identifies a key missing step in the adaptive path from DNA to RNA polymerases and defines a previously unknown postsynthetic determinant of polymerase substrate specificity with implications for the synthesis and replication of noncognate nucleic acid polymers.


Subject(s)
Archaeal Proteins/genetics , DNA Replication/genetics , DNA-Directed DNA Polymerase/genetics , DNA-Directed RNA Polymerases/genetics , Thermococcus/enzymology , Thermococcus/genetics , Amino Acid Sequence , DNA, Archaeal/genetics , Molecular Sequence Data , Point Mutation/genetics , Protein Engineering/methods , Substrate Specificity
16.
Protein Sci ; 33(4): e4940, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38511482

ABSTRACT

Estrogen receptor α is commonly used in synthetic biology to control the activity of genome editing tools. The activating ligands, estrogens, however, interfere with various cellular processes, thereby limiting the applicability of this receptor. Altering its ligand preference to chemicals of choice solves this hurdle but requires adaptation of unspecified ligand-interacting residues. Here, we provide a solution by combining rational protein design with multi-site-directed mutagenesis and directed evolution of stably integrated variants in Saccharomyces cerevisiae. This method yielded an estrogen receptor variant, named TERRA, that lost its estrogen responsiveness and became activated by tamoxifen, an anti-estrogenic drug used for breast cancer treatment. This tamoxifen preference of TERRA was maintained in mammalian cells and mice, even when fused to Cre recombinase, expanding the mammalian synthetic biology toolbox. Not only is our platform transferable to engineer ligand preference of any steroid receptor, it can also profile drug-resistance landscapes for steroid receptor-targeted therapies.


Subject(s)
Estradiol , Estrogen Receptor alpha , Animals , Mice , Estrogen Receptor alpha/genetics , Estrogen Receptor alpha/chemistry , Estrogen Receptor alpha/metabolism , Estradiol/chemistry , Estradiol/metabolism , Ligands , Tamoxifen/pharmacology , Tamoxifen/metabolism , Receptors, Estrogen/genetics , Receptors, Estrogen/chemistry , Receptors, Estrogen/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Mammals
17.
ISME Commun ; 2(1): 40, 2022 Apr 25.
Article in English | MEDLINE | ID: mdl-37938658

ABSTRACT

A bottleneck for microbial community experiments with many samples and/or replicates is the fast quantification of individual taxon abundances, which is commonly achieved through sequencing marker genes such as the 16S rRNA gene. Here, we propose a new approach for high-throughput and high-quality enumeration of human gut bacteria in a defined community, combining flow cytometry and supervised classification to identify and quantify species mixed in silico and in defined communities in vitro. We identified species in a 5-species in silico community with an F1 score of 71%. In addition, we demonstrate in vitro that our method performs equally well or better than 16S rRNA gene sequencing in two-species cocultures and agrees with 16S rRNA gene sequencing data on the most abundant species in a four-species community. We found that shape and size differences alone are insufficient to distinguish species, and that it is thus necessary to exploit the multivariate nature of flow cytometry data. Finally, we observed that variability of flow cytometry data across replicates differs between gut bacterial species. In conclusion, the performance of supervised classification of gut species in flow cytometry data is species-dependent, but is for some combinations accurate enough to serve as a faster alternative to 16S rRNA gene sequencing.

18.
Sci Rep ; 11(1): 9134, 2021 04 28.
Article in English | MEDLINE | ID: mdl-33911147

ABSTRACT

Insertions and deletions (indels) are known to affect function, biophysical properties and substrate specificity of enzymes, and they play a central role in evolution. Despite such clear significance, this class of mutation remains an underexploited tool in protein engineering with few available platforms capable of systematically generating and analysing libraries of varying sequence composition and length. We present a novel DNA assembly platform (InDel assembly), based on cycles of endonuclease restriction digestion and ligation of standardised dsDNA building blocks, that can generate libraries exploring both composition and sequence length variation. In addition, we developed a framework to analyse the output of selection from InDel-generated libraries, combining next generation sequencing and alignment-free strategies for sequence analysis. We demonstrate the approach by engineering the well-characterized TEM-1 ß-lactamase Ω-loop, involved in substrate specificity, identifying multiple novel extended spectrum ß-lactamases with loops of modified length and composition-areas of the sequence space not previously explored. Together, the InDel assembly and analysis platforms provide an efficient route to engineer protein loops or linkers where sequence length and composition are both essential functional parameters.


Subject(s)
Protein Engineering , beta-Lactamases/metabolism , Amino Acid Sequence , DNA/chemistry , DNA/metabolism , Gene Library , High-Throughput Nucleotide Sequencing , INDEL Mutation , Protein Structure, Secondary , Sequence Analysis, DNA , Substrate Specificity , beta-Lactamases/genetics
19.
Biomolecules ; 10(12)2020 12 08.
Article in English | MEDLINE | ID: mdl-33302546

ABSTRACT

Archaeal DNA polymerases from the B-family (polB) have found essential applications in biotechnology. In addition, some of their variants can accept a wide range of modified nucleotides or xenobiotic nucleotides, such as 1,5-anhydrohexitol nucleic acid (HNA), which has the unique ability to selectively cross-pair with DNA and RNA. This capacity is essential to allow the transmission of information between different chemistries of nucleic acid molecules. Variants of the archaeal polymerase from Thermococcus gorgonarius, TgoT, that can either generate HNA from DNA (TgoT_6G12) or DNA from HNA (TgoT_RT521) have been previously identified. To understand how DNA and HNA are recognized and selected by these two laboratory-evolved polymerases, we report six X-ray structures of these variants, as well as an in silico model of a ternary complex with HNA. Structural comparisons of the apo form of TgoT_6G12 together with its binary and ternary complexes with a DNA duplex highlight an ensemble of interactions and conformational changes required to promote DNA or HNA synthesis. MD simulations of the ternary complex suggest that the HNA-DNA hybrid duplex remains stable in the A-DNA helical form and help explain the presence of mutations in regions that would normally not be in contact with the DNA if it were not in the A-helical form. One complex with two incorporated HNA nucleotides is surprisingly found in a one nucleotide-backtracked form, which is new for a DNA polymerase. This information can be used for engineering a new generation of more efficient HNA polymerase variants.


Subject(s)
Archaeal Proteins/chemistry , DNA Polymerase beta/chemistry , DNA, Archaeal/chemistry , Hexosephosphates/chemistry , Nucleotides/chemistry , RNA, Archaeal/chemistry , Thermococcus/chemistry , Archaeal Proteins/genetics , Archaeal Proteins/metabolism , Binding Sites , Cloning, Molecular , Crystallography, X-Ray , DNA Polymerase beta/genetics , DNA Polymerase beta/metabolism , DNA, Archaeal/genetics , DNA, Archaeal/metabolism , Directed Molecular Evolution/methods , Escherichia coli/genetics , Escherichia coli/metabolism , Gene Expression , Genetic Vectors/chemistry , Genetic Vectors/metabolism , Hexosephosphates/metabolism , Kinetics , Molecular Dynamics Simulation , Mutation , Nucleic Acid Conformation , Nucleotides/genetics , Nucleotides/metabolism , Protein Binding , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Protein Engineering/methods , Protein Interaction Domains and Motifs , RNA, Archaeal/genetics , RNA, Archaeal/metabolism , Substrate Specificity , Thermococcus/enzymology
20.
J Am Chem Soc ; 131(41): 14827-37, 2009 Oct 21.
Article in English | MEDLINE | ID: mdl-19778048

ABSTRACT

Hydrophobic base analogues (HBAs) have shown great promise for the expansion of the chemical and coding potential of nucleic acids but are generally poor polymerase substrates. While extensive synthetic efforts have yielded examples of HBAs with favorable substrate properties, their discovery has remained challenging. Here we describe a complementary strategy for improving HBA substrate properties by directed evolution of a dedicated polymerase using compartmentalized self-replication (CSR) with the archetypal HBA 5-nitroindole (d5NI) and its derivative 5-nitroindole-3-carboxamide (d5NIC) as selection substrates. Starting from a repertoire of chimeric polymerases generated by molecular breeding of DNA polymerase genes from the genus Thermus, we isolated a polymerase (5D4) with a generically enhanced ability to utilize HBAs. The selected polymerase. 5D4 was able to form and extend d5NI and d5NIC (d5NI(C)) self-pairs as well as d5NI(C) heteropairs with all four bases with efficiencies approaching, or exceeding, those of the cognate Watson-Crick pairs, despite significant distortions caused by the intercalation of the d5NI(C) heterocycles into the opposing strand base stack, as shown by nuclear magnetic resonance spectroscopy (NMR). Unlike Taq polymerase, 5D4 was also able to extend HBA pairs such as Pyrene: varphi (abasic site), d5NI: varphi, and isocarbostyril (ICS): 7-azaindole (7AI), allowed bypass of a chemically diverse spectrum of HBAs, and enabled PCR amplification with primers comprising multiple d5NI(C)-substitutions, while maintaining high levels of catalytic activity and fidelity. The selected polymerase 5D4 promises to expand the range of nucleobase analogues amenable to replication and should find numerous applications, including the synthesis and replication of nucleic acid polymers with expanded chemical and functional diversity.


Subject(s)
DNA-Directed DNA Polymerase/metabolism , DNA/chemistry , DNA/metabolism , Hydrophobic and Hydrophilic Interactions , Amides/chemistry , Bacteria/enzymology , Base Pairing , Base Sequence , DNA/genetics , DNA Primers/chemistry , DNA Primers/metabolism , DNA-Directed DNA Polymerase/chemistry , Indoles/chemistry , Kinetics , Models, Molecular , Protein Conformation , Substrate Specificity
SELECTION OF CITATIONS
SEARCH DETAIL