RESUMEN
Hyperthermophilic organisms thrive in extreme environments prone to high levels of DNA damage. Growth at high temperature stimulates DNA base hydrolysis resulting in apurinic/apyrimidinic (AP) sites that destabilize the genome. Organisms across all domains have evolved enzymes to recognize and repair AP sites to maintain genome stability. The hyperthermophilic archaeon Thermococcus kodakarensis encodes several enzymes to repair AP site damage including the essential AP endonuclease TK endonuclease IV. Recently, using functional genomic screening, we discovered a new family of AP lyases typified by TK0353. Here, using biochemistry, structural analysis, and genetic deletion, we have characterized the TK0353 structure and function. TK0353 lacks glycosylase activity on a variety of damaged bases and is therefore either a monofunctional AP lyase or may be a glycosylase-lyase on a yet unidentified substrate. The crystal structure of TK0353 revealed a novel fold, which does not resemble other known DNA repair enzymes. The TK0353 gene is not essential for T. kodakarensis viability presumably because of redundant base excision repair enzymes involved in AP site processing. In summary, TK0353 is a novel AP lyase unique to hyperthermophiles that provides redundant repair activity necessary for genome maintenance.
Asunto(s)
ADN-(Sitio Apurínico o Apirimidínico) Liasa , Thermococcus , Desoxirribonucleasa IV (Fago T4-Inducido) , Daño del ADN , Reparación del ADN , ADN-(Sitio Apurínico o Apirimidínico) Liasa/química , ADN-(Sitio Apurínico o Apirimidínico) Liasa/genética , ADN-(Sitio Apurínico o Apirimidínico) Liasa/metabolismo , Thermococcus/enzimología , Thermococcus/genéticaRESUMEN
Nucleic acid sequencing technologies have gone through extraordinary advancements in the past several decades, significantly increasing throughput while reducing cost. To create similar advancement in proteomics, numerous approaches are being investigated to advance protein sequencing. One of the promising approaches uses N-terminal amino acid binders (NAABs), also referred to as recognizers, that selectively can identify amino acids at the N-terminus of a peptide. However, there are only a few engineered NAABs currently available that bind to specific amino acids and meet the requirements of a biotechnology reagent. Therefore, additional NAABs need to be identified and engineered to enable confident identification and, ultimately, de novo protein sequencing. To fill this gap, a human protein GID4 was engineered to create a NAAB for N-terminal proline (Nt-Pro). While native GID4 binds Nt-Pro, its binding is weak (µmol/L) and greatly influenced by the identity of residues following the Nt-Pro. Through directed evolution, yeast-surface display, and fluorescence-activated cell sorting, we identified sequence variants of GID4 with increased binding response to Nt-Pro. Moreover, variants with an A252V mutation showed a reduced influence from residues in the second and third positions of the target peptide when binding to Nt-Pro. The workflow outlined here is shown to be a viable strategy for engineering NAABs, even when starting from native Nt-binding proteins whose binding is strongly impacted by the identity of residues following Nt-amino acid.
RESUMEN
It has been predicted that 30 to 80% of archaeal genomes remain annotated as hypothetical proteins with no assigned gene function. Further, many archaeal organisms are difficult to grow or are unculturable. To overcome these technical and experimental hurdles, we developed a high-throughput functional genomics screen that utilizes capillary electrophoresis (CE) to identify nucleic acid modifying enzymes based on activity rather than sequence homology. Here, we describe a functional genomics screening workflow to find DNA modifying enzyme activities encoded by the hyperthermophile Thermococcus kodakarensis (T. kodakarensis). Large DNA insert fosmid libraries representing an â¼5-fold average coverage of the T. kodakarensis genome were prepared in Escherichia coli. RNA-seq showed a high fraction (84%) of T. kodakarensis genes were transcribed in E. coli despite differences in promoter structure and translational machinery. Our high-throughput screening workflow used fluorescently labeled DNA substrates directly in heat-treated lysates of fosmid clones with capillary electrophoresis detection of reaction products. Using this method, we identified both a new DNA endonuclease activity for a previously described RNA endonuclease (Nob1) and a novel AP lyase DNA repair enzyme family (termed 'TK0353') that is found only in a small subset of Thermococcales. The screening methodology described provides a fast and efficient way to explore the T. kodakarensis genome for a variety of nucleic acid modifying activities and may have implications for similar exploration of enzymes and pathways that underlie core cellular processes in other Archaea. IMPORTANCE This study provides a rapid, simple, high-throughput method to discover novel archaeal nucleic acid modifying enzymes by utilizing a fosmid genomic library, next-generation sequencing, and capillary electrophoresis. The method described here provides the details necessary to create 384-well fosmid library plates from Thermococcus kodakarensis genomic DNA, sequence 384-well fosmids plates using Illumina next-generation sequencing, and perform high-throughput functional read-out assays using capillary electrophoresis to identify a variety of nucleic acid modifying activities, including DNA cleavage and ligation. We used this approach to identify a new DNA endonuclease activity for a previously described RNA endonuclease (Nob1) and identify a novel AP lyase enzyme (TK0353) that lacks sequence homology to known nucleic acid modifying enzymes.
Asunto(s)
Proteínas Arqueales , Thermococcus , Proteínas Arqueales/metabolismo , ADN de Archaea/genética , ADN de Archaea/metabolismo , Electroforesis Capilar , Escherichia coli/genética , Escherichia coli/metabolismo , GenómicaRESUMEN
Family D DNA polymerase (PolD) is the essential replicative DNA polymerase for duplication of most archaeal genomes. PolD contains a unique two-barrel catalytic core absent from all other DNA polymerase families but found in RNA polymerases (RNAPs). While PolD has an ancestral RNA polymerase catalytic core, its active site has evolved the ability to discriminate against ribonucleotides. Until now, the mechanism evolved by PolD to prevent ribonucleotide incorporation was unknown. In all other DNA polymerase families, an active site steric gate residue prevents ribonucleotide incorporation. In this work, we identify two consensus active site acidic (a) and basic (b) motifs shared across the entire two-barrel nucleotide polymerase superfamily, and a nucleotide selectivity (s) motif specific to PolD versus RNAPs. A novel steric gate histidine residue (H931 in Thermococcus sp. 9°N PolD) in the PolD s-motif both prevents ribonucleotide incorporation and promotes efficient dNTP incorporation. Further, a PolD H931A steric gate mutant abolishes ribonucleotide discrimination and readily incorporates a variety of 2' modified nucleotides. Taken together, we construct the first putative nucleotide bound PolD active site model and provide structural and functional evidence for the emergence of DNA replication through the evolution of an ancestral RNAP two-barrel catalytic core.
Asunto(s)
Proteínas Arqueales/genética , ADN de Archaea/genética , ADN Polimerasa Dirigida por ADN/genética , Regulación de la Expresión Génica Arqueal , Genoma Arqueal , Ribonucleótidos/genética , Thermococcus/genética , Secuencia de Aminoácidos , Proteínas Arqueales/química , Proteínas Arqueales/metabolismo , Sitios de Unión , Dominio Catalítico , Clonación Molecular , Replicación del ADN , ADN de Archaea/metabolismo , ADN Polimerasa Dirigida por ADN/química , ADN Polimerasa Dirigida por ADN/metabolismo , Expresión Génica , Histidina/química , Histidina/metabolismo , Cinética , Modelos Moleculares , Mutación , Unión Proteica , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Dominios y Motivos de Interacción de Proteínas , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Ribonucleótidos/química , Ribonucleótidos/metabolismo , Alineación de Secuencia , Homología de Secuencia de Aminoácido , Especificidad por Sustrato , Thermococcus/enzimologíaRESUMEN
Inteins (intervening proteins), mobile genetic elements removed through protein splicing, often interrupt proteins required for DNA replication, recombination, and repair. An abundance of in vitro evidence implies that inteins may act as regulatory elements, whereby reduced splicing inhibits production of the mature protein lacking the intein, but in vivo evidence of regulatory intein excision in the native host is absent. The model archaeon Thermococcus kodakarensis encodes 15 inteins, and we establish the impacts of intein splicing inhibition on host physiology and replication in vivo. We report that a decrease in intein splicing efficiency of the recombinase RadA, a Rad51/RecA homolog, has widespread physiological consequences, including a general growth defect, increased sensitivity to DNA damage, and a switch in the mode of DNA replication from recombination-dependent replication toward origin-dependent replication.
Asunto(s)
Proteínas Arqueales , Replicación del ADN , Proteínas de Unión al ADN , Inteínas , Thermococcus , Proteínas Arqueales/metabolismo , Proteínas Arqueales/genética , Daño del ADN , ADN de Archaea/metabolismo , ADN de Archaea/genética , Proteínas de Unión al ADN/metabolismo , Proteínas de Unión al ADN/genética , Inteínas/genética , Empalme de Proteína , Recombinación Genética , Thermococcus/genética , Thermococcus/metabolismoRESUMEN
Replicative DNA polymerases duplicate entire genomes at high fidelity. This feature is shared among the three domains of life and is facilitated by their dual polymerase and exonuclease activities. Family D replicative DNA polymerases (PolD), found exclusively in Archaea, contain an unusual RNA polymerase-like catalytic core, and a unique Mre11-like proofreading active site. Here, we present cryo-EM structures of PolD trapped in a proofreading mode, revealing an unanticipated correction mechanism that extends the repertoire of protein domains known to be involved in DNA proofreading. Based on our experimental structures, mutants of PolD were designed and their contribution to mismatch bypass and exonuclease kinetics was determined. This study sheds light on the convergent evolution of structurally distinct families of DNA polymerases, and the domain acquisition and exchange mechanism that occurred during the evolution of the replisome in the three domains of life.
Asunto(s)
ADN Polimerasa Dirigida por ADN , Exonucleasas , Exonucleasas/genética , Exonucleasas/metabolismo , ADN Polimerasa Dirigida por ADN/metabolismo , Replicación del ADN/genética , Dominio Catalítico , Dominios ProteicosRESUMEN
CRISPR-Cas systems provide heritable acquired immunity against viruses to archaea and bacteria. Cas3 is a CRISPR-associated protein that is common to all Type I systems, possesses both nuclease and helicase activities, and is responsible for degradation of invading DNA. Involvement of Cas3 in DNA repair had been suggested in the past, but then set aside when the role of CRISPR-Cas as an adaptive immune system was realized. Here we show that in the model archaeon Haloferax volcanii a cas3 deletion mutant exhibits increased resistance to DNA damaging agents compared with the wild-type strain, but its ability to recover quickly from such damage is reduced. Analysis of cas3 point mutants revealed that the helicase domain of the protein is responsible for the DNA damage sensitivity phenotype. Epistasis analysis indicated that cas3 operates with mre11 and rad50 in restraining the homologous recombination pathway of DNA repair. Mutants deleted for Cas3 or deficient in its helicase activity showed higher rates of homologous recombination, as measured in pop-in assays using non-replicating plasmids. These results demonstrate that Cas proteins act in DNA repair, in addition to their role in defense against selfish elements and are an integral part of the cellular response to DNA damage.
RESUMEN
The formation and persistence of DNA damage can impact biological processes such as DNA replication and transcription. To maintain genome stability and integrity, organisms rely on robust DNA damage repair pathways. Techniques to detect and locate DNA damage sites across a genome enable an understanding of the consequences of DNA damage as well as how damage is repaired, which can have key diagnostic and therapeutic implications. Importantly, advancements in technology have enabled the development of high-throughput sequencing-based DNA damage detection methods. These methods require DNA enrichment or amplification steps that limit the ability to quantitate the DNA damage sites. Further, each of these methods is typically tailored to detect only a specific type of damage. RAre DAmage and Repair (RADAR) sequencing is a DNA sequencing workflow that overcomes these limitations and enables detection and quantitation of DNA damage sites in any organism on a genome-wide scale. RADAR-seq works by replacing DNA damage sites with a patch of modified bases that can be directly detected by Pacific Biosciences Single-Molecule Real Time sequencing. Here, we present three protocols that enable detection of thymine dimers and ribonucleotides in bacterial and archaeal genomes. Basic Protocol 1 enables construction of a reference genome required for RADAR-seq analyses. Basic Protocol 2 describes how to locate, quantitate, and compare thymine dimer levels in Escherichia coli exposed to varying amounts of UV light. Basic Protocol 3 describes how to locate, quantitate, and compare ribonucleotide levels in wild-type and ΔRNaseH2 Thermococcus kodakarensis. Importantly, all three protocols provide in-depth steps for data analysis. Together they serve as proof-of-principle experiments that will allow users to adapt the protocols to locate and quantitate a wide variety of DNA damage sites in any organism. © 2022 New England Biolabs. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Constructing a reference genome utilizing SMRT sequencing Basic Protocol 2: Mapping and quantitating genomic thymine dimer formation in untreated versus UV-irradiated E. coli using RADAR-seq Basic Protocol 3: Mapping and quantitating genomic ribonucleotide incorporation in wildtype versus ΔRNaseH2 T. kodakarensis using RADAR-seq.
Asunto(s)
Reparación del ADN , Dímeros de Pirimidina , Dímeros de Pirimidina/genética , Reparación del ADN/genética , Escherichia coli/genética , Daño del ADN/genética , Ribonucleótidos , Genoma ArquealRESUMEN
Thermococcus kodakarensis (T. kodakarensis), a hyperthermophilic, genetically accessible model archaeon, encodes two putative restriction modification (R-M) defense systems, TkoI and TkoII. TkoI is encoded by TK1460 while TkoII is encoded by TK1158. Bioinformative analysis suggests both R-M enzymes are large, fused methyltransferase (MTase)-endonuclease polypeptides that contain both restriction endonuclease (REase) activity to degrade foreign invading DNA and MTase activity to methylate host genomic DNA at specific recognition sites. In this work, we demonsrate T. kodakarensis strains deleted for either or both R-M enzymes grow more slowly but display significantly increased competency compared to strains with intact R-M systems, suggesting that both TkoI and TkoII assist in maintenance of genomic integrity in vivo and likely protect against viral- or plasmid-based DNA transfers. Pacific Biosciences single molecule real-time (SMRT) sequencing of T. kodakarensis strains containing both, one or neither R-M systems permitted assignment of the recognition sites for TkoI and TkoII and demonstrated that both R-M enzymes are TypeIIL; TkoI and TkoII methylate the N6 position of adenine on one strand of the recognition sequences GTGAAG and TTCAAG, respectively. Further in vitro biochemical characterization of the REase activities reveal TkoI and TkoII cleave the DNA backbone GTGAAG(N)20/(N)18 and TTCAAG(N)10/(N)8, respectively, away from the recognition sequences, while in vitro characterization of the MTase activities reveal transfer of tritiated S-adenosyl methionine by TkoI and TkoII to their respective recognition sites. Together these results demonstrate TkoI and TkoII restriction systems are important for protecting T. kodakarensis genome integrity from invading foreign DNA.
RESUMEN
Reactive oxygen species drive the oxidation of guanine to 8-oxoguanine (8oxoG), which threatens genome integrity. The repair of 8oxoG is carried out by base excision repair enzymes in Bacteria and Eukarya, however, little is known about archaeal 8oxoG repair. This study identifies a member of the Ogg-subfamily archaeal GO glycosylase (AGOG) in Thermococcus kodakarensis, an anaerobic, hyperthermophilic archaeon, and delineates its mechanism, kinetics, and substrate specificity. TkoAGOG is the major 8oxoG glycosylase in T. kodakarensis, but is non-essential. In addition to TkoAGOG, the major apurinic/apyrimidinic (AP) endonuclease (TkoEndoIV) required for archaeal base excision repair and cell viability was identified and characterized. Enzymes required for the archaeal oxidative damage base excision repair pathway were identified and the complete pathway was reconstituted. This study illustrates the conservation of oxidative damage repair across all Domains of life.
Asunto(s)
ADN Glicosilasas/metabolismo , Reparación del ADN , Thermococcus/metabolismo , Proteínas Arqueales/genética , Proteínas Arqueales/metabolismo , Daño del ADN , ADN Glicosilasas/genética , Guanina/análogos & derivados , Guanina/metabolismo , Estrés Oxidativo , Thermococcus/genéticaRESUMEN
RAre DAmage and Repair sequencing (RADAR-seq) is a highly adaptable sequencing method that enables the identification and detection of rare DNA damage events for a wide variety of DNA lesions at single-molecule resolution on a genome-wide scale. In RADAR-seq, DNA lesions are replaced with a patch of modified bases that can be directly detected by Pacific Biosciences Single Molecule Real-Time (SMRT) sequencing. RADAR-seq enables dynamic detection over a wide range of DNA damage frequencies, including low physiological levels. Furthermore, without the need for DNA amplification and enrichment steps, RADAR-seq provides sequencing coverage of damaged and undamaged DNA across an entire genome. Here, we use RADAR-seq to measure the frequency and map the location of ribonucleotides in wild-type and RNaseH2-deficient E. coli and Thermococcus kodakarensis strains. Additionally, by tracking ribonucleotides incorporated during in vivo lagging strand DNA synthesis, we determined the replication initiation point in E. coli, and its relation to the origin of replication (oriC). RADAR-seq was also used to map cyclobutane pyrimidine dimers (CPDs) in Escherichia coli (E. coli) genomic DNA exposed to UV-radiation. On a broader scale, RADAR-seq can be applied to understand formation and repair of DNA damage, the correlation between DNA damage and disease initiation and progression, and complex biological pathways, including DNA replication.
Asunto(s)
Daño del ADN , Reparación del ADN , Genoma Arqueal , Genoma Bacteriano , Pruebas de Mutagenicidad/métodos , Análisis de Secuencia de ADN/métodos , Replicación del ADN , ADN de Archaea , ADN Bacteriano/efectos de la radiación , Escherichia coli/genética , Escherichia coli/efectos de la radiación , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Dímeros de Pirimidina , Ribonucleótidos , Thermococcus/genética , Rayos UltravioletaRESUMEN
A variant of 9°N DNA polymerase [Genbank ID (AAA88769.1)] with three mutations (D141A, E143A, A485L) and commercialized under the name "Therminator DNA polymerase" has the ability to incorporate a variety of modified nucleotide classes. This Review focuses on how Therminator DNA Polymerase has enabled new technologies in synthetic biology and DNA sequencing. In addition, we discuss mechanisms for increased modified nucleotide incorporation.
RESUMEN
DNA replication and repair are essential biological processes needed for the survival of all organisms. Although these processes are fundamentally conserved in the three domains, archaea, bacteria and eukarya, the proteins and complexes involved differ. The genetic and biophysical tools developed for archaea in the last several years have accelerated the study of DNA replication and repair in this domain. In this review, the current knowledge of DNA replication and repair processes in archaea will be summarized, with emphasis on the contribution of genetics and other recently developed biophysical and molecular tools, including capillary gel electrophoresis, next-generation sequencing and single-molecule approaches. How these new tools will continue to drive archaeal DNA replication and repair research will also be discussed.