Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 172
Filter
Add more filters

Publication year range
1.
Annu Rev Microbiol ; 75: 129-149, 2021 10 08.
Article in English | MEDLINE | ID: mdl-34314594

ABSTRACT

The amount of bacterial and archaeal genome sequence and methylome data has greatly increased over the last decade, enabling new insights into the functional roles of DNA methylation in these organisms. Methyltransferases (MTases), the enzymes responsible for DNA methylation, are exchanged between prokaryotes through horizontal gene transfer and can function either as part of restriction-modification systems or in apparent isolation as single (orphan) genes. The patterns of DNA methylation they confer on the host chromosome can have significant effects on gene expression, DNA replication, and other cellular processes. Some processes require very stable patterns of methylation, resulting in conservation of persistent MTases in a particular lineage. Other processes require patterns that are more dynamic yet more predictable than what is afforded by horizontal gene transfer and gene loss, resulting in phase-variable or recombination-driven MTase alleles. In this review, we discuss what is currently known about the functions of DNA methylation in prokaryotes in light of these evolutionary patterns.


Subject(s)
DNA Methylation , Epigenomics , DNA Restriction-Modification Enzymes/genetics , DNA Restriction-Modification Enzymes/metabolism , Methyltransferases/genetics , Methyltransferases/metabolism , Prokaryotic Cells/metabolism
2.
Nucleic Acids Res ; 51(D1): D629-D630, 2023 01 06.
Article in English | MEDLINE | ID: mdl-36318248

ABSTRACT

REBASE is a comprehensive and extensively curated database of information about the components of restriction-modification (RM) systems. It is fully referenced and provides information about the recognition and cleavage sites for both restriction enzymes and DNA methyltransferases together with their commercial availability, methylation sensitivity, crystal and sequence data. All completely sequenced genomes and select shotgun sequences are analyzed for RM system components. When PacBio sequence data is available, the recognition sequences of many DNA methyltransferases (MTases) can be determined. This has led to an explosive growth in the number of well-characterized MTases in REBASE. The contents of REBASE may be browsed from the web rebase.neb.com and selected compilations can be downloaded by FTP (ftp.neb.com). Monthly updates are also available via email.


Subject(s)
DNA Methylation , DNA Modification Methylases , Databases, Factual , DNA Restriction Enzymes/metabolism , DNA Modification Methylases/metabolism , DNA/genetics , DNA Restriction-Modification Enzymes/genetics
3.
PLoS Biol ; 18(11): e3000999, 2020 11.
Article in English | MEDLINE | ID: mdl-33253151

ABSTRACT

How do we scale biological science to the demand of next generation biology and medicine to keep track of the facts, predictions, and hypotheses? These days, enormous amounts of DNA sequence and other omics data are generated. Since these data contain the blueprint for life, it is imperative that we interpret it accurately. The abundance of DNA is only one part of the challenge. Artificial Intelligence (AI) and network methods routinely build on large screens, single cell technologies, proteomics, and other modalities to infer or predict biological functions and phenotypes associated with proteins, pathways, and organisms. As a first step, how do we systematically trace the provenance of knowledge from experimental ground truth to gene function predictions and annotations? Here, we review the main challenges in tracking the evolution of biological knowledge and propose several specific solutions to provenance and computational tracing of evidence in functional linkage networks.


Subject(s)
Big Data , Gene Regulatory Networks , Genomics/statistics & numerical data , Algorithms , Artificial Intelligence , Computational Biology , Genetic Linkage , High-Throughput Nucleotide Sequencing , Humans , Models, Genetic , Proteomics/statistics & numerical data , Synthetic Biology , Systems Biology
4.
Nucleic Acids Res ; 49(19): e113, 2021 11 08.
Article in English | MEDLINE | ID: mdl-34417598

ABSTRACT

DNA methylation is widespread amongst eukaryotes and prokaryotes to modulate gene expression and confer viral resistance. 5-Methylcytosine (m5C) methylation has been described in genomes of a large fraction of bacterial species as part of restriction-modification systems, each composed of a methyltransferase and cognate restriction enzyme. Methylases are site-specific and target sequences vary across organisms. High-throughput methods, such as bisulfite-sequencing can identify m5C at base resolution but require specialized library preparations and single molecule, real-time (SMRT) sequencing usually misses m5C. Here, we present a new method called RIMS-seq (rapid identification of methylase specificity) to simultaneously sequence bacterial genomes and determine m5C methylase specificities using a simple experimental protocol that closely resembles the DNA-seq protocol for Illumina. Importantly, the resulting sequencing quality is identical to DNA-seq, enabling RIMS-seq to substitute standard sequencing of bacterial genomes. Applied to bacteria and synthetic mixed communities, RIMS-seq reveals new methylase specificities, supporting routine study of m5C methylation while sequencing new genomes.


Subject(s)
5-Methylcytosine/metabolism , DNA Modification Methylases/metabolism , DNA Restriction Enzymes/metabolism , Escherichia coli K12/genetics , Genome, Bacterial , High-Throughput Nucleotide Sequencing/methods , Acinetobacter calcoaceticus/enzymology , Acinetobacter calcoaceticus/genetics , Aeromonas hydrophila/enzymology , Aeromonas hydrophila/genetics , Bacillus amyloliquefaciens/enzymology , Bacillus amyloliquefaciens/genetics , Base Sequence , Clostridium acetobutylicum/enzymology , Clostridium acetobutylicum/genetics , DNA Methylation , DNA Modification Methylases/genetics , DNA Restriction Enzymes/genetics , Escherichia coli K12/enzymology , Gene Expression Regulation, Bacterial , Haemophilus/enzymology , Haemophilus/genetics , Haemophilus influenzae/enzymology , Haemophilus influenzae/genetics , Humans , Microbiota/genetics , Sequence Analysis, DNA , Skin/microbiology
5.
Nature ; 591(7849): 202, 2021 03.
Article in English | MEDLINE | ID: mdl-33664515
6.
Nucleic Acids Res ; 48(22): 12858-12873, 2020 12 16.
Article in English | MEDLINE | ID: mdl-33270887

ABSTRACT

Analysis of genomic DNA from pathogenic strains of Burkholderia cenocepacia J2315 and Escherichia coli O104:H4 revealed the presence of two unusual MTase genes. Both are plasmid-borne ORFs, carried by pBCA072 for B. cenocepacia J2315 and pESBL for E. coli O104:H4. Pacific Biosciences SMRT sequencing was used to investigate DNA methyltransferases M.BceJIII and M.EcoGIX, using artificial constructs. Mating properties of engineered pESBL derivatives were also investigated. Both MTases yield promiscuous m6A modification of single strands, in the context SAY (where S = C or G and Y = C or T). Strikingly, this methylation is asymmetric in vivo, detected almost exclusively on one DNA strand, and is incomplete: typically, around 40% of susceptible motifs are modified. Genetic and biochemical studies suggest that enzyme action depends on replication mode: DNA Polymerase I (PolI)-dependent ColE1 and p15A origins support asymmetric modification, while the PolI-independent pSC101 origin does not. An MTase-PolI complex may enable discrimination of PolI-dependent and independent plasmid origins. M.EcoGIX helps to establish pESBL in new hosts by blocking the action of restriction enzymes, in an orientation-dependent fashion. Expression and action appear to occur on the entering single strand in the recipient, early in conjugal transfer, until lagging-strand replication creates the double-stranded form.


Subject(s)
DNA Methylation/genetics , DNA Polymerase I/genetics , DNA, Single-Stranded/genetics , Methyltransferases/genetics , Bacterial Proteins/genetics , Burkholderia cenocepacia/genetics , DNA Replication/genetics , Escherichia coli O104/genetics , Escherichia coli Proteins/genetics , Genome, Bacterial/genetics , Plasmids/genetics , Ribosomal Proteins/genetics
7.
Nucleic Acids Res ; 48(3): 1466-1478, 2020 02 20.
Article in English | MEDLINE | ID: mdl-31879785

ABSTRACT

HhaI, a Type II restriction endonuclease, recognizes the symmetric sequence 5'-GCG↓C-3' in duplex DNA and cleaves ('↓') to produce fragments with 2-base, 3'-overhangs. We determined the structure of HhaI in complex with cognate DNA at an ultra-high atomic resolution of 1.0 Å. Most restriction enzymes act as dimers with two catalytic sites, and cleave the two strands of duplex DNA simultaneously, in a single binding event. HhaI, in contrast, acts as a monomer with only one catalytic site, and cleaves the DNA strands sequentially, one after the other. HhaI comprises three domains, each consisting of a mixed five-stranded ß sheet with a defined function. The first domain contains the catalytic-site; the second contains residues for sequence recognition; and the third contributes to non-specific DNA binding. The active-site belongs to the 'PD-D/EXK' superfamily of nucleases and contains the motif SD-X11-EAK. The first two domains are similar in structure to two other monomeric restriction enzymes, HinP1I (G↓CGC) and MspI (C↓CGG), which produce fragments with 5'-overhangs. The third domain, present only in HhaI, shifts the positions of the recognition residues relative to the catalytic site enabling this enzyme to cleave the recognition sequence at a different position. The structure of M.HhaI, the biological methyltransferase partner of HhaI, was determined earlier. Together, these two structures represent the first natural pair of restriction-modification enzymes to be characterized in atomic detail.


Subject(s)
DNA/ultrastructure , Deoxyribonucleases, Type II Site-Specific/ultrastructure , Nucleic Acid Conformation , Protein Conformation , Catalytic Domain , Crystallography, X-Ray , DNA/chemistry , DNA/genetics , DNA Restriction Enzymes/chemistry , DNA Restriction Enzymes/genetics , DNA Restriction Enzymes/ultrastructure , DNA-Binding Proteins/chemistry , DNA-Binding Proteins/genetics , DNA-Binding Proteins/ultrastructure , Deoxyribonucleases, Type II Site-Specific/chemistry , Deoxyribonucleases, Type II Site-Specific/genetics , Haemophilus/chemistry , Haemophilus/enzymology , Protein Binding/genetics
8.
Nucleic Acids Res ; 48(19): 11040-11053, 2020 11 04.
Article in English | MEDLINE | ID: mdl-33045731

ABSTRACT

The genomes of gut Bacteroidales contain numerous invertible regions, many of which contain promoters that dictate phase-variable synthesis of surface molecules such as polysaccharides, fimbriae, and outer surface proteins. Here, we characterize a different type of phase-variable system of Bacteroides fragilis, a Type I restriction modification system (R-M). We show that reversible DNA inversions within this R-M locus leads to the generation of eight specificity proteins with distinct recognition sites. In vitro grown bacteria have a different proportion of specificity gene combinations at the expression locus than bacteria isolated from the mammalian gut. By creating mutants, each able to produce only one specificity protein from this region, we identified the R-M recognition sites of four of these S-proteins using SMRT sequencing. Transcriptome analysis revealed that the locked specificity mutants, whether grown in vitro or isolated from the mammalian gut, have distinct transcriptional profiles, likely creating different phenotypes, one of which was confirmed. Genomic analyses of diverse strains of Bacteroidetes from both host-associated and environmental sources reveal the ubiquity of phase-variable R-M systems in this phylum.


Subject(s)
Bacterial Proteins/metabolism , Bacteroides fragilis/enzymology , DNA Restriction-Modification Enzymes/metabolism , Gastrointestinal Microbiome , Animals , Bacterial Proteins/genetics , DNA Restriction-Modification Enzymes/genetics , Humans , Mice , Mutation , Transcriptome
9.
Nucleic Acids Res ; 48(20): 11468-11485, 2020 11 18.
Article in English | MEDLINE | ID: mdl-33119758

ABSTRACT

Type I restriction-modification (R-M) systems consist of a DNA endonuclease (HsdR, HsdM and HsdS subunits) and methyltransferase (HsdM and HsdS subunits). The hsdS sequences flanked by inverted repeats (referred to as epigenetic invertons) in certain Type I R-M systems undergo invertase-catalyzed inversions. Previous studies in Streptococcus pneumoniae have shown that hsdS inversions within clonal populations produce subpopulations with profound differences in the methylome, cellular physiology and virulence. In this study, we bioinformatically identified six major clades of the tyrosine and serine family invertases homologs from 16 bacterial phyla, which potentially catalyze hsdS inversions in the epigenetic invertons. In particular, the epigenetic invertons are highly enriched in host-associated bacteria. We further verified hsdS inversions in the Type I R-M systems of four representative host-associated bacteria and found that each of the resultant hsdS allelic variants specifies methylation of a unique DNA sequence. In addition, transcriptome analysis revealed that hsdS allelic variations in Enterococcus faecalis exert significant impact on gene expression. These findings indicate that epigenetic switches driven by invertases in the epigenetic invertons broadly operate in the host-associated bacteria, which may broadly contribute to bacterial host adaptation and virulence beyond the role of the Type I R-M systems against phage infection.


Subject(s)
Bacterial Proteins/genetics , DNA Restriction-Modification Enzymes/genetics , Epigenesis, Genetic , Gene Expression Regulation, Bacterial , Bacteroides fragilis/genetics , DNA Methylation , DNA, Bacterial/chemistry , Enterococcus faecalis/genetics , Inverted Repeat Sequences , Streptococcus agalactiae/genetics , Treponema denticola/genetics
10.
Nucleic Acids Res ; 46(2): 840-848, 2018 01 25.
Article in English | MEDLINE | ID: mdl-29228259

ABSTRACT

We describe the cloning, expression and characterization of the first truly non-specific adenine DNA methyltransferase, M.EcoGII. It is encoded in the genome of the pathogenic strain Escherichia coli O104:H4 C227-11, where it appears to reside on a cryptic prophage, but is not expressed. However, when the gene encoding M.EcoGII is expressed in vivo - using a high copy pRRS plasmid vector and a methylation-deficient E. coli host-extensive in vivo adenine methylation activity is revealed. M.EcoGII methylates adenine residues in any DNA sequence context and this activity extends to dA and rA bases in either strand of a DNA:RNA-hybrid oligonucleotide duplex and to rA bases in RNAs prepared by in vitro transcription. Using oligonucleotide and bacteriophage M13mp18 virion DNA substrates, we find that M.EcoGII also methylates single-stranded DNA in vitro and that this activity is only slightly less robust than that observed using equivalent double-stranded DNAs. In vitro assays, using purified recombinant M.EcoGII enzyme, demonstrate that up to 99% of dA bases in duplex DNA substrates can be methylated thereby rendering them insensitive to cleavage by multiple restriction endonucleases. These properties suggest that the enzyme could also be used for high resolution mapping of protein binding sites in DNA and RNA substrates.


Subject(s)
DNA Restriction Enzymes/metabolism , Escherichia coli/genetics , Prophages/enzymology , Site-Specific DNA-Methyltransferase (Adenine-Specific)/metabolism , Adenine/metabolism , Base Sequence , DNA Methylation , DNA Restriction Enzymes/genetics , DNA, Single-Stranded/genetics , DNA, Single-Stranded/metabolism , Electrophoresis, Polyacrylamide Gel , Escherichia coli/virology , Prophages/genetics , Protein Binding , RNA, Double-Stranded/genetics , RNA, Double-Stranded/metabolism , Site-Specific DNA-Methyltransferase (Adenine-Specific)/genetics , Substrate Specificity
11.
PLoS Biol ; 14(4): e1002442, 2016 Apr.
Article in English | MEDLINE | ID: mdl-27082731

ABSTRACT

The creation of restriction enzymes with programmable DNA-binding and -cleavage specificities has long been a goal of modern biology. The recently discovered Type IIL MmeI family of restriction-and-modification (RM) enzymes that possess a shared target recognition domain provides a framework for engineering such new specificities. However, a lack of structural information on Type IIL enzymes has limited the repertoire that can be rationally engineered. We report here a crystal structure of MmeI in complex with its DNA substrate and an S-adenosylmethionine analog (Sinefungin). The structure uncovers for the first time the interactions that underlie MmeI-DNA recognition and methylation (5'-TCCRAC-3'; R = purine) and provides a molecular basis for changing specificity at four of the six base pairs of the recognition sequence (5'-TCCRAC-3'). Surprisingly, the enzyme is resilient to specificity changes at the first position of the recognition sequence (5'-TCCRAC-3'). Collectively, the structure provides a basis for engineering further derivatives of MmeI and delineates which base pairs of the recognition sequence are more amenable to alterations than others.


Subject(s)
DNA/chemistry , Deoxyribonucleases, Type II Site-Specific/chemistry , Base Sequence , DNA Methylation , Hydrolysis , Molecular Sequence Data
12.
Nucleic Acids Res ; 45(6): 3395-3406, 2017 04 07.
Article in English | MEDLINE | ID: mdl-28180279

ABSTRACT

Staphylococcus aureus displays a clonal population structure in which horizontal gene transfer between different lineages is extremely rare. This is due, in part, to the presence of a Type I DNA restriction-modification (RM) system given the generic name of Sau1, which maintains different patterns of methylation on specific target sequences on the genomes of different lineages. We have determined the target sequences recognized by the Sau1 Type I RM systems present in a wide range of the most prevalent S. aureus lineages and assigned the sequences recognized to particular target recognition domains within the RM enzymes. We used a range of biochemical assays on purified enzymes and single molecule real-time sequencing on genomic DNA to determine these target sequences and their patterns of methylation. Knowledge of the main target sequences for Sau1 will facilitate the synthesis of new vectors for transformation of the most prevalent lineages of this 'untransformable' bacterium.


Subject(s)
DNA Modification Methylases/chemistry , DNA Modification Methylases/metabolism , Deoxyribonucleases, Type I Site-Specific/chemistry , Deoxyribonucleases, Type I Site-Specific/metabolism , Staphylococcus aureus/enzymology , Amino Acid Sequence , DNA/chemistry , DNA/metabolism , Protein Domains , Sequence Analysis, DNA , Staphylococcus aureus/genetics , Transformation, Bacterial
13.
Nucleic Acids Res ; 45(15): 9005-9018, 2017 Sep 06.
Article in English | MEDLINE | ID: mdl-28911108

ABSTRACT

Two restriction-modification systems have been previously discovered in Thermus aquaticus YT-1. TaqI is a 263-amino acid (aa) Type IIP restriction enzyme that recognizes and cleaves within the symmetric sequence 5'-TCGA-3'. TaqII, in contrast, is a 1105-aa Type IIC restriction-and-modification enzyme, one of a family of Thermus homologs. TaqII was originally reported to recognize two different asymmetric sequences: 5'-GACCGA-3' and 5'-CACCCA-3'. We previously cloned the taqIIRM gene, purified the recombinant protein from Escherichia coli, and showed that TaqII recognizes the 5'-GACCGA-3' sequence only. Here, we report the discovery, isolation, and characterization of TaqIII, the third R-M system from T. aquaticus YT-1. TaqIII is a 1101-aa Type IIC/IIL enzyme and recognizes the 5'-CACCCA-3' sequence previously attributed to TaqII. The cleavage site is 11/9 nucleotides downstream of the A residue. The enzyme exhibits striking biochemical similarity to TaqII. The 93% identity between their aa sequences suggests that they have a common evolutionary origin. The genes are located on two separate plasmids, and are probably paralogs or pseudoparalogs. Putative positions and aa that specify DNA recognition were identified and recognition motifs for 6 uncharacterized Thermus-family enzymes were predicted.


Subject(s)
Bacterial Proteins/genetics , Deoxyribonucleases, Type II Site-Specific/genetics , Nucleotide Motifs , Plasmids/metabolism , Thermus/enzymology , Amino Acid Sequence , Bacterial Proteins/metabolism , Cloning, Molecular , DNA Cleavage , Deoxyribonucleases, Type II Site-Specific/metabolism , Escherichia coli/genetics , Escherichia coli/metabolism , Gene Expression , Isoenzymes/genetics , Isoenzymes/metabolism , Molecular Weight , Plasmids/chemistry , Recombinant Proteins/genetics , Recombinant Proteins/metabolism , Sequence Alignment , Sequence Homology, Amino Acid , Substrate Specificity , Thermus/genetics
14.
PLoS Genet ; 12(2): e1005854, 2016 Feb.
Article in English | MEDLINE | ID: mdl-26870957

ABSTRACT

DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities of 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active 'orphan' MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.


Subject(s)
Epigenomics , Prokaryotic Cells/metabolism , Conserved Sequence , DNA Methylation/genetics , DNA Replication/genetics , DNA Restriction-Modification Enzymes/classification , DNA Restriction-Modification Enzymes/metabolism , Evolution, Molecular , Gene Expression Regulation , Genome , Methyltransferases/metabolism , Molecular Sequence Annotation , Multigene Family , Nucleotide Motifs/genetics , Phylogeny , Substrate Specificity
15.
Int J Syst Evol Microbiol ; 68(10): 3175-3183, 2018 Oct.
Article in English | MEDLINE | ID: mdl-30156530

ABSTRACT

A Gram-stain-positive, catalase-positive and pleomorphic rod organism was isolated from malted barley in Finland, classified initially by partial 16S rRNA gene sequencing and originally deposited in the VTT Culture Collection as a strain of Propionibacterium acidipropionici (currently Acidipropionibacterium acidipropionici). The subsequent comparison of the whole 16S rRNA gene with other representatives of the genus Acidipropionibacterium revealed that the strain belongs to a novel species, most closely related to Acidipropionibacterium microaerophilum and Acidipropionibacterium acidipropionici, with similarity values of 98.46 and 98.31 %, respectively. The whole genome sequencing using PacBio RS II platform allowed further comparison of the genome with all of the other DNA sequences available for the type strains of the Acidipropionibacterium species. Those comparisons revealed the highest similarity of strain JS278T to A. acidipropionici, which was confirmed by the average nucleotide identity analysis. The genome of strain JS278T is intermediate in size compared to the A. acidipropionici and Acidipropionibacterium jensenii at 3 432 872 bp, the G+C content is 68.4 mol%. The strain fermented a wide range of carbon sources, and produced propionic acid as the major fermentation product. Besides its poor ability to grow at 37 °C and positive catalase reaction, the observed phenotype was almost indistinguishable from those of A. acidipropionici and A. jensenii. Based on our findings, we conclude that the organism represents a novel member of the genus Acidipropionibacterium, for which we propose the name Acidipropionibacteriumvirtanenii sp. nov. The type strain is JS278T (=VTT E-113202T=DSM 106790T).


Subject(s)
Hordeum/microbiology , Phylogeny , Propionibacterium/classification , Bacterial Typing Techniques , Base Composition , DNA, Bacterial/genetics , Fermentation , Finland , Propionibacterium/genetics , Propionibacterium/isolation & purification , RNA, Ribosomal, 16S/genetics , Sequence Analysis, DNA
16.
Nature ; 546(7660): 600, 2017 06 28.
Article in English | MEDLINE | ID: mdl-28658236
17.
Nucleic Acids Res ; 44(D1): D330-5, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26635392

ABSTRACT

The COMBREX database (COMBREX-DB; combrex.bu.edu) is an online repository of information related to (i) experimentally determined protein function, (ii) predicted protein function, (iii) relationships among proteins of unknown function and various types of experimental data, including molecular function, protein structure, and associated phenotypes. The database was created as part of the novel COMBREX (COMputational BRidges to EXperiments) effort aimed at accelerating the rate of gene function validation. It currently holds information on ∼ 3.3 million known and predicted proteins from over 1000 completely sequenced bacterial and archaeal genomes. The database also contains a prototype recommendation system for helping users identify those proteins whose experimental determination of function would be most informative for predicting function for other proteins within protein families. The emphasis on documenting experimental evidence for function predictions, and the prioritization of uncharacterized proteins for experimental testing distinguish COMBREX from other publicly available microbial genomics resources. This article describes updates to COMBREX-DB since an initial description in the 2011 NAR Database Issue.


Subject(s)
Archaeal Proteins/physiology , Bacterial Proteins/physiology , Databases, Protein , Archaeal Proteins/chemistry , Archaeal Proteins/genetics , Bacterial Proteins/chemistry , Bacterial Proteins/genetics , Molecular Sequence Annotation
18.
Nucleic Acids Res ; 44(19): 9413-9425, 2016 Nov 02.
Article in English | MEDLINE | ID: mdl-27580720

ABSTRACT

We identify a new subgroup of Type I Restriction-Modification enzymes that modify cytosine in one DNA strand and adenine in the opposite strand for host protection. Recognition specificity has been determined for ten systems using SMRT sequencing and each recognizes a novel DNA sequence motif. Previously characterized Type I systems use two identical copies of a single methyltransferase (MTase) subunit, with one bound at each half site of the specificity (S) subunit to form the MTase. The new m4C-producing Type I systems we describe have two separate yet highly similar MTase subunits that form a heterodimeric M1M2S MTase. The MTase subunits from these systems group into two families, one of which has NPPF in the highly conserved catalytic motif IV and modifies adenine to m6A, and one having an NPPY catalytic motif IV and modifying cytosine to m4C. The high degree of similarity among their cytosine-recognizing components (MTase and S) suggest they have recently evolved, most likely from the far more common m6A Type I systems. Type I enzymes that modify cytosine exclusively were formed by replacing the adenine target recognition domain (TRD) with a cytosine-recognizing TRD. These are the first examples of m4C modification in Type I RM systems.


Subject(s)
Cytosine/metabolism , DNA Restriction-Modification Enzymes/metabolism , DNA/metabolism , Adenine/metabolism , Amino Acid Sequence , Catalysis , Computational Biology/methods , DNA/chemistry , DNA Restriction-Modification Enzymes/chemistry , DNA Restriction-Modification Enzymes/genetics , Methylation , Methyltransferases/chemistry , Methyltransferases/metabolism , Mutation , Nucleotide Motifs , Protein Subunits/chemistry , Protein Subunits/metabolism , Substrate Specificity
19.
Proc Natl Acad Sci U S A ; 112(14): 4316-21, 2015 Apr 07.
Article in English | MEDLINE | ID: mdl-25831492

ABSTRACT

Modified DNA bases in mammalian genomes, such as 5-methylcytosine ((5m)C) and its oxidized forms, are implicated in important epigenetic regulation processes. In human or mouse, successive enzymatic conversion of (5m)C to its oxidized forms is carried out by the ten-eleven translocation (TET) proteins. Previously we reported the structure of a TET-like (5m)C oxygenase (NgTET1) from Naegleria gruberi, a single-celled protist evolutionarily distant from vertebrates. Here we show that NgTET1 is a 5-methylpyrimidine oxygenase, with activity on both (5m)C (major activity) and thymidine (T) (minor activity) in all DNA forms tested, and provide unprecedented evidence for the formation of 5-formyluridine ((5f)U) and 5-carboxyuridine ((5ca)U) in vitro. Mutagenesis studies reveal a delicate balance between choice of (5m)C or T as the preferred substrate. Furthermore, our results suggest substrate preference by NgTET1 to (5m)CpG and TpG dinucleotide sites in DNA. Intriguingly, NgTET1 displays higher T-oxidation activity in vitro than mammalian TET1, supporting a closer evolutionary relationship between NgTET1 and the base J-binding proteins from trypanosomes. Finally, we demonstrate that NgTET1 can be readily used as a tool in (5m)C sequencing technologies such as single molecule, real-time sequencing to map (5m)C in bacterial genomes at base resolution.


Subject(s)
5-Methylcytosine/chemistry , Naegleria/enzymology , Oxygenases/chemistry , Protozoan Proteins/chemistry , Algorithms , Animals , Cytosine/chemistry , DNA/chemistry , DNA-Binding Proteins/chemistry , Epigenesis, Genetic , Epigenomics , Humans , Mice , Mixed Function Oxygenases/chemistry , Mutation , Oxygen/chemistry , Phylogeny , Proto-Oncogene Proteins/chemistry , Sequence Analysis, DNA , Thymidine/chemistry
20.
BMC Genomics ; 18(1): 790, 2017 Oct 16.
Article in English | MEDLINE | ID: mdl-29037147

ABSTRACT

BACKGROUND: Propionibacterium freudenreichii is an industrially important bacterium granted the Generally Recognized as Safe (the GRAS) status, due to its long safe use in food bioprocesses. Despite the recognized role in the food industry and in the production of vitamin B12, as well as its documented health-promoting potential, P. freudenreichii remained poorly characterised at the genomic level. At present, only three complete genome sequences are available for the species. RESULTS: We used the PacBio RS II sequencing platform to generate complete genomes of 20 P. freudenreichii strains and compared them in detail. Comparative analyses revealed both sequence conservation and genome organisational diversity among the strains. Assembly from long reads resulted in the discovery of additional circular elements: two putative conjugative plasmids and three active, lysogenic bacteriophages. It also permitted characterisation of the CRISPR-Cas systems. The use of the PacBio sequencing platform allowed identification of DNA modifications, which in turn allowed characterisation of the restriction-modification systems together with their recognition motifs. The observed genomic differences suggested strain variation in surface piliation and specific mucus binding, which were validated by experimental studies. The phenotypic characterisation displayed large diversity between the strains in ability to utilise a range of carbohydrates, to grow at unfavourable conditions and to form a biofilm. CONCLUSION: The complete genome sequencing allowed detailed characterisation of the industrially important species, P. freudenreichii by facilitating the discovery of previously unknown features. The results presented here lay a solid foundation for future genetic and functional genomic investigations of this actinobacterial species.


Subject(s)
Genomics/methods , Propionibacterium freudenreichii/genetics , Adaptation, Physiological/genetics , Genome, Bacterial/genetics , Genomic Islands/genetics , Plasmids/genetics , Sequence Analysis
SELECTION OF CITATIONS
SEARCH DETAIL