Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Bioinformatics ; 40(4)2024 Mar 29.
Artículo en Inglés | MEDLINE | ID: mdl-38507682

RESUMEN

MOTIVATION: Reliable prediction of protein thermostability from its sequence is valuable for both academic and industrial research. This prediction problem can be tackled using machine learning and by taking advantage of the recent blossoming of deep learning methods for sequence analysis. These methods can facilitate training on more data and, possibly, enable the development of more versatile thermostability predictors for multiple ranges of temperatures. RESULTS: We applied the principle of transfer learning to predict protein thermostability using embeddings generated by protein language models (pLMs) from an input protein sequence. We used large pLMs that were pre-trained on hundreds of millions of known sequences. The embeddings from such models allowed us to efficiently train and validate a high-performing prediction method using over one million sequences that we collected from organisms with annotated growth temperatures. Our method, TemStaPro (Temperatures of Stability for Proteins), was used to predict thermostability of CRISPR-Cas Class II effector proteins (C2EPs). Predictions indicated sharp differences among groups of C2EPs in terms of thermostability and were largely in tune with previously published and our newly obtained experimental data. AVAILABILITY AND IMPLEMENTATION: TemStaPro software and the related data are freely available from https://github.com/ievapudz/TemStaPro and https://doi.org/10.5281/zenodo.7743637.


Asunto(s)
Aprendizaje Automático , Proteínas , Proteínas/metabolismo , Programas Informáticos , Secuencia de Aminoácidos , Lenguaje
2.
Nucleic Acids Res ; 52(6): 3234-3248, 2024 Apr 12.
Artículo en Inglés | MEDLINE | ID: mdl-38261981

RESUMEN

Cas9 and Cas12 nucleases of class 2 CRISPR-Cas systems provide immunity in prokaryotes through RNA-guided cleavage of foreign DNA. Here we characterize a set of compact CRISPR-Cas12m (subtype V-M) effector proteins and show that they provide protection against bacteriophages and plasmids through the targeted DNA binding rather than DNA cleavage. Biochemical assays suggest that Cas12m effectors can act as roadblocks inhibiting DNA transcription and/or replication, thereby triggering interference against invaders. Cryo-EM structure of Gordonia otitidis (Go) Cas12m ternary complex provided here reveals the structural mechanism of DNA binding ensuring interference. Harnessing GoCas12m innate ability to bind DNA target we fused it with adenine deaminase TadA-8e and showed an efficient A-to-G editing in Escherichia coli and human cells. Overall, this study expands our understanding of the functionally diverse Cas12 protein family, revealing DNA-binding dependent interference mechanism of Cas12m effectors that could be harnessed for engineering of compact base-editing tools.


Asunto(s)
Sistemas CRISPR-Cas , Edición Génica , Humanos , ADN/genética , Endonucleasas/metabolismo , Plásmidos/genética , Escherichia coli/genética , Escherichia coli/metabolismo
3.
Nature ; 616(7956): 384-389, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-37020015

RESUMEN

The widespread TnpB proteins of IS200/IS605 transposon family have recently emerged as the smallest RNA-guided nucleases capable of targeted genome editing in eukaryotic cells1,2. Bioinformatic analysis identified TnpB proteins as the likely predecessors of Cas12 nucleases3-5, which along with Cas9 are widely used for targeted genome manipulation. Whereas Cas12 family nucleases are well characterized both biochemically and structurally6, the molecular mechanism of TnpB remains unknown. Here we present the cryogenic-electron microscopy structures of the Deinococcus radiodurans TnpB-reRNA (right-end transposon element-derived RNA) complex in DNA-bound and -free forms. The structures reveal the basic architecture of TnpB nuclease and the molecular mechanism for DNA target recognition and cleavage that is supported by biochemical experiments. Collectively, these results demonstrate that TnpB represents the minimal structural and functional core of the Cas12 protein family and provide a framework for developing TnpB-based genome editing tools.


Asunto(s)
Proteínas Asociadas a CRISPR , Elementos Transponibles de ADN , Deinococcus , Endonucleasas , Edición Génica , Proteínas Asociadas a CRISPR/química , Proteínas Asociadas a CRISPR/clasificación , Proteínas Asociadas a CRISPR/metabolismo , Proteínas Asociadas a CRISPR/ultraestructura , Sistemas CRISPR-Cas/genética , Microscopía por Crioelectrón , Deinococcus/enzimología , Deinococcus/genética , ADN/química , ADN/genética , ADN/metabolismo , ADN/ultraestructura , Elementos Transponibles de ADN/genética , Endonucleasas/química , Endonucleasas/clasificación , Endonucleasas/metabolismo , Endonucleasas/ultraestructura , Evolución Molecular , Edición Génica/métodos , ARN Guía de Sistemas CRISPR-Cas
4.
Cell ; 185(21): 4023-4037.e18, 2022 10 13.
Artículo en Inglés | MEDLINE | ID: mdl-36174579

RESUMEN

High-throughput RNA sequencing offers broad opportunities to explore the Earth RNA virome. Mining 5,150 diverse metatranscriptomes uncovered >2.5 million RNA virus contigs. Analysis of >330,000 RNA-dependent RNA polymerases (RdRPs) shows that this expansion corresponds to a 5-fold increase of the known RNA virus diversity. Gene content analysis revealed multiple protein domains previously not found in RNA viruses and implicated in virus-host interactions. Extended RdRP phylogeny supports the monophyly of the five established phyla and reveals two putative additional bacteriophage phyla and numerous putative additional classes and orders. The dramatically expanded phylum Lenarviricota, consisting of bacterial and related eukaryotic viruses, now accounts for a third of the RNA virome. Identification of CRISPR spacer matches and bacteriolytic proteins suggests that subsets of picobirnaviruses and partitiviruses, previously associated with eukaryotes, infect prokaryotic hosts.


Asunto(s)
Bacteriófagos , Virus ARN , Bacteriófagos/genética , ARN Polimerasas Dirigidas por ADN/genética , Genoma Viral , Filogenia , ARN , Virus ARN/genética , ARN Polimerasa Dependiente del ARN/genética , Viroma
5.
PLoS Biol ; 19(11): e3001442, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34752450

RESUMEN

The archaeal tailed viruses (arTV), evolutionarily related to tailed double-stranded DNA (dsDNA) bacteriophages of the class Caudoviricetes, represent the most common isolates infecting halophilic archaea. Only a handful of these viruses have been genomically characterized, limiting our appreciation of their ecological impacts and evolution. Here, we present 37 new genomes of haloarchaeal tailed virus isolates, more than doubling the current number of sequenced arTVs. Analysis of all 63 available complete genomes of arTVs, which we propose to classify into 14 new families and 3 orders, suggests ancient divergence of archaeal and bacterial tailed viruses and points to an extensive sharing of genes involved in DNA metabolism and counterdefense mechanisms, illuminating common strategies of virus-host interactions with tailed bacteriophages. Coupling of the comparative genomics with the host range analysis on a broad panel of haloarchaeal species uncovered 4 distinct groups of viral tail fiber adhesins controlling the host range expansion. The survey of metagenomes using viral hallmark genes suggests that the global architecture of the arTV community is shaped through recurrent transfers between different biomes, including hypersaline, marine, and anoxic environments.


Asunto(s)
Virus de Archaea/clasificación , Virus de Archaea/genética , Evolución Biológica , Variación Genética , Virus de Archaea/metabolismo , ADN/genética , ADN Viral/genética , Genoma Viral , Especificidad del Huésped , Mutación/genética , Filogenia , Células Procariotas/virología , Proteínas Virales/genética
6.
Nature ; 599(7886): 692-696, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34619744

RESUMEN

Transposition has a key role in reshaping genomes of all living organisms1. Insertion sequences of IS200/IS605 and IS607 families2 are among the simplest mobile genetic elements and contain only the genes that are required for their transposition and its regulation. These elements encode tnpA transposase, which is essential for mobilization, and often carry an accessory tnpB gene, which is dispensable for transposition. Although the role of TnpA in transposon mobilization of IS200/IS605 is well documented, the function of TnpB has remained largely unknown. It had been suggested that TnpB has a role in the regulation of transposition, although no mechanism for this has been established3-5. A bioinformatic analysis indicated that TnpB might be a predecessor of the CRISPR-Cas9/Cas12 nucleases6-8. However, no biochemical activities have been ascribed to TnpB. Here we show that TnpB of Deinococcus radiodurans ISDra2 is an RNA-directed nuclease that is guided by an RNA, derived from the right-end element of a transposon, to cleave DNA next to the 5'-TTGAT transposon-associated motif. We also show that TnpB could be reprogrammed to cleave DNA target sites in human cells. Together, this study expands our understanding of transposition mechanisms by highlighting the role of TnpB in transposition, experimentally confirms that TnpB is a functional progenitor of CRISPR-Cas nucleases and establishes TnpB as a prototype of a new system for genome editing.


Asunto(s)
Elementos Transponibles de ADN/genética , Deinococcus/enzimología , Deinococcus/genética , Desoxirribonucleasa I/genética , Desoxirribonucleasa I/metabolismo , ARN/genética , Secuencia de Bases , Proteínas Asociadas a CRISPR/metabolismo , Sistemas CRISPR-Cas , Escherichia coli/genética , Edición Génica , Células HEK293 , Humanos , Motivos de Nucleótidos
7.
Front Microbiol ; 12: 699140, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34267740

RESUMEN

Bam35 and related betatectiviruses are tail-less bacteriophages that prey on members of the Bacillus cereus group. These temperate viruses replicate their linear genome by a protein-primed mechanism. In this work, we have identified and characterized the product of the viral ORF2 as a single-stranded DNA binding protein (hereafter B35SSB). B35SSB binds ssDNA with great preference over dsDNA or RNA in a sequence-independent, highly cooperative manner that results in a non-specific stimulation of DNA replication. We have also identified several aromatic and basic residues, involved in base-stacking and electrostatic interactions, respectively, that are required for effective protein-ssDNA interaction. Although SSBs are essential for DNA replication in all domains of life as well as many viruses, they are very diverse proteins. However, most SSBs share a common structural domain, named OB-fold. Protein-primed viruses could constitute an exception, as no OB-fold DNA binding protein has been reported. Based on databases searches as well as phylogenetic and structural analyses, we showed that B35SSB belongs to a novel and independent group of SSBs. This group contains proteins encoded by protein-primed viral genomes from unrelated viruses, spanning betatectiviruses and Φ29 and close podoviruses, and they share a conserved pattern of secondary structure. Sensitive searches and structural predictions indicate that B35SSB contains a conserved domain resembling a divergent OB-fold, which would constitute the first occurrence of an OB-fold-like domain in a protein-primed genome.

8.
Int J Mol Sci ; 22(14)2021 Jul 08.
Artículo en Inglés | MEDLINE | ID: mdl-34298953

RESUMEN

A novel siphovirus, vB_PagS_MED16 (MED16) was isolated in Lithuania using Pantoea agglomerans strain BSL for the phage propagation. The double-stranded DNA genome of MED16 (46,103 bp) contains 73 predicted open reading frames (ORFs) encoding proteins, but no tRNA. Our comparative sequence analysis revealed that 26 of these ORFs code for unique proteins that have no reliable identity when compared to database entries. Based on phylogenetic analysis, MED16 represents a new genus with siphovirus morphology. In total, 35 MED16 ORFs were given a putative functional annotation, including those coding for the proteins responsible for virion morphogenesis, phage-host interactions, and DNA metabolism. In addition, a gene encoding a preQ0 DNA deoxyribosyltransferase (DpdA) is present in the genome of MED16 and the LC-MS/MS analysis indicates 2'-deoxy-7-amido-7-deazaguanosine (dADG)-modified phage DNA, which, to our knowledge, has never been experimentally validated in genomes of Pantoea phages. Thus, the data presented in this study provide new information on Pantoea-infecting viruses and offer novel insights into the diversity of DNA modifications in bacteriophages.


Asunto(s)
ADN Viral , Genoma Viral , Guanosina , Sistemas de Lectura Abierta , Pantoea/virología , Siphoviridae , Proteínas Virales , ADN Viral/genética , ADN Viral/metabolismo , Guanosina/análogos & derivados , Guanosina/química , Guanosina/metabolismo , Siphoviridae/genética , Siphoviridae/metabolismo , Proteínas Virales/genética , Proteínas Virales/metabolismo
9.
Biochim Biophys Acta Gen Subj ; 1865(10): 129967, 2021 10.
Artículo en Inglés | MEDLINE | ID: mdl-34324954

RESUMEN

BACKGROUND: Bacterial viruses (bacteriophages or phages) have a lot of uncharacterized genes, which hinders the progress of their applied research. Functional characterization of these genes is often hampered by a lack of suitable methods for engineering of phage genomes. METHODS: Phages vB_EcoM_Alf5 (Alf5) and VB_EcoM_VpaE1 (VpaE1) were used as the model phages of Felixounovirus genus. The phage-coded properties were predicted by bioinformatics analysis. The 'pull-down' assay was used for detection of protein-protein interactions. Primer extension analysis was used for the DNA polymerase (DNAP) activity testing. Bacteriophage lambda Redγßα-assisted homologous recombination was used for construction of phage mutants. RESULTS: Bioinformatics analysis showed that felixounoviruses encode DNA polymerase, which is homologous to the T7 DNAP. We found that the Escherichia coli thioredoxin A (TrxA) in vitro interacts with the predicted DNAP of Alf5 phage (gp096) and enhances its activity. Phages Alf5 and VpaE1 do not grow on E. coli strains lacking trxA gene unless it is provided in trans. This feature was used for construction of the deletion/insertion mutants of non-essential genes of felixounoviruses. CONCLUSION: DNA replication of phages from Felixonuvirus genus depends on the host trxA, which therefore may be used as a molecular marker for their genome engineering. GENERAL SIGNIFICANCE: We present a proof-of-principle of a strategy for targeted engineering of bacteriophages of Felixounovirus genus. The method developed here will facilitate the basic and applied research of this unexplored phage group. Furthermore, detected functional interactions between the phage and host proteins will be significant for basic research of DNA replication.


Asunto(s)
Bacteriófagos/genética , Proteínas de Escherichia coli/genética , Escherichia coli/genética , Ingeniería Genética , Tiorredoxinas/genética , Biomarcadores
10.
Nat Commun ; 11(1): 5512, 2020 11 02.
Artículo en Inglés | MEDLINE | ID: mdl-33139742

RESUMEN

Bacterial Cas9 nucleases from type II CRISPR-Cas antiviral defence systems have been repurposed as genome editing tools. Although these proteins are found in many microbes, only a handful of variants are used for these applications. Here, we use bioinformatic and biochemical analyses to explore this largely uncharacterized diversity. We apply cell-free biochemical screens to assess the protospacer adjacent motif (PAM) and guide RNA (gRNA) requirements of 79 Cas9 proteins, thus identifying at least 7 distinct gRNA classes and 50 different PAM sequence requirements. PAM recognition spans the entire spectrum of T-, A-, C-, and G-rich nucleotides, from single nucleotide recognition to sequence strings longer than 4 nucleotides. Characterization of a subset of Cas9 orthologs using purified components reveals additional biochemical diversity, including both narrow and broad ranges of temperature dependence, staggered-end DNA target cleavage, and a requirement for long stretches of homology between gRNA and DNA target. Our results expand the available toolset of RNA-programmable CRISPR-associated nucleases.


Asunto(s)
Proteína 9 Asociada a CRISPR/genética , Sistemas CRISPR-Cas/genética , Edición Génica/métodos , ARN Guía de Kinetoplastida/genética , Secuencia de Bases , Proteína 9 Asociada a CRISPR/metabolismo , Biología Computacional , División del ADN , ARN Guía de Kinetoplastida/metabolismo , Homología de Secuencia de Ácido Nucleico
11.
Nucleic Acids Res ; 48(18): 10142-10156, 2020 10 09.
Artículo en Inglés | MEDLINE | ID: mdl-32976577

RESUMEN

B-family DNA polymerases (PolBs) represent the most common replicases. PolB enzymes that require RNA (or DNA) primed templates for DNA synthesis are found in all domains of life and many DNA viruses. Despite extensive research on PolBs, their origins and evolution remain enigmatic. Massive accumulation of new genomic and metagenomic data from diverse habitats as well as availability of new structural information prompted us to conduct a comprehensive analysis of the PolB sequences, structures, domain organizations, taxonomic distribution and co-occurrence in genomes. Based on phylogenetic analysis, we identified a new, widespread group of bacterial PolBs that are more closely related to the catalytically active N-terminal half of the eukaryotic PolEpsilon (PolEpsilonN) than to Escherichia coli Pol II. In Archaea, we characterized six new groups of PolBs. Two of them show close relationships with eukaryotic PolBs, the first one with PolEpsilonN, and the second one with PolAlpha, PolDelta and PolZeta. In addition, structure comparisons suggested common origin of the catalytically inactive C-terminal half of PolEpsilon (PolEpsilonC) and PolAlpha. Finally, in certain archaeal PolBs we discovered C-terminal Zn-binding domains closely related to those of PolAlpha and PolEpsilonC. Collectively, the obtained results allowed us to propose a scenario for the evolution of eukaryotic PolBs.


Asunto(s)
ADN Polimerasa beta/química , ADN Polimerasa beta/clasificación , Eucariontes/enzimología , Evolución Molecular , Archaea/enzimología , Bacterias/enzimología , Virus ADN/enzimología , Bases de Datos de Proteínas
12.
Nat Microbiol ; 5(10): 1262-1270, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32690954

RESUMEN

RNA viruses in aquatic environments remain poorly studied. Here, we analysed the RNA virome from approximately 10 l water from Yangshan Deep-Water Harbour near the Yangtze River estuary in China and identified more than 4,500 distinct RNA viruses, doubling the previously known set of viruses. Phylogenomic analysis identified several major lineages, roughly, at the taxonomic ranks of class, order and family. The 719-member-strong Yangshan virus assemblage is the sister clade to the expansive class Alsuviricetes and consists of viruses with simple genomes that typically encode only RNA-dependent RNA polymerase (RdRP), capping enzyme and capsid protein. Several clades within the Yangshan assemblage independently evolved domain permutation in the RdRP. Another previously unknown clade shares ancestry with Potyviridae, the largest known plant virus family. The 'Aquatic picorna-like viruses/Marnaviridae' clade was greatly expanded, with more than 800 added viruses. Several RdRP-linked protein domains not previously detected in any RNA viruses were identified, such as the small ubiquitin-like modifier (SUMO) domain, phospholipase A2 and PrsW-family protease domain. Multiple viruses utilize alternative genetic codes implying protist (especially ciliate) hosts. The results reveal a vast RNA virome that includes many previously unknown groups. However, phylogenetic analysis of the RdRPs supports the previously established five-branch structure of the RNA virus evolutionary tree, with no additional phyla.


Asunto(s)
Genoma Viral , Metagenoma , Metagenómica , Virus ARN/clasificación , Virus ARN/genética , Secuencia de Aminoácidos , Biodiversidad , China , Biología Computacional/métodos , Evolución Molecular , Orden Génico , Metagenómica/métodos , Filogenia , Proteínas Virales/química , Proteínas Virales/genética , Microbiología del Agua
14.
Nat Commun ; 10(1): 3425, 2019 07 31.
Artículo en Inglés | MEDLINE | ID: mdl-31366885

RESUMEN

Single-stranded (ss) DNA viruses are a major component of the earth virome. In particular, the circular, Rep-encoding ssDNA (CRESS-DNA) viruses show high diversity and abundance in various habitats. By combining sequence similarity network and phylogenetic analyses of the replication proteins (Rep) belonging to the HUH endonuclease superfamily, we show that the replication machinery of the CRESS-DNA viruses evolved, on three independent occasions, from the Reps of bacterial rolling circle-replicating plasmids. The CRESS-DNA viruses emerged via recombination between such plasmids and cDNA copies of capsid genes of eukaryotic positive-sense RNA viruses. Similarly, the rep genes of prokaryotic DNA viruses appear to have evolved from HUH endonuclease genes of various bacterial and archaeal plasmids. Our findings also suggest that eukaryotic polyomaviruses and papillomaviruses with dsDNA genomes have evolved via parvoviruses from CRESS-DNA viruses. Collectively, our results shed light on the complex evolutionary history of a major class of viruses revealing its polyphyletic origins.


Asunto(s)
Archaea/genética , Bacterias/genética , ADN Viral/genética , Evolución Molecular , Plásmidos/genética , Secuencia de Bases , ADN Helicasas/genética , ADN de Cadena Simple/genética , Genoma Viral/genética , Papillomaviridae/genética , Parvovirus/genética , Poliomavirus/genética , Alineación de Secuencia
16.
mBio ; 9(6)2018 11 27.
Artículo en Inglés | MEDLINE | ID: mdl-30482837

RESUMEN

Viruses with RNA genomes dominate the eukaryotic virome, reaching enormous diversity in animals and plants. The recent advances of metaviromics prompted us to perform a detailed phylogenomic reconstruction of the evolution of the dramatically expanded global RNA virome. The only universal gene among RNA viruses is the gene encoding the RNA-dependent RNA polymerase (RdRp). We developed an iterative computational procedure that alternates the RdRp phylogenetic tree construction with refinement of the underlying multiple-sequence alignments. The resulting tree encompasses 4,617 RNA virus RdRps and consists of 5 major branches; 2 of the branches include positive-sense RNA viruses, 1 is a mix of positive-sense (+) RNA and double-stranded RNA (dsRNA) viruses, and 2 consist of dsRNA and negative-sense (-) RNA viruses, respectively. This tree topology implies that dsRNA viruses evolved from +RNA viruses on at least two independent occasions, whereas -RNA viruses evolved from dsRNA viruses. Reconstruction of RNA virus evolution using the RdRp tree as the scaffold suggests that the last common ancestors of the major branches of +RNA viruses encoded only the RdRp and a single jelly-roll capsid protein. Subsequent evolution involved independent capture of additional genes, in particular, those encoding distinct RNA helicases, enabling replication of larger RNA genomes and facilitating virus genome expression and virus-host interactions. Phylogenomic analysis reveals extensive gene module exchange among diverse viruses and horizontal virus transfer between distantly related hosts. Although the network of evolutionary relationships within the RNA virome is bound to further expand, the present results call for a thorough reevaluation of the RNA virus taxonomy.IMPORTANCE The majority of the diverse viruses infecting eukaryotes have RNA genomes, including numerous human, animal, and plant pathogens. Recent advances of metagenomics have led to the discovery of many new groups of RNA viruses in a wide range of hosts. These findings enable a far more complete reconstruction of the evolution of RNA viruses than was attainable previously. This reconstruction reveals the relationships between different Baltimore classes of viruses and indicates extensive transfer of viruses between distantly related hosts, such as plants and animals. These results call for a major revision of the existing taxonomy of RNA viruses.


Asunto(s)
Evolución Molecular , Filogenia , Virus ARN/clasificación , Virus ARN/genética , Animales , Análisis por Conglomerados , Biología Computacional/métodos , Plantas , ARN Polimerasa Dependiente del ARN/genética , Homología de Secuencia de Aminoácido , Proteínas Virales/genética
17.
Viruses ; 10(4)2018 04 10.
Artículo en Inglés | MEDLINE | ID: mdl-29642587

RESUMEN

Numerous metagenomic studies have uncovered a remarkable diversity of circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA viruses, the majority of which are uncultured and unclassified. Unlike capsid proteins, the Reps show significant similarity across different groups of CRESS DNA viruses and have conserved domain organization with the N-terminal nuclease and the C-terminal helicase domain. Consequently, Rep is widely used as a marker for identification, classification and assessment of the diversity of CRESS DNA viruses. However, it has been shown that in certain viruses the Rep nuclease and helicase domains display incongruent evolutionary histories. Here, we systematically evaluated the co-evolutionary patterns of the two Rep domains across classified and unclassified CRESS DNA viruses. Our analysis indicates that the Reps encoded by members of the families Bacilladnaviridae, Circoviridae, Geminiviridae, Genomoviridae, Nanoviridae and Smacoviridae display largely congruent evolutionary patterns in the two domains. By contrast, among the unclassified CRESS DNA viruses, 71% appear to have chimeric Reps. Such massive chimerism suggests that unclassified CRESS DNA viruses represent a dynamic population in which exchange of gene fragments encoding the nuclease and helicase domains is extremely common. Furthermore, purging of the chimeric sequences uncovered six monophyletic Rep groups that may represent new families of CRESS DNA viruses.


Asunto(s)
Quimerismo , Virus ADN/clasificación , Virus ADN/genética , ADN de Cadena Simple/genética , Filogenia , Evolución Molecular , Genoma Viral/genética , Metagenómica , Dominios Proteicos/genética , Proteínas Virales/genética
18.
J Mol Biol ; 430(5): 737-750, 2018 03 02.
Artículo en Inglés | MEDLINE | ID: mdl-29198957

RESUMEN

Cellular organisms in different domains of life employ structurally unrelated, non-homologous DNA primases for synthesis of a primer for DNA replication. Archaea and eukaryotes encode enzymes of the archaeo-eukaryotic primase (AEP) superfamily, whereas bacteria uniformly use primases of the DnaG family. However, AEP genes are widespread in bacterial genomes raising questions regarding their provenance and function. Here, using an archaeal primase-polymerase PolpTN2 encoded by pTN2 plasmid as a seed for sequence similarity searches, we recovered over 800 AEP homologs from bacteria belonging to 12 highly diverse phyla. These sequences formed a supergroup, PrimPol-PV1, and could be classified into five novel AEP families which are characterized by a conserved motif containing an arginine residue likely to be involved in nucleotide binding. Functional assays confirm the essentiality of this motif for catalytic activity of the PolpTN2 primase-polymerase. Further analyses showed that bacterial AEPs display a range of domain organizations and uncovered several candidates for novel families of helicases. Furthermore, sequence and structure comparisons suggest that PriCT-1 and PriCT-2 domains frequently fused to the AEP domains are related to each other as well as to the non-catalytic, large subunit of archaeal and eukaryotic primases, and to the recently discovered PriX subunit of archaeal primases. Finally, genomic neighborhood analysis indicates that the identified AEPs encoded in bacterial genomes are nearly exclusively associated with highly diverse integrated mobile genetic elements, including integrative conjugative plasmids and prophages.


Asunto(s)
Archaea/enzimología , Archaea/genética , Bacterias/genética , ADN Primasa/metabolismo , Eucariontes/enzimología , Eucariontes/genética , Secuencia de Aminoácidos , Bacterias/enzimología , Dominio Catalítico , ADN Helicasas/metabolismo , ADN Primasa/clasificación , Replicación del ADN , Evolución Molecular , Secuencias Repetitivas Esparcidas , Modelos Moleculares , Conformación Proteica , Dominios Proteicos , Alineación de Secuencia , Thermococcus/genética
19.
Virology ; 504: 114-121, 2017 04.
Artículo en Inglés | MEDLINE | ID: mdl-28189969

RESUMEN

Bacilladnaviruses have single-stranded (ss) DNA genomes and infect diatoms, a major group of unicellular algae widespread in aquatic habitats. Despite their ecological importance, the provenance and relationships of bacilladnaviruses to other eukaryotic viruses remain unclear. Accordingly, they are currently classified into the 'floating' genus Bacilladnavirus. Here we present three new bacilladnavirus genomes recovered from a mollusc Amphibola crenata and benthic sediments from the Avon-Heathcote estuary in New Zealand. Our analysis shows that the rolling-circle replication-initiation proteins of bacilladnaviruses display unique conserved motifs and in phylogenetic trees form a monophyletic clade separated from other groups of ssDNA viruses. Unexpectedly, distant homology detection combined with structural modeling indicates that bacilladnavirus capsid proteins are homologous to those of ssRNA viruses from the Nodaviridae family. Considering the sequence diversity within the expanding Bacilladnavirus genus, we argue that classification of these viruses has to be revised and the current genus upgraded to the family level.


Asunto(s)
Proteínas de la Cápside/genética , Virus ADN/clasificación , Virus ADN/genética , Gastrópodos/virología , Animales , Virus ADN/aislamiento & purificación , Virus ADN/ultraestructura , ADN Circular/genética , ADN de Cadena Simple/genética , ADN Viral/genética , Evolución Molecular , Variación Genética/genética , Genoma Viral/genética , Nueva Zelanda , Nodaviridae/genética , Replicación Viral/genética
20.
Nucleic Acids Res ; 44(10): 4551-64, 2016 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-27112572

RESUMEN

Genomic DNA replication is a complex process that involves multiple proteins. Cellular DNA replication systems are broadly classified into only two types, bacterial and archaeo-eukaryotic. In contrast, double-stranded (ds) DNA viruses feature a much broader diversity of DNA replication machineries. Viruses differ greatly in both completeness and composition of their sets of DNA replication proteins. In this study, we explored whether there are common patterns underlying this extreme diversity. We identified and analyzed all major functional groups of DNA replication proteins in all available proteomes of dsDNA viruses. Our results show that some proteins are common to viruses infecting all domains of life and likely represent components of the ancestral core set. These include B-family polymerases, SF3 helicases, archaeo-eukaryotic primases, clamps and clamp loaders of the archaeo-eukaryotic type, RNase H and ATP-dependent DNA ligases. We also discovered a clear correlation between genome size and self-sufficiency of viral DNA replication, the unanticipated dominance of replicative helicases and pervasive functional associations among certain groups of DNA replication proteins. Altogether, our results provide a comprehensive view on the diversity and evolution of replication systems in the DNA virome and uncover fundamental principles underlying the orchestration of viral DNA replication.


Asunto(s)
Replicación del ADN , Virus ADN/genética , Genoma Viral , Proteínas Virales/genética , ADN , ADN Helicasas/genética , ADN Helicasas/metabolismo , ADN Primasa/genética , ADN Primasa/metabolismo , ADN-Topoisomerasas/genética , ADN-Topoisomerasas/metabolismo , Transferencia de Gen Horizontal , Tamaño del Genoma , Interacciones Huésped-Patógeno/genética , Filogenia , ARN Polimerasa Dependiente del ARN/genética , ARN Polimerasa Dependiente del ARN/metabolismo , Proteínas Virales/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...