Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 46
Filter
1.
Proc Natl Acad Sci U S A ; 120(48): e2308224120, 2023 Nov 28.
Article in English | MEDLINE | ID: mdl-37983496

ABSTRACT

The TnpB proteins are transposon-associated RNA-guided nucleases that are among the most abundant proteins encoded in bacterial and archaeal genomes, but whose functions in the transposon life cycle remain unknown. TnpB appears to be the evolutionary ancestor of Cas12, the effector nuclease of type V CRISPR-Cas systems. We performed a comprehensive census of TnpBs in archaeal and bacterial genomes and constructed a phylogenetic tree on which we mapped various features of these proteins. In multiple branches of the tree, the catalytic site of the TnpB nuclease is rearranged, demonstrating structural and probably biochemical malleability of this enzyme. We identified numerous cases of apparent recruitment of TnpB for other functions of which the most common is the evolution of type V CRISPR-Cas effectors on about 50 independent occasions. In many other cases of more radical exaptation, the catalytic site of the TnpB nuclease is apparently inactivated, suggesting a regulatory function, whereas in others, the activity appears to be retained, indicating that the recruited TnpB functions as a nuclease, for example, as a toxin. These findings demonstrate remarkable evolutionary malleability of the TnpB scaffold and provide extensive opportunities for further exploration of RNA-guided biological systems as well as multiple applications.


Subject(s)
Bacteria , Ribonucleases , Ribonucleases/metabolism , Phylogeny , Bacteria/metabolism , Archaea/metabolism , Endonucleases/metabolism , CRISPR-Cas Systems , RNA
2.
Nucleic Acids Res ; 51(15): 8150-8168, 2023 08 25.
Article in English | MEDLINE | ID: mdl-37283088

ABSTRACT

CRISPR-cas loci typically contain CRISPR arrays with unique spacers separating direct repeats. Spacers along with portions of adjacent repeats are transcribed and processed into CRISPR(cr) RNAs that target complementary sequences (protospacers) in mobile genetic elements, resulting in cleavage of the target DNA or RNA. Additional, standalone repeats in some CRISPR-cas loci produce distinct cr-like RNAs implicated in regulatory or other functions. We developed a computational pipeline to systematically predict crRNA-like elements by scanning for standalone repeat sequences that are conserved in closely related CRISPR-cas loci. Numerous crRNA-like elements were detected in diverse CRISPR-Cas systems, mostly, of type I, but also subtype V-A. Standalone repeats often form mini-arrays containing two repeat-like sequence separated by a spacer that is partially complementary to promoter regions of cas genes, in particular cas8, or cargo genes located within CRISPR-Cas loci, such as toxins-antitoxins. We show experimentally that a mini-array from a type I-F1 CRISPR-Cas system functions as a regulatory guide. We also identified mini-arrays in bacteriophages that could abrogate CRISPR immunity by inhibiting effector expression. Thus, recruitment of CRISPR effectors for regulatory functions via spacers with partial complementarity to the target is a common feature of diverse CRISPR-Cas systems.


Subject(s)
CRISPR-Cas Systems , RNA , Repetitive Sequences, Nucleic Acid
3.
bioRxiv ; 2023 Mar 03.
Article in English | MEDLINE | ID: mdl-37090614

ABSTRACT

CRISPR- cas loci typically contain CRISPR arrays with unique spacers separating direct repeats. Spacers along with portions of adjacent repeats are transcribed and processed into CRISPR(cr) RNAs that target complementary sequences (protospacers) in mobile genetic elements, resulting in cleavage of the target DNA or RNA. Additional, standalone repeats in some CRISPR- cas loci produce distinct cr-like RNAs implicated in regulatory or other functions. We developed a computational pipeline to systematically predict crRNA-like elements by scanning for standalone repeat sequences that are conserved in closely related CRISPR- cas loci. Numerous crRNA-like elements were detected in diverse CRISPR-Cas systems, mostly, of type I, but also subtype V-A. Standalone repeats often form mini-arrays containing two repeat-like sequence separated by a spacer that is partially complementary to promoter regions of cas genes, in particular cas8 , or cargo genes located within CRISPR-Cas loci, such as toxins-antitoxins. We show experimentally that a mini-array from a type I-F1 CRISPR-Cas system functions as a regulatory guide. We also identified mini-arrays in bacteriophages that could abrogate CRISPR immunity by inhibiting effector expression. Thus, recruitment of CRISPR effectors for regulatory functions via spacers with partial complementarity to the target is a common feature of diverse CRISPR-Cas systems.

5.
mBio ; 12(6): e0293821, 2021 12 21.
Article in English | MEDLINE | ID: mdl-34872347

ABSTRACT

Transposition is a major mechanism of horizontal gene mobility in prokaryotes. However, exploration of the genes mobilized by transposons (cargo) is hampered by the difficulty in delineating integrated transposons from their surrounding genetic context. Here, we present a computational approach that allowed us to identify the boundaries of 6,549 Tn7-like transposons. We found that 96% of these transposons carry at least one cargo gene. Delineation of distinct communities in a gene-sharing network demonstrates how transposons function as a conduit of genes between phylogenetically distant hosts. Comparative analysis of the cargo genes reveals significant enrichment of mobile genetic elements (MGEs) nested within Tn7-like transposons, such as insertion sequences and toxin-antitoxin modules, and of genes involved in recombination, anti-MGE defense, and antibiotic resistance. More unexpectedly, cargo also includes genes encoding central carbon metabolism enzymes. Twenty-two Tn7-like transposons carry both an anti-MGE defense system and antibiotic resistance genes, illustrating how bacteria can overcome these combined pressures upon acquisition of a single transposon. This work substantially expands the distribution of Tn7-like transposons, defines their evolutionary relationships, and provides a large-scale functional classification of prokaryotic genes mobilized by transposition. IMPORTANCE Transposons are major vehicles of horizontal gene transfer that, in addition to genes directly involved in transposition, carry cargo genes. However, characterization of these genes is hampered by the difficulty of identification of transposon boundaries. We developed a computational approach for detecting transposon ends and applied it to perform a comprehensive census of the cargo genes of Tn7-like transposons, a large class of bacterial mobile genetic elements (MGE), many of which employ a unique, CRISPR-mediated mechanism of site-specific transposition. The cargo genes encompass a striking diversity of MGE, defense, and antibiotic resistance systems. Unexpectedly, we also identified cargo genes encoding metabolic enzymes. Thus, Tn7-like transposons mobilize a vast repertoire of genes that can have multiple effects on the host bacteria.


Subject(s)
Anti-Bacterial Agents/pharmacology , Bacteria/drug effects , Bacteria/genetics , Bacterial Proteins/genetics , DNA Transposable Elements , Drug Resistance, Bacterial , Bacteria/classification , Bacterial Proteins/metabolism , Phylogeny , Recombination, Genetic
6.
CRISPR J ; 4(5): 656-672, 2021 10.
Article in English | MEDLINE | ID: mdl-34582696

ABSTRACT

Type IV CRISPR-Cas are a distinct variety of highly derived CRISPR-Cas systems that appear to have evolved from type III systems through the loss of the target-cleaving nuclease and partial deterioration of the large subunit of the effector complex. All known type IV CRISPR-Cas systems are encoded on plasmids, integrative and conjugative elements (ICEs), or prophages, and are thought to contribute to competition between these elements, although the mechanistic details of their function remain unknown. There is a clear parallel between the compositions and likely origin of type IV and type I systems recruited by Tn7-like transposons and mediating RNA-guided transposition. We investigated the diversity and evolutionary relationships of type IV systems, with a focus on those in Acidithiobacillia, where this variety of CRISPR is particularly abundant and always found on ICEs. Our analysis revealed remarkable evolutionary plasticity of type IV CRISPR-Cas systems, with adaptation and ancillary genes originating from different ancestral CRISPR-Cas varieties, and extensive gene shuffling within the type IV loci. The adaptation module and the CRISPR array apparently were lost in the type IV ancestor but were subsequently recaptured by type IV systems on several independent occasions. We demonstrate a high level of heterogeneity among the repeats with type IV CRISPR arrays, which far exceed the heterogeneity of any other known CRISPR repeats and suggest a unique adaptation mechanism. The spacers in the type IV arrays, for which protospacers could be identified, match plasmid genes, in particular those encoding the conjugation apparatus components. Both the biochemical mechanism of type IV CRISPR-Cas function and their role in the competition among mobile genetic elements remain to be investigated.


Subject(s)
CRISPR-Cas Systems/genetics , Evolution, Molecular , Proteobacteria/genetics , Genes, Bacterial , Phylogeny , Polymorphism, Genetic , Proteobacteria/classification
8.
Science ; 372(6541)2021 04 30.
Article in English | MEDLINE | ID: mdl-33926924

ABSTRACT

CRISPR-Cas systems provide RNA-guided adaptive immunity in prokaryotes. We report that the multisubunit CRISPR effector Cascade transcriptionally regulates a toxin-antitoxin RNA pair, CreTA. CreT (Cascade-repressed toxin) is a bacteriostatic RNA that sequesters the rare arginine tRNAUCU (transfer RNA with anticodon UCU). CreA is a CRISPR RNA-resembling antitoxin RNA, which requires Cas6 for maturation. The partial complementarity between CreA and the creT promoter directs Cascade to repress toxin transcription. Thus, CreA becomes antitoxic only in the presence of Cascade. In CreTA-deleted cells, cascade genes become susceptible to disruption by transposable elements. We uncover several CreTA analogs associated with diverse archaeal and bacterial CRISPR-cas loci. Thus, toxin-antitoxin RNA pairs can safeguard CRISPR immunity by making cells addicted to CRISPR-Cas, which highlights the multifunctionality of Cas proteins and the intricate mechanisms of CRISPR-Cas regulation.


Subject(s)
CRISPR-Associated Proteins/physiology , CRISPR-Cas Systems/physiology , Haloarcula/physiology , RNA, Archaeal/physiology , Toxin-Antitoxin Systems/physiology , CRISPR-Associated Proteins/genetics , CRISPR-Cas Systems/genetics , DNA Mutational Analysis , Gene Expression Regulation, Archaeal , Haloarcula/genetics , Operon , RNA, Transfer, Arg/metabolism , Toxin-Antitoxin Systems/genetics
9.
Microbiome ; 9(1): 78, 2021 03 29.
Article in English | MEDLINE | ID: mdl-33781338

ABSTRACT

BACKGROUND: Double-stranded DNA bacteriophages (dsDNA phages) play pivotal roles in structuring human gut microbiomes; yet, the gut virome is far from being fully characterized, and additional groups of phages, including highly abundant ones, continue to be discovered by metagenome mining. A multilevel framework for taxonomic classification of viruses was recently adopted, facilitating the classification of phages into evolutionary informative taxonomic units based on hallmark genes. Together with advanced approaches for sequence assembly and powerful methods of sequence analysis, this revised framework offers the opportunity to discover and classify unknown phage taxa in the human gut. RESULTS: A search of human gut metagenomes for circular contigs encoding phage hallmark genes resulted in the identification of 3738 apparently complete phage genomes that represent 451 putative genera. Several of these phage genera are only distantly related to previously identified phages and are likely to found new families. Two of the candidate families, "Flandersviridae" and "Quimbyviridae", include some of the most common and abundant members of the human gut virome that infect Bacteroides, Parabacteroides, and Prevotella. The third proposed family, "Gratiaviridae," consists of less abundant phages that are distantly related to the families Autographiviridae, Drexlerviridae, and Chaseviridae. Analysis of CRISPR spacers indicates that phages of all three putative families infect bacteria of the phylum Bacteroidetes. Comparative genomic analysis of the three candidate phage families revealed features without precedent in phage genomes. Some "Quimbyviridae" phages possess Diversity-Generating Retroelements (DGRs) that generate hypervariable target genes nested within defense-related genes, whereas the previously known targets of phage-encoded DGRs are structural genes. Several "Flandersviridae" phages encode enzymes of the isoprenoid pathway, a lipid biosynthesis pathway that so far has not been known to be manipulated by phages. The "Gratiaviridae" phages encode a HipA-family protein kinase and glycosyltransferase, suggesting these phages modify the host cell wall, preventing superinfection by other phages. Hundreds of phages in these three and other families are shown to encode catalases and iron-sequestering enzymes that can be predicted to enhance cellular tolerance to reactive oxygen species. CONCLUSIONS: Analysis of phage genomes identified in whole-community human gut metagenomes resulted in the delineation of at least three new candidate families of Caudovirales and revealed diverse putative mechanisms underlying phage-host interactions in the human gut. Addition of these phylogenetically classified, diverse, and distinct phages to public databases will facilitate taxonomic decomposition and functional characterization of human gut viromes. Video abstract.


Subject(s)
Bacteriophages , Gastrointestinal Microbiome , Microbiota , Bacteria/genetics , Bacteriophages/genetics , Gastrointestinal Microbiome/genetics , Genome, Viral/genetics , Humans , Metagenome , Phylogeny
10.
Nucleic Acids Res ; 49(4): e20, 2021 02 26.
Article in English | MEDLINE | ID: mdl-33290505

ABSTRACT

CRISPR-Cas are adaptive immune systems that degrade foreign genetic elements in archaea and bacteria. In carrying out their immune functions, CRISPR-Cas systems heavily rely on RNA components. These CRISPR (cr) RNAs are repeat-spacer units that are produced by processing of pre-crRNA, the transcript of CRISPR arrays, and guide Cas protein(s) to the cognate invading nucleic acids, enabling their destruction. Several bioinformatics tools have been developed to detect CRISPR arrays based solely on DNA sequences, but all these tools employ the same strategy of looking for repetitive patterns, which might correspond to CRISPR array repeats. The identified patterns are evaluated using a fixed, built-in scoring function, and arrays exceeding a cut-off value are reported. Here, we instead introduce a data-driven approach that uses machine learning to detect and differentiate true CRISPR arrays from false ones based on several features. Our CRISPR detection tool, CRISPRidentify, performs three steps: detection, feature extraction and classification based on manually curated sets of positive and negative examples of CRISPR arrays. The identified CRISPR arrays are then reported to the user accompanied by detailed annotation. We demonstrate that our approach identifies not only previously detected CRISPR arrays, but also CRISPR array candidates not detected by other tools. Compared to other methods, our tool has a drastically reduced false positive rate. In contrast to the existing tools, our approach not only provides the user with the basic statistics on the identified CRISPR arrays but also produces a certainty score as a practical measure of the likelihood that a given genomic region is a CRISPR array.


Subject(s)
Clustered Regularly Interspaced Short Palindromic Repeats , Machine Learning , Software , Genome, Archaeal , Genome, Bacterial
11.
CRISPR J ; 3(6): 535-549, 2020 12.
Article in English | MEDLINE | ID: mdl-33346707

ABSTRACT

CRISPR-Cas systems typically consist of a CRISPR array and cas genes that are organized in one or more operons. However, a substantial fraction of CRISPR arrays are not adjacent to cas genes. Definitive identification of such isolated CRISPR arrays runs into the problem of false-positives, with unrelated types of repetitive sequences mimicking CRISPR. We developed a computational pipeline to eliminate false CRISPR predictions and found that up to 25% of the CRISPR arrays in complete bacterial and archaeal genomes are located away from cas genes. Most of the repeats in these isolated arrays are identical to repeats in cas-adjacent CRISPR arrays in the same or closely related genomes, indicating an evolutionary relationship between isolated arrays and arrays in typical CRISPR-cas loci. The spacers in isolated CRISPR arrays show nearly as many matches to viral genomes as spacers from complete CRISPR-cas loci, suggesting that the isolated arrays were either functionally active recently or continue to function. Reconstruction of evolutionary events in closely related bacterial genomes suggests three routes of evolution of isolated CRISPR arrays: (1) loss of cas genes in a CRISPR-cas locus, (2) de novo generation of arrays from off-target spacer integration into sequences resembling the corresponding repeats, and (3) transfer by mobile genetic elements. Both combination of de novo emerging arrays with cas genes and regain of cas genes by isolated arrays via recombination likely contribute to functional diversification in CRISPR-Cas evolution.


Subject(s)
Clustered Regularly Interspaced Short Palindromic Repeats/genetics , Computational Biology/methods , Gene Editing/methods , Bacteria/genetics , CRISPR-Associated Proteins/genetics , CRISPR-Cas Systems/genetics , Clustered Regularly Interspaced Short Palindromic Repeats/physiology , Genome, Archaeal/genetics , Genome, Bacterial/genetics , Genome, Viral/genetics , Genomics/methods , Phylogeny
12.
Nucleic Acids Res ; 48(21): 12297-12309, 2020 12 02.
Article in English | MEDLINE | ID: mdl-33152077

ABSTRACT

CRISPR-Cas defense systems opened up the field of genome editing due to the ease with which effector Cas nucleases can be programmed with guide RNAs to access desirable genomic sites. Type II-A SpCas9 from Streptococcus pyogenes was the first Cas9 nuclease used for genome editing and it remains the most popular enzyme of its class. Nevertheless, SpCas9 has some drawbacks including a relatively large size and restriction to targets flanked by an 'NGG' PAM sequence. The more compact Type II-C Cas9 orthologs can help to overcome the size limitation of SpCas9. Yet, only a few Type II-C nucleases were fully characterized to date. Here, we characterized two Cas9 II-C orthologs, DfCas9 from Defluviimonas sp.20V17 and PpCas9 from Pasteurella pneumotropica. Both DfCas9 and PpCas9 cleave DNA in vitro and have novel PAM requirements. Unlike DfCas9, the PpCas9 nuclease is active in human cells. This small nuclease requires an 'NNNNRTT' PAM orthogonal to that of SpCas9 and thus potentially can broaden the range of Cas9 applications in biomedicine and biotechnology.


Subject(s)
CRISPR-Associated Protein 9/genetics , CRISPR-Cas Systems , Clustered Regularly Interspaced Short Palindromic Repeats , Genome, Bacterial , Pasteurella pneumotropica/genetics , RNA, Guide, Kinetoplastida/genetics , Amino Acid Sequence , Base Sequence , CRISPR-Associated Protein 9/chemistry , CRISPR-Associated Protein 9/metabolism , Cloning, Molecular , Escherichia coli/genetics , Escherichia coli/metabolism , Gene Editing/methods , Gene Expression , Genetic Vectors/chemistry , Genetic Vectors/metabolism , HEK293 Cells , Humans , Nucleic Acid Conformation , Pasteurella pneumotropica/enzymology , RNA, Guide, Kinetoplastida/chemistry , RNA, Guide, Kinetoplastida/metabolism , Recombinant Proteins/chemistry , Recombinant Proteins/genetics , Recombinant Proteins/metabolism , Rhodobacteraceae/enzymology , Rhodobacteraceae/genetics , Sequence Alignment , Sequence Homology, Amino Acid
13.
Nat Commun ; 11(1): 3784, 2020 07 29.
Article in English | MEDLINE | ID: mdl-32728052

ABSTRACT

The CRISPR-Cas are adaptive bacterial and archaeal immunity systems that have been harnessed for the development of powerful genome editing and engineering tools. In the incessant host-parasite arms race, viruses evolved multiple anti-defense mechanisms including diverse anti-CRISPR proteins (Acrs) that specifically inhibit CRISPR-Cas and therefore have enormous potential for application as modulators of genome editing tools. Most Acrs are small and highly variable proteins which makes their bioinformatic prediction a formidable task. We present a machine-learning approach for comprehensive Acr prediction. The model shows high predictive power when tested against an unseen test set and was employed to predict 2,500 candidate Acr families. Experimental validation of top candidates revealed two unknown Acrs (AcrIC9, IC10) and three other top candidates were coincidentally identified and found to possess anti-CRISPR activity. These results substantially expand the repertoire of predicted Acrs and provide a resource for experimental Acr discovery.


Subject(s)
Bacteriophages/genetics , CRISPR-Associated Protein 9/antagonists & inhibitors , Machine Learning , Sequence Analysis, Protein/methods , Viral Proteins/genetics , Archaea/genetics , Archaea/virology , Bacteria/genetics , Bacteria/virology , CRISPR-Associated Protein 9/genetics , CRISPR-Cas Systems/genetics , Computational Biology/methods , Datasets as Topic , Gene Editing/methods , Host-Parasite Interactions/genetics , Sequence Homology, Amino Acid
14.
Commun Biol ; 3(1): 321, 2020 06 22.
Article in English | MEDLINE | ID: mdl-32572116

ABSTRACT

CRISPR arrays contain spacers, some of which are homologous to genome segments of viruses and other parasitic genetic elements and are employed as portion of guide RNAs to recognize and specifically inactivate the target genomes. However, the fraction of the spacers in sequenced CRISPR arrays that reliably match protospacer sequences in genomic databases is small, leaving the question of the origin(s) open for the great majority of the spacers. Here, we extend the spacer analysis by examining the distribution of partial matches (matching k-mers) between spacers and genomes of viruses infecting the given host as well as the host genomes themselves. The results indicate that most of the spacers originate from the host-specific viromes, whereas self-targeting is strongly selected against. However, we present evidence that the vast majority of the viruses comprising the viromes currently remain unknown although they are likely to be related to identified viruses.


Subject(s)
Clustered Regularly Interspaced Short Palindromic Repeats , Prokaryotic Cells/virology , Virome/genetics , Adaptation, Biological/genetics , Bacteria/genetics , Bacteria/virology , Escherichia coli/genetics , Escherichia coli/virology , Genome , Host-Pathogen Interactions/genetics , Proviruses/genetics
16.
Nucleic Acids Res ; 48(4): 2026-2034, 2020 02 28.
Article in English | MEDLINE | ID: mdl-31943070

ABSTRACT

Type II CRISPR-Cas9 RNA-guided nucleases are widely used for genome engineering. Type II-A SpCas9 protein from Streptococcus pyogenes is the most investigated and highly used enzyme of its class. Nevertheless, it has some drawbacks, including a relatively big size, imperfect specificity and restriction to DNA targets flanked by an NGG PAM sequence. Cas9 orthologs from other bacterial species may provide a rich and largely untapped source of biochemical diversity, which can help to overcome the limitations of SpCas9. Here, we characterize CcCas9, a Type II-C CRISPR nuclease from Clostridium cellulolyticum H10. We show that CcCas9 is an active endonuclease of comparatively small size that recognizes a novel two-nucleotide PAM sequence. The CcCas9 can potentially broaden the existing scope of biotechnological applications of Cas9 nucleases and may be particularly advantageous for genome editing of C. cellulolyticum H10, a bacterium considered to be a promising biofuel producer.


Subject(s)
CRISPR-Associated Protein 9/chemistry , CRISPR-Cas Systems/genetics , Clostridium cellulolyticum/enzymology , DNA/chemistry , CRISPR-Associated Protein 9/genetics , Crystallography, X-Ray , DNA/genetics , Gene Editing , Mutation , Nucleotide Motifs/genetics , RNA, Guide, Kinetoplastida/genetics , Streptococcus pyogenes/enzymology , Substrate Specificity
17.
CRISPR J ; 3(3): 156-163, 2020 06.
Article in English | MEDLINE | ID: mdl-33555973

ABSTRACT

The principal function of archaeal and bacterial CRISPR-Cas systems is antivirus adaptive immunity. However, recent genome analyses identified a variety of derived CRISPR-Cas variants at least some of which appear to perform different functions. Here, we describe a unique repertoire of CRISPR-Cas-related systems that we discovered by searching archaeal metagenome-assemble genomes of the Asgard superphylum. Several of these variants contain extremely diverged homologs of Cas1, the integrase involved in CRISPR adaptation as well as casposon transposition. Strikingly, the diversity of Cas1 in Asgard archaea alone is greater than that detected so far among the rest of archaea and bacteria. The Asgard CRISPR-Cas derivatives also encode distinct forms of Cas4, Cas5, and Cas7 proteins, and/or additional nucleases. Some of these systems are predicted to perform defense functions, but possibly not programmable ones, whereas others are likely to represent previously unknown mobile genetic elements.


Subject(s)
Archaea/genetics , CRISPR-Cas Systems , Clustered Regularly Interspaced Short Palindromic Repeats , Archaea/classification , Archaea/metabolism , Archaeal Proteins/genetics , Archaeal Proteins/metabolism , Endonucleases/genetics , Genome, Archaeal , Metagenome , Phylogeny
18.
Nat Rev Microbiol ; 18(2): 67-83, 2020 02.
Article in English | MEDLINE | ID: mdl-31857715

ABSTRACT

The number and diversity of known CRISPR-Cas systems have substantially increased in recent years. Here, we provide an updated evolutionary classification of CRISPR-Cas systems and cas genes, with an emphasis on the major developments that have occurred since the publication of the latest classification, in 2015. The new classification includes 2 classes, 6 types and 33 subtypes, compared with 5 types and 16 subtypes in 2015. A key development is the ongoing discovery of multiple, novel class 2 CRISPR-Cas systems, which now include 3 types and 17 subtypes. A second major novelty is the discovery of numerous derived CRISPR-Cas variants, often associated with mobile genetic elements that lack the nucleases required for interference. Some of these variants are involved in RNA-guided transposition, whereas others are predicted to perform functions distinct from adaptive immunity that remain to be characterized experimentally. The third highlight is the discovery of numerous families of ancillary CRISPR-linked genes, often implicated in signal transduction. Together, these findings substantially clarify the functional diversity and evolutionary history of CRISPR-Cas.


Subject(s)
Archaea/genetics , Bacteria/genetics , CRISPR-Cas Systems/genetics , Evolution, Molecular , Gene Expression Regulation, Archaeal/physiology , Gene Expression Regulation, Bacterial/physiology , CRISPR-Cas Systems/physiology
19.
Nat Protoc ; 14(10): 3013-3031, 2019 10.
Article in English | MEDLINE | ID: mdl-31520072

ABSTRACT

Functionally linked genes in bacterial and archaeal genomes are often organized into operons. However, the composition and architecture of operons are highly variable and frequently differ even among closely related genomes. Therefore, to efficiently extract reliable functional predictions for uncharacterized genes from comparative analyses of the rapidly growing genomic databases, dedicated computational approaches are required. We developed a protocol to systematically and automatically identify genes that are likely to be functionally associated with a 'bait' gene or locus by using relevance metrics. Given a set of bait loci and a genomic database defined by the user, this protocol compares the genomic neighborhoods of the baits to identify genes that are likely to be functionally linked to the baits by calculating the abundance of a given gene within and outside the bait neighborhoods and the distance to the bait. We exemplify the performance of the protocol with three test cases, namely, genes linked to CRISPR-Cas systems using the 'CRISPRicity' metric, genes associated with archaeal proviruses and genes linked to Argonaute genes in halobacteria. The protocol can be run by users with basic computational skills. The computational cost depends on the sizes of the genomic dataset and the list of reference loci and can vary from one CPU-hour to hundreds of hours on a supercomputer.


Subject(s)
Computational Biology/methods , Genes, Archaeal , Genes, Bacterial , Genomics/methods , CRISPR-Cas Systems , Genome, Archaeal , Genome, Bacterial , Molecular Sequence Annotation/methods , Open Reading Frames , Operon
20.
Nat Rev Microbiol ; 17(8): 513-525, 2019 08.
Article in English | MEDLINE | ID: mdl-31165781

ABSTRACT

The principal function of CRISPR-Cas systems in archaea and bacteria is defence against mobile genetic elements (MGEs), including viruses, plasmids and transposons. However, the relationships between CRISPR-Cas and MGEs are far more complex. Several classes of MGE contributed to the origin and evolution of CRISPR-Cas, and, conversely, CRISPR-Cas systems and their components were recruited by various MGEs for functions that remain largely uncharacterized. In this Analysis article, we investigate and substantially expand the range of CRISPR-Cas components carried by MGEs. Three groups of Tn7-like transposable elements encode 'minimal' type I CRISPR-Cas derivatives capable of target recognition but not cleavage, and another group encodes an inactivated type V variant. These partially inactivated CRISPR-Cas variants might mediate guide RNA-dependent integration of the respective transposons. Numerous plasmids and some prophages encode type IV systems, with similar predicted properties, that appear to contribute to competition among plasmids and between plasmids and viruses. Many prokaryotic viruses also carry CRISPR mini-arrays, some of which recognize other viruses and are implicated in inter-virus conflicts, and solitary repeat units, which could inhibit host CRISPR-Cas systems.


Subject(s)
CRISPR-Cas Systems , Evolution, Molecular , Gene Transfer, Horizontal , Interspersed Repetitive Sequences , Recombination, Genetic , Archaea/genetics , Bacteria/genetics , Bacteriophages/genetics , DNA Transposable Elements , Plasmids
SELECTION OF CITATIONS
SEARCH DETAIL
...