Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 45
Filter
1.
Nucleic Acids Res ; 52(D1): D586-D589, 2024 Jan 05.
Article in English | MEDLINE | ID: mdl-37904617

ABSTRACT

Many microorganisms produce natural products that are frequently used in the development of medicines and crop protection agents. Genome mining has evolved into a prominent method to access this potential. antiSMASH is the most popular tool for this task. Here we present version 4 of the antiSMASH database, providing biosynthetic gene clusters detected by antiSMASH 7.1 in publicly available, dereplicated, high-quality microbial genomes via an interactive graphical user interface. In version 4, the database contains 231 534 high quality BGC regions from 592 archaeal, 35 726 bacterial and 236 fungal genomes and is available at https://antismash-db.secondarymetabolites.org/.


Subject(s)
Biological Products , Biosynthetic Pathways , Databases, Genetic , Genome, Microbial , Biosynthetic Pathways/genetics , Multigene Family , Software
2.
Nucleic Acids Res ; 2024 Jun 22.
Article in English | MEDLINE | ID: mdl-38908028

ABSTRACT

Filamentous Actinobacteria, recently renamed Actinomycetia, are the most prolific source of microbial bioactive natural products. Studies on biosynthetic gene clusters benefit from or require chromosome-level assemblies. Here, we provide DNA sequences from >1000 isolates: 881 complete genomes and 153 near-complete genomes, representing 28 genera and 389 species, including 244 likely novel species. All genomes are from filamentous isolates of the class Actinomycetia from the NBC culture collection. The largest genus is Streptomyces with 886 genomes including 742 complete assemblies. We use this data to show that analysis of complete genomes can bring biological understanding not previously derived from more fragmented sequences or less systematic datasets. We document the central and structured location of core genes and distal location of specialized metabolite biosynthetic gene clusters and duplicate core genes on the linear Streptomyces chromosome, and analyze the content and length of the terminal inverted repeats which are characteristic for Streptomyces. We then analyze the diversity of trans-AT polyketide synthase biosynthetic gene clusters, which encodes the machinery of a biotechnologically highly interesting compound class. These insights have both ecological and biotechnological implications in understanding the importance of high quality genomic resources and the complex role synteny plays in Actinomycetia biology.

3.
Nucleic Acids Res ; 51(W1): W46-W50, 2023 07 05.
Article in English | MEDLINE | ID: mdl-37140036

ABSTRACT

Microorganisms produce small bioactive compounds as part of their secondary or specialised metabolism. Often, such metabolites have antimicrobial, anticancer, antifungal, antiviral or other bio-activities and thus play an important role for applications in medicine and agriculture. In the past decade, genome mining has become a widely-used method to explore, access, and analyse the available biodiversity of these compounds. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' (https://antismash.secondarymetabolites.org/) has supported researchers in their microbial genome mining tasks, both as a free to use web server and as a standalone tool under an OSI-approved open source licence. It is currently the most widely used tool for detecting and characterising biosynthetic gene clusters (BGCs) in archaea, bacteria, and fungi. Here, we present the updated version 7 of antiSMASH. antiSMASH 7 increases the number of supported cluster types from 71 to 81, as well as containing improvements in the areas of chemical structure prediction, enzymatic assembly-line visualisation and gene cluster regulation.


Subject(s)
Computers , Software , Bacteria/genetics , Bacteria/metabolism , Archaea/genetics , Genome, Microbial , Multigene Family , Secondary Metabolism/genetics
4.
Nucleic Acids Res ; 51(D1): D603-D610, 2023 01 06.
Article in English | MEDLINE | ID: mdl-36399496

ABSTRACT

With an ever-increasing amount of (meta)genomic data being deposited in sequence databases, (meta)genome mining for natural product biosynthetic pathways occupies a critical role in the discovery of novel pharmaceutical drugs, crop protection agents and biomaterials. The genes that encode these pathways are often organised into biosynthetic gene clusters (BGCs). In 2015, we defined the Minimum Information about a Biosynthetic Gene cluster (MIBiG): a standardised data format that describes the minimally required information to uniquely characterise a BGC. We simultaneously constructed an accompanying online database of BGCs, which has since been widely used by the community as a reference dataset for BGCs and was expanded to 2021 entries in 2019 (MIBiG 2.0). Here, we describe MIBiG 3.0, a database update comprising large-scale validation and re-annotation of existing entries and 661 new entries. Particular attention was paid to the annotation of compound structures and biological activities, as well as protein domain selectivities. Together, these new features keep the database up-to-date, and will provide new opportunities for the scientific community to use its freely available data, e.g. for the training of new machine learning models to predict sequence-structure-function relationships for diverse natural products. MIBiG 3.0 is accessible online at https://mibig.secondarymetabolites.org/.


Subject(s)
Genome , Genomics , Multigene Family , Biosynthetic Pathways/genetics
5.
Nucleic Acids Res ; 50(D1): D736-D740, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34718689

ABSTRACT

As a result of the continuous evolution of drug resistant bacteria, new antibiotics are urgently needed. Encoded by biosynthetic gene clusters (BGCs), antibiotic compounds are mostly produced by bacteria. With the exponential increase in the number of publicly available, sequenced genomes and the advancements of BGC prediction tools, genome mining algorithms have uncovered millions of uncharacterized BGCs for further evaluation. Since compound identification and characterization remain bottlenecks, a major challenge is prioritizing promising BGCs. Recently, researchers adopted self-resistance based strategies allowing them to predict the biological activities of natural products encoded by uncharacterized BGCs. Since 2017, the Antibiotic Resistant Target Seeker (ARTS) facilitated this so-called target-directed genome mining (TDGM) approach for the prioritization of BGCs encoding potentially novel antibiotics. Here, we present the ARTS database, available at https://arts-db.ziemertlab.com/. The ARTS database provides pre-computed ARTS results for >70,000 genomes and metagenome assembled genomes in total. Advanced search queries allow users to rapidly explore the fundamental criteria of TDGM such as BGC proximity, duplication and horizontal gene transfers of essential housekeeping genes. Furthermore, the ARTS database provides results interconnected throughout the bacterial kingdom as well as links to known databases in natural product research.


Subject(s)
Databases, Factual , Drug Resistance, Bacterial/genetics , Metagenome/genetics , Software , Anti-Bacterial Agents , Bacteria/drug effects , Bacteria/genetics , Biosynthetic Pathways/drug effects , Biosynthetic Pathways/genetics , Gene Transfer, Horizontal/genetics , Genome, Bacterial
6.
Nucleic Acids Res ; 49(D1): D639-D643, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33152079

ABSTRACT

Microorganisms produce natural products that are frequently used in the development of antibacterial, antiviral, and anticancer drugs, pesticides, herbicides, or fungicides. In recent years, genome mining has evolved into a prominent method to access this potential. antiSMASH is one of the most popular tools for this task. Here, we present version 3 of the antiSMASH database, providing a means to access and query precomputed antiSMASH-5.2-detected biosynthetic gene clusters from representative, publicly available, high-quality microbial genomes via an interactive graphical user interface. In version 3, the database contains 147 517 high quality BGC regions from 388 archaeal, 25 236 bacterial and 177 fungal genomes and is available at https://antismash-db.secondarymetabolites.org/.


Subject(s)
Data Mining , Databases as Topic , Enzymes/classification , Biosynthetic Pathways/genetics , Multigene Family , Search Engine
7.
Nucleic Acids Res ; 49(D1): D490-D497, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33010170

ABSTRACT

Computational analysis of biosynthetic gene clusters (BGCs) has revolutionized natural product discovery by enabling the rapid investigation of secondary metabolic potential within microbial genome sequences. Grouping homologous BGCs into Gene Cluster Families (GCFs) facilitates mapping their architectural and taxonomic diversity and provides insights into the novelty of putative BGCs, through dereplication with BGCs of known function. While multiple databases exist for exploring BGCs from publicly available data, no public resources exist that focus on GCF relationships. Here, we present BiG-FAM, a database of 29,955 GCFs capturing the global diversity of 1,225,071 BGCs predicted from 209,206 publicly available microbial genomes and metagenome-assembled genomes (MAGs). The database offers rich functionalities, such as multi-criterion GCF searches, direct links to BGC databases such as antiSMASH-DB, and rapid GCF annotation of user-supplied BGCs from antiSMASH results. BiG-FAM can be accessed online at https://bigfam.bioinformatics.nl.


Subject(s)
Biosynthetic Pathways/genetics , Databases, Genetic , Multigene Family , Clostridium/genetics , Search Engine , Streptomyces/genetics
8.
Nucleic Acids Res ; 49(W1): W29-W35, 2021 07 02.
Article in English | MEDLINE | ID: mdl-33978755

ABSTRACT

Many microorganisms produce natural products that form the basis of antimicrobials, antivirals, and other drugs. Genome mining is routinely used to complement screening-based workflows to discover novel natural products. Since 2011, the "antibiotics and secondary metabolite analysis shell-antiSMASH" (https://antismash.secondarymetabolites.org/) has supported researchers in their microbial genome mining tasks, both as a free-to-use web server and as a standalone tool under an OSI-approved open-source license. It is currently the most widely used tool for detecting and characterising biosynthetic gene clusters (BGCs) in bacteria and fungi. Here, we present the updated version 6 of antiSMASH. antiSMASH 6 increases the number of supported cluster types from 58 to 71, displays the modular structure of multi-modular BGCs, adds a new BGC comparison algorithm, allows for the integration of results from other prediction tools, and more effectively detects tailoring enzymes in RiPP clusters.


Subject(s)
Biological Products/metabolism , Genome, Microbial , Software , Bacteria/genetics , Biosynthetic Pathways/genetics , Fungi/genetics , Secondary Metabolism/genetics
9.
Nucleic Acids Res ; 48(W1): W546-W552, 2020 07 02.
Article in English | MEDLINE | ID: mdl-32427317

ABSTRACT

Multi-drug resistant pathogens have become a major threat to human health and new antibiotics are urgently needed. Most antibiotics are derived from secondary metabolites produced by bacteria. In order to avoid suicide, these bacteria usually encode resistance genes, in some cases within the biosynthetic gene cluster (BGC) of the respective antibiotic compound. Modern genome mining tools enable researchers to computationally detect and predict BGCs that encode the biosynthesis of secondary metabolites. The major challenge now is the prioritization of the most promising BGCs encoding antibiotics with novel modes of action. A recently developed target-directed genome mining approach allows researchers to predict the mode of action of the encoded compound of an uncharacterized BGC based on the presence of resistant target genes. In 2017, we introduced the 'Antibiotic Resistant Target Seeker' (ARTS). ARTS allows for specific and efficient genome mining for antibiotics with interesting and novel targets by rapidly linking housekeeping and known resistance genes to BGC proximity, duplication and horizontal gene transfer (HGT) events. Here, we present ARTS 2.0 available at http://arts.ziemertlab.com. ARTS 2.0 now includes options for automated target directed genome mining in all bacterial taxa as well as metagenomic data. Furthermore, it enables comparison of similar BGCs from different genomes and their putative resistance genes.


Subject(s)
Drug Resistance, Bacterial/genetics , Genome, Bacterial , Software , Biosynthetic Pathways/genetics , Data Mining , Genes, Bacterial , Metagenomics
10.
Nucleic Acids Res ; 48(D1): D454-D458, 2020 01 08.
Article in English | MEDLINE | ID: mdl-31612915

ABSTRACT

Fueled by the explosion of (meta)genomic data, genome mining of specialized metabolites has become a major technology for drug discovery and studying microbiome ecology. In these efforts, computational tools like antiSMASH have played a central role through the analysis of Biosynthetic Gene Clusters (BGCs). Thousands of candidate BGCs from microbial genomes have been identified and stored in public databases. Interpreting the function and novelty of these predicted BGCs requires comparison with a well-documented set of BGCs of known function. The MIBiG (Minimum Information about a Biosynthetic Gene Cluster) Data Standard and Repository was established in 2015 to enable curation and storage of known BGCs. Here, we present MIBiG 2.0, which encompasses major updates to the schema, the data, and the online repository itself. Over the past five years, 851 new BGCs have been added. Additionally, we performed extensive manual data curation of all entries to improve the annotation quality of our repository. We also redesigned the data schema to ensure the compliance of future annotations. Finally, we improved the user experience by adding new features such as query searches and a statistics page, and enabled direct link-outs to chemical structure databases. The repository is accessible online at https://mibig.secondarymetabolites.org/.


Subject(s)
Databases, Genetic , Genome, Bacterial , Genomics/methods , Multigene Family , Software , Biosynthetic Pathways/genetics , Molecular Sequence Annotation
11.
Proc Natl Acad Sci U S A ; 116(41): 20366-20375, 2019 10 08.
Article in English | MEDLINE | ID: mdl-31548381

ABSTRACT

Streptomycetes serve as major producers of various pharmacologically and industrially important natural products. Although CRISPR-Cas9 systems have been developed for more robust genetic manipulations, concerns of genome instability caused by the DNA double-strand breaks (DSBs) and the toxicity of Cas9 remain. To overcome these limitations, here we report development of the DSB-free, single-nucleotide-resolution genome editing system CRISPR-BEST (CRISPR-Base Editing SysTem), which comprises a cytidine (CRISPR-cBEST) and an adenosine (CRISPR-aBEST) deaminase-based base editor. Specifically targeted by an sgRNA, CRISPR-cBEST can efficiently convert a C:G base pair to a T:A base pair and CRISPR-aBEST can convert an A:T base pair to a G:C base pair within a window of approximately 7 and 6 nucleotides, respectively. CRISPR-BEST was validated and successfully used in different Streptomyces species. Particularly in nonmodel actinomycete Streptomyces collinus Tü365, CRISPR-cBEST efficiently inactivated the 2 copies of kirN gene that are in the duplicated kirromycin biosynthetic pathways simultaneously by STOP codon introduction. Generating such a knockout mutant repeatedly failed using the conventional DSB-based CRISPR-Cas9. An unbiased, genome-wide off-target evaluation indicates the high fidelity and applicability of CRISPR-BEST. Furthermore, the system supports multiplexed editing with a single plasmid by providing a Csy4-based sgRNA processing machinery. To simplify the protospacer identification process, we also updated the CRISPy-web (https://crispy.secondarymetabolites.org), and now it allows designing sgRNAs specifically for CRISPR-BEST applications.


Subject(s)
CRISPR-Cas Systems , Gene Editing , Streptomyces coelicolor/genetics , DNA, Bacterial/genetics , Gene Expression Regulation, Bacterial , Genome, Bacterial , Genome-Wide Association Study , Plasmids
12.
Brief Bioinform ; 20(4): 1103-1113, 2019 07 19.
Article in English | MEDLINE | ID: mdl-29112695

ABSTRACT

Many drugs are derived from small molecules produced by microorganisms and plants, so-called natural products. Natural products have diverse chemical structures, but the biosynthetic pathways producing those compounds are often organized as biosynthetic gene clusters (BGCs) and follow a highly conserved biosynthetic logic. This allows for the identification of core biosynthetic enzymes using genome mining strategies that are based on the sequence similarity of the involved enzymes/genes. However, mining for a variety of BGCs quickly approaches a complexity level where manual analyses are no longer possible and require the use of automated genome mining pipelines, such as the antiSMASH software. In this review, we discuss the principles underlying the predictions of antiSMASH and other tools and provide practical advice for their application. Furthermore, we discuss important caveats such as rule-based BGC detection, sequence and annotation quality and cluster boundary prediction, which all have to be considered while planning for, performing and analyzing the results of genome mining studies.


Subject(s)
Biosynthetic Pathways/genetics , Multigene Family , Software , Biological Products/metabolism , Computational Biology/methods , Data Mining/methods , Databases, Genetic , Genome, Microbial , Genome, Plant , Internet , Models, Biological
13.
Nucleic Acids Res ; 47(D1): D625-D630, 2019 01 08.
Article in English | MEDLINE | ID: mdl-30395294

ABSTRACT

Natural products originating from microorganisms are frequently used in antimicrobial and anticancer drugs, pesticides, herbicides or fungicides. In the last years, the increasing availability of microbial genome data has made it possible to access the wealth of biosynthetic clusters responsible for the production of these compounds by genome mining. antiSMASH is one of the most popular tools in this field. The antiSMASH database provides pre-computed antiSMASH results for many publicly available microbial genomes and allows for advanced cross-genome searches. The current version 2 of the antiSMASH database contains annotations for 6200 full bacterial genomes and 18,576 bacterial draft genomes and is available at https://antismash-db.secondarymetabolites.org/.


Subject(s)
Databases, Genetic , Genome, Bacterial , Molecular Sequence Annotation , Secondary Metabolism/genetics , Multigene Family , Software
14.
Nucleic Acids Res ; 47(W1): W81-W87, 2019 07 02.
Article in English | MEDLINE | ID: mdl-31032519

ABSTRACT

Secondary metabolites produced by bacteria and fungi are an important source of antimicrobials and other bioactive compounds. In recent years, genome mining has seen broad applications in identifying and characterizing new compounds as well as in metabolic engineering. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' (https://antismash.secondarymetabolites.org) has assisted researchers in this, both as a web server and a standalone tool. It has established itself as the most widely used tool for identifying and analysing biosynthetic gene clusters (BGCs) in bacterial and fungal genome sequences. Here, we present an entirely redesigned and extended version 5 of antiSMASH. antiSMASH 5 adds detection rules for clusters encoding the biosynthesis of acyl-amino acids, ß-lactones, fungal RiPPs, RaS-RiPPs, polybrominated diphenyl ethers, C-nucleosides, PPY-like ketones and lipolanthines. For type II polyketide synthase-encoding gene clusters, antiSMASH 5 now offers more detailed predictions. The HTML output visualization has been redesigned to improve the navigation and visual representation of annotations. We have again improved the runtime of analysis steps, making it possible to deliver comprehensive annotations for bacterial genomes within a few minutes. A new output file in the standard JavaScript object notation (JSON) format is aimed at downstream tools that process antiSMASH results programmatically.


Subject(s)
Genome, Bacterial/genetics , Genome, Fungal/genetics , Genomics , Software , Bacteria/genetics , Biosynthetic Pathways/genetics , Computational Biology , Data Mining , Fungi/genetics , Internet
15.
Int J Mol Sci ; 22(14)2021 Jul 15.
Article in English | MEDLINE | ID: mdl-34299187

ABSTRACT

By culturing microorganisms under standard laboratory conditions, most biosynthetic gene clusters (BGCs) are not expressed, and thus, the products are not produced. To explore this biosynthetic potential, we developed a novel "semi-targeted" approach focusing on activating "silent" BGCs by concurrently introducing a group of regulator genes into streptomycetes of the Tübingen strain collection. We constructed integrative plasmids containing two classes of regulatory genes under the control of the constitutive promoter ermE*p (cluster situated regulators (CSR) and Streptomyces antibiotic regulatory proteins (SARPs)). These plasmids were introduced into Streptomyces sp. TÜ17, Streptomyces sp. TÜ10 and Streptomyces sp. TÜ102. Introduction of the CSRs-plasmid into strain S. sp. TÜ17 activated the production of mayamycin A. By using the individual regulator genes, we proved that Aur1P, was responsible for the activation. In strain S. sp. TÜ102, the introduction of the SARP-plasmid triggered the production of a chartreusin-like compound. Insertion of the CSRs-plasmid into strain S. sp. TÜ10 resulted in activating the warkmycin-BGC. In both recombinants, activation of the BGCs was only possible through the simultaneous expression of aur1PR3 and griR in S. sp. TÜ102 and aur1P and pntR in of S. sp. TÜ10.


Subject(s)
Bacterial Proteins/genetics , Benz(a)Anthracenes/metabolism , Multigene Family , Recombinant Proteins/genetics , Streptomyces/genetics , Bacterial Proteins/metabolism , Benzopyrans , Gene Expression Regulation, Bacterial , Glycosides/biosynthesis , Promoter Regions, Genetic , Recombinant Proteins/metabolism , Streptomyces/growth & development , Streptomyces/metabolism , Transcription Factors/metabolism , Trisaccharides/biosynthesis
16.
Nucleic Acids Res ; 46(W1): W205-W208, 2018 07 02.
Article in English | MEDLINE | ID: mdl-29722870

ABSTRACT

Patterns in biological sequences frequently signify interesting features in the underlying molecule. Many tools exist to search for well-known patterns. Less support is available for exploratory analysis, where no well-defined patterns are known yet. PatScanUI (https://patscan.secondarymetabolites.org/) provides a highly interactive web interface to the powerful generic pattern search tool PatScan. The complex PatScan-patterns are created in a drag-and-drop aware interface allowing researchers to do rapid prototyping of the often complicated patterns useful to identifying features of interest.


Subject(s)
Sequence Analysis, DNA/methods , Sequence Analysis, Protein/methods , Software , Bacteria/genetics , Internet , Iron , Response Elements , User-Computer Interface
17.
Nucleic Acids Res ; 45(W1): W55-W63, 2017 07 03.
Article in English | MEDLINE | ID: mdl-28453650

ABSTRACT

Plant specialized metabolites are chemically highly diverse, play key roles in host-microbe interactions, have important nutritional value in crops and are frequently applied as medicines. It has recently become clear that plant biosynthetic pathway-encoding genes are sometimes densely clustered in specific genomic loci: biosynthetic gene clusters (BGCs). Here, we introduce plantiSMASH, a versatile online analysis platform that automates the identification of candidate plant BGCs. Moreover, it allows integration of transcriptomic data to prioritize candidate BGCs based on the coexpression patterns of predicted biosynthetic enzyme-coding genes, and facilitates comparative genomic analysis to study the evolutionary conservation of each cluster. Applied on 48 high-quality plant genomes, plantiSMASH identifies a rich diversity of candidate plant BGCs. These results will guide further experimental exploration of the nature and dynamics of gene clustering in plant metabolism. Moreover, spurred by the continuing decrease in costs of plant genome sequencing, they will allow genome mining technologies to be applied to plant natural product discovery. The plantiSMASH web server, precalculated results and source code are freely available from http://plantismash.secondarymetabolites.org.


Subject(s)
Genes, Plant , Genome, Plant , Software , Biosynthetic Pathways/genetics , Enzymes/genetics , Gene Expression Profiling , Genomics , Internet , Molecular Sequence Annotation , Plants/genetics , Plants/metabolism , Transcriptome
18.
Nucleic Acids Res ; 45(D1): D555-D559, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27924032

ABSTRACT

Secondary metabolites produced by microorganisms are the main source of bioactive compounds that are in use as antimicrobial and anticancer drugs, fungicides, herbicides and pesticides. In the last decade, the increasing availability of microbial genomes has established genome mining as a very important method for the identification of their biosynthetic gene clusters (BGCs). One of the most popular tools for this task is antiSMASH. However, so far, antiSMASH is limited to de novo computing results for user-submitted genomes and only partially connects these with BGCs from other organisms. Therefore, we developed the antiSMASH database, a simple but highly useful new resource to browse antiSMASH-annotated BGCs in the currently 3907 bacterial genomes in the database and perform advanced search queries combining multiple search criteria. antiSMASH-DB is available at http://antismash-db.secondarymetabolites.org/.


Subject(s)
Biosynthetic Pathways , Databases, Factual , Microbiology , Secondary Metabolism , Biosynthetic Pathways/genetics , Computational Biology/methods , Gene Expression Regulation , Protein Processing, Post-Translational , Secondary Metabolism/genetics , Web Browser
19.
Nucleic Acids Res ; 45(W1): W42-W48, 2017 07 03.
Article in English | MEDLINE | ID: mdl-28472505

ABSTRACT

With the rise of multi-drug resistant pathogens and the decline in number of potential new antibiotics in development there is a fervent need to reinvigorate the natural products discovery pipeline. Most antibiotics are derived from secondary metabolites produced by microorganisms and plants. To avoid suicide, an antibiotic producer harbors resistance genes often found within the same biosynthetic gene cluster (BGC) responsible for manufacturing the antibiotic. Existing mining tools are excellent at detecting BGCs or resistant genes in general, but provide little help in prioritizing and identifying gene clusters for compounds active against specific and novel targets. Here we introduce the 'Antibiotic Resistant Target Seeker' (ARTS) available at https://arts.ziemertlab.com. ARTS allows for specific and efficient genome mining for antibiotics with interesting and novel targets. The aim of this web server is to automate the screening of large amounts of sequence data and to focus on the most promising strains that produce antibiotics with new modes of action. ARTS integrates target directed genome mining methods, antibiotic gene cluster predictions and 'essential gene screening' to provide an interactive page for rapid identification of known and putative targets in BGCs.


Subject(s)
Anti-Bacterial Agents/biosynthesis , Drug Resistance, Bacterial/genetics , Software , Actinobacteria/genetics , Biosynthetic Pathways/genetics , Data Mining , Drug Discovery , Genome, Bacterial , Internet
20.
Nucleic Acids Res ; 45(W1): W36-W41, 2017 07 03.
Article in English | MEDLINE | ID: mdl-28460038

ABSTRACT

Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding the production of such compounds. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' has assisted researchers in efficiently performing this, both as a web server and a standalone tool. Here, we present the thoroughly updated antiSMASH version 4, which adds several novel features, including prediction of gene cluster boundaries using the ClusterFinder method or the newly integrated CASSIS algorithm, improved substrate specificity prediction for non-ribosomal peptide synthetase adenylation domains based on the new SANDPUMA algorithm, improved predictions for terpene and ribosomally synthesized and post-translationally modified peptides cluster products, reporting of sequence similarity to proteins encoded in experimentally characterized gene clusters on a per-protein basis and a domain-level alignment tool for comparative analysis of trans-AT polyketide synthase assembly line architectures. Additionally, several usability features have been updated and improved. Together, these improvements make antiSMASH up-to-date with the latest developments in natural product research and will further facilitate computational genome mining for the discovery of novel bioactive molecules.


Subject(s)
Secondary Metabolism/genetics , Software , Algorithms , Anti-Bacterial Agents/biosynthesis , Biological Products/metabolism , Biosynthetic Pathways/genetics , Codon , Genes , Internet , Peptide Synthases/metabolism , Peptides/chemistry , Peptides/metabolism , Polyketide Synthases/chemistry , Protein Domains , Protein Processing, Post-Translational , Terpenes/chemistry
SELECTION OF CITATIONS
SEARCH DETAIL