Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 145
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Cheminform ; 16(1): 58, 2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38783386

RESUMO

Effective visualization of small molecules is paramount in conveying concepts and results in cheminformatics. Scalable vector graphics (SVG) are preferred for creating such visualizations, as SVGs can be easily altered in post-production and exported to other formats. A wide spectrum of software applications already exist that can visualize molecules, and customize these visualizations, in many ways. However, software packages that can output projected 3D models onto a 2D canvas directly as SVG, while being programmatically accessible from Python, are lacking. Here, we introduce CineMol, which can draw vectorized approximations of three-dimensional small molecule models in seconds, without triangulation or ray tracing, resulting in files of around 50-300 kilobytes per molecule model for compounds with up to 45 heavy atoms. The SVGs outputted by CineMol can be readily modified in popular vector graphics editing software applications. CineMol is written in Python and can be incorporated into any existing Python cheminformatics workflow, as it only depends on native Python libraries. CineMol also provides programmatic access to all its internal states, allowing for per-atom and per-bond-based customization. CineMol's capacity to programmatically create molecular visualizations suitable for post-production offers researchers and scientists a powerful tool for enhancing the clarity and visual impact of their scientific presentations and publications in cheminformatics, metabolomics, and related scientific disciplines.Scientific contributionWe introduce CineMol, a Python-based tool that provides a valuable solution for cheminformatics researchers by enabling the direct generation of high-quality approximations of two-dimensional SVG visualizations from three-dimensional small molecule models, all within a programmable Python framework. CineMol offers a unique combination of speed, efficiency, and accessibility, making it an indispensable tool for researchers in cheminformatics, especially when working with SVG visualizations.

2.
ACS Chem Biol ; 19(5): 1106-1115, 2024 May 17.
Artigo em Inglês | MEDLINE | ID: mdl-38602492

RESUMO

The prevalence of multidrug-resistant (MDR) pathogens combined with a decline in antibiotic discovery presents a major challenge for health care. To refill the discovery pipeline, we need to find new ways to uncover new chemical entities. Here, we report the global genome mining-guided discovery of new lipopeptide antibiotics tridecaptin A5 and tridecaptin D, which exhibit unusual bioactivities within their class. The change in the antibacterial spectrum of Oct-TriA5 was explained solely by a Phe to Trp substitution as compared to Oct-TriA1, while Oct-TriD contained 6 substitutions. Metabolomic analysis of producer Paenibacillus sp. JJ-21 validated the predicted amino acid sequence of tridecaptin A5. Screening of tridecaptin analogues substituted at position 9 identified Oct-His9 as a potent congener with exceptional efficacy against Pseudomonas aeruginosa and reduced hemolytic and cytotoxic properties. Our work highlights the promise of tridecaptin analogues to combat MDR pathogens.


Assuntos
Antibacterianos , Testes de Sensibilidade Microbiana , Pseudomonas aeruginosa , Antibacterianos/farmacologia , Antibacterianos/química , Pseudomonas aeruginosa/efeitos dos fármacos , Humanos , Especificidade de Hospedeiro , Descoberta de Drogas , Lipopeptídeos/farmacologia , Lipopeptídeos/química , Peptídeos
3.
Artigo em Inglês | MEDLINE | ID: mdl-38569653

RESUMO

Microbes typically live in complex habitats where they need to rapidly adapt to continuously changing growth conditions. To do so, they produce an astonishing array of natural products with diverse structures and functions. Actinobacteria stand out for their prolific production of bioactive molecules, including antibiotics, anticancer agents, antifungals, and immunosuppressants. Attention has been directed especially towards the identification of the compounds they produce and the mining of the large diversity of biosynthetic gene clusters (BGCs) in their genomes. However, the current return on investment in random screening for bioactive compounds is low, while it is hard to predict which of the millions of BGCs should be prioritized. Moreover, many of the BGCs for yet undiscovered natural products are silent or cryptic under laboratory growth conditions. To identify ways to prioritize and activate these BGCs, knowledge regarding the way their expression is controlled is crucial. Intricate regulatory networks control global gene expression in Actinobacteria, governed by a staggering number of up to 1000 transcription factors per strain. This review highlights recent advances in experimental and computational methods for characterizing and predicting transcription factor binding sites and their applications to guide natural product discovery. We propose that regulation-guided genome mining approaches will open new avenues toward eliciting the expression of BGCs, as well as prioritizing subsets of BGCs for expression using synthetic biology approaches. ONE-SENTENCE SUMMARY: This review provides insights into advances in experimental and computational methods aimed at predicting transcription factor binding sites and their applications to guide natural product discovery.


Assuntos
Actinobacteria , Produtos Biológicos , Descoberta de Drogas , Redes Reguladoras de Genes , Actinobacteria/metabolismo , Actinobacteria/genética , Produtos Biológicos/metabolismo , Vias Biossintéticas , Biologia Computacional/métodos , Regulação Bacteriana da Expressão Gênica , Família Multigênica , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética
4.
ISME Commun ; 4(1): ycae034, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38559569

RESUMO

Plant-microbiome research plays a pivotal role in understanding the relationships between plants and their associated microbial communities, with implications for agriculture and ecosystem dynamics. Metabarcoding analysis on variable regions of the 16S ribosomal RNA (rRNA) gene remains the dominant technology to study microbiome diversity in this field. However, the choice of the targeted variable region might affect the outcome of the microbiome studies. In our in silico analysis, we have evaluated whether the targeted variable region has an impact on taxonomic resolution in 16 plant-related microbial genera. Through a comparison of 16S rRNA gene variable regions with whole-genome data, our findings suggest that the V1-V3 region is generally a more suitable option than the widely used V3-V4 region for targeting microbiome analysis in plant-related genera. However, sole reliance on one region could introduce detection biases for specific genera. Thus, we are suggesting that while transitioning to full-length 16S rRNA gene and whole-genome sequencing for plant-microbiome analysis, the usage of genus-specific variable regions can achieve more precise taxonomic assignments. More broadly, our approach provides a blueprint to identify the most discriminating variable regions of the 16S rRNA gene for genera of interest.

5.
J Mol Biol ; : 168558, 2024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38580076

RESUMO

Actinobacteria undergo a complex multicellular life cycle and produce a wide range of specialized metabolites, including the majority of the antibiotics. These biological processes are controlled by intricate regulatory pathways, and to better understand how they are controlled we need to augment our insights into the transcription factor binding sites. Here, we present LogoMotif (https://logomotif.bioinformatics.nl), an open-source database for characterized and predicted transcription factor binding sites in Actinobacteria, along with their cognate position weight matrices and hidden Markov models. Genome-wide predictions of binding site locations in Streptomyces model organisms are supplied and visualized in interactive regulatory networks. In the web interface, users can freely access, download and investigate the underlying data. With this curated collection of actinobacterial regulatory interactions, LogoMotif serves as a basis for binding site predictions, thus providing users with clues on how to elicit the expression of genes of interest and guide genome mining efforts.

6.
ISME J ; 18(1)2024 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-38537571

RESUMO

Natural ecosystems harbor a huge reservoir of taxonomically diverse microbes that are important for plant growth and health. The vast diversity of soil microorganisms and their complex interactions make it challenging to pinpoint the main players important for the life support functions microbes can provide to plants, including enhanced tolerance to (a)biotic stress factors. Designing simplified microbial synthetic communities (SynComs) helps reduce this complexity to unravel the molecular and chemical basis and interplay of specific microbiome functions. While SynComs have been successfully employed to dissect microbial interactions or reproduce microbiome-associated phenotypes, the assembly and reconstitution of these communities have often been based on generic abundance patterns or taxonomic identities and co-occurrences but have only rarely been informed by functional traits. Here, we review recent studies on designing functional SynComs to reveal common principles and discuss multidimensional approaches for community design. We propose a strategy for tailoring the design of functional SynComs based on integration of high-throughput experimental assays with microbial strains and computational genomic analyses of their functional capabilities.


Assuntos
Microbiota , Solo/química , Microbiologia do Solo , Interações Microbianas
7.
Nat Commun ; 15(1): 2072, 2024 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-38453959

RESUMO

Bacteria have an extensive adaptive ability to live in close association with eukaryotic hosts, exhibiting detrimental, neutral or beneficial effects on host growth and health. However, the genes involved in niche adaptation are mostly unknown and their functions poorly characterized. Here, we present bacLIFE ( https://github.com/Carrion-lab/bacLIFE ) a streamlined computational workflow for genome annotation, large-scale comparative genomics, and prediction of lifestyle-associated genes (LAGs). As a proof of concept, we analyzed 16,846 genomes from the Burkholderia/Paraburkholderia and Pseudomonas genera, which led to the identification of hundreds of genes potentially associated with a plant pathogenic lifestyle. Site-directed mutagenesis of 14 of these predicted LAGs of unknown function, followed by plant bioassays, showed that 6 predicted LAGs are indeed involved in the phytopathogenic lifestyle of Burkholderia plantarii and Pseudomonas syringae pv. phaseolicola. These 6 LAGs encompassed a glycosyltransferase, extracellular binding proteins, homoserine dehydrogenases and hypothetical proteins. Collectively, our results highlight bacLIFE as an effective computational tool for prediction of LAGs and the generation of hypotheses for a better understanding of bacteria-host interactions.


Assuntos
Genoma Bacteriano , Pseudomonas syringae , Genoma Bacteriano/genética , Pseudomonas syringae/genética , Fluxo de Trabalho , Genômica/métodos
8.
Environ Microbiol ; 26(2): e16589, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38356049

RESUMO

Ancient environmental samples, including permafrost soils and frozen animal remains, represent an archive with microbial communities that have barely been explored. This yet unexplored microbial world is a genetic resource that may provide us with new evolutionary insights into recent genomic changes, as well as novel metabolic pathways and chemistry. Here, we describe Actinomycetota Micromonospora, Oerskovia, Saccharopolyspora, Sanguibacter and Streptomyces species were successfully revived and their genome sequences resolved. Surprisingly, the genomes of these bacteria from an ancient source show a large phylogenetic distance to known strains and harbour many novel biosynthetic gene clusters that may well represent uncharacterised biosynthetic potential. Metabolic profiles of the strains display the production of known molecules like antimycin, conglobatin and macrotetrolides, but the majority of the mass features could not be dereplicated. Our work provides insights into Actinomycetota isolated from an ancient source, yielding unexplored genomic information that is not yet present in current databases.


Assuntos
Actinomycetales , Mamutes , Streptomyces , Animais , Filogenia , Genômica , Streptomyces/genética , Fezes
9.
Nucleic Acids Res ; 52(D1): D586-D589, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37904617

RESUMO

Many microorganisms produce natural products that are frequently used in the development of medicines and crop protection agents. Genome mining has evolved into a prominent method to access this potential. antiSMASH is the most popular tool for this task. Here we present version 4 of the antiSMASH database, providing biosynthetic gene clusters detected by antiSMASH 7.1 in publicly available, dereplicated, high-quality microbial genomes via an interactive graphical user interface. In version 4, the database contains 231 534 high quality BGC regions from 592 archaeal, 35 726 bacterial and 236 fungal genomes and is available at https://antismash-db.secondarymetabolites.org/.


Assuntos
Produtos Biológicos , Vias Biossintéticas , Bases de Dados Genéticas , Genoma Microbiano , Vias Biossintéticas/genética , Família Multigênica , Software
10.
Comput Struct Biotechnol J ; 23: 22-33, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38075396

RESUMO

The Rubiaceae plant family, comprising 3 subfamilies and over 13,000 species, is known for producing significant bioactive compounds such as caffeine and monoterpene indole alkaloids. Despite an increase in available genomes from the Rubiaceae family over the past decade, a systematic analysis of the metabolic gene clusters (MGCs) encoded by these genomes has been lacking. In this study, we aim to identify and analyze metabolic gene clusters within complete Rubiaceae genomes through a comparative analysis of eight species. Applying two bioinformatics pipelines, we identified 2372 candidate MGCs, organized into 549 gene cluster families (GCFs). To enhance the reliability of these findings, we developed coexpression networks and conducted orthology analyses. Using genomic data from Solanum lycopersicum (Solanaceae) for comparative purposes, we provided a detailed view of predicted metabolic enzymes, pathways, and coexpression networks. We bring some examples of MGCs and GCFs involved in biological pathways of terpenes, saccharides and alkaloids. Such insights lay the groundwork for discovering new compounds and associated MGCs within the Rubiaceae family, with potential implications in developing more robust crop species and expanding the understanding of plant metabolism. This large-scale exploration also provides a new perspective on the evolution and structure-function relationship of these clusters, offering opportunities for the highly efficient utilization of these unique metabolites. The outcome of this study contributes to a broader comprehension of the biosynthetic pathways, elucidating multiple aspects of specialized metabolism and offering innovative avenues for biotechnological applications.

12.
Cell ; 186(16): 3400-3413.e20, 2023 08 03.
Artigo em Inglês | MEDLINE | ID: mdl-37541197

RESUMO

Approximately 15% of US adults have circulating levels of uric acid above its solubility limit, which is causally linked to the disease gout. In most mammals, uric acid elimination is facilitated by the enzyme uricase. However, human uricase is a pseudogene, having been inactivated early in hominid evolution. Though it has long been known that uric acid is eliminated in the gut, the role of the gut microbiota in hyperuricemia has not been studied. Here, we identify a widely distributed bacterial gene cluster that encodes a pathway for uric acid degradation. Stable isotope tracing demonstrates that gut bacteria metabolize uric acid to xanthine or short chain fatty acids. Ablation of the microbiota in uricase-deficient mice causes severe hyperuricemia, and anaerobe-targeted antibiotics increase the risk of gout in humans. These data reveal a role for the gut microbiota in uric acid excretion and highlight the potential for microbiome-targeted therapeutics in hyperuricemia.


Assuntos
Gota , Hominidae , Hiperuricemia , Adulto , Animais , Humanos , Camundongos , Gota/genética , Gota/metabolismo , Hominidae/genética , Hiperuricemia/genética , Mamíferos/metabolismo , Urato Oxidase/genética , Ácido Úrico/metabolismo , Evolução Molecular
13.
Trends Pharmacol Sci ; 44(8): 532-541, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37391295

RESUMO

Ribosomally synthesized and post-translationally modified peptides (RiPPs) are a chemically diverse class of metabolites. Many RiPPs show potent biological activities that make them attractive starting points for drug development. A promising approach for the discovery of new classes of RiPPs is genome mining. However, the accuracy of genome mining is hampered by the lack of signature genes shared across different RiPP classes. One way to reduce false-positive predictions is by complementing genomic information with metabolomics data. In recent years, several new approaches addressing such integrative genomics and metabolomics analyses have been developed. In this review, we provide a detailed discussion of RiPP-compatible software tools that integrate paired genomics and metabolomics data. We highlight current challenges in data integration and identify opportunities for further developments targeting new classes of bioactive RiPPs.


Assuntos
Produtos Biológicos , Humanos , Ribossomos/genética , Ribossomos/metabolismo , Peptídeos , Genômica , Metaboloma , Processamento de Proteína Pós-Traducional
14.
BMC Bioinformatics ; 24(1): 181, 2023 May 03.
Artigo em Inglês | MEDLINE | ID: mdl-37131131

RESUMO

BACKGROUND: Co-localized sets of genes that encode specialized functions are common across microbial genomes and occur in genomes of larger eukaryotes as well. Important examples include Biosynthetic Gene Clusters (BGCs) that produce specialized metabolites with medicinal, agricultural, and industrial value (e.g. antimicrobials). Comparative analysis of BGCs can aid in the discovery of novel metabolites by highlighting distribution and identifying variants in public genomes. Unfortunately, gene-cluster-level homology detection remains inaccessible, time-consuming and difficult to interpret. RESULTS: The comparative gene cluster analysis toolbox (CAGECAT) is a rapid and user-friendly platform to mitigate difficulties in comparative analysis of whole gene clusters. The software provides homology searches and downstream analyses without the need for command-line or programming expertise. By leveraging remote BLAST databases, which always provide up-to-date results, CAGECAT can yield relevant matches that aid in the comparison, taxonomic distribution, or evolution of an unknown query. The service is extensible and interoperable and implements the cblaster and clinker pipelines to perform homology search, filtering, gene neighbourhood estimation, and dynamic visualisation of resulting variant BGCs. With the visualisation module, publication-quality figures can be customized directly from a web-browser, which greatly accelerates their interpretation via informative overlays to identify conserved genes in a BGC query. CONCLUSION: Overall, CAGECAT is an extensible software that can be interfaced via a standard web-browser for whole region homology searches and comparison on continually updated genomes from NCBI. The public web server and installable docker image are open source and freely available without registration at: https://cagecat.bioinformatics.nl .


Assuntos
Computadores , Software , Família Multigênica , Genoma , Análise por Conglomerados
15.
Nucleic Acids Res ; 51(W1): W46-W50, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37140036

RESUMO

Microorganisms produce small bioactive compounds as part of their secondary or specialised metabolism. Often, such metabolites have antimicrobial, anticancer, antifungal, antiviral or other bio-activities and thus play an important role for applications in medicine and agriculture. In the past decade, genome mining has become a widely-used method to explore, access, and analyse the available biodiversity of these compounds. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' (https://antismash.secondarymetabolites.org/) has supported researchers in their microbial genome mining tasks, both as a free to use web server and as a standalone tool under an OSI-approved open source licence. It is currently the most widely used tool for detecting and characterising biosynthetic gene clusters (BGCs) in archaea, bacteria, and fungi. Here, we present the updated version 7 of antiSMASH. antiSMASH 7 increases the number of supported cluster types from 71 to 81, as well as containing improvements in the areas of chemical structure prediction, enzymatic assembly-line visualisation and gene cluster regulation.


Assuntos
Computadores , Software , Bactérias/genética , Bactérias/metabolismo , Archaea/genética , Genoma Microbiano , Família Multigênica , Metabolismo Secundário/genética
16.
ACS Infect Dis ; 9(4): 739-748, 2023 04 14.
Artigo em Inglês | MEDLINE | ID: mdl-37000899

RESUMO

The unabated rise of antibiotic resistance has raised the specter of a post-antibiotic era and underscored the importance of developing new classes of antibiotics. The relacidines are a recently discovered group of nonribosomal lipopeptide antibiotics that show promising activity against Gram-negative pathogens and share structural similarities with brevicidine and laterocidine. While the first reports of the relacidines indicated that they possess a C-terminal five-amino acid macrolactone, an N-terminal lipid tail, and an overall positive charge, no stereochemical configuration was assigned, thereby precluding a full structure determination. To address this issue, we here report a bioinformatics guided total synthesis of relacidine A and B and show that the authentic natural products match our predicted and synthesized structures. Following on this, we also synthesized an analogue of relacidine A wherein the ester linkage of the macrolactone was replaced by the corresponding amide. This analogue was found to possess enhanced hydrolytic stability while maintaining the antibacterial activity of the natural product in both in vitro and in vivo efficacy studies.


Assuntos
Antibacterianos , Lipopeptídeos , Antibacterianos/química , Lipopeptídeos/farmacologia , Lipopeptídeos/química , Amidas
17.
Nat Biotechnol ; 41(10): 1416-1423, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-36782070

RESUMO

The gut microbiota produce hundreds of small molecules, many of which modulate host physiology. Although efforts have been made to identify biosynthetic genes for secondary metabolites, the chemical output of the gut microbiome consists predominantly of primary metabolites. Here we introduce the gutSMASH algorithm for identification of primary metabolic gene clusters, and we used it to systematically profile gut microbiome metabolism, identifying 19,890 gene clusters in 4,240 high-quality microbial genomes. We found marked differences in pathway distribution among phyla, reflecting distinct strategies for energy capture. These data explain taxonomic differences in short-chain fatty acid production and suggest a characteristic metabolic niche for each taxon. Analysis of 1,135 individuals from a Dutch population-based cohort shows that the level of microbiome-derived metabolites in plasma and feces is almost completely uncorrelated with the metagenomic abundance of corresponding metabolic genes, indicating a crucial role for pathway-specific gene regulation and metabolite flux. This work is a starting point for understanding differences in how bacterial taxa contribute to the chemistry of the microbiome.


Assuntos
Microbioma Gastrointestinal , Microbiota , Humanos , Microbioma Gastrointestinal/genética , Fezes/microbiologia , Bactérias , Redes e Vias Metabólicas/genética
18.
PLoS Comput Biol ; 19(2): e1010462, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36758069

RESUMO

Microbial specialised metabolism is full of valuable natural products that are applied clinically, agriculturally, and industrially. The genes that encode their biosynthesis are often physically clustered on the genome in biosynthetic gene clusters (BGCs). Many BGCs consist of multiple groups of co-evolving genes called sub-clusters that are responsible for the biosynthesis of a specific chemical moiety in a natural product. Sub-clusters therefore provide an important link between the structures of a natural product and its BGC, which can be leveraged for predicting natural product structures from sequence, as well as for linking chemical structures and metabolomics-derived mass features to BGCs. While some initial computational methodologies have been devised for sub-cluster detection, current approaches are not scalable, have only been run on small and outdated datasets, or produce an impractically large number of possible sub-clusters to mine through. Here, we constructed a scalable method for unsupervised sub-cluster detection, called iPRESTO, based on topic modelling and statistical analysis of co-occurrence patterns of enzyme-coding protein families. iPRESTO was used to mine sub-clusters across 150,000 prokaryotic BGCs from antiSMASH-DB. After annotating a fraction of the resulting sub-cluster families, we could predict a substructure for 16% of the antiSMASH-DB BGCs. Additionally, our method was able to confirm 83% of the experimentally characterised sub-clusters in MIBiG reference BGCs. Based on iPRESTO-detected sub-clusters, we could correctly identify the BGCs for xenorhabdin and salbostatin biosynthesis (which had not yet been annotated in BGC databases), as well as propose a candidate BGC for akashin biosynthesis. Additionally, we show for a collection of 145 actinobacteria how substructures can aid in linking BGCs to molecules by correlating iPRESTO-detected sub-clusters to MS/MS-derived Mass2Motifs substructure patterns. This work paves the way for deeper functional and structural annotation of microbial BGCs by improved linking of orphan molecules to their cognate gene clusters, thus facilitating accelerated natural product discovery.


Assuntos
Produtos Biológicos , Espectrometria de Massas em Tandem , Metabolômica , Bactérias/genética , Família Multigênica
19.
Trends Genet ; 39(4): 237-239, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36822964

RESUMO

Convergent evolution has been described for several metabolic pathways across the kingdoms of life. However, there is hitherto no evidence for such an interkingdom process for antimicrobials. A new report suggests that marine animals have evolved the ability to biosynthesize antimicrobial polyketides, in parallel with bacteria.


Assuntos
Antibacterianos , Bactérias , Animais , Bactérias/genética
20.
Microbiome ; 11(1): 13, 2023 01 23.
Artigo em Inglês | MEDLINE | ID: mdl-36691088

RESUMO

BACKGROUND: It is well-known that the microbiome produces a myriad of specialised metabolites with diverse functions. To better characterise their structures and identify their producers in complex samples, integrative genome and metabolome mining is becoming increasingly popular. Metabologenomic co-occurrence-based correlation scoring methods facilitate the linking of metabolite mass fragmentation spectra (MS/MS) to their cognate biosynthetic gene clusters (BGCs) based on shared absence/presence patterns of metabolites and BGCs in paired omics datasets of multiple strains. Recently, these methods have been made more readily accessible through the NPLinker platform. However, co-occurrence-based approaches usually result in too many candidate links to manually validate. To address this issue, we introduce a generic feature-based correlation method that matches chemical compound classes between BGCs and MS/MS spectra. RESULTS: To automatically reduce the long lists of potential BGC-MS/MS spectrum links, we match natural product (NP) ontologies previously independently developed for genomics and metabolomics and developed NPClassScore: an empirical class matching score that we also implemented in the NPLinker platform. By applying NPClassScore on three paired omics datasets totalling 189 bacterial strains, we show that the number of links is reduced by on average 63% as compared to using a co-occurrence-based strategy alone. We further demonstrate that 96% of experimentally validated links in these datasets are retained and prioritised when using NPClassScore. CONCLUSION: The matching genome-metabolome class ontologies provide a starting point for selecting plausible candidates for BGCs and MS/MS spectra based on matching chemical compound class ontologies. NPClassScore expedites genome/metabolome data integration, as relevant BGC-metabolite links are prioritised, and researchers are faced with substantially fewer proposed BGC-MS/MS links to manually inspect. We anticipate that our addition to the NPLinker platform will aid integrative omics mining workflows in discovering novel NPs and understanding complex metabolic interactions in the microbiome. Video Abstract.


Assuntos
Vias Biossintéticas , Espectrometria de Massas em Tandem , Vias Biossintéticas/genética , Genômica , Metabolômica/métodos , Família Multigênica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...