Your browser doesn't support javascript.
loading
Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching.
Louwen, Joris J R; Medema, Marnix H; van der Hooft, Justin J J.
Afiliación
  • Louwen JJR; Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands.
  • Medema MH; Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands.
  • van der Hooft JJJ; Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, the Netherlands. justin.vanderhooft@wur.nl.
Microbiome ; 11(1): 13, 2023 01 23.
Article en En | MEDLINE | ID: mdl-36691088
ABSTRACT

BACKGROUND:

It is well-known that the microbiome produces a myriad of specialised metabolites with diverse functions. To better characterise their structures and identify their producers in complex samples, integrative genome and metabolome mining is becoming increasingly popular. Metabologenomic co-occurrence-based correlation scoring methods facilitate the linking of metabolite mass fragmentation spectra (MS/MS) to their cognate biosynthetic gene clusters (BGCs) based on shared absence/presence patterns of metabolites and BGCs in paired omics datasets of multiple strains. Recently, these methods have been made more readily accessible through the NPLinker platform. However, co-occurrence-based approaches usually result in too many candidate links to manually validate. To address this issue, we introduce a generic feature-based correlation method that matches chemical compound classes between BGCs and MS/MS spectra.

RESULTS:

To automatically reduce the long lists of potential BGC-MS/MS spectrum links, we match natural product (NP) ontologies previously independently developed for genomics and metabolomics and developed NPClassScore an empirical class matching score that we also implemented in the NPLinker platform. By applying NPClassScore on three paired omics datasets totalling 189 bacterial strains, we show that the number of links is reduced by on average 63% as compared to using a co-occurrence-based strategy alone. We further demonstrate that 96% of experimentally validated links in these datasets are retained and prioritised when using NPClassScore.

CONCLUSION:

The matching genome-metabolome class ontologies provide a starting point for selecting plausible candidates for BGCs and MS/MS spectra based on matching chemical compound class ontologies. NPClassScore expedites genome/metabolome data integration, as relevant BGC-metabolite links are prioritised, and researchers are faced with substantially fewer proposed BGC-MS/MS links to manually inspect. We anticipate that our addition to the NPLinker platform will aid integrative omics mining workflows in discovering novel NPs and understanding complex metabolic interactions in the microbiome. Video Abstract.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Espectrometría de Masas en Tándem / Vías Biosintéticas Tipo de estudio: Prognostic_studies Idioma: En Revista: Microbiome Año: 2023 Tipo del documento: Article País de afiliación: Países Bajos

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Espectrometría de Masas en Tándem / Vías Biosintéticas Tipo de estudio: Prognostic_studies Idioma: En Revista: Microbiome Año: 2023 Tipo del documento: Article País de afiliación: Países Bajos