Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 30
Filter
Add more filters










Publication year range
1.
Nat Commun ; 15(1): 5356, 2024 Jun 25.
Article in English | MEDLINE | ID: mdl-38918378

ABSTRACT

Type 1 polyketides are a major class of natural products used as antiviral, antibiotic, antifungal, antiparasitic, immunosuppressive, and antitumor drugs. Analysis of public microbial genomes leads to the discovery of over sixty thousand type 1 polyketide gene clusters. However, the molecular products of only about a hundred of these clusters are characterized, leaving most metabolites unknown. Characterizing polyketides relies on bioactivity-guided purification, which is expensive and time-consuming. To address this, we present Seq2PKS, a machine learning algorithm that predicts chemical structures derived from Type 1 polyketide synthases. Seq2PKS predicts numerous putative structures for each gene cluster to enhance accuracy. The correct structure is identified using a variable mass spectral database search. Benchmarks show that Seq2PKS outperforms existing methods. Applying Seq2PKS to Actinobacteria datasets, we discover biosynthetic gene clusters for monazomycin, oasomycin A, and 2-aminobenzamide-actiphenol.


Subject(s)
Mass Spectrometry , Multigene Family , Polyketide Synthases , Polyketides , Polyketides/metabolism , Polyketides/chemistry , Polyketide Synthases/genetics , Polyketide Synthases/metabolism , Mass Spectrometry/methods , Data Mining/methods , Machine Learning , Actinobacteria/genetics , Actinobacteria/metabolism , Genome, Bacterial , Algorithms , Biological Products/chemistry , Biological Products/metabolism
2.
Nat Prod Rep ; 41(7): 1020-1059, 2024 Jul 17.
Article in English | MEDLINE | ID: mdl-38411572

ABSTRACT

Covering 1965 to February 2024Plants are prolific peptide chemists and are known to make thousands of different peptidic molecules. These peptides vary dramatically in their size, chemistry, and bioactivity. Despite their differences, all plant peptides to date are biosynthesized as ribosomally synthesized and post-translationally modified peptides (RiPPs). Decades of research in plant RiPP biosynthesis have extended the definition and scope of RiPPs from microbial sources, establishing paradigms and discovering new families of biosynthetic enzymes. The discovery and elucidation of plant peptide pathways is challenging due to repurposing and evolution of housekeeping genes as both precursor peptides and biosynthetic enzymes and due to the low rates of gene clustering in plants. In this review, we highlight the chemistry, biosynthesis, and function of the known RiPP classes from plants and recommend a nomenclature for the recent addition of BURP-domain-derived RiPPs termed burpitides. Burpitides are an emerging family of cyclic plant RiPPs characterized by macrocyclic crosslinks between tyrosine or tryptophan side chains and other amino acid side chains or their peptide backbone that are formed by copper-dependent BURP-domain-containing proteins termed burpitide cyclases. Finally, we review the discovery of plant RiPPs through bioactivity-guided, structure-guided, and gene-guided approaches.


Subject(s)
Peptides , Plants , Protein Processing, Post-Translational , Ribosomes , Ribosomes/metabolism , Peptides/chemistry , Peptides/metabolism , Plants/metabolism , Plant Proteins/metabolism , Plant Proteins/genetics , Plant Proteins/chemistry , Molecular Structure
3.
Nat Chem Biol ; 20(4): 530-540, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38355722

ABSTRACT

The biosynthetic dogma of ribosomally synthesized and posttranslationally modified peptides (RiPP) involves enzymatic intermolecular modification of core peptide motifs in precursor peptides. The plant-specific BURP-domain protein family, named after their four founding members, includes autocatalytic peptide cyclases involved in the biosynthesis of side-chain-macrocyclic plant RiPPs. Here we show that AhyBURP, a representative of the founding Unknown Seed Protein-type BURP-domain subfamily, catalyzes intramolecular macrocyclizations of its core peptide during the sequential biosynthesis of monocyclic lyciumin I via glycine-tryptophan crosslinking and bicyclic legumenin via glutamine-tyrosine crosslinking. X-ray crystallography of AhyBURP reveals the BURP-domain fold with two type II copper centers derived from a conserved stapled-disulfide and His motif. We show the macrocyclization of lyciumin-C(sp3)-N-bond formation followed by legumenin-C(sp3)-O-bond formation requires dioxygen and radical involvement based on enzyme assays in anoxic conditions and isotopic labeling. Our study expands enzymatic intramolecular modifications beyond catalytic moiety and chromophore biogenesis to RiPP biosynthesis.


Subject(s)
Lignans , Protein Biosynthesis , Protein Processing, Post-Translational , Amino Acid Sequence , Peptides/chemistry , Plants/metabolism , Plant Proteins/genetics , Plant Proteins/metabolism
4.
Nat Commun ; 14(1): 4219, 2023 07 14.
Article in English | MEDLINE | ID: mdl-37452020

ABSTRACT

Recent analyses of public microbial genomes have found over a million biosynthetic gene clusters, the natural products of the majority of which remain unknown. Additionally, GNPS harbors billions of mass spectra of natural products without known structures and biosynthetic genes. We bridge the gap between large-scale genome mining and mass spectral datasets for natural product discovery by developing HypoRiPPAtlas, an Atlas of hypothetical natural product structures, which is ready-to-use for in silico database search of tandem mass spectra. HypoRiPPAtlas is constructed by mining genomes using seq2ripp, a machine-learning tool for the prediction of ribosomally synthesized and post-translationally modified peptides (RiPPs). In HypoRiPPAtlas, we identify RiPPs in microbes and plants. HypoRiPPAtlas could be extended to other natural product classes in the future by implementing corresponding biosynthetic logic. This study paves the way for large-scale explorations of biosynthetic pathways and chemical structures of microbial and plant RiPP classes.


Subject(s)
Biological Products , Ribosomes , Ribosomes/metabolism , Biological Products/chemistry , Peptides/chemistry , Databases, Factual , Tandem Mass Spectrometry , Protein Processing, Post-Translational
5.
Plants (Basel) ; 12(12)2023 Jun 14.
Article in English | MEDLINE | ID: mdl-37375938

ABSTRACT

Jujube is a nutritious fruit, and is high in vitamin C, fiber, phenolics, flavonoids, nucleotides, and organic acids. It is both an important food and a source of traditional medicine. Metabolomics can reveal metabolic differences between Ziziphus jujuba fruits from different jujube cultivars and growth sites. In the fall of 2022, mature fresh fruit of eleven cultivars from replicated trials at three sites in New Mexico-Leyendecker, Los Lunas, and Alcalde-were sampled from September to October for an untargeted metabolomics study. The 11 cultivars were Alcalde 1, Dongzao, Jinsi (JS), Jinkuiwang (JKW), Jixin, Kongfucui (KFC), Lang, Li, Maya, Shanxi Li, and Zaocuiwang (ZCW). Based on the LC-MS/MS analysis, there were 1315 compounds detected with amino acids and derivatives (20.15%) and flavonoids (15.44%) as dominant categories. The results reveal that the cultivar was the dominant factor in metabolite profiles, while the location was secondary. A pairwise comparison of cultivar metabolomes revealed that two pairs had fewer differential metabolites (i.e., Li/Shanxi Li and JS/JKW) than all the other pairs, highlighting that pairwise metabolic comparison can be applied for cultivar fingerprinting. Differential metabolite analysis also showed that half of drying cultivars have up-regulated lipid metabolites compared to fresh or multi-purpose fruit cultivars and that specialized metabolites vary significantly between cultivars from 35.3% (Dongzao/ZCW) to 56.7% (Jixin/KFC). An exemplary analyte matching sedative cyclopeptide alkaloid sanjoinine A was only detected in the Jinsi and Jinkuiwang cultivars. Overall, our metabolic analysis of the jujube cultivar's mature fruits provides the largest resource of jujube fruit metabolomes to date and will inform cultivar selection for nutritional and medicinal research and for fruit metabolic breeding.

6.
Appl Environ Microbiol ; 88(14): e0180321, 2022 07 26.
Article in English | MEDLINE | ID: mdl-35862730

ABSTRACT

Interactions between bacteria and phytoplankton can influence primary production, community composition, and algal bloom development. However, these interactions are poorly described for many consortia, particularly for freshwater bloom-forming cyanobacteria. Here, we assessed the gene content and expression of two uncultivated Acidobacteria from Lake Erie Microcystis blooms. These organisms were targeted because they were previously identified as important catalase producers in Microcystis blooms, suggesting that they protect Microcystis from H2O2. Metatranscriptomics revealed that both Acidobacteria transcribed genes for uptake of organic compounds that are known cyanobacterial products and exudates, including lactate, glycolate, amino acids, peptides, and cobalamins. Expressed genes for amino acid metabolism and peptide transport and degradation suggest that use of amino acids and peptides by Acidobacteria may regenerate nitrogen for cyanobacteria and other organisms. The Acidobacteria genomes lacked genes for biosynthesis of cobalamins but expressed genes for its transport and remodeling. This indicates that the Acidobacteria obtained cobalamins externally, potentially from Microcystis, which has a complete gene repertoire for pseudocobalamin biosynthesis; expressed them in field samples; and produced pseudocobalamin in axenic culture. Both Acidobacteria were detected in Microcystis blooms worldwide. Together, the data support the hypotheses that uncultured and previously unidentified Acidobacteria taxa exchange metabolites with phytoplankton during harmful cyanobacterial blooms and influence nitrogen available to phytoplankton. Thus, novel Acidobacteria may play a role in cyanobacterial physiology and bloom development. IMPORTANCE Interactions between heterotrophic bacteria and phytoplankton influence competition and successions between phytoplankton taxa, thereby influencing ecosystem-wide processes such as carbon cycling and algal bloom development. The cyanobacterium Microcystis forms harmful blooms in freshwaters worldwide and grows in buoyant colonies that harbor other bacteria in their phycospheres. Bacteria in the phycosphere and in the surrounding community likely influence Microcystis physiology and ecology and thus the development of freshwater harmful cyanobacterial blooms. However, the impacts and mechanisms of interaction between bacteria and Microcystis are not fully understood. This study explores the mechanisms of interaction between Microcystis and uncultured members of its phycosphere in situ with population genome resolution to investigate the cooccurrence of Microcystis and freshwater Acidobacteria in blooms worldwide.


Subject(s)
Cyanobacteria , Microcystis , Acidobacteria/metabolism , Amino Acids/metabolism , Carbon/metabolism , Cyanobacteria/genetics , Ecosystem , Hydrogen Peroxide/metabolism , Lakes/microbiology , Microcystis/genetics , Microcystis/metabolism , Nitrogen/metabolism , Phytoplankton/metabolism , Vitamin B 12/metabolism
7.
J Am Chem Soc ; 144(17): 7686-7692, 2022 05 04.
Article in English | MEDLINE | ID: mdl-35438481

ABSTRACT

Moroidin is a bicyclic plant octapeptide with tryptophan side-chain cross-links, originally isolated as a pain-causing agent from the Australian stinging tree Dendrocnide moroides. Moroidin and its analog celogentin C, derived from Celosia argentea, are inhibitors of tubulin polymerization and, thus, lead structures for cancer therapy. However, low isolation yields from source plants and challenging organic synthesis hinder moroidin-based drug development. Here, we present biosynthesis as an alternative route to moroidin-type bicyclic peptides and report that they are ribosomally synthesized and posttranslationally modified peptides (RiPPs) derived from BURP-domain peptide cyclases in plants. By mining 793 plant transcriptomes for moroidin core peptide motifs within BURP-domain precursor peptides, we identified a moroidin cyclase in Japanese kerria, which catalyzes the installation of the tryptophan-indole-centered macrocyclic bonds of the moroidin bicyclic motif in the presence of cupric ions. Based on the kerria moroidin cyclase, we demonstrate the feasibility of producing diverse moroidins including celogentin C in transgenic tobacco plants and report specific cytotoxicity of celogentin C against a lung adenocarcinoma cancer cell line. Our study sets the stage for future biosynthetic development of moroidin-based therapeutics and highlights that mining plant transcriptomes can reveal bioactive cyclic peptides and their underlying cyclases from new source plants.


Subject(s)
Peptides, Cyclic , Tryptophan , Australia , Peptides/metabolism , Peptides/pharmacology , Peptides, Cyclic/chemistry , Plants , Protein Processing, Post-Translational , Tryptophan/metabolism
8.
Nat Chem Biol ; 18(1): 18-28, 2022 01.
Article in English | MEDLINE | ID: mdl-34811516

ABSTRACT

Many bioactive plant cyclic peptides form side-chain-derived macrocycles. Lyciumins, cyclic plant peptides with tryptophan macrocyclizations, are ribosomal peptides (RiPPs) originating from repetitive core peptide motifs in precursor peptides with plant-specific BURP (BNM2, USP, RD22 and PG1beta) domains, but the biosynthetic mechanism for their formation has remained unknown. Here, we characterize precursor-peptide BURP domains as copper-dependent autocatalytic peptide cyclases and use a combination of tandem mass spectrometry-based metabolomics and plant genomics to systematically discover five BURP-domain-derived plant RiPP classes, with mono- and bicyclic structures formed via tryptophans and tyrosines, from botanical collections. As BURP-domain cyclases are scaffold-generating enzymes in plant specialized metabolism that are physically connected to their substrates in the same polypeptide, we introduce a bioinformatic method to mine plant genomes for precursor-peptide-encoding genes by detection of repetitive substrate domains and known core peptide features. Our study sets the stage for chemical, biosynthetic and biological exploration of plant RiPP natural products from BURP-domain cyclases.


Subject(s)
Peptides, Cyclic/biosynthesis , Peptides, Cyclic/chemistry , Plant Proteins/chemistry , Amino Acid Sequence , Catalysis , Cell Membrane Permeability , Cyclization , Genome, Plant , Tandem Mass Spectrometry
9.
Front Plant Sci ; 12: 692108, 2021.
Article in English | MEDLINE | ID: mdl-34925392

ABSTRACT

Copper is an important transition metal cofactor in plant metabolism, which enables diverse biocatalysis in aerobic environments. Multiple classes of plant metalloenzymes evolved and underwent genetic expansions during the evolution of terrestrial plants and, to date, several representatives of these copper enzyme classes have characterized mechanisms. In this review, we give an updated overview of chemistry, structure, mechanism, function and phylogenetic distribution of plant copper metalloenzymes with an emphasis on biosynthesis of aromatic compounds such as phenylpropanoids (lignin, lignan, flavonoids) and cyclic peptides with macrocyclizations via aromatic amino acids. We also review a recent addition to plant copper enzymology in a copper-dependent peptide cyclase called the BURP domain. Given growing plant genetic resources, a large pool of copper biocatalysts remains to be characterized from plants as plant genomes contain on average more than 70 copper enzyme genes. A major challenge in characterization of copper biocatalysts from plant genomes is the identification of endogenous substrates and catalyzed reactions. We highlight some recent and future trends in filling these knowledge gaps in plant metabolism and the potential for genomic discovery of copper-based enzymology from plants.

10.
Proc Natl Acad Sci U S A ; 115(46): E10961-E10969, 2018 11 13.
Article in English | MEDLINE | ID: mdl-30373830

ABSTRACT

The plant kingdom contains vastly untapped natural product chemistry, which has been traditionally explored through the activity-guided approach. Here, we describe a gene-guided approach to discover and engineer a class of plant ribosomal peptides, the branched cyclic lyciumins. Initially isolated from the Chinese wolfberry Lycium barbarum, lyciumins are protease-inhibiting peptides featuring an N-terminal pyroglutamate and a macrocyclic bond between a tryptophan-indole nitrogen and a glycine α-carbon. We report the identification of a lyciumin precursor gene from L. barbarum, which encodes a BURP domain and repetitive lyciumin precursor peptide motifs. Genome mining enabled by this initial finding revealed rich lyciumin genotypes and chemotypes widespread in flowering plants. We establish a biosynthetic framework of lyciumins and demonstrate the feasibility of producing diverse natural and unnatural lyciumins in transgenic tobacco. With rapidly expanding plant genome resources, our approach will complement bioactivity-guided approaches to unlock and engineer hidden plant peptide chemistry for pharmaceutical and agrochemical applications.


Subject(s)
Gene Expression Profiling/methods , Genes, Plant , Peptides, Cyclic/genetics , Plants/genetics , Amino Acid Sequence/genetics , Biological Products/chemistry , Genome , Genomics/methods , Lignans/biosynthesis , Peptides/chemistry , Peptides/genetics , Peptides, Cyclic/metabolism , Protein Processing, Post-Translational , Ribosomes/genetics , Ribosomes/metabolism
11.
Angew Chem Int Ed Engl ; 57(14): 3671-3675, 2018 03 26.
Article in English | MEDLINE | ID: mdl-29417714

ABSTRACT

Characterization of complex natural product mixtures to the absolute structural level of their components often requires significant amounts of starting materials and lengthy purification process, followed by arduous structure elucidation efforts. The crystalline sponge (CS) method has demonstrated utility in the absolute structure elucidation of isolated organic compounds at miniscule quantities compared to conventional methods. In this work, we developed a new CS-based workflow that greatly expedites the in-depth structural analysis of crude natural product extracts. Using a crude extract of the red alga Laurencia pacifica, we showed that CS affinity screening prior to compound isolation enables prioritization of analytes present in the extract, and we subsequently resolved the molecular structures of six sesquiterpenes with stereochemical clarity from around 10 mg crude extract. This study demonstrates a new chemotyping workflow that can greatly accelerate natural product discovery from complex samples.


Subject(s)
Biological Products/chemistry , Complex Mixtures/chemistry , Laurencia/chemistry , Sesquiterpenes/chemistry , Animals , Biological Products/isolation & purification , Complex Mixtures/isolation & purification , Crystallization/methods , Molecular Structure , Sesquiterpenes/isolation & purification
12.
J Am Chem Soc ; 139(46): 16838-16844, 2017 11 22.
Article in English | MEDLINE | ID: mdl-29083151

ABSTRACT

Sesquiterpene scaffolds are the core backbones of many medicinally and industrially important natural products. A plethora of sesquiterpene synthases, widely present in bacteria, fungi, and plants, catalyze the formation of these intricate structures often with multiple stereocenters starting from linear farnesyl diphosphate substrates. Recent advances in next-generation sequencing and metabolomics technologies have greatly facilitated gene discovery for sesquiterpene synthases. However, a major bottleneck limits biochemical characterization of recombinant sesquiterpene synthases: the absolute structural elucidation of the derived sesquiterpene products. Here, we report the identification and biochemical characterization of LphTPS-A, a sesquiterpene synthase from the red macroalga Laurencia pacifica. Using the combination of transcriptomics, sesquiterpene synthase expression in yeast, and microgram-scale nuclear magnetic resonance-coupled crystalline sponge X-ray diffraction analysis, we resolved the absolute stereochemistry of prespatane, the major sesquiterpene product of LphTPS-A, and thereby functionally define LphTPS-A as the first bourbonane-producing sesquiterpene synthase and the first biochemically characterized sesquiterpene synthase from red algae. Our study showcases a workflow integrating multiomics approaches, synthetic biology, and the crystalline sponge method, which is generally applicable for uncovering new terpene chemistry and biochemistry from source-limited living organisms.

13.
Nat Biotechnol ; 34(8): 828-837, 2016 08 09.
Article in English | MEDLINE | ID: mdl-27504778

ABSTRACT

The potential of the diverse chemistries present in natural products (NP) for biotechnology and medicine remains untapped because NP databases are not searchable with raw data and the NP community has no way to share data other than in published papers. Although mass spectrometry (MS) techniques are well-suited to high-throughput characterization of NP, there is a pressing need for an infrastructure to enable sharing and curation of data. We present Global Natural Products Social Molecular Networking (GNPS; http://gnps.ucsd.edu), an open-access knowledge base for community-wide organization and sharing of raw, processed or identified tandem mass (MS/MS) spectrometry data. In GNPS, crowdsourced curation of freely available community-wide reference MS libraries will underpin improved annotations. Data-driven social-networking should facilitate identification of spectra and foster collaborations. We also introduce the concept of 'living data' through continuous reanalysis of deposited data.


Subject(s)
Biological Products/chemistry , Biological Products/classification , Data Curation/methods , Databases, Chemical , Information Dissemination/methods , Mass Spectrometry/statistics & numerical data , Database Management Systems , Information Storage and Retrieval/methods , Internationality
14.
ACS Chem Biol ; 10(11): 2501-11, 2015 Nov 20.
Article in English | MEDLINE | ID: mdl-26378620

ABSTRACT

Terpenes are ubiquitous natural chemicals with diverse biological functions spanning all three domains of life. In specialized metabolism, the active sites of terpene synthases (TPSs) evolve in shape and reactivity to direct the biosynthesis of a myriad of chemotypes for organismal fitness. As most terpene biosynthesis mechanistically involves highly reactive carbocationic intermediates, the protein surfaces catalyzing these cascade reactions possess reactive regions possibly prone to premature carbocation capture and potentially enzyme inactivation. Here, we show using proteomic and X-ray crystallographic analyses that cationic intermediates undergo capture by conserved active site residues leading to inhibitory self-alkylation. Moreover, the level of cation-mediated inactivation increases with mutation of the active site, upon changes in the size and structure of isoprenoid diphosphate substrates, and alongside increases in reaction temperatures. TPSs that individually synthesize multiple products are less prone to self-alkylation then TPSs possessing relatively high product specificity. In total, the results presented suggest that mechanism-based alkylation represents an overlooked mechanistic pressure during the evolution of cation-derived terpene biosynthesis.


Subject(s)
Alkyl and Aryl Transferases/antagonists & inhibitors , Alkyl and Aryl Transferases/metabolism , Alkyl and Aryl Transferases/chemistry , Alkyl and Aryl Transferases/genetics , Alkylation , Catalytic Domain , Crystallography, X-Ray , Models, Molecular , Mutation , Protein Processing, Post-Translational , Proteomics , Terpenes/chemistry
15.
J Nat Prod ; 77(8): 1902-9, 2014 Aug 22.
Article in English | MEDLINE | ID: mdl-25116163

ABSTRACT

Nonribosomal peptides (NRPs) such as vancomycin and daptomycin are among the most effective antibiotics. While NRPs are biomedically important, the computational techniques for sequencing these peptides are still in their infancy. The recent emergence of mass spectrometry techniques for NRP analysis (capable of sequencing an NRP from small amounts of nonpurified material) revealed an enormous diversity of NRPs. However, as many NRPs have nonlinear structure (e.g., cyclic or branched-cyclic peptides), the standard de novo sequencing tools (developed for linear peptides) are not applicable to NRP analysis. Here, we introduce the first NRP identification algorithm, NRPquest, that performs mutation-tolerant and modification-tolerant searches of spectral data sets against a database of putative NRPs. In contrast to previous studies aimed at NRP discovery (that usually report very few NRPs), NRPquest revealed nearly a hundred NRPs (including unknown variants of previously known peptides) in a single study. This result indicates that NRPquest can potentially make MS-based NRP identification as robust as the identification of linear peptides in traditional proteomics.


Subject(s)
Anti-Bacterial Agents/pharmacology , Biological Products/pharmacology , Peptides/pharmacology , Algorithms , Anti-Bacterial Agents/chemistry , Bacillus/genetics , Bacillus/metabolism , Biological Products/chemistry , Daptomycin/pharmacology , Mass Spectrometry , Molecular Structure , Peptide Synthases/metabolism , Peptides/chemistry , Proteomics , Streptomyces/genetics , Streptomyces/metabolism , Vancomycin/pharmacology
16.
Nat Chem Biol ; 10(8): 640-7, 2014 Aug.
Article in English | MEDLINE | ID: mdl-24974229

ABSTRACT

Polybrominated diphenyl ethers (PBDEs) and polybrominated bipyrroles are natural products that bioaccumulate in the marine food chain. PBDEs have attracted widespread attention because of their persistence in the environment and potential toxicity to humans. However, the natural origins of PBDE biosynthesis are not known. Here we report marine bacteria as producers of PBDEs and establish a genetic and molecular foundation for their production that unifies paradigms for the elaboration of bromophenols and bromopyrroles abundant in marine biota. We provide biochemical evidence of marine brominases revealing decarboxylative-halogenation enzymology previously unknown among halogenating enzymes. Biosynthetic motifs discovered in our study were used to mine sequence databases to discover unrealized marine bacterial producers of organobromine compounds.


Subject(s)
Halogenated Diphenyl Ethers/metabolism , Pseudoalteromonas/genetics , Pseudoalteromonas/metabolism , Aquatic Organisms , Genome, Bacterial , Halogenation , Molecular Sequence Data , Multigene Family , Oxidoreductases/genetics , Oxidoreductases/metabolism , Pyrroles/metabolism
17.
ACS Chem Biol ; 9(7): 1545-51, 2014 Jul 18.
Article in English | MEDLINE | ID: mdl-24802639

ABSTRACT

Ribosomally synthesized and posttranslationally modified peptides (RiPPs), especially from microbial sources, are a large group of bioactive natural products that are a promising source of new (bio)chemistry and bioactivity.1 In light of exponentially increasing microbial genome databases and improved mass spectrometry (MS)-based metabolomic platforms, there is a need for computational tools that connect natural product genotypes predicted from microbial genome sequences with their corresponding chemotypes from metabolomic data sets. Here, we introduce RiPPquest, a tandem mass spectrometry database search tool for identification of microbial RiPPs, and apply it to lanthipeptide discovery. RiPPquest uses genomics to limit search space to the vicinity of RiPP biosynthetic genes and proteomics to analyze extensive peptide modifications and compute p-values of peptide-spectrum matches (PSMs). We highlight RiPPquest by connecting multiple RiPPs from extracts of Streptomyces to their gene clusters and by the discovery of a new class III lanthipeptide, informatipeptin, from Streptomyces viridochromogenes DSM 40736 to reflect that it is a natural product that was discovered by mass spectrometry based genome mining using algorithmic tools rather than manual inspection of mass spectrometry data and genetic information. The presented tool is available at cyclo.ucsd.edu.


Subject(s)
Databases, Genetic , Genome, Bacterial , Genomics/methods , Peptides/genetics , Ribosomes/genetics , Streptomyces/genetics , Amino Acid Sequence , Biological Products/metabolism , Drug Discovery/methods , Molecular Sequence Data , Multigene Family , Peptides/chemistry , Protein Processing, Post-Translational , Proteomics/methods , Ribosomes/chemistry , Streptomyces/chemistry , Tandem Mass Spectrometry/methods
18.
Proc Natl Acad Sci U S A ; 111(5): 1957-62, 2014 Feb 04.
Article in English | MEDLINE | ID: mdl-24449899

ABSTRACT

Recent developments in next-generation sequencing technologies have brought recognition of microbial genomes as a rich resource for novel natural product discovery. However, owing to the scarcity of efficient procedures to connect genes to molecules, only a small fraction of secondary metabolomes have been investigated to date. Transformation-associated recombination (TAR) cloning takes advantage of the natural in vivo homologous recombination of Saccharomyces cerevisiae to directly capture large genomic loci. Here we report a TAR-based genetic platform that allows us to directly clone, refactor, and heterologously express a silent biosynthetic pathway to yield a new antibiotic. With this method, which involves regulatory gene remodeling, we successfully expressed a 67-kb nonribosomal peptide synthetase biosynthetic gene cluster from the marine actinomycete Saccharomonospora sp. CNQ-490 and produced the dichlorinated lipopeptide antibiotic taromycin A in the model expression host Streptomyces coelicolor. The taromycin gene cluster (tar) is highly similar to the clinically approved antibiotic daptomycin from Streptomyces roseosporus, but has notable structural differences in three amino acid residues and the lipid side chain. With the activation of the tar gene cluster and production of taromycin A, this study highlights a unique "plug-and-play" approach to efficiently gaining access to orphan pathways that may open avenues for novel natural product discoveries and drug development.


Subject(s)
Biosynthetic Pathways/genetics , Daptomycin/analogs & derivatives , Lipopeptides/biosynthesis , Multigene Family/genetics , Chromatography, High Pressure Liquid , Cloning, Molecular , Daptomycin/biosynthesis , Daptomycin/chemistry , Gene Expression Regulation, Bacterial , Genes, Bacterial/genetics , Genes, Regulator/genetics , Genetic Complementation Test , Genetic Vectors/genetics , Lipopeptides/chemistry , Molecular Sequence Data , Physical Chromosome Mapping , Recombination, Genetic/genetics , Reproducibility of Results , Streptomyces/genetics
19.
Proc Natl Acad Sci U S A ; 110(47): E4407-16, 2013 Nov 19.
Article in English | MEDLINE | ID: mdl-24191063

ABSTRACT

Glycosyl groups are an essential mediator of molecular interactions in cells and on cellular surfaces. There are very few methods that directly relate sugar-containing molecules to their biosynthetic machineries. Here, we introduce glycogenomics as an experiment-guided genome-mining approach for fast characterization of glycosylated natural products (GNPs) and their biosynthetic pathways from genome-sequenced microbes by targeting glycosyl groups in microbial metabolomes. Microbial GNPs consist of aglycone and glycosyl structure groups in which the sugar unit(s) are often critical for the GNP's bioactivity, e.g., by promoting binding to a target biomolecule. GNPs are a structurally diverse class of molecules with important pharmaceutical and agrochemical applications. Herein, O- and N-glycosyl groups are characterized in their sugar monomers by tandem mass spectrometry (MS) and matched to corresponding glycosylation genes in secondary metabolic pathways by a MS-glycogenetic code. The associated aglycone biosynthetic genes of the GNP genotype then classify the natural product to further guide structure elucidation. We highlight the glycogenomic strategy by the characterization of several bioactive glycosylated molecules and their gene clusters, including the anticancer agent cinerubin B from Streptomyces sp. SPB74 and an antibiotic, arenimycin B, from Salinispora arenicola CNB-527.


Subject(s)
Biosynthetic Pathways/genetics , Data Mining/methods , Genes, Microbial/genetics , Genomics/methods , Metabolome , Tandem Mass Spectrometry/methods , Biological Products/chemistry , Chromatography, Gel , Chromatography, High Pressure Liquid , Glycosylation , Magnetic Resonance Spectroscopy , Molecular Structure
20.
Proc Natl Acad Sci U S A ; 110(28): E2611-20, 2013 Jul 09.
Article in English | MEDLINE | ID: mdl-23798442

ABSTRACT

The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779(T). The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family-gene cluster families of hundreds or more diverse organisms in one single MS/MS network.


Subject(s)
Multigene Family , Tandem Mass Spectrometry/methods , Amino Acid Sequence , Bacillus/genetics , Bacterial Proteins/chemistry , Bacterial Proteins/genetics , Molecular Sequence Data , Peptides/chemistry , Peptides/genetics , Pseudomonas/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...