RESUMO
Although biosynthetic gene clusters (BGCs) have been discovered for hundreds of bacterial metabolites, our knowledge of their diversity remains limited. Here, we used a novel algorithm to systematically identify BGCs in the extensive extant microbial sequencing data. Network analysis of the predicted BGCs revealed large gene cluster families, the vast majority uncharacterized. We experimentally characterized the most prominent family, consisting of two subfamilies of hundreds of BGCs distributed throughout the Proteobacteria; their products are aryl polyenes, lipids with an aryl head group conjugated to a polyene tail. We identified a distant relationship to a third subfamily of aryl polyene BGCs, and together the three subfamilies represent the largest known family of biosynthetic gene clusters, with more than 1,000 members. Although these clusters are widely divergent in sequence, their small molecule products are remarkably conserved, indicating for the first time the important roles these compounds play in Gram-negative cell biology.
Assuntos
Algoritmos , Bactérias/genética , Bactérias/metabolismo , Bactérias/química , Bactérias/classificação , Mutação , Estresse Oxidativo , Filogenia , Metabolismo SecundárioRESUMO
Covering: Focus on 2015 to 2020Plant and soil microbiomes consist of diverse communities of organisms from across kingdoms and can profoundly affect plant growth and health. Natural product-based intercellular signals govern important interactions between microbiome members that ultimately regulate their beneficial or harmful impacts on the plant. Exploiting these evolved signalling circuits to engineer microbiomes towards beneficial interactions with crops is an attractive goal. There are few reports thus far of engineering the intercellular signalling of microbiomes, but this article argues that it represents a tremendous opportunity for advancing the field of microbiome engineering. This could be achieved through the selection of synergistic consortia in combination with genetic engineering of signal pathways to realise an optimised microbiome.
Assuntos
Microbiota , Solo , Bactérias/genética , Produtos Agrícolas , Raízes de Plantas , Microbiologia do SoloRESUMO
Polyketides are a class of specialised metabolites synthesised by both eukaryotes and prokaryotes. These chemically and structurally diverse molecules are heavily used in the clinic and include frontline antimicrobial and anticancer drugs such as erythromycin and doxorubicin. To replenish the clinicians' diminishing arsenal of bioactive molecules, a promising strategy aims at transferring polyketide biosynthetic pathways from their native producers into the biotechnologically desirable host Escherichia coli. This approach has been successful for type I modular polyketide synthases (PKSs); however, despite more than 3 decades of research, the large and important group of type II PKSs has until now been elusive in E. coli. Here, we report on a versatile polyketide biosynthesis pipeline, based on identification of E. coli-compatible type II PKSs. We successfully express 5 ketosynthase (KS) and chain length factor (CLF) pairs-e.g., from Photorhabdus luminescens TT01, Streptomyces resistomycificus, Streptoccocus sp. GMD2S, Pseudoalteromonas luteoviolacea, and Ktedonobacter racemifer-as soluble heterodimeric recombinant proteins in E. coli for the first time. We define the anthraquinone minimal PKS components and utilise this biosynthetic system to synthesise anthraquinones, dianthrones, and benzoisochromanequinones (BIQs). Furthermore, we demonstrate the tolerance and promiscuity of the anthraquinone heterologous biosynthetic pathway in E. coli to act as genetically applicable plug-and-play scaffold, showing it to function successfully when combined with enzymes from phylogenetically distant species, endophytic fungi and plants, which resulted in 2 new-to-nature compounds, neomedicamycin and neochaetomycin. This work enables plug-and-play combinatorial biosynthesis of aromatic polyketides using bacterial type II PKSs in E. coli, providing full access to its many advantages in terms of easy and fast genetic manipulation, accessibility for high-throughput robotics, and convenient biotechnological scale-up. Using the synthetic and systems biology toolbox, this plug-and-play biosynthetic platform can serve as an engine for the production of new and diversified bioactive polyketides in an automated, rapid, and versatile fashion.
Assuntos
Antraquinonas/metabolismo , Proteínas de Bactérias/metabolismo , Escherichia coli/metabolismo , Hidrocarbonetos Policíclicos Aromáticos/metabolismo , Policetídeo Sintases/metabolismo , Policetídeos/metabolismo , Proteínas Recombinantes/metabolismo , 3-Oxoacil-(Proteína de Transporte de Acila) Sintase/classificação , 3-Oxoacil-(Proteína de Transporte de Acila) Sintase/genética , 3-Oxoacil-(Proteína de Transporte de Acila) Sintase/metabolismo , Antraquinonas/química , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , Vias Biossintéticas , Escherichia coli/genética , Modelos Químicos , Estrutura Molecular , Filogenia , Hidrocarbonetos Policíclicos Aromáticos/química , Policetídeo Sintases/química , Policetídeo Sintases/genética , Policetídeos/química , Proteínas Recombinantes/químicaRESUMO
Metabolic engineering technologies have been employed with increasing success over the last three decades for the engineering and optimization of industrial host strains to competitively produce high-value chemical targets. To this end, continued reductions in the time taken from concept, to development, to scale-up are essential. Design-Build-Test-Learn pipelines that are able to rapidly deliver diverse chemical targets through iterative optimization of microbial production strains have been established. Biofoundries are employing in silico tools for the design of genetic parts, alongside combinatorial design of experiments approaches to optimize selection from within the potential design space of biological circuits based on multi-criteria objectives. These genetic constructs can then be built and tested through automated laboratory workflows, with performance data analysed in the learn phase to inform further design. Successful examples of rapid prototyping processes for microbially produced compounds reveal the potential role of biofoundries in leading the sustainable production of next-generation bio-based chemicals.
Assuntos
Bactérias/genética , Produtos Biológicos/metabolismo , Microbiologia Industrial/métodos , Engenharia Metabólica/métodos , Redes e Vias Metabólicas/genética , Biologia Sintética/métodos , Bactérias/metabolismo , Biotecnologia/métodos , Escherichia coli/genética , Escherichia coli/metabolismo , Regulação da Expressão Gênica , Plasmídeos/genética , Plasmídeos/metabolismoRESUMO
Antibiotic production is coordinated in the Streptomyces coelicolor population through the use of diffusible signaling molecules of the γ-butyrolactone (GBL) family. The GBL regulatory system involves a small, and not completely defined two-gene network which governs a potentially bi-stable switch between the "on" and "off" states of antibiotic production. The use of this circuit as a tool for synthetic biology has been hampered by a lack of mechanistic understanding of its functionality. We here present the creation and analysis of a versatile and adaptable ensemble model of the Streptomyces GBL system (detailed information on all model mechanisms and parameters is documented in http://www.systemsbiology.ls.manchester.ac.uk/wiki/index.php/Main_Page). We use the model to explore a range of previously proposed mechanistic hypotheses, including transcriptional interference, antisense RNA interactions between the mRNAs of the two genes, and various alternative regulatory activities. Our results suggest that transcriptional interference alone is not sufficient to explain the system's behavior. Instead, antisense RNA interactions seem to be the system's driving force, combined with an aggressive scbR promoter. The computational model can be used to further challenge and refine our understanding of the system's activity and guide future experimentation.
Assuntos
4-Butirolactona/metabolismo , Streptomyces coelicolor/metabolismo , Antibacterianos , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Simulação por Computador , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Regiões Promotoras Genéticas , RNA Antissenso/metabolismo , RNA Mensageiro/metabolismo , Streptomyces coelicolor/genética , Biologia SintéticaRESUMO
Bio-based production of industrial chemicals using synthetic biology can provide alternative green routes from renewable resources, allowing for cleaner production processes. To efficiently produce chemicals on-demand through microbial strain engineering, biomanufacturing foundries have developed automated pipelines that are largely compound agnostic in their time to delivery. Here we benchmark the capabilities of a biomanufacturing pipeline to enable rapid prototyping of microbial cell factories for the production of chemically diverse industrially relevant material building blocks. Over 85 days the pipeline was able to produce 17 potential material monomers and key intermediates by combining 160 genetic parts into 115 unique biosynthetic pathways. To explore the scale-up potential of our prototype production strains, we optimized the enantioselective production of mandelic acid and hydroxymandelic acid, achieving gram-scale production in fed-batch fermenters. The high success rate in the rapid design and prototyping of microbially-produced material building blocks reveals the potential role of biofoundries in leading the transition to sustainable materials production.
Assuntos
Bactérias/metabolismo , Microbiologia Industrial/métodos , Engenharia Metabólica/métodos , Benchmarking , Vias Biossintéticas , Indústria Química , Simulação por Computador , Fermentação , Ácidos Mandélicos/metabolismo , EstereoisomerismoRESUMO
CRISPR-Cas9 has proven as a very powerful gene editing tool for Actinomyces, allowing scarless and precise genome editing in selected strains of these biotechnologically relevant microorganisms. However, its general application in actinomycetes has been limited due to its inefficacy when applying the system in an untested strain. Here, we provide evidence of how Cas9 levels are toxic for the model actinomycetes Streptomyces coelicolor M145 and Streptomyces lividans TK24, which show delayed or absence of growth. We overcame this toxicity by lowering Cas9 levels and have generated a set of plasmids in which Cas9 expression is either controlled by theophylline-inducible or constitutive promoters. We validated the targeting of these CRISPR-Cas9 system using the glycerol uptake operon and the actinorhodin biosynthesis gene cluster. Our results highlight the importance of adjusting Cas9 expression levels specifically in strains to gain optimum and efficient gene editing in Actinomyces.
Assuntos
Sistemas CRISPR-Cas , Recombinação Genética , Streptomyces/genética , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Família Multigênica , Plasmídeos/genética , Streptomyces/metabolismo , Streptomyces coelicolor/genética , Streptomyces coelicolor/metabolismoRESUMO
In a typical untargeted metabolomics experiment, the huge amount of complex data generated by mass spectrometry necessitates automated tools for the extraction of useful biological information. Each metabolite generates numerous mass spectrometry features. The association of these experimental features to the underlying metabolites still represents one of the major bottlenecks in metabolomics data processing. While certain identification (e.g., by comparison to authentic standards) is always desirable, it is usually achievable only for a limited number of compounds, and scientists often deal with a significant amount of putatively annotated metabolites. The confidence in a specific annotation is usually assessed by considering different sources of information (e.g., isotope patterns, adduct formation, chromatographic retention times, and fragmentation patterns). IPA (integrated probabilistic annotation) offers a rigorous and reproducible method to automatically annotate metabolite profiles and evaluate the resulting confidence of the putative annotations. It is able to provide a rigorous measure of our confidence in any putative annotation and is also able to update and refine our beliefs (i.e., background prior knowledge) by incorporating different sources of information in the annotation process, such as isotope patterns, adduct formation and biochemical relations. The IPA package is freely available on GitHub ( https://github.com/francescodc87/IPA ), together with the related extensive documentation.
Assuntos
Metaboloma , Metabolômica/métodos , Algoritmos , Teorema de Bayes , Cromatografia Líquida de Alta Pressão , Escherichia coli/metabolismo , Marcação por Isótopo , Espectrometria de Massas por Ionização por Electrospray , Tirosina/metabolismo , Interface Usuário-ComputadorRESUMO
Summary: Synthetic biology applies the principles of engineering to biology in order to create biological functionalities not seen before in nature. One of the most exciting applications of synthetic biology is the design of new organisms with the ability to produce valuable chemicals including pharmaceuticals and biomaterials in a greener; sustainable fashion. Selecting the right enzymes to catalyze each reaction step in order to produce a desired target compound is, however, not trivial. Here, we present Selenzyme, a free online enzyme selection tool for metabolic pathway design. The user is guided through several decision steps in order to shortlist the best candidates for a given pathway step. The tool graphically presents key information about enzymes based on existing databases and tools such as: similarity of sequences and of catalyzed reactions; phylogenetic distance between source organism and intended host species; multiple alignment highlighting conserved regions, predicted catalytic site, and active regions and relevant properties such as predicted solubility and transmembrane regions. Selenzyme provides bespoke sequence selection for automated workflows in biofoundries. Availability and implementation: The tool is integrated as part of the pathway design stage into the design-build-test-learn SYNBIOCHEM pipeline. The Selenzyme web server is available at http://selenzyme.synbiochem.co.uk. Supplementary information: Supplementary data are available at Bioinformatics online.
Assuntos
Redes e Vias Metabólicas , Software , Biologia Sintética/métodos , Bases de Dados Factuais , Enzimas/genética , Internet , FilogeniaRESUMO
Many antibiotics, chemotherapeutics, crop protection agents and food preservatives originate from molecules produced by bacteria, fungi or plants. In recent years, genome mining methodologies have been widely adopted to identify and characterize the biosynthetic gene clusters encoding the production of such compounds. Since 2011, the 'antibiotics and secondary metabolite analysis shell-antiSMASH' has assisted researchers in efficiently performing this, both as a web server and a standalone tool. Here, we present the thoroughly updated antiSMASH version 4, which adds several novel features, including prediction of gene cluster boundaries using the ClusterFinder method or the newly integrated CASSIS algorithm, improved substrate specificity prediction for non-ribosomal peptide synthetase adenylation domains based on the new SANDPUMA algorithm, improved predictions for terpene and ribosomally synthesized and post-translationally modified peptides cluster products, reporting of sequence similarity to proteins encoded in experimentally characterized gene clusters on a per-protein basis and a domain-level alignment tool for comparative analysis of trans-AT polyketide synthase assembly line architectures. Additionally, several usability features have been updated and improved. Together, these improvements make antiSMASH up-to-date with the latest developments in natural product research and will further facilitate computational genome mining for the discovery of novel bioactive molecules.
Assuntos
Metabolismo Secundário/genética , Software , Algoritmos , Antibacterianos/biossíntese , Produtos Biológicos/metabolismo , Vias Biossintéticas/genética , Códon , Genes , Internet , Peptídeo Sintases/metabolismo , Peptídeos/química , Peptídeos/metabolismo , Policetídeo Sintases/química , Domínios Proteicos , Processamento de Proteína Pós-Traducional , Terpenos/químicaRESUMO
BACKGROUND: Streptomyces species produce a vast diversity of secondary metabolites of clinical and biotechnological importance, in particular antibiotics. Recent developments in metabolic engineering, synthetic and systems biology have opened new opportunities to exploit Streptomyces secondary metabolism, but achieving industry-level production without time-consuming optimization has remained challenging. Genome-scale metabolic modelling has been shown to be a powerful tool to guide metabolic engineering strategies for accelerated strain optimization, and several generations of models of Streptomyces metabolism have been developed for this purpose. RESULTS: Here, we present the most recent update of a genome-scale stoichiometric constraint-based model of the metabolism of Streptomyces coelicolor, the major model organism for the production of antibiotics in the genus. We show that the updated model enables better metabolic flux and biomass predictions and facilitates the integrative analysis of multi-omics data such as transcriptomics, proteomics and metabolomics. CONCLUSIONS: The updated model presented here provides an enhanced basis for the next generation of metabolic engineering attempts in Streptomyces.
Assuntos
Modelos Biológicos , Streptomyces coelicolor/metabolismo , Antibacterianos/metabolismo , Proteínas de Bactérias/metabolismo , Biomassa , Regulação Bacteriana da Expressão Gênica , Engenharia Metabólica , Proteômica , Streptomyces coelicolor/genética , Streptomyces coelicolor/crescimento & desenvolvimentoRESUMO
The rapid increase of publicly available microbial genome sequences has highlighted the presence of hundreds of thousands of biosynthetic gene clusters (BGCs) encoding valuable secondary metabolites. The experimental characterization of new BGCs is extremely laborious and struggles to keep pace with the in silico identification of potential BGCs. Therefore, the prioritisation of promising candidates among computationally predicted BGCs represents a pressing need. Here, we propose an output ordering and prioritisation system (OOPS) which helps sorting identified BGCs by a wide variety of custom-weighted biological and biochemical criteria in a flexible and user-friendly interface. OOPS facilitates a judicious prioritisation of BGCs using G+C content, coding sequence length, gene number, cluster self-similarity and codon bias parameters, as well as enabling the user to rank BGCs based upon BGC type, novelty, and taxonomic distribution. Effective prioritisation of BGCs will help to reduce experimental attrition rates and improve the breadth of bioactive metabolites characterized.
Assuntos
Produtos Biológicos/metabolismo , Vias Biossintéticas/genética , Biologia Computacional/métodos , Família Multigênica , Perfilação da Expressão Gênica , Metabolismo Secundário/genéticaRESUMO
Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software.
Assuntos
Bactérias/genética , Fungos/genética , Metabolismo Secundário/genética , Software , Algoritmos , Bactérias/metabolismo , Vias Biossintéticas/genética , Domínio Catalítico , Mineração de Dados , Enzimas/química , Fungos/metabolismo , Genômica/métodos , Internet , PolicetídeosRESUMO
Covering: 2000 to 2016Progress in synthetic biology is enabled by powerful bioinformatics tools allowing the integration of the design, build and test stages of the biological engineering cycle. In this review we illustrate how this integration can be achieved, with a particular focus on natural products discovery and production. Bioinformatics tools for the DESIGN and BUILD stages include tools for the selection, synthesis, assembly and optimization of parts (enzymes and regulatory elements), devices (pathways) and systems (chassis). TEST tools include those for screening, identification and quantification of metabolites for rapid prototyping. The main advantages and limitations of these tools as well as their interoperability capabilities are highlighted.
Assuntos
Produtos Biológicos , Biologia Sintética , Biologia Computacional , Estrutura MolecularRESUMO
The Manchester Synthetic Biology Research Centre (SYNBIOCHEM) is a foundry for the biosynthesis and sustainable production of fine and speciality chemicals. The Centre's integrated technology platforms provide a unique capability to facilitate predictable engineering of microbial bio-factories for chemicals production. An overview of these capabilities is described.
Assuntos
Engenharia Metabólica , Biologia Sintética , Reino Unido , UniversidadesRESUMO
Bacterial secondary metabolites are widely used as antibiotics, anticancer drugs, insecticides and food additives. Attempts to engineer their biosynthetic gene clusters (BGCs) to produce unnatural metabolites with improved properties are often frustrated by the unpredictability and complexity of the enzymes that synthesize these molecules, suggesting that genetic changes within BGCs are limited by specific constraints. Here, by performing a systematic computational analysis of BGC evolution, we derive evidence for three findings that shed light on the ways in which, despite these constraints, nature successfully invents new molecules: 1) BGCs for complex molecules often evolve through the successive merger of smaller sub-clusters, which function as independent evolutionary entities. 2) An important subset of polyketide synthases and nonribosomal peptide synthetases evolve by concerted evolution, which generates sets of sequence-homogenized domains that may hold promise for engineering efforts since they exhibit a high degree of functional interoperability, 3) Individual BGC families evolve in distinct ways, suggesting that design strategies should take into account family-specific functional constraints. These findings suggest novel strategies for using synthetic biology to rationally engineer biosynthetic pathways.
Assuntos
Bioengenharia/métodos , Biologia Computacional/métodos , Evolução Molecular , Modelos Genéticos , Família Multigênica/genéticaRESUMO
Nonribosomally and ribosomally synthesized bioactive peptides constitute a source of molecules of great biomedical importance, including antibiotics such as penicillin, immunosuppressants such as cyclosporine, and cytostatics such as bleomycin. Recently, an innovative mass-spectrometry-based strategy, peptidogenomics, has been pioneered to effectively mine microbial strains for novel peptidic metabolites. Even though mass-spectrometric peptide detection can be performed quite fast, true high-throughput natural product discovery approaches have still been limited by the inability to rapidly match the identified tandem mass spectra to the gene clusters responsible for the biosynthesis of the corresponding compounds. With Pep2Path, we introduce a software package to fully automate the peptidogenomics approach through the rapid Bayesian probabilistic matching of mass spectra to their corresponding biosynthetic gene clusters. Detailed benchmarking of the method shows that the approach is powerful enough to correctly identify gene clusters even in data sets that consist of hundreds of genomes, which also makes it possible to match compounds from unsequenced organisms to closely related biosynthetic gene clusters in other genomes. Applying Pep2Path to a data set of compounds without known biosynthesis routes, we were able to identify candidate gene clusters for the biosynthesis of five important compounds. Notably, one of these clusters was detected in a genome from a different subphylum of Proteobacteria than that in which the molecule had first been identified. All in all, our approach paves the way towards high-throughput discovery of novel peptidic natural products. Pep2Path is freely available from http://pep2path.sourceforge.net/, implemented in Python, licensed under the GNU General Public License v3 and supported on MS Windows, Linux and Mac OS X.
Assuntos
Produtos Biológicos , Mineração de Dados/métodos , Genômica/métodos , Peptídeos/genética , Software , Espectrometria de Massas em Tandem/métodos , Algoritmos , Sequência de Aminoácidos , Bactérias/química , Bactérias/genética , Sequência de Bases , Teorema de Bayes , Bases de Dados Genéticas , Dados de Sequência Molecular , Peptídeos/químicaRESUMO
Microbial secondary metabolites are a potent source of antibiotics and other pharmaceuticals. Genome mining of their biosynthetic gene clusters has become a key method to accelerate their identification and characterization. In 2011, we developed antiSMASH, a web-based analysis platform that automates this process. Here, we present the highly improved antiSMASH 2.0 release, available at http://antismash.secondarymetabolites.org/. For the new version, antiSMASH was entirely re-designed using a plug-and-play concept that allows easy integration of novel predictor or output modules. antiSMASH 2.0 now supports input of multiple related sequences simultaneously (multi-FASTA/GenBank/EMBL), which allows the analysis of draft genomes comprising multiple contigs. Moreover, direct analysis of protein sequences is now possible. antiSMASH 2.0 has also been equipped with the capacity to detect additional classes of secondary metabolites, including oligosaccharide antibiotics, phenazines, thiopeptides, homo-serine lactones, phosphonates and furans. The algorithm for predicting the core structure of the cluster end product is now also covering lantipeptides, in addition to polyketides and non-ribosomal peptides. The antiSMASH ClusterBlast functionality has been extended to identify sub-clusters involved in the biosynthesis of specific chemical building blocks. The new features currently make antiSMASH 2.0 the most comprehensive resource for identifying and analyzing novel secondary metabolite biosynthetic pathways in microorganisms.
Assuntos
Vias Biossintéticas/genética , Software , Bactérias/genética , Bactérias/metabolismo , Mineração de Dados , Fungos/genética , Fungos/metabolismo , Genoma Bacteriano , Genoma Fúngico , Genômica , InternetRESUMO
The genes encoding many biomolecular systems and pathways are genomically organized in operons or gene clusters. With MultiGeneBlast, we provide a user-friendly and effective tool to perform homology searches with operons or gene clusters as basic units, instead of single genes. The contextualization offered by MultiGeneBlast allows users to get a better understanding of the function, evolutionary history, and practical applications of such genomic regions. The tool is fully equipped with applications to generate search databases from GenBank or from the user's own sequence data. Finally, an architecture search mode allows searching for gene clusters with novel configurations, by detecting genomic regions with any user-specified combination of genes. Sources, precompiled binaries, and a graphical tutorial of MultiGeneBlast are freely available from http://multigeneblast.sourceforge.net/.
Assuntos
Bases de Dados Genéticas , Genômica/métodos , Família Multigênica/genética , Óperon/genética , SoftwareRESUMO
Wild-type Streptomyces coelicolor A3(2) produces aminobacteriohopanetriol as the only elongated C35 hopanoid. The hopanoid phenotype of two mutants bearing a deletion of genes from a previously identified hopanoid biosynthesis gene cluster provides clues to the formation of C35 bacteriohopanepolyols. orf14 encodes a putative nucleosidase; its deletion induces the accumulation of adenosylhopane as it cannot be converted into ribosylhopane. orf18 encodes a putative transaminase; its deletion results in the accumulation of adenosylhopane, ribosylhopane, and bacteriohopanetetrol. Ribosylhopane was postulated twenty years ago as a precursor for bacterial hopanoids but was never identified in a bacterium. Absence of the transaminase encoded by orf18 prevents the reductive amination of ribosylhopane into aminobacteriohopanetriol and induces its accumulation. Its reduction by an aldose-reductase-like enzyme produces bacteriohopanetetrol, which is normally not present in S. coelicolor.