Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Nat Biotechnol ; 42(2): 243-246, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37156916

RESUMO

As structure prediction methods are generating millions of publicly available protein structures, searching these databases is becoming a bottleneck. Foldseek aligns the structure of a query protein against a database by describing tertiary amino acid interactions within proteins as sequences over a structural alphabet. Foldseek decreases computation times by four to five orders of magnitude with 86%, 88% and 133% of the sensitivities of Dali, TM-align and CE, respectively.


Assuntos
Algoritmos , Proteínas , Bases de Dados de Proteínas , Proteínas/química , Aminoácidos , Software
2.
Nature ; 622(7983): 637-645, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37704730

RESUMO

Proteins are key to all cellular processes and their structure is important in understanding their function and evolution. Sequence-based predictions of protein structures have increased in accuracy1, and over 214 million predicted structures are available in the AlphaFold database2. However, studying protein structures at this scale requires highly efficient methods. Here, we developed a structural-alignment-based clustering algorithm-Foldseek cluster-that can cluster hundreds of millions of structures. Using this method, we have clustered all of the structures in the AlphaFold database, identifying 2.30 million non-singleton structural clusters, of which 31% lack annotations representing probable previously undescribed structures. Clusters without annotation tend to have few representatives covering only 4% of all proteins in the AlphaFold database. Evolutionary analysis suggests that most clusters are ancient in origin but 4% seem to be species specific, representing lower-quality predictions or examples of de novo gene birth. We also show how structural comparisons can be used to predict domain families and their relationships, identifying examples of remote structural similarity. On the basis of these analyses, we identify several examples of human immune-related proteins with putative remote homology in prokaryotic species, illustrating the value of this resource for studying protein function and evolution across the tree of life.


Assuntos
Algoritmos , Análise por Conglomerados , Proteínas , Homologia Estrutural de Proteína , Humanos , Bases de Dados de Proteínas , Proteínas/química , Proteínas/classificação , Proteínas/metabolismo , Alinhamento de Sequência , Anotação de Sequência Molecular , Células Procarióticas/química , Filogenia , Especificidade da Espécie , Evolução Molecular
3.
BMC Bioinformatics ; 24(1): 181, 2023 May 03.
Artigo em Inglês | MEDLINE | ID: mdl-37131131

RESUMO

BACKGROUND: Co-localized sets of genes that encode specialized functions are common across microbial genomes and occur in genomes of larger eukaryotes as well. Important examples include Biosynthetic Gene Clusters (BGCs) that produce specialized metabolites with medicinal, agricultural, and industrial value (e.g. antimicrobials). Comparative analysis of BGCs can aid in the discovery of novel metabolites by highlighting distribution and identifying variants in public genomes. Unfortunately, gene-cluster-level homology detection remains inaccessible, time-consuming and difficult to interpret. RESULTS: The comparative gene cluster analysis toolbox (CAGECAT) is a rapid and user-friendly platform to mitigate difficulties in comparative analysis of whole gene clusters. The software provides homology searches and downstream analyses without the need for command-line or programming expertise. By leveraging remote BLAST databases, which always provide up-to-date results, CAGECAT can yield relevant matches that aid in the comparison, taxonomic distribution, or evolution of an unknown query. The service is extensible and interoperable and implements the cblaster and clinker pipelines to perform homology search, filtering, gene neighbourhood estimation, and dynamic visualisation of resulting variant BGCs. With the visualisation module, publication-quality figures can be customized directly from a web-browser, which greatly accelerates their interpretation via informative overlays to identify conserved genes in a BGC query. CONCLUSION: Overall, CAGECAT is an extensible software that can be interfaced via a standard web-browser for whole region homology searches and comparison on continually updated genomes from NCBI. The public web server and installable docker image are open source and freely available without registration at: https://cagecat.bioinformatics.nl .


Assuntos
Computadores , Software , Família Multigênica , Genoma , Análise por Conglomerados
4.
J Nat Prod ; 86(3): 541-549, 2023 03 24.
Artigo em Inglês | MEDLINE | ID: mdl-36524608

RESUMO

Penicillium turbatum has previously been reported to produce A26771B, a 16-membered macrocyclic polyketide with activity against Gram-positive bacteria, mycoplasma, and fungi, as well as the structurally related compounds berkeleylactone E and berkeleylactones I-O. In this work, large-scale cultivation of P. turbatum NRRL 5630 on rice yielded seven new berkeleylactone analogues, berkeleylactone E methyl ester, 14-epi-berkeleylactone F, berkeleylactones P-R, 12-epi-berkeleylactone Q, and 13-epi-berkeleylactone R, and six previously reported analogues, A26771B and berkeleylactones E-G and J-K. The structures of the berkeleylactones were elucidated by detailed analysis of spectroscopic data, molecular modeling, and comparison with literature values. Interestingly, six of the berkeleylactone analogues were isolated as pairs of hydroxy epimers, highlighting how Nature can exploit stereodivergence in biosynthetic pathways to increase chemical diversity. The genome of P. turbatum was sequenced, and a putative gene cluster (bekl) responsible for the biosynthesis of the berkeleylactones was identified. The new berkeleylactone analogues exhibited no significant biological activity against a panel of bacteria, fungi, the parasite Giardia duodenalis, or NS-1 murine myeloma cells, suggesting a hitherto undiscovered biological role.


Assuntos
Penicillium , Camundongos , Animais , Estrutura Molecular , Hidroxilação , Penicillium/química
5.
Nucleic Acids Res ; 51(D1): D777-D784, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36271795

RESUMO

In phylogenomics the evolutionary relationship of organisms is studied by their genomic information. A common approach to phylogenomics is to extract related genes from each organism, build a multiple sequence alignment and then reconstruct evolution relations through a phylogenetic tree. Often a set of highly conserved genes occurring in single-copy, called core genes, are used for this analysis, as they allow efficient automation within a taxonomic clade. Here we introduce the Universal Fungal Core Genes (UFCG) database and pipeline for genome-wide phylogenetic analysis of fungi. The UFCG database consists of 61 curated fungal marker genes, including a novel set of 41 computationally derived core genes and 20 canonical genes derived from literature, as well as marker gene sequences extracted from publicly available fungal genomes. Furthermore, we provide an easy-to-use, fully automated and open-source pipeline for marker gene extraction, training and phylogenetic tree reconstruction. The UFCG pipeline can identify marker genes from genomic, proteomic and transcriptomic data, while producing phylogenies consistent with those previously reported, and is publicly available together with the UFCG database at https://ufcg.steineggerlab.com.


Assuntos
Bases de Dados Genéticas , Fungos , Fungos/classificação , Fungos/genética , Genes Fúngicos , Genoma Fúngico , Filogenia , Proteômica
6.
Fungal Biol Biotechnol ; 8(1): 13, 2021 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-34763725

RESUMO

BACKGROUND: Fungi are prolific producers of secondary metabolites (SMs), which are bioactive small molecules with important applications in medicine, agriculture and other industries. The backbones of a large proportion of fungal SMs are generated through the action of large, multi-domain megasynth(et)ases such as polyketide synthases (PKSs) and nonribosomal peptide synthetases (NRPSs). The structure of these backbones is determined by the domain architecture of the corresponding megasynth(et)ase, and thus accurate annotation and classification of these architectures is an important step in linking SMs to their biosynthetic origins in the genome. RESULTS: Here we report synthaser, a Python package leveraging the NCBI's conserved domain search tool for remote prediction and classification of fungal megasynth(et)ase domain architectures. Synthaser is capable of batch sequence analysis, and produces rich textual output and interactive visualisations which allow for quick assessment of the megasynth(et)ase diversity of a fungal genome. Synthaser uses a hierarchical rule-based classification system, which can be extensively customised by the user through a web application ( http://gamcil.github.io/synthaser ). We show that synthaser provides more accurate domain architecture predictions than comparable tools which rely on curated profile hidden Markov model (pHMM)-based approaches; the utilisation of the NCBI conserved domain database also allows for significantly greater flexibility compared to pHMM approaches. In addition, we demonstrate how synthaser can be applied to large scale genome mining pipelines through the construction of an Aspergillus PKS similarity network. CONCLUSIONS: Synthaser is an easy to use tool that represents a significant upgrade to previous domain architecture analysis tools. It is freely available under a MIT license from PyPI ( https://pypi.org/project/synthaser ) and GitHub ( https://github.com/gamcil/synthaser ).

7.
Bioinformatics ; 37(16): 2473-2475, 2021 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-33459763

RESUMO

SUMMARY: Genes involved in biological pathways are often collocalised in gene clusters, the comparison of which can give valuable insights into their function and evolutionary history. However, comparison and visualization of gene cluster similarity is a tedious process, particularly when many clusters are being compared. Here, we present clinker, a Python based tool and clustermap.js, a companion JavaScript visualization library, which used together can automatically generate accurate, interactive, publication-quality gene cluster comparison figures directly from sequence files. AVAILABILITY AND IMPLEMENTATION: Source code and documentation for clinker and clustermap.js is available on GitHub (github.com/gamcil/clinker and github.com/gamcil/clustermap.js, respectively) under the MIT license. clinker can be installed directly from the Python Package Index via pip. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

8.
Bioinform Adv ; 1(1): vbab016, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-36700093

RESUMO

Motivation: Genes involved in coordinated biological pathways, including metabolism, drug resistance and virulence, are often collocalized as gene clusters. Identifying homologous gene clusters aids in the study of their function and evolution, however, existing tools are limited to searching local sequence databases. Tools for remotely searching public databases are necessary to keep pace with the rapid growth of online genomic data. Results: Here, we present cblaster, a Python-based tool to rapidly detect collocated genes in local and remote databases. cblaster is easy to use, offering both a command line and a user-friendly graphical user interface. It generates outputs that enable intuitive visualizations of large datasets and can be readily incorporated into larger bioinformatic pipelines. cblaster is a significant update to the comparative genomics toolbox. Availability and implementation: cblaster source code and documentation is freely available from GitHub under the MIT license (github.com/gamcil/cblaster). Supplementary information: Supplementary data are available at Bioinformatics Advances online.

9.
Org Biomol Chem ; 19(3): 587-595, 2021 01 28.
Artigo em Inglês | MEDLINE | ID: mdl-33242032

RESUMO

The hancockiamides are an unusual new family of N-cinnamoylated piperazines from the Australian soil fungus Aspergillus hancockii. Genomic analyses of A. hancockii identified a biosynthetic gene cluster (hkm) of 12 genes, including two single-module nonribosomal peptide synthetase (NRPS) genes. Heterologous expression of the hkm cluster in A. nidulans confirmed its role in the biosynthesis of the hancockiamides. We further demonstrated that a novel cytochrome P450, Hkm5, catalyses the methylenedioxy bridge formation, and that the PAL gene hkm12 is dispensable, but contributes to increased production of the cinnamoylated hancockiamides. In vitro enzymatic assays and substrate feeding studies demonstrated that NRPS Hkm11 activates and transfers trans-cinnamate to the piperazine scaffold and has flexibility to accept bioisosteric thienyl and furyl analogues. This is the first reported cinnamate-activating fungal NRPS. Expression of a truncated cluster lacking the acetyltransferase gene led to seven additional congeners, including an unexpected family of 2,5-dibenzylpiperazines. These pleiotropic effects highlight the plasticity of the pathway and the power of this approach for accessing novel natural product scaffolds.


Assuntos
Aspergillus/metabolismo , Peptídeo Sintases/metabolismo , Piperazinas/química , Piperazinas/metabolismo , Aspergillus/genética , Cinética , Família Multigênica/genética
10.
Proc Natl Acad Sci U S A ; 117(39): 24243-24250, 2020 09 29.
Artigo em Inglês | MEDLINE | ID: mdl-32929037

RESUMO

The necrotrophic fungal pathogen Cochliobolus victoriae produces victorin, a host-selective toxin (HST) essential for pathogenicity to certain oat cultivars with resistance against crown rust. Victorin is a mixture of highly modified heterodetic cyclic hexapeptides, previously assumed to be synthesized by a nonribosomal peptide synthetase. Herein, we demonstrate that victorin is a member of the ribosomally synthesized and posttranslationally modified peptide (RiPP) family of natural products. Analysis of a newly generated long-read assembly of the C. victoriae genome revealed three copies of precursor peptide genes (vicA1-3) with variable numbers of "GLKLAF" core peptide repeats corresponding to the victorin peptide backbone. vicA1-3 are located in repeat-rich gene-sparse regions of the genome and are loosely clustered with putative victorin biosynthetic genes, which are supported by the discovery of compact gene clusters harboring corresponding homologs in two distantly related plant-associated Sordariomycete fungi. Deletion of at least one copy of vicA resulted in strongly diminished victorin production. Deletion of a gene encoding a DUF3328 protein (VicYb) abolished the production altogether, supporting its predicted role in oxidative cyclization of the core peptide. In addition, we uncovered a copper amine oxidase (CAO) encoded by vicK, in which its deletion led to the accumulation of new glycine-containing victorin derivatives. The role of VicK in oxidative deamination of the N-terminal glycyl moiety of the hexapeptides to the active glyoxylate forms was confirmed in vitro. This study finally unraveled the genetic and molecular bases for biosynthesis of one of the first discovered HSTs and expanded our understanding of underexplored fungal RiPPs.


Assuntos
Ascomicetos/metabolismo , Proteínas Fúngicas/metabolismo , Micotoxinas/metabolismo , Ascomicetos/genética , Desaminação , Proteínas Fúngicas/genética , Proteínas Fúngicas/toxicidade , Deleção de Genes , Família Multigênica , Micotoxinas/genética , Micotoxinas/toxicidade , Biossíntese de Proteínas , Processamento de Proteína Pós-Traducional
11.
Fungal Genet Biol ; 143: 103435, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32702474

RESUMO

Aspergillus burnettii is a new species belonging to the A. alliaceus clade in Aspergillus subgenus Circumdati section Flavi isolated from peanut-growing properties in southern Queensland, Australia. A. burnettii is a fast-growing, floccose fungus with distinctive brown conidia and is a talented producer of biomass-degrading enzymes and secondary metabolites. Chemical profiling of A. burnettii revealed the metabolites ochratoxin A, kotanins, isokotanins, asperlicin E, anominine and paspalinine, which are common to subgenus Circumdati, together with burnettiene A, burnettramic acids, burnettides, and high levels of 14α-hydroxypaspalinine and hirsutide. The genome of A. burnettii was sequenced and an annotated draft genome is presented. A. burnettii is rich in secondary metabolite biosynthetic gene clusters, containing 51 polyketide synthases, 28 non-ribosomal peptide synthetases and 19 genes related to terpene biosynthesis. Functional annotation of digestive enzymes of A. burnettii and A. alliaceus revealed overlapping carbon utilisation profiles, consistent with a close phylogenetic relationship.


Assuntos
Aspergillus/genética , Vias Biossintéticas/genética , Peptídeo Sintases/genética , Filogenia , Aspergillus/classificação , Aspergillus/metabolismo , Classificação , Genômica , Família Multigênica/genética , Policetídeo Sintases/genética , Análise de Sequência de DNA
12.
ACS Synth Biol ; 9(7): 1843-1854, 2020 07 17.
Artigo em Inglês | MEDLINE | ID: mdl-32526136

RESUMO

Accessing the full biosynthetic potential encoded in the genomes of fungi is limited by the low expression of most biosynthetic gene clusters (BGCs) under common laboratory culture conditions. CRISPR-mediated transcriptional activation (CRISPRa) of fungal BGCs could accelerate genomics-driven bioactive secondary metabolite discovery. In this work, we established the first CRISPRa system for filamentous fungi. First, we constructed a CRISPR/dLbCas12a-VPR-based system and demonstrated the activation of a fluorescent reporter in Aspergillus nidulans. Then, we targeted the native nonribosomal peptide synthetase-like (NRPS-like) gene micA in both chromosomal and episomal contexts, achieving increased production of the compound microperfuranone. Finally, multigene CRISPRa led to the discovery of the mic cluster product as dehydromicroperfuranone. Additionally, we demonstrated the utility of the variant dLbCas12aD156R-VPR for CRISPRa at room temperature culture conditions. Different aspects that influence the efficiency of CRISPRa in fungi were investigated, providing a framework for the further development of fungal artificial transcription factors based on CRISPR/Cas.


Assuntos
Aspergillus nidulans/genética , Sistemas CRISPR-Cas , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas/genética , Descoberta de Drogas/métodos , Genes Fúngicos , Família Multigênica , Ativação Transcricional , Proteínas de Bactérias/genética , Proteína 9 Associada à CRISPR/genética , Proteínas Associadas a CRISPR/genética , Meios de Cultura , Endodesoxirribonucleases/genética , Firmicutes/enzimologia , Peptídeo Sintases/genética , Regiões Promotoras Genéticas , RNA Guia de Cinetoplastídeos/genética , Streptococcus pyogenes/enzimologia , Temperatura , Transcrição Gênica/genética
13.
J Am Chem Soc ; 142(15): 7145-7152, 2020 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-32182055

RESUMO

1-Benzazepine is a pharmaceutically important scaffold but is rare among natural products. Nanangelenin A (1), containing an unprecedented 3,4-dihydro-1-benzazepine-2,5-dione-N-prenyl-N-acetoxy-anthranilamide scaffold, was isolated from a novel species of Australian fungus, Aspergillus nanangensis. Genomic and retrobiosynthetic analyses identified a putative nonribosomal peptide synthetase (NRPS) gene cluster (nan). The detailed biosynthetic pathway to 1 was established by heterologous pathway reconstitution in A. nidulans, which led to biosynthesis of intermediates nanagelenin B-F (2-5 and 7). We demonstrated that the NRPS NanA incorporates anthranilic acid (Ant) and l-kynurenine (l-Kyn), which is supplied by a dedicated indoleamine-2,3-dioxygenase NanC encoded in the gene cluster. Using heterologous in vivo assays and mutagenesis, we demonstrated that the C-terminal condensation (CT) and thiolation (T3) domains of NanA are responsible for the regioselective cyclization of the tethered Ant-l-Kyn dipeptide to form the unusual benzazepine scaffold in 1. We also showed that NanA-CT catalyzes the regioselective cyclization of a surrogate synthetic substrate, Ant-l-Kyn-N-acetylcysteamine, to give the benzazepine scaffold, while spontaneous cyclization of the dipeptide yielded the alternative kinetically favored benzodiazepine scaffold. The discovery of 1 and the characterization of NanA have expanded the chemical and functional diversities of fungal NRPSs.


Assuntos
Alcaloides/metabolismo , Aspergillus/patogenicidade , Benzazepinas/síntese química , Cinurenina/metabolismo , Família Multigênica/genética , Benzazepinas/química , Catálise , Ciclização
14.
Beilstein J Org Chem ; 15: 2631-2643, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31807198

RESUMO

Chemical investigation of an undescribed Australian fungus, Aspergillus nanangensis, led to the identification of the nanangenines - a family of seven new and three previously reported drimane sesquiterpenoids. The structures of the nanangenines were elucidated by detailed spectroscopic analysis supported by single crystal X-ray diffraction studies. The compounds were assayed for in vitro activity against bacteria, fungi, mammalian cells and plants. Bioinformatics analysis, including comparative analysis with other acyl drimenol-producing Aspergilli, led to the identification of a putative nanangenine biosynthetic gene cluster that corresponds to the proposed biosynthetic pathway for nanangenines.

15.
Org Lett ; 21(5): 1287-1291, 2019 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-30735051

RESUMO

The burnettramic acids are a new class of antibiotics from an Australian fungus Aspergillus burnettii. The rare bolaamphiphilic scaffold consists of ß-d-mannose linked to a pyrrolizidinedione unit via a 26-carbon chain. The most abundant metabolite displayed potent in vitro antifungal activity. Comparative genomics identified the hybrid PKS-NRPS bua gene cluster, which was verified by heterologous pathway reconstitution in Aspergillus nidulans.


Assuntos
Antibacterianos/síntese química , Aspergillus/química , Compostos Heterocíclicos com 2 Anéis/química , Antibacterianos/biossíntese , Aspergillus/metabolismo , Austrália , Isomerismo , Manose/química , Estrutura Molecular , Família Multigênica , Oxirredução , Pirrolidinas/química , Metabolismo Secundário
16.
Org Biomol Chem ; 16(10): 1620-1626, 2018 03 07.
Artigo em Inglês | MEDLINE | ID: mdl-29393329

RESUMO

Fungi are a rich source of bioactive small molecules. However, the large number of biosynthetic gene clusters (BGCs) encoding these molecules in their genomes suggests their biosynthetic potential is far greater than we previously appreciated. The mining of fungal genomes therefore holds great promise for the discovery of new chemical entities for pharmaceutical and agricultural applications. As more and more fungal genomes become available, the accompanying number of BGCs is quickly becoming unmanageable. Along with improving molecular genetic tools to accelerate the translation of BGCs to small molecules, we must devise strategies to prioritise BGCs most likely to encode the biosynthesis of novel small molecules and molecules with new or improved bioactivities or functions. In this perspective, we discuss existing and emerging strategies for prioritisation of BGCs to increase the odds of fruitful genome mining in fungi.


Assuntos
Fungos/genética , Genoma Fúngico , Genômica/métodos , Família Multigênica , Descoberta de Drogas/métodos , Redes Reguladoras de Genes
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA