RESUMO
The extraordinary diversity of viruses infecting bacteria and archaea is now primarily studied through metagenomics. While metagenomes enable high-throughput exploration of the viral sequence space, metagenome-derived sequences lack key information compared to isolated viruses, in particular host association. Different computational approaches are available to predict the host(s) of uncultivated viruses based on their genome sequences, but thus far individual approaches are limited either in precision or in recall, i.e., for a number of viruses they yield erroneous predictions or no prediction at all. Here, we describe iPHoP, a two-step framework that integrates multiple methods to reliably predict host taxonomy at the genus rank for a broad range of viruses infecting bacteria and archaea, while retaining a low false discovery rate. Based on a large dataset of metagenome-derived virus genomes from the IMG/VR database, we illustrate how iPHoP can provide extensive host prediction and guide further characterization of uncultivated viruses.
Assuntos
Archaea , Vírus , Archaea/genética , Metagenoma/genética , Vírus/genética , Bactérias/genética , Metagenômica/métodos , Aprendizado de Máquina , Genoma Viral/genéticaRESUMO
Most of the described species in kingdom Fungi are contained in two phyla, the Ascomycota and the Basidiomycota (subkingdom Dikarya). As a result, our understanding of the biology of the kingdom is heavily influenced by traits observed in Dikarya, such as aerial spore dispersal and life cycles dominated by mitosis of haploid nuclei. We now appreciate that Fungi comprises numerous phylum-level lineages in addition to those of Dikarya, but the phylogeny and genetic characteristics of most of these lineages are poorly understood due to limited genome sampling. Here, we addressed major evolutionary trends in the non-Dikarya fungi by phylogenomic analysis of 69 newly generated draft genome sequences of the zoosporic (flagellated) lineages of true fungi. Our phylogeny indicated five lineages of zoosporic fungi and placed Blastocladiomycota, which has an alternation of haploid and diploid generations, as branching closer to the Dikarya than to the Chytridiomyceta. Our estimates of heterozygosity based on genome sequence data indicate that the zoosporic lineages plus the Zoopagomycota are frequently characterized by diploid-dominant life cycles. We mapped additional traits, such as ancestral cell-cycle regulators, cell-membrane- and cell-wall-associated genes, and the use of the amino acid selenocysteine on the phylogeny and found that these ancestral traits that are shared with Metazoa have been subject to extensive parallel loss across zoosporic lineages. Together, our results indicate a gradual transition in the genetics and cell biology of fungi from their ancestor and caution against assuming that traits measured in Dikarya are typical of other fungal lineages.
Assuntos
Fungos , Estágios do Ciclo de Vida , Filogenia , Diploide , Fungos/classificação , Fungos/genética , Genoma Fúngico/genéticaRESUMO
As actors of global carbon cycle, Agaricomycetes (Basidiomycota) have developed complex enzymatic machineries that allow them to decompose all plant polymers, including lignin. Among them, saprotrophic Agaricales are characterized by an unparalleled diversity of habitats and lifestyles. Comparative analysis of 52 Agaricomycetes genomes (14 of them sequenced de novo) reveals that Agaricales possess a large diversity of hydrolytic and oxidative enzymes for lignocellulose decay. Based on the gene families with the predicted highest evolutionary rates-namely cellulose-binding CBM1, glycoside hydrolase GH43, lytic polysaccharide monooxygenase AA9, class-II peroxidases, glucose-methanol-choline oxidase/dehydrogenases, laccases, and unspecific peroxygenases-we reconstructed the lifestyles of the ancestors that led to the extant lignocellulose-decomposing Agaricomycetes. The changes in the enzymatic toolkit of ancestral Agaricales are correlated with the evolution of their ability to grow not only on wood but also on leaf litter and decayed wood, with grass-litter decomposers as the most recent eco-physiological group. In this context, the above families were analyzed in detail in connection with lifestyle diversity. Peroxidases appear as a central component of the enzymatic toolkit of saprotrophic Agaricomycetes, consistent with their essential role in lignin degradation and high evolutionary rates. This includes not only expansions/losses in peroxidase genes common to other basidiomycetes but also the widespread presence in Agaricales (and Russulales) of new peroxidases types not found in wood-rotting Polyporales, and other Agaricomycetes orders. Therefore, we analyzed the peroxidase evolution in Agaricomycetes by ancestral-sequence reconstruction revealing several major evolutionary pathways and mapped the appearance of the different enzyme types in a time-calibrated species tree.
Assuntos
Agaricales/genética , Genoma Fúngico , Lignina/metabolismo , Peroxidases/genética , Filogenia , Agaricales/enzimologia , Ecossistema , Família Multigênica , Peroxidases/metabolismoRESUMO
The ectomycorrhizal (ECM) symbiosis has independently evolved from diverse types of saprotrophic ancestors. In this study, we seek to identify genomic signatures of the transition to the ECM habit within the hyperdiverse Russulaceae. We present comparative analyses of the genomic architecture and the total and secreted gene repertoires of 18 species across the order Russulales, of which 13 are newly sequenced, including a representative of a saprotrophic member of Russulaceae, Gloeopeniophorella convolvens. The genomes of ECM Russulaceae are characterized by a loss of genes for plant cell wall-degrading enzymes (PCWDEs), an expansion of genome size through increased transposable element (TE) content, a reduction in secondary metabolism clusters, and an association of small secreted proteins (SSPs) with TE 'nests', or dense aggregations of TEs. Some PCWDEs have been retained or even expanded, mostly in a species-specific manner. The genome of G. convolvens possesses some characteristics of ECM genomes (e.g. loss of some PCWDEs, TE expansion, reduction in secondary metabolism clusters). Functional specialization in ECM decomposition may drive diversification. Accelerated gene evolution predates the evolution of the ECM habit, indicating that changes in genome architecture and gene content may be necessary to prime the evolutionary switch.
Assuntos
Agaricales , Micorrizas , Agaricales/genética , Elementos de DNA Transponíveis/genética , Evolução Molecular , Hábitos , Micorrizas/genética , Filogenia , Simbiose/genéticaRESUMO
Evolution of lignocellulose decomposition was one of the most ecologically important innovations in fungi. White-rot fungi in the Agaricomycetes (mushrooms and relatives) are the most effective microorganisms in degrading both cellulose and lignin components of woody plant cell walls (PCW). However, the precise evolutionary origins of lignocellulose decomposition are poorly understood, largely because certain early-diverging clades of Agaricomycetes and its sister group, the Dacrymycetes, have yet to be sampled, or have been undersampled, in comparative genomic studies. Here, we present new genome sequences of ten saprotrophic fungi, including members of the Dacrymycetes and early-diverging clades of Agaricomycetes (Cantharellales, Sebacinales, Auriculariales, and Trechisporales), which we use to refine the origins and evolutionary history of the enzymatic toolkit of lignocellulose decomposition. We reconstructed the origin of ligninolytic enzymes, focusing on class II peroxidases (AA2), as well as enzymes that attack crystalline cellulose. Despite previous reports of white rot appearing as early as the Dacrymycetes, our results suggest that white-rot fungi evolved later in the Agaricomycetes, with the first class II peroxidases reconstructed in the ancestor of the Auriculariales and residual Agaricomycetes. The exemplars of the most ancient clades of Agaricomycetes that we sampled all lack class II peroxidases, and are thus concluded to use a combination of plesiomorphic and derived PCW degrading enzymes that predate the evolution of white rot.
Assuntos
Agaricales/genética , Genômica , Lignina/genética , Basidiomycota/genética , Evolução Molecular , Genoma Fúngico , Anotação de Sequência Molecular , Peroxidases/genética , FilogeniaRESUMO
Organisms across the tree of life use a variety of mechanisms to respond to stress-inducing fluctuations in osmotic conditions. Cellular response mechanisms and phenotypes associated with osmoadaptation also play important roles in bacterial virulence, human health, agricultural production and many other biological systems. To improve understanding of osmoadaptive strategies, we have generated 59 high-quality draft genomes for the haloarchaea (a euryarchaeal clade whose members thrive in hypersaline environments and routinely experience drastic changes in environmental salinity) and analyzed these new genomes in combination with those from 21 previously sequenced haloarchaeal isolates. We propose a generalized model for haloarchaeal management of cytoplasmic osmolarity in response to osmotic shifts, where potassium accumulation and sodium expulsion during osmotic upshock are accomplished via secondary transport using the proton gradient as an energy source, and potassium loss during downshock is via a combination of secondary transport and non-specific ion loss through mechanosensitive channels. We also propose new mechanisms for magnesium and chloride accumulation. We describe the expansion and differentiation of haloarchaeal general transcription factor families, including two novel expansions of the TATA-binding protein family, and discuss their potential for enabling rapid adaptation to environmental fluxes. We challenge a recent high-profile proposal regarding the evolutionary origins of the haloarchaea by showing that inclusion of additional genomes significantly reduces support for a proposed large-scale horizontal gene transfer into the ancestral haloarchaeon from the bacterial domain. The combination of broad (17 genera) and deep (≥5 species in four genera) sampling of a phenotypically unified clade has enabled us to uncover both highly conserved and specialized features of osmoadaptation. Finally, we demonstrate the broad utility of such datasets, for metagenomics, improvements to automated gene annotation and investigations of evolutionary processes.
Assuntos
Adaptação Fisiológica/genética , Archaea/genética , Metagenômica , Proteína de Ligação a TATA-Box/genética , Sequência de Bases , Evolução Molecular , Genoma Arqueal , Humanos , Anotação de Sequência Molecular , Concentração Osmolar , Filogenia , SalinidadeRESUMO
Wood decay mechanisms in Agaricomycotina have been traditionally separated in two categories termed white and brown rot. Recently the accuracy of such a dichotomy has been questioned. Here, we present the genome sequences of the white-rot fungus Cylindrobasidium torrendii and the brown-rot fungus Fistulina hepatica both members of Agaricales, combining comparative genomics and wood decay experiments. C. torrendii is closely related to the white-rot root pathogen Armillaria mellea, while F. hepatica is related to Schizophyllum commune, which has been reported to cause white rot. Our results suggest that C. torrendii and S. commune are intermediate between white-rot and brown-rot fungi, but at the same time they show characteristics of decay that resembles soft rot. Both species cause weak wood decay and degrade all wood components but leave the middle lamella intact. Their gene content related to lignin degradation is reduced, similar to brown-rot fungi, but both have maintained a rich array of genes related to carbohydrate degradation, similar to white-rot fungi. These characteristics appear to have evolved from white-rot ancestors with stronger ligninolytic ability. F. hepatica shows characteristics of brown rot both in terms of wood decay genes found in its genome and the decay that it causes. However, genes related to cellulose degradation are still present, which is a plesiomorphic characteristic shared with its white-rot ancestors. Four wood degradation-related genes, homologs of which are frequently lost in brown-rot fungi, show signs of pseudogenization in the genome of F. hepatica. These results suggest that transition toward a brown-rot lifestyle could be an ongoing process in F. hepatica. Our results reinforce the idea that wood decay mechanisms are more diverse than initially thought and that the dichotomous separation of wood decay mechanisms in Agaricomycotina into white rot and brown rot should be revisited.
Assuntos
Agaricales/genética , Evolução Molecular , Genoma Fúngico , Madeira/microbiologia , Agaricales/enzimologia , Agaricales/patogenicidade , Lignina/metabolismo , Filogenia , Análise de Sequência de DNARESUMO
Copepods are among the most abundant organisms on the planet and play critical functions in aquatic ecosystems. Among copepods, populations of the Eurytemora affinis species complex are numerically dominant in many coastal habitats and are food sources for major fisheries. Intriguingly, certain populations possess the unusual capacity to invade novel salinities on rapid time scales. Despite their ecological importance, high-quality genomic resources have been absent for calanoid copepods, limiting our ability to comprehensively dissect the genome architecture underlying the highly invasive and adaptive capacity of certain populations. Here, we presented the first chromosome-level genome of a calanoid copepod, from the Atlantic clade (Eurytemora carolleeae) of the E. affinis species complex. This genome was assembled using high-coverage long-read and high-throughput chromosome conformation capture sequences of an inbred line, generated through 30 generations of full-sib mating. This genome, consisting of 529.3 megabase (Mb) (contig N50 = 4.2 Mb, scaffold N50 = 140.6 Mb), was anchored onto four chromosomes. Genome annotation predicted 20,262 protein-coding genes, of which ion transporter gene families were substantially expanded based on comparative analyses of 12 additional arthropod genomes. Also, we found genome-wide signatures of historical gene body methylation of the ion transporter genes and the significant clustering of these genes on each chromosome. This genome represents one of the most contiguous copepod genomes to date and among the highest quality marine invertebrate genomes. As such, this genome provides an invaluable resource to help yield fundamental insights into the ability of this copepod to adapt to rapidly changing environments.
RESUMO
Lipid nanoparticles (LNPs) are being intensively researched and developed to leverage their ability to safely and effectively deliver therapeutics. To achieve optimal therapeutic delivery, a comprehensive understanding of the relationship between formulation, structure, and efficacy is critical. However, the vast chemical space involved in the production of LNPs and the resulting structural complexity make the structure to function relationship challenging to assess and predict. New components and formulation procedures, which provide new opportunities for the use of LNPs, would be best identified and optimized using high-throughput characterization methods. Recently, a high-throughput workflow, consisting of automated mixing, small-angle X-ray scattering (SAXS), and cellular assays, demonstrated a link between formulation, internal structure, and efficacy for a library of LNPs. As SAXS data can be rapidly collected, the stage is set for the collection of thousands of SAXS profiles from a myriad of LNP formulations. In addition, correlated LNP small-angle neutron scattering (SANS) datasets, where components are systematically deuterated for additional contrast inside, provide complementary structural information. The centralization of SAXS and SANS datasets from LNPs, with appropriate, standardized metadata describing formulation parameters, into a data repository will provide valuable guidance for the formulation of LNPs with desired properties. To this end, we introduce Simple Scattering, an easy-to-use, open data repository for storing and sharing groups of correlated scattering profiles obtained from LNP screening experiments. Here, we discuss the current state of the repository, including limitations and upcoming changes, and our vision towards future usage in developing our collective knowledge base of LNPs.
RESUMO
Scientific progress depends on reliable and reproducible results. Progress can also be accelerated when data are shared and re-analyzed to address new questions. Current approaches to storing and analyzing neural data typically involve bespoke formats and software that make replication, as well as the subsequent reuse of data, difficult if not impossible. To address these challenges, we created Spyglass, an open-source software framework that enables reproducible analyses and sharing of data and both intermediate and final results within and across labs. Spyglass uses the Neurodata Without Borders (NWB) standard and includes pipelines for several core analyses in neuroscience, including spectral filtering, spike sorting, pose tracking, and neural decoding. It can be easily extended to apply both existing and newly developed pipelines to datasets from multiple sources. We demonstrate these features in the context of a cross-laboratory replication by applying advanced state space decoding algorithms to publicly available data. New users can try out Spyglass on a Jupyter Hub hosted by HHMI and 2i2c: https://spyglass.hhmi.2i2c.cloud/.
RESUMO
Metagenomics is a technique for genome-wide profiling of microbiomes; this technique generates billions of DNA sequences called reads. Given the multiplication of metagenomic projects, computational tools are necessary to enable the efficient and accurate classification of metagenomic reads without needing to construct a reference database. The program DL-TODA presented here aims to classify metagenomic reads using a deep learning model trained on over 3000 bacterial species. A convolutional neural network architecture originally designed for computer vision was applied for the modeling of species-specific features. Using synthetic testing data simulated with 2454 genomes from 639 species, DL-TODA was shown to classify nearly 75% of the reads with high confidence. The classification accuracy of DL-TODA was over 0.98 at taxonomic ranks above the genus level, making it comparable with Kraken2 and Centrifuge, two state-of-the-art taxonomic classification tools. DL-TODA also achieved an accuracy of 0.97 at the species level, which is higher than 0.93 by Kraken2 and 0.85 by Centrifuge on the same test set. Application of DL-TODA to the human oral and cropland soil metagenomes further demonstrated its use in analyzing microbiomes from diverse environments. Compared to Centrifuge and Kraken2, DL-TODA predicted distinct relative abundance rankings and is less biased toward a single taxon.
Assuntos
Aprendizado Profundo , Microbiota , Humanos , Redes Neurais de Computação , Bactérias/genética , Metagenoma , Microbiota/genética , AlgoritmosRESUMO
Traumatic brain injury (TBI) affects how the brain functions in the short and long term. Resulting patient outcomes across physical, cognitive, and psychological domains are complex and often difficult to predict. Major challenges to developing personalized treatment for TBI include distilling large quantities of complex data and increasing the precision with which patient outcome prediction (prognoses) can be rendered. We developed and applied interpretable machine learning methods to TBI patient data. We show that complex data describing TBI patients' intake characteristics and outcome phenotypes can be distilled to smaller sets of clinically interpretable latent factors. We demonstrate that 19 clusters of TBI outcomes can be predicted from intake data, a ~ 6× improvement in precision over clinical standards. Finally, we show that 36% of the outcome variance across patients can be predicted. These results demonstrate the importance of interpretable machine learning applied to deeply characterized patients for data-driven distillation and precision prognosis.
Assuntos
Lesões Encefálicas Traumáticas , Destilação , Humanos , Lesões Encefálicas Traumáticas/diagnóstico , Lesões Encefálicas Traumáticas/terapia , Prognóstico , Aprendizado de Máquina , FenótipoRESUMO
Fungi have evolved over millions of years and their species diversity is predicted to be the second largest on the earth. Fungi have cross-kingdom interactions with many organisms that have mutually shaped their evolutionary trajectories. Zygomycete fungi hold a pivotal position in the fungal tree of life and provide important perspectives on the early evolution of fungi from aquatic to terrestrial environments. Phylogenomic analyses have found that zygomycete fungi diversified into two separate clades, the Mucoromycota which are frequently associated with plants and Zoopagomycota that are commonly animal-associated fungi. Genetic elements that contributed to the fitness and divergence of these lineages may have been shaped by the varied interactions these fungi have had with plants, animals, bacteria, and other microbes. To investigate this, we performed comparative genomic analyses of the two clades of zygomycetes in the context of Kingdom Fungi, benefiting from our generation of a new collection of zygomycete genomes, including nine produced for this study. We identified lineage-specific genomic content that may contribute to the disparate biology observed in these zygomycetes. Our findings include the discovery of undescribed diversity in CotH, a Mucormycosis pathogenicity factor, which was found in a broad set of zygomycetes. Reconciliation analysis identified multiple duplication events and an expansion of CotH copies throughout the Mucoromycotina, Mortierellomycotina, Neocallimastigomycota, and Basidiobolus lineages. A kingdom-level phylogenomic analysis also identified new evolutionary relationships within the subphyla of Mucoromycota and Zoopagomycota, including supporting the sister-clade relationship between Glomeromycotina and Mortierellomycotina and the placement of Basidiobolus as sister to other Zoopagomycota lineages.
Assuntos
Glomeromycota , Mucormicose , Animais , Mucormicose/genética , Fungos/genética , Filogenia , Glomeromycota/genética , Plantas/genética , Genoma Fúngico , Evolução MolecularRESUMO
SUMMARY: High-throughput DNA sequencing technologies have spurred the development of numerous novel methods for genome assembly. With few exceptions, these algorithms are heuristic and require one or more parameters to be manually set by the user. One approach to parameter tuning involves assembling data from an organism with an available high-quality reference genome, and measuring assembly accuracy using some metrics. We developed a system to measure assembly quality under several scoring metrics, and to compare assembly quality across a variety of assemblers, sequence data types, and parameter choices. When used in conjunction with training data such as a high-quality reference genome and sequence reads from the same organism, our program can be used to manually identify an optimal sequencing and assembly strategy for de novo sequencing of related organisms. AVAILABILITY: GPL source code and a usage tutorial is at http://ngopt.googlecode.com CONTACT: aarondarling@ucdavis.edu SUPPLEMENTARY INFORMATION: Supplementary data is available at Bioinformatics online.
Assuntos
Algoritmos , Genoma , Haloferax volcanii/genética , Sequenciamento de Nucleotídeos em Larga Escala/normas , Alinhamento de Sequência/normas , Sequência de Bases , Análise de Sequência de DNA/normasRESUMO
chipD is a web server that facilitates design of DNA oligonucleotide probes for high-density tiling arrays, which can be used in a number of genomic applications such as ChIP-chip or gene-expression profiling. The server implements a probe selection algorithm that takes as an input, in addition to the target sequences, a set of parameters that allow probe design to be tailored to specific applications, protocols or the array manufacturer's requirements. The algorithm optimizes probes to meet three objectives: (i) probes should be specific; (ii) probes should have similar thermodynamic properties; and (iii) the target sequence coverage should be homogeneous and avoid significant gaps. The output provides in a text format, the list of probe sequences with their genomic locations, targeted strands and hybridization characteristics. chipD has been used successfully to design tiling arrays for bacteria and yeast. chipD is available at http://chipd.uwbacter.org/.
Assuntos
Análise de Sequência com Séries de Oligonucleotídeos , Sondas de Oligonucleotídeos/química , Software , Algoritmos , Perfilação da Expressão Gênica , Internet , Rhodobacter sphaeroides/genética , Interface Usuário-ComputadorRESUMO
The neurophysiology of cells and tissues are monitored electrophysiologically and optically in diverse experiments and species, ranging from flies to humans. Understanding the brain requires integration of data across this diversity, and thus these data must be findable, accessible, interoperable, and reusable (FAIR). This requires a standard language for data and metadata that can coevolve with neuroscience. We describe design and implementation principles for a language for neurophysiology data. Our open-source software (Neurodata Without Borders, NWB) defines and modularizes the interdependent, yet separable, components of a data language. We demonstrate NWB's impact through unified description of neurophysiology data across diverse modalities and species. NWB exists in an ecosystem, which includes data management, analysis, visualization, and archive tools. Thus, the NWB data language enables reproduction, interchange, and reuse of diverse neurophysiology data. More broadly, the design principles of NWB are generally applicable to enhance discovery across biology through data FAIRness.
The brain is an immensely complex organ which regulates many of the behaviors that animals need to survive. To understand how the brain works, scientists monitor and record brain activity under different conditions using a variety of experimental techniques. These neurophysiological studies are often conducted on multiple types of cells in the brain as well as a variety of species, ranging from mice to flies, or even frogs and worms. Such a range of approaches provides us with highly informative, complementary 'views' of the brain. However, to form a complete, coherent picture of how the brain works, scientists need to be able to integrate all the data from these different experiments. For this to happen effectively, neurophysiology data need to meet certain criteria: namely, they must be findable, accessible, interoperable, and re-usable (or FAIR for short). However, the sheer diversity of neurophysiology experiments impedes the 'FAIR'-ness of the information obtained from them. To overcome this problem, researchers need a standardized way to communicate their experiments and share their results in other words, a 'standard language' to describe neurophysiology data. Rübel, Tritt, Ly, Dichter, Ghosh et al. therefore set out to create such a language that was not only FAIR, but could also co-evolve with neurophysiology research. First, they produced a computer software program (called Neurodata Without Borders, or NWB for short) which generated and defined the different components of the new standard language. Then, other tools for data management were created to expand the NWB platform using the standardized language. This included data analysis and visualization methods, as well as an 'archive' to store and access data. Testing the new language and associated tools showed that they indeed allowed researchers to access, analyze, and share information from many different types of experiments, in organisms ranging from flies to humans. The NWB software is open-source, meaning that anyone can obtain a copy and make changes to it. Thus, NWB and its associated resources provide the basis for a collaborative, community-based system for sharing neurophysiology data. Rübel et al. hope that NWB will inspire similar developments across other fields of biology that share similar levels of complexity with neurophysiology.
Assuntos
Ciência de Dados , Ecossistema , Humanos , Metadados , Neurofisiologia , SoftwareRESUMO
The Alpine goat (Capra aegagrus hircus) is parasitized by the barber pole worm (Haemonchus contortus). Hematological parameters from transcript and metagenome analysis in the host are reflective of infestation. We explored comparisons between blood samples of control, infected, infected zoledronic acid-treated, and infected antibody (anti-γδ T cells) treated wethers under controlled conditions. Seven days post-inoculation (dpi), we identified 7,627 transcripts associated with the different treatment types. Microbiome measurements at 7 dpi revealed fewer raw read counts across all treatments and a less diverse microbial flora than at 21 dpi. This study identifies treatment specific transcripts and an increase in microflora abundance and diversity as wethers age. Further, F/B ratio reflect health, based on depression or elevation above thresholds defined by the baseline of non-infected controls. Forty Alpine wethers were studied where blood samples were collected from five goats in four treatment groups on 7 dpi and 21 dpi. Transcript and microbiome profiles were obtained using the Partek Flow (St. Louis, Missouri, USA) software suites pipelines. Inflammation comparisons were based on the Firmicutes/Bacteriodetes ratios that are calculated as well as the reduction of microbial diversity.
Assuntos
Hemoncose , Haemonchus , Microbiota , Animais , Cabras , Hemoncose/veterinária , Haemonchus/genética , Testes Hematológicos , Masculino , Microbiota/genéticaRESUMO
Wood-decaying fungi of the class Agaricomycetes (phylum Basidiomycota) are saprotrophs that break down lignocellulose and play an important role in nutrient recycling. They secrete a wide range of extracellular plant cell wall degrading enzymes that break down cellulose, hemicellulose, and lignin, the main building blocks of plant biomass. Although the production of these enzymes is regulated mainly at the transcriptional level, no activating regulators have been identified in any wood-decaying fungus in the class Agaricomycetes. We studied the regulation of cellulase expression in the wood-decaying fungus Schizophyllum commune. Comparative genomics and transcriptomics on two wild isolates revealed a Zn2Cys6-type transcription factor gene (roc1) that was highly upregulated during growth on cellulose, compared to glucose. It is only conserved in the class Agaricomycetes. A roc1 knockout strain showed an inability to grow on medium with cellulose as sole carbon source, and growth on cellobiose and xylan (other components of wood) was inhibited. Growth on non-wood-related carbon sources was not inhibited. Cellulase gene expression and enzyme activity were reduced in the Δroc1 strain. ChIP-Seq identified 1474 binding sites of the Roc1 transcription factor. Promoters of genes involved in lignocellulose degradation were enriched with these binding sites, especially those of LPMO (lytic polysaccharide monooxygenase) CAZymes, indicating that Roc1 directly regulates these genes. A conserved motif was identified as the binding site of Roc1, which was confirmed by a functional promoter analysis. Together, Roc1 is a key regulator of cellulose degradation and the first identified in wood-decaying fungi in the phylum Basidiomycota. IMPORTANCE Wood-degrading fungi in the phylum Basidiomycota play a crucial role in nutrient recycling by breaking down all components of wood. Fungi have evolved transcriptional networks that regulate expression of wood-degrading enzymes, allowing them to prioritize one nutrient source over another. However, to date all these transcription factors have been identified in the phylum Ascomycota, which is only distantly related to the phylum Basidiomycota. Here, we identified the transcription factor Roc1 as a key regulator of cellulose degradation in the mushroom-forming and wood-degrading fungus Schizophyllum commune. Roc1 is highly conserved in the phylum Basidiomycota. Using comparative genomics, transcriptomics, ChIP-Seq and promoter analysis we have identified direct targets of Roc1, as well as other aspects of the transcriptional response to cellulose.
Assuntos
Agaricales , Basidiomycota , Celulase , Schizophyllum , Agaricales/genética , Agaricales/metabolismo , Basidiomycota/genética , Carbono/metabolismo , Celulase/metabolismo , Celulose/metabolismo , Proteínas Fúngicas/genética , Proteínas Fúngicas/metabolismo , Lignina/metabolismo , Schizophyllum/genética , Schizophyllum/metabolismo , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
A ubiquitous problem in aggregating data across different experimental and observational data sources is a lack of software infrastructure that enables flexible and extensible standardization of data and metadata. To address this challenge, we developed HDMF, a hierarchical data modeling framework for modern science data standards. With HDMF, we separate the process of data standardization into three main components: (1) data modeling and specification, (2) data I/O and storage, and (3) data interaction and data APIs. To enable standards to support the complex requirements and varying use cases throughout the data life cycle, HDMF provides object mapping infrastructure to insulate and integrate these various components. This approach supports the flexible development of data standards and extensions, optimized storage backends, and data APIs, while allowing the other components of the data standards ecosystem to remain stable. To meet the demands of modern, large-scale science data, HDMF provides advanced data I/O functionality for iterative data write, lazy data load, and parallel I/O. It also supports optimization of data storage via support for chunking, compression, linking, and modular data storage. We demonstrate the application of HDMF in practice to design NWB 2.0 [13], a modern data standard for collaborative science across the neurophysiology community.
RESUMO
Divergence of breeding system plays an important role in fungal speciation. Ectomycorrhizal fungi, however, pose a challenge for the study of reproductive biology because most cannot be mated under laboratory conditions. To overcome this barrier, we sequenced the draft genomes of the ectomycorrhizal sister species Rhizopogon vinicolor Smith and Zeller and R. vesiculosus Smith and Zeller (Basidiomycota, Boletales)-the first genomes available for Basidiomycota truffles-and characterized gene content and organization surrounding their mating type loci. Both species possess a pair of homeodomain transcription factor homologs at the mating type A-locus as well as pheromone receptor and pheromone precursor homologs at the mating type B-locus. Comparison of Rhizopogon genomes with genomes from Boletales, Agaricales, and Polyporales revealed synteny of the A-locus region within Boletales, but several genomic rearrangements across orders. Our findings suggest correlation between gene content at the B-locus region and breeding system in Boletales with tetrapolar species possessing more diverse gene content than bipolar species. Rhizopogon vinicolor possesses a greater number of B-locus pheromone receptor and precursor genes than R. vesiculosus, as well as a pair of isoprenyl cysteine methyltransferase genes flanking the B-locus compared to a single copy in R. vesiculosus Examination of dikaryotic single nucleotide polymorphisms within genomes revealed greater heterozygosity in R. vinicolor, consistent with increased rates of outcrossing. Both species possess the components of a heterothallic breeding system with R. vinicolor possessing a B-locus region structure consistent with tetrapolar Boletales and R. vesiculosus possessing a B-locus region structure intermediate between bipolar and tetrapolar Boletales.