RESUMO
BACKGROUND: Brown algae belong to the Stramenopiles phylum and are phylogenetically distant from plants and other multicellular organisms. This independent evolutionary history has shaped brown algae with numerous metabolic characteristics specific to this group, including the synthesis of peculiar polysaccharides contained in their extracellular matrix (ECM). Alginates and fucose-containing sulphated polysaccharides (FCSPs), the latter including fucans, are the main components of ECMs. However, the metabolic pathways of these polysaccharides remain poorly described due to a lack of genomic data. RESULTS: An extensive genomic dataset has been recently released for brown algae and their close sister species, for which we previously performed an expert annotation of key genes involved in ECM-carbohydrate metabolisms. Here we provide a deeper analysis of this set of genes using comparative genomics, phylogenetics analyses, and protein modelling. Two key gene families involved in both the synthesis and degradation of alginate were suggested to have been acquired by the common ancestor of brown algae and their closest sister species Schizocladia ischiensis. Our analysis indicates that this assumption can be extended to additional metabolic steps, and thus to the whole alginate metabolic pathway. The pathway for the biosynthesis of fucans still remains biochemically unresolved and we also investigate putative fucosyltransferase genes that may harbour a fucan synthase activity in brown algae. CONCLUSIONS: Our analysis is the first extensive survey of carbohydrate-related enzymes in brown algae, and provides a valuable resource for future research into the glycome and ECM of brown algae. The expansion of specific families related to alginate metabolism may have represented an important prerequisite for the evolution of developmental complexity in brown algae. Our analysis questions the possible occurrence of FCSPs outside brown algae, notably within their closest sister taxon and in other Stramenopiles such as diatoms. Filling this knowledge gap in the future will help determine the origin and evolutionary history of fucan synthesis in eukaryotes.
Assuntos
Evolução Molecular , Matriz Extracelular , Phaeophyceae , Filogenia , Polissacarídeos , Phaeophyceae/genética , Phaeophyceae/metabolismo , Polissacarídeos/biossíntese , Polissacarídeos/metabolismo , Matriz Extracelular/metabolismo , Alginatos/metabolismo , Genômica/métodosRESUMO
Macroalgal (seaweed) genomic resources are generally lacking as compared with other eukaryotic taxa, and this is particularly true in the red algae (Rhodophyta). Understanding red algal genomes is critical to understanding eukaryotic evolution given that red algal genes are spread across eukaryotic lineages from secondary endosymbiosis and red algae diverged early in the Archaeplastids. The Gracilariales is a highly diverse and widely distributed order including species that can serve as ecosystem engineers in intertidal habitats and several notorious introduced species. The genus Gracilaria is cultivated worldwide, in part for its production of agar and other bioactive compounds with downstream pharmaceutical and industrial applications. This genus is also emerging as a model for algal evolutionary ecology. Here, we report new whole-genome assemblies for two species (Gracilaria chilensis and Gracilaria gracilis), a draft genome assembly of Gracilaria caudata, and genome annotation of the previously published Gracilaria vermiculophylla genome. To facilitate accessibility and comparative analysis, we integrated these data in a newly created web-based portal dedicated to red algal genomics (https://rhodoexplorer.sb-roscoff.fr). These genomes will provide a resource for understanding algal biology and, more broadly, eukaryotic evolution.
Assuntos
Gracilaria , Rodófitas , Gracilaria/genética , Ecossistema , Rodófitas/genética , Genômica , GenomaRESUMO
The ever-increasing number of available microbial genomes and metagenomes provides new opportunities to investigate the links between niche partitioning and genome evolution in the ocean, especially for the abundant and ubiquitous marine picocyanobacteria Prochlorococcus and Synechococcus. Here, by combining metagenome analyses of the Tara Oceans dataset with comparative genomics, including phyletic patterns and genomic context of individual genes from 256 reference genomes, we show that picocyanobacterial communities thriving in different niches possess distinct gene repertoires. We also identify clusters of adjacent genes that display specific distribution patterns in the field (eCAGs) and are thus potentially involved in the same metabolic pathway and may have a key role in niche adaptation. Several eCAGs are likely involved in the uptake or incorporation of complex organic forms of nutrients, such as guanidine, cyanate, cyanide, pyrimidine, or phosphonates, which might be either directly used by cells, for example for the biosynthesis of proteins or DNA, or degraded to inorganic nitrogen and/or phosphorus forms. We also highlight the enrichment of eCAGs involved in polysaccharide capsule biosynthesis in Synechococcus populations thriving in both nitrogen- and phosphorus-depleted areas vs. low-iron (Fe) regions, suggesting that the complexes they encode may be too energy-consuming for picocyanobacteria thriving in the latter areas. In contrast, Prochlorococcus populations thriving in Fe-depleted areas specifically possess an alternative respiratory terminal oxidase, potentially involved in the reduction of Fe(III) to Fe(II). Altogether, this study provides insights into how phytoplankton communities populate oceanic ecosystems, which is relevant to understanding their capacity to respond to ongoing climate change.
Assuntos
Prochlorococcus , Synechococcus , Água do Mar/microbiologia , Ecossistema , Compostos Férricos/metabolismo , Oceanos e Mares , Synechococcus/genética , Synechococcus/metabolismo , Metagenoma , Família Multigênica , Nitrogênio/metabolismo , Fósforo/metabolismo , Prochlorococcus/genética , FilogeniaRESUMO
Cyanorak v2.1 (http://www.sb-roscoff.fr/cyanorak) is an information system dedicated to visualizing, comparing and curating the genomes of Prochlorococcus, Synechococcus and Cyanobium, the most abundant photosynthetic microorganisms on Earth. The database encompasses sequences from 97 genomes, covering most of the wide genetic diversity known so far within these groups, and which were split into 25,834 clusters of likely orthologous groups (CLOGs). The user interface gives access to genomic characteristics, accession numbers as well as an interactive map showing strain isolation sites. The main entry to the database is through search for a term (gene name, product, etc.), resulting in a list of CLOGs and individual genes. Each CLOG benefits from a rich functional annotation including EggNOG, EC/K numbers, GO terms, TIGR Roles, custom-designed Cyanorak Roles as well as several protein motif predictions. Cyanorak also displays a phyletic profile, indicating the genotype and pigment type for each CLOG, and a genome viewer (Jbrowse) to visualize additional data on each genome such as predicted operons, genomic islands or transcriptomic data, when available. This information system also includes a BLAST search tool, comparative genomic context as well as various data export options. Altogether, Cyanorak v2.1 constitutes an invaluable, scalable tool for comparative genomics of ecologically relevant marine microorganisms.
Assuntos
Organismos Aquáticos/genética , Cianobactérias/genética , Curadoria de Dados , Bases de Dados Genéticas , Genoma Bacteriano , Sistemas de Informação , Proteínas de Bactérias/genética , Geografia , Funções Verossimilhança , Filogenia , Interface Usuário-ComputadorRESUMO
Marine picocyanobacteria of the genera Prochlorococcus and Synechococcus are the most abundant photosynthetic organisms on Earth, an ecological success thought to be linked to the differential partitioning of distinct ecotypes into specific ecological niches. However, the underlying processes that governed the diversification of these microorganisms and the appearance of niche-related phenotypic traits are just starting to be elucidated. Here, by comparing 81 genomes, including 34 new Synechococcus, we explored the evolutionary processes that shaped the genomic diversity of picocyanobacteria. Time-calibration of a core-protein tree showed that gene gain/loss occurred at an unexpectedly low rate between the different lineages, with for instance 5.6 genes gained per million years (My) for the major Synechococcus lineage (sub-cluster 5.1), among which only 0.71/My have been fixed in the long term. Gene content comparisons revealed a number of candidates involved in nutrient adaptation, a large proportion of which are located in genomic islands shared between either closely or more distantly related strains, as identified using an original network construction approach. Interestingly, strains representative of the different ecotypes co-occurring in phosphorus-depleted waters (Synechococcus clades III, WPC1, and sub-cluster 5.3) were shown to display different adaptation strategies to this limitation. In contrast, we found few genes potentially involved in adaptation to temperature when comparing cold and warm thermotypes. Indeed, comparison of core protein sequences highlighted variants specific to cold thermotypes, notably involved in carotenoid biosynthesis and the oxidative stress response, revealing that long-term adaptation to thermal niches relies on amino acid substitutions rather than on gene content variation. Altogether, this study not only deciphers the respective roles of gene gains/losses and sequence variation but also uncovers numerous gene candidates likely involved in niche partitioning of two key members of the marine phytoplankton.
RESUMO
Understanding how microorganisms adjust their metabolism to maintain their ability to cope with short-term environmental variations constitutes one of the major current challenges in microbial ecology. Here, the best physiologically characterized marine Synechococcus strain, WH7803, was exposed to modulated light/dark cycles or acclimated to continuous high-light (HL) or low-light (LL), then shifted to various stress conditions, including low (LT) or high temperature (HT), HL and ultraviolet (UV) radiations. Physiological responses were analyzed by measuring time courses of photosystem (PS) II quantum yield, PSII repair rate, pigment ratios and global changes in gene expression. Previously published membrane lipid composition were also used for correlation analyses. These data revealed that cells previously acclimated to HL are better prepared than LL-acclimated cells to sustain an additional light or UV stress, but not a LT stress. Indeed, LT seems to induce a synergic effect with the HL treatment, as previously observed with oxidative stress. While all tested shift conditions induced the downregulation of many photosynthetic genes, notably those encoding PSI, cytochrome b6/f and phycobilisomes, UV stress proved to be more deleterious for PSII than the other treatments, and full recovery of damaged PSII from UV stress seemed to involve the neo-synthesis of a fairly large number of PSII subunits and not just the reassembly of pre-existing subunits after D1 replacement. In contrast, genes involved in glycogen degradation and carotenoid biosynthesis pathways were more particularly upregulated in response to LT. Altogether, these experiments allowed us to identify responses common to all stresses and those more specific to a given stress, thus highlighting genes potentially involved in niche acclimation of a key member of marine ecosystems. Our data also revealed important specific features of the stress responses compared to model freshwater cyanobacteria.
RESUMO
BACKGROUND: Despite a growing number of investigations on early diverging fungi, the corresponding lineages have not been as extensively characterized as Ascomycota or Basidiomycota ones. The Mucor genus, pertaining to one of these lineages is not an exception. To this date, a restricted number of Mucor annotated genomes is publicly available and mainly correspond to the reference species, Mucor circinelloides, and to medically relevant species. However, the Mucor genus is composed of a large number of ubiquitous species as well as few species that have been reported to specifically occur in certain habitats. The present study aimed to expand the range of Mucor genomes available and identify potential genomic imprints of adaptation to different environments and lifestyles in the Mucor genus. RESULTS: In this study, we report four newly sequenced genomes of Mucor isolates collected from non-clinical environments pertaining to species with contrasted lifestyles, namely Mucor fuscus and Mucor lanceolatus, two species used in cheese production (during ripening), Mucor racemosus, a recurrent cheese spoiler sometimes described as an opportunistic animal and human pathogen, and Mucor endophyticus, a plant endophyte. Comparison of these new genomes with those previously available for six Mucor and two Rhizopus (formerly identified as M. racemosus) isolates allowed global structural and functional description such as their TE content, core and species-specific genes and specialized genes. We proposed gene candidates involved in iron metabolism; some of these genes being known to be involved in pathogenicity; and described patterns such as a reduced number of CAZymes in the species used for cheese ripening as well as in the endophytic isolate that might be related to adaptation to different environments and lifestyles within the Mucor genus. CONCLUSIONS: This study extended the descriptive data set for Mucor genomes, pointed out the complexity of obtaining a robust phylogeny even with multiple genes families and allowed identifying contrasting potentially lifestyle-associated gene repertoires. The obtained data will allow investigating further the link between genetic and its biological data, especially in terms of adaptation to a given habitat.
Assuntos
Adaptação Fisiológica/genética , Genômica , Estilo de Vida , Mucor/genética , Sequência de Bases/genética , Proteínas Fúngicas/genética , Genoma Fúngico , Filogenia , Especificidade da EspécieRESUMO
Brown algae are multicellular photosynthetic stramenopiles that colonize marine rocky shores worldwide. Ectocarpus sp. Ec32 has been established as a genomic model for brown algae. Here we present the genome and metabolic network of the closely related species, Ectocarpus subulatus Kützing, which is characterized by high abiotic stress tolerance. Since their separation, both strains show new traces of viral sequences and the activity of large retrotransposons, which may also be related to the expansion of a family of chlorophyll-binding proteins. Further features suspected to contribute to stress tolerance include an expanded family of heat shock proteins, the reduction of genes involved in the production of halogenated defence compounds, and the presence of fewer cell wall polysaccharide-modifying enzymes. Overall, E. subulatus has mainly lost members of gene families down-regulated in low salinities, and conserved those that were up-regulated in the same condition. However, 96% of genes that differed between the two examined Ectocarpus species, as well as all genes under positive selection, were found to encode proteins of unknown function. This underlines the uniqueness of brown algal stress tolerance mechanisms as well as the significance of establishing E. subulatus as a comparative model for future functional studies.
Assuntos
Genoma/genética , Phaeophyceae/genética , Estresse Fisiológico/genética , Proteínas de Algas/genética , Redes e Vias Metabólicas/genética , Família Multigênica/genética , VitóriaRESUMO
Understanding growth mechanisms in brown algae is a current scientific and economic challenge that can benefit from the modeling of their metabolic networks. The sequencing of the genomes of Saccharina japonica and Cladosiphon okamuranus has provided the necessary data for the reconstruction of Genome-Scale Metabolic Networks (GSMNs). The same in silico method deployed for the GSMN reconstruction of Ectocarpus siliculosus to investigate the metabolic capabilities of these two algae, was used. Integrating metabolic profiling data from the literature, we provided functional GSMNs composed of an average of 2230 metabolites and 3370 reactions. Based on these GSMNs and previously published work, we propose a model for the biosynthetic pathways of the main carotenoids in these two algae. We highlight, on the one hand, the reactions and enzymes that have been preserved through evolution and, on the other hand, the specificities related to brown algae. Our data further indicate that, if abscisic acid is produced by Saccharina japonica, its biosynthesis pathway seems to be different in its final steps from that described in land plants. Thus, our work illustrates the potential of GSMNs reconstructions for formalizing hypotheses that can be further tested using targeted biochemical approaches.
RESUMO
Dinitroanilines are chemical compounds with high selectivity for plant cell α-tubulin in which they promote microtubule depolymerization. They target α-tubulin regions that have diverged over evolution and show no effect on non-photosynthetic eukaryotes. Hence, they have been used as herbicides over decades. Interestingly, dinitroanilines proved active on microtubules of eukaryotes deriving from photosynthetic ancestors such as Toxoplasma gondii and Plasmodium falciparum, which are responsible for toxoplasmosis and malaria, respectively. By combining differential in silico screening of virtual chemical libraries on Arabidopsis thaliana and mammal tubulin structural models together with cell-based screening of chemical libraries, we have identified dinitroaniline related and non-related compounds. They inhibit plant, but not mammalian tubulin assembly in vitro, and accordingly arrest A. thaliana development. In addition, these compounds exhibit a moderate cytotoxic activity towards T. gondii and P. falciparum. These results highlight the potential of novel herbicidal scaffolds in the design of urgently needed anti-parasitic drugs.
Assuntos
Apicomplexa/fisiologia , Plantas/metabolismo , Plantas/parasitologia , Tubulina (Proteína)/metabolismo , Animais , Células HeLa , Humanos , Microtúbulos/metabolismo , Modelos Moleculares , Fotossíntese , Células Vegetais/metabolismo , Plasmodium falciparum , Conformação Proteica , Tubulina (Proteína)/química , Tubulina (Proteína)/genéticaRESUMO
The primary problem with the explosion of biomedical datasets is not the data, not computational resources, and not the required storage space, but the general lack of trained and skilled researchers to manipulate and analyze these data. Eliminating this problem requires development of comprehensive educational resources. Here we present a community-driven framework that enables modern, interactive teaching of data analytics in life sciences and facilitates the development of training materials. The key feature of our system is that it is not a static but a continuously improved collection of tutorials. By coupling tutorials with a web-based analysis framework, biomedical researchers can learn by performing computation themselves through a web browser without the need to install software or search for example datasets. Our ultimate goal is to expand the breadth of training materials to include fundamental statistical and data science topics and to precipitate a complete re-engineering of undergraduate and graduate curricula in life sciences. This project is accessible at https://training.galaxyproject.org.
Assuntos
Biologia Computacional/educação , Biologia Computacional/métodos , Pesquisadores/educação , Currículo , Análise de Dados , Educação a Distância/métodos , Educação a Distância/tendências , Humanos , SoftwareRESUMO
Plastids are supported by a wide range of proteins encoded within the nucleus and imported from the cytoplasm. These plastid-targeted proteins may originate from the endosymbiont, the host, or other sources entirely. Here, we identify and characterise 770 plastid-targeted proteins that are conserved across the ochrophytes, a major group of algae including diatoms, pelagophytes and kelps, that possess plastids derived from red algae. We show that the ancestral ochrophyte plastid proteome was an evolutionary chimera, with 25% of its phylogenetically tractable nucleus-encoded proteins deriving from green algae. We additionally show that functional mixing of host and plastid proteomes, such as through dual-targeting, is an ancestral feature of plastid evolution. Finally, we detect a clear phylogenetic signal from one ochrophyte subgroup, the lineage containing pelagophytes and dictyochophytes, in plastid-targeted proteins from another major algal lineage, the haptophytes. This may represent a possible serial endosymbiosis event deep in eukaryotic evolutionary history.
Assuntos
Proteínas de Cloroplastos/genética , Evolução Molecular , Haptófitas/classificação , Haptófitas/genética , Estramenópilas/classificação , Estramenópilas/genéticaRESUMO
Kelps are founding species of temperate marine ecosystems, living in intertidal coastal areas where they are often challenged by generalist and specialist herbivores. As most sessile organisms, kelps develop defensive strategies to restrain grazing damage and preserve their own fitness during interactions with herbivores. To decipher some inducible defense and signaling mechanisms, we carried out metabolome and transcriptome analyses in two emblematic kelp species, Lessonia spicata from South Pacific coasts and Laminaria digitata from North Atlantic, when challenged with their main specialist herbivores. Mass spectrometry based metabolomics revealed large metabolic changes induced in these two brown algae following challenges with their own specialist herbivores. Targeted metabolic profiling of L. spicata further showed that free fatty acid (FFA) and amino acid (AA) metabolisms were particularly regulated under grazing. An early stress response was illustrated by the accumulation of Sulphur containing amino acids in the first twelve hours of herbivory pressure. At latter time periods (after 24 hours), we observed FFA liberation and eicosanoid oxylipins synthesis likely representing metabolites related to stress. Global transcriptomic analysis identified sets of candidate genes specifically induced by grazing in both kelps. qPCR analysis of the top candidate genes during a 48-hours time course validated the results. Most of these genes were particularly activated by herbivore challenge after 24 hours, suggesting that transcriptional reprogramming could be operated at this time period. We demonstrated the potential utility of these genes as molecular markers for herbivory by measuring their inductions in grazed individuals of field harvested L. digitata and L. spicata. By unravelling the regulation of some metabolites and genes following grazing pressure in two kelps representative of the two hemispheres, this work contributes to provide a set of herbivore-induced chemical and molecular responses in kelp species, showing similar inducible responses upon specialist herbivores in their respective ecosystems.
Assuntos
Herbivoria , Phaeophyceae/fisiologia , Aminoácidos/metabolismo , Etiquetas de Sequências Expressas , Ácidos Graxos não Esterificados/metabolismo , Metabolômica , Phaeophyceae/genética , Phaeophyceae/metabolismo , Reação em Cadeia da Polimerase em Tempo Real , TranscriptomaRESUMO
Sulfatases cleave sulfate groups from various molecules and constitute a biologically and industrially important group of enzymes. However, the number of sulfatases whose substrate has been characterized is limited in comparison to the huge diversity of sulfated compounds, yielding functional annotations of sulfatases particularly prone to flaws and misinterpretations. In the context of the explosion of genomic data, a classification system allowing a better prediction of substrate specificity and for setting the limit of functional annotations is urgently needed for sulfatases. Here, after an overview on the diversity of sulfated compounds and on the known sulfatases, we propose a classification database, SulfAtlas (http://abims.sb-roscoff.fr/sulfatlas/), based on sequence homology and composed of four families of sulfatases. The formylglycine-dependent sulfatases, which constitute the largest family, are also divided by phylogenetic approach into 73 subfamilies, each subfamily corresponding to either a known specificity or to an uncharacterized substrate. SulfAtlas summarizes information about the different families of sulfatases. Within a family a web page displays the list of its subfamilies (when they exist) and the list of EC numbers. The family or subfamily page shows some descriptors and a table with all the UniProt accession numbers linked to the databases UniProt, ExplorEnz, and PDB.
Assuntos
Sulfatases/metabolismo , Sulfatos/metabolismo , Animais , Bactérias/enzimologia , Biocatálise , Domínio Catalítico , Bases de Dados de Proteínas , Humanos , Internet , Filogenia , Especificidade por Substrato , Sulfatases/química , Sulfatases/classificação , Sulfatos/química , Interface Usuário-ComputadorRESUMO
BACKGROUND: Several R packages exist for the detection of differentially expressed genes from RNA-Seq data. The analysis process includes three main steps, namely normalization, dispersion estimation and test for differential expression. Quality control steps along this process are recommended but not mandatory, and failing to check the characteristics of the dataset may lead to spurious results. In addition, normalization methods and statistical models are not exchangeable across the packages without adequate transformations the users are often not aware of. Thus, dedicated analysis pipelines are needed to include systematic quality control steps and prevent errors from misusing the proposed methods. RESULTS: SARTools is an R pipeline for differential analysis of RNA-Seq count data. It can handle designs involving two or more conditions of a single biological factor with or without a blocking factor (such as a batch effect or a sample pairing). It is based on DESeq2 and edgeR and is composed of an R package and two R script templates (for DESeq2 and edgeR respectively). Tuning a small number of parameters and executing one of the R scripts, users have access to the full results of the analysis, including lists of differentially expressed genes and a HTML report that (i) displays diagnostic plots for quality control and model hypotheses checking and (ii) keeps track of the whole analysis process, parameter values and versions of the R packages used. CONCLUSIONS: SARTools provides systematic quality controls of the dataset as well as diagnostic plots that help to tune the model parameters. It gives access to the main parameters of DESeq2 and edgeR and prevents untrained users from misusing some functionalities of both packages. By keeping track of all the parameters of the analysis process it fits the requirements of reproducible research.
Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA/análise , Análise de Sequência de RNA/métodos , Software , HumanosRESUMO
Microbial rhodopsins are a diverse group of photoactive transmembrane proteins found in all three domains of life and in viruses. Today, microbial rhodopsin research is a flourishing research field in which new understandings of rhodopsin diversity, function and evolution are contributing to broader microbiological and molecular knowledge. Here, we describe MicRhoDE, a comprehensive, high-quality and freely accessible database that facilitates analysis of the diversity and evolution of microbial rhodopsins. Rhodopsin sequences isolated from a vast array of marine and terrestrial environments were manually collected and curated. To each rhodopsin sequence are associated related metadata, including predicted spectral tuning of the protein, putative activity and function, taxonomy for sequences that can be linked to a 16S rRNA gene, sampling date and location, and supporting literature. The database currently covers 7857 aligned sequences from more than 450 environmental samples or organisms. Based on a robust phylogenetic analysis, we introduce an operational classification system with multiple phylogenetic levels ranging from superclusters to species-level operational taxonomic units. An integrated pipeline for online sequence alignment and phylogenetic tree construction is also provided. With a user-friendly interface and integrated online bioinformatics tools, this unique resource should be highly valuable for upcoming studies of the biogeography, diversity, distribution and evolution of microbial rhodopsins. Database URL: http://micrhode.sb-roscoff.fr.