RESUMO
Animals and fungi have radically distinct morphologies, yet both evolved within the same eukaryotic supergroup: Opisthokonta1,2. Here we reconstructed the trajectory of genetic changes that accompanied the origin of Metazoa and Fungi since the divergence of Opisthokonta with a dataset that includes four novel genomes from crucial positions in the Opisthokonta phylogeny. We show that animals arose only after the accumulation of genes functionally important for their multicellularity, a tendency that began in the pre-metazoan ancestors and later accelerated in the metazoan root. By contrast, the pre-fungal ancestors experienced net losses of most functional categories, including those gained in the path to Metazoa. On a broad-scale functional level, fungal genomes contain a higher proportion of metabolic genes and diverged less from the last common ancestor of Opisthokonta than did the gene repertoires of Metazoa. Metazoa and Fungi also show differences regarding gene gain mechanisms. Gene fusions are more prevalent in Metazoa, whereas a larger fraction of gene gains were detected as horizontal gene transfers in Fungi and protists, in agreement with the long-standing idea that transfers would be less relevant in Metazoa due to germline isolation3-5. Together, our results indicate that animals and fungi evolved under two contrasting trajectories of genetic change that predated the origin of both groups. The gradual establishment of two clearly differentiated genomic contexts thus set the stage for the emergence of Metazoa and Fungi.
Assuntos
Evolução Molecular , Fungos , Genoma , Genômica , Filogenia , Animais , Fungos/genética , Transferência Genética Horizontal , Genes , Genoma/genética , Genoma Fúngico/genética , Metabolismo/genéticaRESUMO
BACKGROUND: Whole-genome shotgun sequencing, which stitches together millions of short sequencing reads into a single genome, ushered in the era of modern genomics and led to a rapid expansion of the number of genome sequences available. Nevertheless, assembly of short reads remains difficult, resulting in fragmented genome sequences. Ultimately, only a sequencing technology capable of capturing complete chromosomes in a single run could resolve all ambiguities. Even "third generation" sequencing technologies produce reads far shorter than most eukaryotic chromosomes. However, the ciliate Oxytricha trifallax has a somatic genome with thousands of chromosomes averaging only 3.2 kbp, making it an ideal candidate for exploring the benefits of sequencing whole chromosomes without assembly. RESULTS: We used single-molecule real-time sequencing to capture thousands of complete chromosomes in single reads and to update the published Oxytricha trifallax JRB310 genome assembly. In this version, over 50% of the completed chromosomes with two telomeres derive from single reads. The improved assembly includes over 12,000 new chromosome isoforms, and demonstrates that somatic chromosomes derive from variable rearrangements between somatic segments encoded up to 191,000 base pairs away. However, while long reads reduce the need for assembly, a hybrid approach that supplements long-read sequencing with short reads for error correction produced the most complete and accurate assembly, overall. CONCLUSIONS: This assembly provides the first example of complete eukaryotic chromosomes captured by single sequencing reads and demonstrates that traditional approaches to genome assembly can mask considerable structural variation.
Assuntos
Cromossomos , Cilióforos/genética , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Biologia Computacional/métodos , Genoma , Genômica/métodos , Hibridização GenéticaRESUMO
Genes evolve by point mutations, but also by shuffling, fusion, and fission of genetic fragments. Therefore, similarity between two sequences can be due to common ancestry producing homology, and/or partial sharing of component fragments. Disentangling these processes is especially challenging in large molecular data sets, because of computational time. In this article, we present CompositeSearch, a memory-efficient, fast, and scalable method to detect composite gene families in large data sets (typically in the range of several million sequences). CompositeSearch generalizes the use of similarity networks to detect composite and component gene families with a greater recall, accuracy, and precision than recent programs (FusedTriplets and MosaicFinder). Moreover, CompositeSearch provides user-friendly quality descriptions regarding the distribution and primary sequence conservation of these gene families allowing critical biological analyses of these data.
Assuntos
Biologia Computacional/métodos , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Algoritmos , Sequência Conservada/genética , Evolução Molecular , Filogenia , Análise de Sequência de DNA/estatística & dados numéricos , SoftwareRESUMO
BACKGROUND: Eukaryotes evolved from the symbiotic association of at least two prokaryotic partners, and a good deal is known about the timings, mechanisms, and dynamics of these evolutionary steps. Recently, it was shown that a new class of nuclear genes, symbiogenetic genes (S-genes), was formed concomitant with endosymbiosis and the subsequent evolution of eukaryotic photosynthetic lineages. Understanding their origins and contributions to eukaryogenesis would provide insights into the ways in which cellular complexity has evolved. RESULTS: Here, we show that chimeric nuclear genes (S-genes), built from prokaryotic domains, are critical for explaining the leap forward in cellular complexity achieved during eukaryogenesis. A total of 282 S-gene families contributed solutions to many of the challenges faced by early eukaryotes, including enhancing the informational machinery, processing spliceosomal introns, tackling genotoxicity within the cell, and ensuring functional protein interactions in a larger, more compartmentalized cell. For hundreds of S-genes, we confirmed the origins of their components (bacterial, archaeal, or generally prokaryotic) by maximum likelihood phylogenies. Remarkably, Bacteria contributed nine-fold more S-genes than Archaea, including a two-fold greater contribution to informational functions. Therefore, there is an additional, large bacterial contribution to the evolution of eukaryotes, implying that fundamental eukaryotic properties do not strictly follow the traditional informational/operational divide for archaeal/bacterial contributions to eukaryogenesis. CONCLUSION: This study demonstrates the extent and process through which prokaryotic fragments from bacterial and archaeal genes inherited during eukaryogenesis underly the creation of novel chimeric genes with important functions.
Assuntos
Quimera/genética , Quimera/metabolismo , Bases de Dados Genéticas , Células Eucarióticas/fisiologia , Evolução Molecular , FilogeniaRESUMO
Based on their small size and genomic properties, ultrasmall prokaryotic groups like the Candidate Phyla Radiation have been proposed as possible symbionts dependent on other bacteria or archaea. In this study, we use a bipartite graph analysis to examine patterns of sequence similarity between draft and complete genomes from ultrasmall bacteria and other complete prokaryotic genomes, assessing whether the former group might engage in significant gene transfer (or even endosymbioses) with other community members. Our results provide preliminary evidence for many lateral gene transfers with other prokaryotes, including members of the archaea, and report the presence of divergent, membrane-associated proteins among these ultrasmall taxa. In particular, these divergent genes were found in TM6 relatives of the intracellular parasite Babela massiliensis.
Assuntos
Archaea/genética , Bactérias/genética , Proteínas de Bactérias/genética , Transferência Genética Horizontal , Proteínas de Membrana/genética , Archaea/classificação , Archaea/isolamento & purificação , Bactérias/classificação , Bactérias/isolamento & purificação , Proteínas de Bactérias/metabolismo , Genoma Arqueal , Genoma Bacteriano , Proteínas de Membrana/metabolismo , Filogenia , SimbioseRESUMO
The role of the skin microbiome in resistance and susceptibility of wildlife to fungal pathogens has been examined from a taxonomic perspective but skin microbial function, in the context of fungal infection, has yet to be studied. Our objective was to understand effects of a bat fungal pathogen site infection status and course of invasion on skin microbial function. We sampled seven hibernating colonies of Myotis lucifugus covering three-time points over the course of Pseudogymnoascus destructans (Pd) invasion and white nose syndrome (pre-invasion, epidemic, and established). Our results support three new hypotheses about Pd and skin functional microbiome: (1) there is an important effect of Pd invasion stage, especially at the epidemic stage; (2) disruption by the fungus at the epidemic stage could decrease anti-fungal functions with potential negative effects on the microbiome and bat health; (3) the collection site might have a larger influence on microbiomes at the pre-invasion stage rather than at epidemic and established stages. Future studies with larger sample sizes and using meta-omics approaches will help confirm these hypotheses, and determine the influence of the microbiome on wildlife survival to fungal disease.
Assuntos
Ascomicetos , Quirópteros , Microbiota , Pele , Animais , Quirópteros/microbiologia , Pele/microbiologia , Ascomicetos/genética , Ascomicetos/patogenicidade , Hibernação , Micoses/microbiologia , Micoses/veterináriaRESUMO
Understanding the response of the coral holobiont to environmental change is crucial to inform conservation efforts. The most pressing problem is "coral bleaching," usually precipitated by prolonged thermal stress. We used untargeted, polar metabolite profiling to investigate the physiological response of the coral species Montipora capitata and Pocillopora acuta to heat stress. Our goal was to identify diagnostic markers present early in the bleaching response. From the untargeted UHPLC-MS data, a variety of co-regulated dipeptides were found that have the highest differential accumulation in both species. The structures of four dipeptides were determined and showed differential accumulation in symbiotic and aposymbiotic (alga-free) populations of the sea anemone Aiptasia (Exaiptasia pallida), suggesting the deep evolutionary origins of these dipeptides and their involvement in symbiosis. These and other metabolites may be used as diagnostic markers for thermal stress in wild coral.
Assuntos
Antozoários , Dinoflagellida , Anêmonas-do-Mar , Animais , Antozoários/fisiologia , Recifes de Corais , Dinoflagellida/fisiologia , Dipeptídeos , Resposta ao Choque Térmico , SimbioseRESUMO
BACKGROUND: Corals, which form the foundation of biodiverse reef ecosystems, are under threat from warming oceans. Reefs provide essential ecological services, including food, income from tourism, nutrient cycling, waste removal, and the absorption of wave energy to mitigate erosion. Here, we studied the coral thermal stress response using network methods to analyze transcriptomic and polar metabolomic data generated from the Hawaiian rice coral Montipora capitata. Coral nubbins were exposed to ambient or thermal stress conditions over a 5-week period, coinciding with a mass spawning event of this species. The major goal of our study was to expand the inventory of thermal stress-related genes and metabolites present in M. capitata and to study gene-metabolite interactions. These interactions provide the foundation for functional or genetic analysis of key coral genes as well as provide potentially diagnostic markers of pre-bleaching stress. A secondary goal of our study was to analyze the accumulation of sex hormones prior to and during mass spawning to understand how thermal stress may impact reproductive success in M. capitata. METHODS: M. capitata was exposed to thermal stress during its spawning cycle over the course of 5 weeks, during which time transcriptomic and polar metabolomic data were collected. We analyzed these data streams individually, and then integrated both data sets using MAGI (Metabolite Annotation and Gene Integration) to investigate molecular transitions and biochemical reactions. RESULTS: Our results reveal the complexity of the thermal stress phenome in M. capitata, which includes many genes involved in redox regulation, biomineralization, and reproduction. The size and number of modules in the gene co-expression networks expanded from the initial stress response to the onset of bleaching. The later stages involved the suppression of metabolite transport by the coral host, including a variety of sodium-coupled transporters and a putative ammonium transporter, possibly as a response to reduction in algal productivity. The gene-metabolite integration data suggest that thermal treatment results in the activation of animal redox stress pathways involved in quenching molecular oxygen to prevent an overabundance of reactive oxygen species. Lastly, evidence that thermal stress affects reproductive activity was provided by the downregulation of CYP-like genes and the irregular production of sex hormones during the mass spawning cycle. Overall, redox regulation and metabolite transport are key components of the coral animal thermal stress phenome. Mass spawning was highly attenuated under thermal stress, suggesting that global climate change may negatively impact reproductive behavior in this species.
RESUMO
The germline-soma divide is a fundamental distinction in developmental biology, and different genes are expressed in germline and somatic cells throughout metazoan life cycles. Ciliates, a group of microbial eukaryotes, exhibit germline-somatic nuclear dimorphism within a single cell with two different genomes. The ciliate Oxytricha trifallax undergoes massive RNA-guided DNA elimination and genome rearrangement to produce a new somatic macronucleus (MAC) from a copy of the germline micronucleus (MIC). This process eliminates noncoding DNA sequences that interrupt genes and also deletes hundreds of germline-limited open reading frames (ORFs) that are transcribed during genome rearrangement. Here, we update the set of transcribed germline-limited ORFs (TGLOs) in O. trifallax. We show that TGLOs tend to be expressed during nuclear development and then are absent from the somatic MAC. We also demonstrate that exposure to synthetic RNA can reprogram TGLO retention in the somatic MAC and that TGLO retention leads to transcription outside the normal developmental program. These data suggest that TGLOs represent a group of developmentally regulated protein-coding sequences whose gene expression is terminated by DNA elimination.
Assuntos
Oxytricha , Animais , Oxytricha/genética , Rearranjo Gênico , Células Germinativas , DNA/metabolismo , RNA/metabolismoRESUMO
In the post genomic era, large and complex molecular datasets from genome and metagenome sequencing projects expand the limits of what is possible for bioinformatic analyses. Network-based methods are increasingly used to complement phylogenetic analysis in studies in molecular evolution, including comparative genomics, classification, and ecological studies. Using network methods, the vertical and horizontal relationships between all genes or genomes, whether they are from cellular chromosomes or mobile genetic elements, can be explored in a single expandable graph. In recent years, development of new methods for the construction and analysis of networks has helped to broaden the availability of these approaches from programmers to a diversity of users. This chapter introduces the different kinds of networks based on sequence similarity that are already available to tackle a wide range of biological questions, including sequence similarity networks, gene-sharing networks and bipartite graphs, and a guide for their construction and analyses.
Assuntos
Metagenoma , Metagenômica , Biodiversidade , Evolução Biológica , Biologia Computacional/métodos , Ecossistema , Evolução Molecular , Ontologia Genética , Redes Reguladoras de Genes , Sequenciamento de Nucleotídeos em Larga Escala , Metagenômica/métodos , Microbiota , Anotação de Sequência Molecular , Família MultigênicaRESUMO
Explaining the evolution of animals requires ecological, developmental, paleontological, and phylogenetic considerations because organismal traits are affected by complex evolutionary processes. Modeling a plurality of processes, operating at distinct time-scales on potentially interdependent traits, can benefit from approaches that are complementary treatments to phylogenetics. Here, we developed an inclusive network approach, implemented in the command line software ComponentGrapher, and analyzed trait co-occurrence of rhinocerotoid mammals. We identified stable, unstable, and pivotal traits, as well as traits contributing to complexes, that may follow to a common developmental regulation, that point to an early implementation of the postcranial Bauplan among rhinocerotoids. Strikingly, most identified traits are highly dissociable, used repeatedly in distinct combinations and in different taxa, which usually do not form clades. Therefore, the genes encoding these traits are likely recruited into novel gene regulation networks during the course of evolution. Our evo-systemic framework, generalizable to other evolved organizations, supports a pluralistic modeling of organismal evolution, including trees and networks.
Assuntos
Evolução Biológica , Mamíferos/anatomia & histologia , Mamíferos/genética , Animais , Osso e Ossos/anatomia & histologia , Mamíferos/classificação , Filogenia , Software , Dente/anatomia & histologiaRESUMO
Microbes are the oldest and most widespread, phylogenetically and metabolically diverse life forms on Earth. However, they have been discovered only 334 years ago, and their diversity started to become seriously investigated even later. For these reasons, microbial studies that unveil novel microbial lineages and processes affecting or involving microbes deeply (and repeatedly) transform knowledge in biology. Considering the quantitative prevalence of taxonomically and functionally unassigned sequences in environmental genomics data sets, and that of uncultured microbes on the planet, we propose that unraveling the microbial dark matter should be identified as a central priority for biologists. Based on former empirical findings of microbial studies, we sketch a logic of discovery with the potential to further highlight the microbial unknowns.
Assuntos
Bactérias/genética , Variação Genética , Metagenômica , FilogeniaRESUMO
The inclusion of introgressive processes in evolutionary studies induces a less constrained view of evolution. Network-based methods (like large-scale similarity networks) allow to include in comparative genomics all extrachromosomic carriers (like viruses, the most abundant biological entities on the planet) with their cellular hosts. The integration of several levels of biological organization (genes, genomes, communities, environments) enables more comprehensive analyses of gene sharing and improved sequence-based classifications. However, the algorithmic tools for the analysis of such networks are usually restricted to people with high programming skills. We present an integrated suite of software tools named MultiTwin, aimed at the construction, structuring, and analysis of multipartite graphs for evolutionary biology. Typically, this kind of graph is useful for the comparative analysis of the gene content of genomes in microbial communities from the environment and for exploring patterns of gene sharing, for example between distantly related cellular genomes, pangenomes, or between cellular genomes and their mobile genetic elements. We illustrate the use of this tool with an application of the bipartite approach (using gene family-genome graphs) for the analysis of pathogenicity traits in prokaryotes.
Assuntos
Evolução Biológica , Técnicas Genéticas , SoftwareRESUMO
Polycyclic aromatic hydrocarbons are distributed ubiquitously in the environment and form metabolites toxic to most organisms. Organic amendment of PAH contaminated soil with compost and farmyard manure has proven to be efficient for PAH bioremediation mediated by native microorganisms, even though information on the identity of PAH degraders in organic-amended soil is still scarce. Here we provide molecular insight into the bacterial communities in soil amended with compost or farmyard manure for which the degradation mass balances of 13C-labeled pyrene have been recently published and assess the relevant bacterial genera capable of degrading pyrene as a model PAH. We performed statistical analyses of bacterial genera abundance data based on total DNA and RNA (for comparison) extracted from the soil samples. The results revealed complex pyrene degrading communities with low abundance of individual degraders instead of a limited number of abundant key players. The bacterial degrader communities of the soil-compost mixture and soil fertilized with farmyard manure differed considerably in composition albeit showing similar degradation kinetics. Additional analyses were carried out on enrichment cultures and enabled the reconstruction of several nearly complete genomes, thus allowing to link microcosm and enrichment experiments. However, pyrene mineralizing bacteria enriched from the compost or unfertilized soil-compost samples did not dominate pyrene degradation in the soils. Based on the present findings, evaluations of PAH degrading microorganisms in complex soil mixtures with high organic matter content should not target abundant key degrading species, since the specific degraders may be highly diverse, of low abundance, and masked by high bacterial background.
RESUMO
BACKGROUND: For a long time biologists and linguists have been noticing surprising similarities between the evolution of life forms and languages. Most of the proposed analogies have been rejected. Some, however, have persisted, and some even turned out to be fruitful, inspiring the transfer of methods and models between biology and linguistics up to today. Most proposed analogies were based on a comparison of the research objects rather than the processes that shaped their evolution. Focusing on process-based analogies, however, has the advantage of minimizing the risk of overstating similarities, while at the same time reflecting the common strategy to use processes to explain the evolution of complexity in both fields. RESULTS: We compared important evolutionary processes in biology and linguistics and identified processes specific to only one of the two disciplines as well as processes which seem to be analogous, potentially reflecting core evolutionary processes. These new process-based analogies support novel methodological transfer, expanding the application range of biological methods to the field of historical linguistics. We illustrate this by showing (i) how methods dealing with incomplete lineage sorting offer an introgression-free framework to analyze highly mosaic word distributions across languages; (ii) how sequence similarity networks can be used to identify composite and borrowed words across different languages; (iii) how research on partial homology can inspire new methods and models in both fields; and (iv) how constructive neutral evolution provides an original framework for analyzing convergent evolution in languages resulting from common descent (Sapir's drift). CONCLUSIONS: Apart from new analogies between evolutionary processes, we also identified processes which are specific to either biology or linguistics. This shows that general evolution cannot be studied from within one discipline alone. In order to get a full picture of evolution, biologists and linguists need to complement their studies, trying to identify cross-disciplinary and discipline-specific evolutionary processes. The fact that we found many process-based analogies favoring transfer from biology to linguistics further shows that certain biological methods and models have a broader scope than previously recognized. This opens fruitful paths for collaboration between the two disciplines. REVIEWERS: This article was reviewed by W. Ford Doolittle and Eugene V. Koonin.