RESUMEN
Genome sequences of diverse free-living protists are essential for understanding eukaryotic evolution and molecular and cell biology. The free-living amoeboflagellate Naegleria gruberi belongs to a varied and ubiquitous protist clade (Heterolobosea) that diverged from other eukaryotic lineages over a billion years ago. Analysis of the 15,727 protein-coding genes encoded by Naegleria's 41 Mb nuclear genome indicates a capacity for both aerobic respiration and anaerobic metabolism with concomitant hydrogen production, with fundamental implications for the evolution of organelle metabolism. The Naegleria genome facilitates substantially broader phylogenomic comparisons of free-living eukaryotes than previously possible, allowing us to identify thousands of genes likely present in the pan-eukaryotic ancestor, with 40% likely eukaryotic inventions. Moreover, we construct a comprehensive catalog of amoeboid-motility genes. The Naegleria genome, analyzed in the context of other protists, reveals a remarkably complex ancestral eukaryote with a rich repertoire of cytoskeletal, sexual, signaling, and metabolic modules.
Asunto(s)
Evolución Biológica , Naegleria/genética , Eucariontes/clasificación , Eucariontes/genética , Flagelos/metabolismo , Datos de Secuencia Molecular , Naegleria/metabolismo , Filogenia , Proteínas Protozoarias/análisis , Proteínas Protozoarias/genéticaRESUMEN
Porphyra umbilicalis (laver) belongs to an ancient group of red algae (Bangiophyceae), is harvested for human food, and thrives in the harsh conditions of the upper intertidal zone. Here we present the 87.7-Mbp haploid Porphyra genome (65.8% G + C content, 13,125 gene loci) and elucidate traits that inform our understanding of the biology of red algae as one of the few multicellular eukaryotic lineages. Novel features of the Porphyra genome shared by other red algae relate to the cytoskeleton, calcium signaling, the cell cycle, and stress-tolerance mechanisms including photoprotection. Cytoskeletal motor proteins in Porphyra are restricted to a small set of kinesins that appear to be the only universal cytoskeletal motors within the red algae. Dynein motors are absent, and most red algae, including Porphyra, lack myosin. This surprisingly minimal cytoskeleton offers a potential explanation for why red algal cells and multicellular structures are more limited in size than in most multicellular lineages. Additional discoveries further relating to the stress tolerance of bangiophytes include ancestral enzymes for sulfation of the hydrophilic galactan-rich cell wall, evidence for mannan synthesis that originated before the divergence of green and red algae, and a high capacity for nutrient uptake. Our analyses provide a comprehensive understanding of the red algae, which are both commercially important and have played a major role in the evolution of other algal groups through secondary endosymbioses.
Asunto(s)
Citoesqueleto/genética , Evolución Molecular , Genoma de Planta/genética , Porphyra/citología , Porphyra/genética , Actinas/genética , Señalización del Calcio/genética , Ciclo Celular/genética , Pared Celular/genética , Pared Celular/metabolismo , Cromatina/genética , Cinesinas/genética , FilogeniaRESUMEN
The freshwater cnidarian Hydra was first described in 1702 and has been the object of study for 300 years. Experimental studies of Hydra between 1736 and 1744 culminated in the discovery of asexual reproduction of an animal by budding, the first description of regeneration in an animal, and successful transplantation of tissue between animals. Today, Hydra is an important model for studies of axial patterning, stem cell biology and regeneration. Here we report the genome of Hydra magnipapillata and compare it to the genomes of the anthozoan Nematostella vectensis and other animals. The Hydra genome has been shaped by bursts of transposable element expansion, horizontal gene transfer, trans-splicing, and simplification of gene structure and gene content that parallel simplification of the Hydra life cycle. We also report the sequence of the genome of a novel bacterium stably associated with H. magnipapillata. Comparisons of the Hydra genome to the genomes of other animals shed light on the evolution of epithelia, contractile tissues, developmentally regulated transcription factors, the Spemann-Mangold organizer, pluripotency genes and the neuromuscular junction.
Asunto(s)
Genoma/genética , Hydra/genética , Animales , Antozoos/genética , Comamonadaceae/genética , Elementos Transponibles de ADN/genética , Transferencia de Gen Horizontal/genética , Genoma Bacteriano/genética , Hydra/microbiología , Hydra/ultraestructura , Datos de Secuencia Molecular , Unión Neuromuscular/ultraestructuraRESUMEN
Membrane transporters play a central role in many cellular processes that rely on the movement of ions and organic molecules between the environment and the cell, and between cellular compartments. Transporters have been well characterized in plants and green algae, but little is known about transporters or their evolutionary histories in the red algae. Here we examined 482 expressed sequence tag contigs that encode putative membrane transporters in the economically important red seaweed Porphyra (Bangiophyceae, Rhodophyta). These contigs are part of a comprehensive transcriptome dataset from Porphyra umbilicalis and Porphyra purpurea. Using phylogenomics, we identified 30 trees that support the expected monophyly of red and green algae/plants (i.e. the Plantae hypothesis) and 19 expressed sequence tag contigs that show evidence of endosymbiotic/horizontal gene transfer involving stramenopiles. The majority (77%) of analyzed contigs encode transporters with unresolved phylogenies, demonstrating the difficulty in resolving the evolutionary history of genes. We observed molecular features of many sodium-coupled transport systems in marine algae, and the potential for coregulation of Porphyra transporter genes that are associated with fatty acid biosynthesis and intracellular lipid trafficking. Although both the tissue-specific and subcellular locations of the encoded proteins require further investigation, our study provides red algal gene candidates associated with transport functions and novel insights into the biology and evolution of these transporters.
Asunto(s)
Eucariontes/genética , Transferencia de Gen Horizontal/genética , Proteínas de Transporte de Membrana/genética , Fotosíntesis/genética , Porphyra/genética , Sodio/metabolismo , Acuaporinas/metabolismo , Transporte Biológico/genética , Señalización del Calcio/genética , Evolución Molecular , Etiquetas de Secuencia Expresada , Agua Dulce , Genes , Transporte Iónico/genética , Metabolismo de los Lípidos/genética , Proteínas de Transporte de Membrana/metabolismo , Datos de Secuencia Molecular , Nitratos/metabolismo , Filogenia , Compuestos de Amonio Cuaternario/metabolismo , Agua de Mar , Transcriptoma/genéticaRESUMEN
The plastid is a defining structure of photosynthetic eukaryotes and houses many plant-specific processes, including the light reactions, carbon fixation, pigment synthesis, and other primary metabolic processes. Identifying proteins associated with catalytic, structural, and regulatory functions that are unique to plastid-containing organisms is necessary to fully define the scope of plant biochemistry. Here, we performed phylogenomics on 20 genomes to compile a new inventory of 597 nucleus-encoded proteins conserved in plants and green algae but not in non-photosynthetic organisms. 286 of these proteins are of known function, whereas 311 are not characterized. This inventory was validated as applicable and relevant to diverse photosynthetic eukaryotes using an additional eight genomes from distantly related plants (including Micromonas, Selaginella, and soybean). Manual curation of the known proteins in the inventory established its importance to plastid biochemistry. To predict functions for the 52% of proteins of unknown function, we used sequence motifs, subcellular localization, co-expression analysis, and RNA abundance data. We demonstrate that 18% of the proteins in the inventory have functions outside the plastid and/or beyond green tissues. Although 32% of proteins in the inventory have homologs in all cyanobacteria, unexpectedly, 30% are eukaryote-specific. Finally, 8% of the proteins of unknown function share no similarity to any characterized protein and are plant lineage-specific. We present this annotated inventory of 597 proteins as a resource for functional analyses of plant-specific biochemistry.
Asunto(s)
Bases de Datos de Proteínas , Proteínas de Plantas/química , Algoritmos , Arabidopsis/genética , Bioquímica/métodos , Linaje de la Célula , Núcleo Celular/metabolismo , Biología Computacional/métodos , Cianobacterias/genética , Cianobacterias/metabolismo , Genoma de Planta , Genómica , Análisis de Secuencia por Matrices de Oligonucleótidos , Filogenia , ARN/metabolismo , ARN de Planta/genéticaRESUMEN
Cassava (Manihot esculenta) provides calories and nutrition for more than half a billion people. It was domesticated by native Amazonian peoples through cultivation of the wild progenitor M. esculenta ssp. flabellifolia and is now grown in tropical regions worldwide. Here we provide a high-quality genome assembly for cassava with improved contiguity, linkage, and completeness; almost 97% of genes are anchored to chromosomes. We find that paleotetraploidy in cassava is shared with the related rubber tree Hevea, providing a resource for comparative studies. We also sequence a global collection of 58 Manihot accessions, including cultivated and wild cassava accessions and related species such as Ceará or India rubber (M. glaziovii), and genotype 268 African cassava varieties. We find widespread interspecific admixture, and detect the genetic signature of past cassava breeding programs. As a clonally propagated crop, cassava is especially vulnerable to pathogens and abiotic stresses. This genomic resource will inform future genome-enabled breeding efforts to improve this staple crop.
Asunto(s)
ADN de Plantas/genética , Hibridación Genética/genética , Manihot/clasificación , Manihot/genética , Fitomejoramiento/métodos , Análisis de Secuencia de ADN/métodos , Mapeo Cromosómico/métodos , Secuencia Conservada/genética , Variación Genética , Genoma de Planta/genética , Especificidad de la EspecieRESUMEN
The red seaweed Porphyra (Bangiophyceae) and related Bangiales have global economic importance. Here, we report the analysis of a comprehensive transcriptome comprising ca. 4.7 million expressed sequence tag (EST) reads from P. umbilicalis (L.) J. Agardh and P. purpurea (Roth) C. Agardh (ca. 980 Mbp of data generated using 454 FLX pyrosequencing). These ESTs were isolated from the haploid gametophyte (blades from both species) and diploid conchocelis stage (from P. purpurea). In a bioinformatic analysis, only 20% of the contigs were found to encode proteins of known biological function. Comparative analysis of predicted protein functions in mesophilic (including Porphyra) and extremophilic red algae suggest that the former has more putative functions related to signaling, membrane transport processes, and establishment of protein complexes. These enhanced functions may reflect general mesophilic adaptations. A near-complete repertoire of genes encoding histones and ribosomal proteins was identified, with some differentially regulated between the blade and conchocelis stage in P. purpurea. This finding may reflect specific regulatory processes associated with these distinct phases of the life history. Fatty acid desaturation patterns, in combination with gene expression profiles, demonstrate differences from seed plants with respect to the transport of fatty acid/lipid among subcellular compartments and the molecular machinery of lipid assembly. We also recovered a near-complete gene repertoire for enzymes involved in the formation of sterols and carotenoids, including candidate genes for the biosynthesis of lutein. Our findings provide key insights into the evolution, development, and biology of Porphyra, an important lineage of red algae.
RESUMEN
The multicellular green alga Volvox carteri and its morphologically diverse close relatives (the volvocine algae) are well suited for the investigation of the evolution of multicellularity and development. We sequenced the 138-mega-base pair genome of V. carteri and compared its approximately 14,500 predicted proteins to those of its unicellular relative Chlamydomonas reinhardtii. Despite fundamental differences in organismal complexity and life history, the two species have similar protein-coding potentials and few species-specific protein-coding gene predictions. Volvox is enriched in volvocine-algal-specific proteins, including those associated with an expanded and highly compartmentalized extracellular matrix. Our analysis shows that increases in organismal complexity can be associated with modifications of lineage-specific proteins rather than large-scale invention of protein-coding capacity.
Asunto(s)
Proteínas Algáceas/química , Proteínas Algáceas/genética , Chlamydomonas reinhardtii/genética , Genoma , Volvox/genética , Proteínas Algáceas/metabolismo , Evolución Biológica , Chlamydomonas reinhardtii/citología , Chlamydomonas reinhardtii/crecimiento & desarrollo , Chlamydomonas reinhardtii/fisiología , ADN de Algas/genética , Evolución Molecular , Proteínas de la Matriz Extracelular/química , Proteínas de la Matriz Extracelular/genética , Genes , Datos de Secuencia Molecular , Estructura Terciaria de Proteína , Secuencias Repetitivas de Ácidos Nucleicos , Análisis de Secuencia de ADN , Especificidad de la Especie , Sintenía , Volvox/citología , Volvox/crecimiento & desarrollo , Volvox/fisiologíaRESUMEN
Understanding how animal complexity has arisen and identifying the key genetic components of this process is a central goal of evolutionary developmental biology. The discovery of microRNAs (miRNAs) as key regulators of development has identified a new set of candidates for this role. microRNAs are small noncoding RNAs that regulate tissue-specific or temporal gene expression through base pairing with target mRNAs. The full extent of the evolutionary distribution of miRNAs is being revealed as more genomes are scrutinized. To explore the evolutionary origins of metazoan miRNAs, we searched the genomes of diverse animals occupying key phylogenetic positions for homologs of experimentally verified human, fly, and worm miRNAs. We identify 30 miRNAs conserved across bilaterians, almost double the previous estimate. We hypothesize that this larger than previously realized core set of miRNAs was already present in the ancestor of all Bilateria and likely had key roles in allowing the evolution of diverse specialist cell types, tissues, and complex morphology. In agreement with this hypothesis, we found only three, conserved miRNA families in the genome of the sea anemone Nematostella vectensis and no convincing family members in the genome of the demosponge Reniera sp. The dramatic expansion of the miRNA repertoire in bilaterians relative to sponges and cnidarians suggests that increased miRNA-mediated gene regulation accompanied the emergence of triploblastic organ-containing body plans.
Asunto(s)
Cordados/genética , Evolución Molecular , Amplificación de Genes , Invertebrados/genética , MicroARNs/genética , Algoritmos , Animales , Análisis por Conglomerados , Biología Computacional , Bases de Datos Genéticas , HumanosRESUMEN
Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the approximately 120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.
Asunto(s)
Proteínas Algáceas/genética , Proteínas Algáceas/fisiología , Evolución Biológica , Chlamydomonas reinhardtii/genética , Genoma , Animales , Chlamydomonas reinhardtii/fisiología , Cloroplastos/metabolismo , Biología Computacional , ADN de Algas/genética , Flagelos/metabolismo , Genes , Genómica , Proteínas de Transporte de Membrana/genética , Proteínas de Transporte de Membrana/fisiología , Datos de Secuencia Molecular , Familia de Multigenes , Fotosíntesis/genética , Filogenia , Plantas/genética , Proteoma , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: The recent completion of the Drosophila melanogaster genomic sequence to high quality and the availability of a greatly expanded set of Drosophila cDNA sequences, aligning to 78% of the predicted euchromatic genes, afforded FlyBase the opportunity to significantly improve genomic annotations. We made the annotation process more rigorous by inspecting each gene visually, utilizing a comprehensive set of curation rules, requiring traceable evidence for each gene model, and comparing each predicted peptide to SWISS-PROT and TrEMBL sequences. RESULTS: Although the number of predicted protein-coding genes in Drosophila remains essentially unchanged, the revised annotation significantly improves gene models, resulting in structural changes to 85% of the transcripts and 45% of the predicted proteins. We annotated transposable elements and non-protein-coding RNAs as new features, and extended the annotation of untranslated (UTR) sequences and alternative transcripts to include more than 70% and 20% of genes, respectively. Finally, cDNA sequence provided evidence for dicistronic transcripts, neighboring genes with overlapping UTRs on the same DNA sequence strand, alternatively spliced genes that encode distinct, non-overlapping peptides, and numerous nested genes. CONCLUSIONS: Identification of so many unusual gene models not only suggests that some mechanisms for gene regulation are more prevalent than previously believed, but also underscores the complex challenges of eukaryotic gene prediction. At present, experimental data and human curation remain essential to generate high-quality genome annotations.