RESUMO
Current genomic perspectives on animal diversity neglect two prominent phyla, the molluscs and annelids, that together account for nearly one-third of known marine species and are important both ecologically and as experimental systems in classical embryology. Here we describe the draft genomes of the owl limpet (Lottia gigantea), a marine polychaete (Capitella teleta) and a freshwater leech (Helobdella robusta), and compare them with other animal genomes to investigate the origin and diversification of bilaterians from a genomic perspective. We find that the genome organization, gene structure and functional content of these species are more similar to those of some invertebrate deuterostome genomes (for example, amphioxus and sea urchin) than those of other protostomes that have been sequenced to date (flies, nematodes and flatworms). The conservation of these genomic features enables us to expand the inventory of genes present in the last common bilaterian ancestor, establish the tripartite diversification of bilaterians using multiple genomic characteristics and identify ancient conserved long- and short-range genetic linkages across metazoans. Superimposed on this broadly conserved pan-bilaterian background we find examples of lineage-specific genome evolution, including varying rates of rearrangement, intron gain and loss, expansions and contractions of gene families, and the evolution of clade-specific genes that produce the unique content of each genome.
Assuntos
Padronização Corporal/genética , Evolução Molecular , Genoma/genética , Sanguessugas/genética , Moluscos/genética , Filogenia , Poliquetos/genética , Animais , Sequência Conservada/genética , Genes Homeobox/genética , Ligação Genética , Especiação Genética , Humanos , Mutação INDEL/genética , Íntrons/genética , Sanguessugas/anatomia & histologia , Moluscos/anatomia & histologia , Família Multigênica/genética , Poliquetos/anatomia & histologia , Sintenia/genéticaRESUMO
Harmful algal blooms (HABs) cause significant economic and ecological damage worldwide. Despite considerable efforts, a comprehensive understanding of the factors that promote these blooms has been lacking, because the biochemical pathways that facilitate their dominance relative to other phytoplankton within specific environments have not been identified. Here, biogeochemical measurements showed that the harmful alga Aureococcus anophagefferens outcompeted co-occurring phytoplankton in estuaries with elevated levels of dissolved organic matter and turbidity and low levels of dissolved inorganic nitrogen. We subsequently sequenced the genome of A. anophagefferens and compared its gene complement with those of six competing phytoplankton species identified through metaproteomics. Using an ecogenomic approach, we specifically focused on gene sets that may facilitate dominance within the environmental conditions present during blooms. A. anophagefferens possesses a larger genome (56 Mbp) and has more genes involved in light harvesting, organic carbon and nitrogen use, and encoding selenium- and metal-requiring enzymes than competing phytoplankton. Genes for the synthesis of microbial deterrents likely permit the proliferation of this species, with reduced mortality losses during blooms. Collectively, these findings suggest that anthropogenic activities resulting in elevated levels of turbidity, organic matter, and metals have opened a niche within coastal ecosystems that ideally suits the unique genetic capacity of A. anophagefferens and thus, has facilitated the proliferation of this and potentially other HABs.
Assuntos
Ecossistema , Eucariotos/genética , Genômica/métodos , Sequência de Aminoácidos , Bactérias/metabolismo , Bactérias/efeitos da radiação , Biodegradação Ambiental/efeitos da radiação , Enzimas/metabolismo , Eucariotos/enzimologia , Genoma/genética , Luz , Filogenia , Fitoplâncton/genética , Fitoplâncton/efeitos da radiação , Proteínas/química , Especificidade da EspécieRESUMO
Chlorella variabilis NC64A, a unicellular photosynthetic green alga (Trebouxiophyceae), is an intracellular photobiont of Paramecium bursaria and a model system for studying virus/algal interactions. We sequenced its 46-Mb nuclear genome, revealing an expansion of protein families that could have participated in adaptation to symbiosis. NC64A exhibits variations in GC content across its genome that correlate with global expression level, average intron size, and codon usage bias. Although Chlorella species have been assumed to be asexual and nonmotile, the NC64A genome encodes all the known meiosis-specific proteins and a subset of proteins found in flagella. We hypothesize that Chlorella might have retained a flagella-derived structure that could be involved in sexual reproduction. Furthermore, a survey of phytohormone pathways in chlorophyte algae identified algal orthologs of Arabidopsis thaliana genes involved in hormone biosynthesis and signaling, suggesting that these functions were established prior to the evolution of land plants. We show that the ability of Chlorella to produce chitinous cell walls likely resulted from the capture of metabolic genes by horizontal gene transfer from algal viruses, prokaryotes, or fungi. Analysis of the NC64A genome substantially advances our understanding of the green lineage evolution, including the genomic interplay with viruses and symbiosis between eukaryotes.
Assuntos
Chlorella/genética , Evolução Molecular , Genoma de Planta , Simbiose , Composição de Bases , Parede Celular/metabolismo , Chlorella/virologia , DNA de Plantas/genética , Etiquetas de Sequências Expressas , Flagelos/genética , Dados de Sequência Molecular , Família Multigênica , Reguladores de Crescimento de Plantas/genética , Sequências Repetitivas de Ácido Nucleico , Reprodução , Análise de Sequência de DNARESUMO
The multicellular green alga Volvox carteri and its morphologically diverse close relatives (the volvocine algae) are well suited for the investigation of the evolution of multicellularity and development. We sequenced the 138-mega-base pair genome of V. carteri and compared its approximately 14,500 predicted proteins to those of its unicellular relative Chlamydomonas reinhardtii. Despite fundamental differences in organismal complexity and life history, the two species have similar protein-coding potentials and few species-specific protein-coding gene predictions. Volvox is enriched in volvocine-algal-specific proteins, including those associated with an expanded and highly compartmentalized extracellular matrix. Our analysis shows that increases in organismal complexity can be associated with modifications of lineage-specific proteins rather than large-scale invention of protein-coding capacity.
Assuntos
Proteínas de Algas/química , Proteínas de Algas/genética , Chlamydomonas reinhardtii/genética , Genoma , Volvox/genética , Proteínas de Algas/metabolismo , Evolução Biológica , Chlamydomonas reinhardtii/citologia , Chlamydomonas reinhardtii/crescimento & desenvolvimento , Chlamydomonas reinhardtii/fisiologia , DNA de Algas/genética , Evolução Molecular , Proteínas da Matriz Extracelular/química , Proteínas da Matriz Extracelular/genética , Genes , Dados de Sequência Molecular , Estrutura Terciária de Proteína , Sequências Repetitivas de Ácido Nucleico , Análise de Sequência de DNA , Especificidade da Espécie , Sintenia , Volvox/citologia , Volvox/crescimento & desenvolvimento , Volvox/fisiologiaRESUMO
The western clawed frog Xenopus tropicalis is an important model for vertebrate development that combines experimental advantages of the African clawed frog Xenopus laevis with more tractable genetics. Here we present a draft genome sequence assembly of X. tropicalis. This genome encodes more than 20,000 protein-coding genes, including orthologs of at least 1700 human disease genes. Over 1 million expressed sequence tags validated the annotation. More than one-third of the genome consists of transposable elements, with unusually prevalent DNA transposons. Like that of other tetrapods, the genome of X. tropicalis contains gene deserts enriched for conserved noncoding elements. The genome exhibits substantial shared synteny with human and chicken over major parts of large chromosomes, broken by lineage-specific chromosome fusions and fissions, mainly in the mammalian lineage.
Assuntos
Genoma , Análise de Sequência de DNA , Xenopus/genética , Animais , Galinhas/genética , Mapeamento Cromossômico , Cromossomos/genética , Biologia Computacional , Sequência Conservada , Elementos de DNA Transponíveis , DNA Complementar , Embrião não Mamífero/metabolismo , Evolução Molecular , Etiquetas de Sequências Expressas , Duplicação Gênica , Genes , Humanos , Filogenia , Sintenia , Vertebrados/genética , Xenopus/embriologia , Proteínas de Xenopus/genéticaRESUMO
Lancelets ('amphioxus') are the modern survivors of an ancient chordate lineage, with a fossil record dating back to the Cambrian period. Here we describe the structure and gene content of the highly polymorphic approximately 520-megabase genome of the Florida lancelet Branchiostoma floridae, and analyse it in the context of chordate evolution. Whole-genome comparisons illuminate the murky relationships among the three chordate groups (tunicates, lancelets and vertebrates), and allow not only reconstruction of the gene complement of the last common chordate ancestor but also partial reconstruction of its genomic organization, as well as a description of two genome-wide duplications and subsequent reorganizations in the vertebrate lineage. These genome-scale events shaped the vertebrate genome and provided additional genetic variation for exploitation during vertebrate evolution.
Assuntos
Cordados/genética , Evolução Molecular , Genoma/genética , Animais , Cordados/classificação , Sequência Conservada , Elementos de DNA Transponíveis/genética , Duplicação Gênica , Genes/genética , Ligação Genética , Humanos , Íntrons/genética , Cariotipagem , Família Multigênica , Filogenia , Polimorfismo Genético/genética , Proteínas/genética , Sintenia , Fatores de Tempo , Vertebrados/classificação , Vertebrados/genéticaRESUMO
Trichoderma reesei is the main industrial source of cellulases and hemicellulases used to depolymerize biomass to simple sugars that are converted to chemical intermediates and biofuels, such as ethanol. We assembled 89 scaffolds (sets of ordered and oriented contigs) to generate 34 Mbp of nearly contiguous T. reesei genome sequence comprising 9,129 predicted gene models. Unexpectedly, considering the industrial utility and effectiveness of the carbohydrate-active enzymes of T. reesei, its genome encodes fewer cellulases and hemicellulases than any other sequenced fungus able to hydrolyze plant cell wall polysaccharides. Many T. reesei genes encoding carbohydrate-active enzymes are distributed nonrandomly in clusters that lie between regions of synteny with other Sordariomycetes. Numerous genes encoding biosynthetic pathways for secondary metabolites may promote survival of T. reesei in its competitive soil habitat, but genome analysis provided little mechanistic insight into its extraordinary capacity for protein secretion. Our analysis, coupled with the genome sequence data, provides a roadmap for constructing enhanced T. reesei strains for industrial applications such as biofuel production.
Assuntos
Mapeamento Cromossômico/métodos , DNA Fúngico/genética , Genoma Fúngico/genética , Análise de Sequência de DNA/métodos , Trichoderma/genética , Sequência de Bases , Dados de Sequência Molecular , Trichoderma/classificaçãoRESUMO
BACKGROUND: Scleractinian corals are the foundation of reef ecosystems in tropical marine environments. Their great success is due to interactions with endosymbiotic dinoflagellates (Symbiodinium spp.), with which they are obligately symbiotic. To develop a foundation for studying coral biology and coral symbiosis, we have constructed a set of cDNA libraries and generated and annotated ESTs from two species of corals, Acropora palmata and Montastraea faveolata. RESULTS: We generated 14,588 (Ap) and 3,854 (Mf) high quality ESTs from five life history/symbiosis stages (spawned eggs, early-stage planula larvae, late-stage planula larvae either infected with symbionts or uninfected, and adult coral). The ESTs assembled into a set of primarily stage-specific clusters, producing 4,980 (Ap), and 1,732 (Mf) unigenes. The egg stage library, relative to the other developmental stages, was enriched in genes functioning in cell division and proliferation, transcription, signal transduction, and regulation of protein function. Fifteen unigenes were identified as candidate symbiosis-related genes as they were expressed in all libraries constructed from the symbiotic stages and were absent from all of the non symbiotic stages. These include several DNA interacting proteins, and one highly expressed unigene (containing 17 cDNAs) with no significant protein-coding region. A significant number of unigenes (25) encode potential pattern recognition receptors (lectins, scavenger receptors, and others), as well as genes that may function in signaling pathways involved in innate immune responses (toll-like signaling, NFkB p105, and MAP kinases). Comparison between the A. palmata and an A. millepora EST dataset identified ferritin as a highly expressed gene in both datasets that appears to be undergoing adaptive evolution. Five unigenes appear to be restricted to the Scleractinia, as they had no homology to any sequences in the nr databases nor to the non-scleractinian cnidarians Nematostella vectensis and Hydra magnipapillata. CONCLUSION: Partial sequencing of 5 cDNA libraries each for A. palmata and M. faveolata has produced a rich set of candidate genes (4,980 genes from A. palmata, and 1,732 genes from M. faveolata) that we can use as a starting point for examining the life history and symbiosis of these two species, as well as to further expand the dataset of cnidarian genes for comparative genomics and evolutionary studies.
Assuntos
Antozoários/genética , Etiquetas de Sequências Expressas , Genômica/métodos , Simbiose , Sequência de Aminoácidos , Animais , Antozoários/crescimento & desenvolvimento , Antozoários/parasitologia , DNA Complementar/química , DNA Complementar/genética , Dinoflagellida/crescimento & desenvolvimento , Ecossistema , Evolução Molecular , Ferritinas/genética , Interações Hospedeiro-Parasita , Dados de Sequência Molecular , Análise de Sequência de DNA , Homologia de Sequência de Aminoácidos , Especificidade da Espécie , Transcrição GênicaRESUMO
We report the draft genome sequence of the model moss Physcomitrella patens and compare its features with those of flowering plants, from which it is separated by more than 400 million years, and unicellular aquatic algae. This comparison reveals genomic changes concomitant with the evolutionary movement to land, including a general increase in gene family complexity; loss of genes associated with aquatic environments (e.g., flagellar arms); acquisition of genes for tolerating terrestrial stresses (e.g., variation in temperature and water availability); and the development of the auxin and abscisic acid signaling pathways for coordinating multicellular growth and dehydration response. The Physcomitrella genome provides a resource for phylogenetic inferences about gene function and for experimental analysis of plant processes through this plant's unique facility for reverse genetics.
Assuntos
Evolução Biológica , Bryopsida/genética , Genoma de Planta , Adaptação Fisiológica , Animais , Arabidopsis/genética , Arabidopsis/fisiologia , Bryopsida/fisiologia , Chlamydomonas reinhardtii/genética , Chlamydomonas reinhardtii/fisiologia , Biologia Computacional , Reparo do DNA , Desidratação , Duplicação Gênica , Genes de Plantas , Magnoliopsida/genética , Magnoliopsida/fisiologia , Redes e Vias Metabólicas/genética , Família Multigênica , Oryza/genética , Oryza/fisiologia , Filogenia , Proteínas de Plantas/genética , Proteínas de Plantas/fisiologia , Sequências Repetitivas de Ácido Nucleico , Retroelementos , Análise de Sequência de DNA , Transdução de Sinais/genéticaRESUMO
Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the approximately 120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.
Assuntos
Proteínas de Algas/genética , Proteínas de Algas/fisiologia , Evolução Biológica , Chlamydomonas reinhardtii/genética , Genoma , Animais , Chlamydomonas reinhardtii/fisiologia , Cloroplastos/metabolismo , Biologia Computacional , DNA de Algas/genética , Flagelos/metabolismo , Genes , Genômica , Proteínas de Membrana Transportadoras/genética , Proteínas de Membrana Transportadoras/fisiologia , Dados de Sequência Molecular , Família Multigênica , Fotossíntese/genética , Filogenia , Plantas/genética , Proteoma , Análise de Sequência de DNARESUMO
Sea anemones are seemingly primitive animals that, along with corals, jellyfish, and hydras, constitute the oldest eumetazoan phylum, the Cnidaria. Here, we report a comparative analysis of the draft genome of an emerging cnidarian model, the starlet sea anemone Nematostella vectensis. The sea anemone genome is complex, with a gene repertoire, exon-intron structure, and large-scale gene linkage more similar to vertebrates than to flies or nematodes, implying that the genome of the eumetazoan ancestor was similarly complex. Nearly one-fifth of the inferred genes of the ancestor are eumetazoan novelties, which are enriched for animal functions like cell signaling, adhesion, and synaptic transmission. Analysis of diverse pathways suggests that these gene "inventions" along the lineage leading to animals were likely already well integrated with preexisting eukaryotic genes in the eumetazoan progenitor.
Assuntos
Evolução Biológica , Genoma , Anêmonas-do-Mar/genética , Animais , Adesão Celular , Evolução Molecular , Genes , Ligação Genética , Genoma Humano , Genômica , Humanos , Íntrons , Redes e Vias Metabólicas , Família Multigênica , Músculos/fisiologia , Fenômenos Fisiológicos do Sistema Nervoso , Filogenia , Anêmonas-do-Mar/metabolismo , Análise de Sequência de DNA , Transdução de Sinais , SinteniaRESUMO
Draft genome sequences have been determined for the soybean pathogen Phytophthora sojae and the sudden oak death pathogen Phytophthora ramorum. Oömycetes such as these Phytophthora species share the kingdom Stramenopila with photosynthetic algae such as diatoms, and the presence of many Phytophthora genes of probable phototroph origin supports a photosynthetic ancestry for the stramenopiles. Comparison of the two species' genomes reveals a rapid expansion and diversification of many protein families associated with plant infection such as hydrolases, ABC transporters, protein toxins, proteinase inhibitors, and, in particular, a superfamily of 700 proteins with similarity to known oömycete avirulence genes.
Assuntos
Evolução Biológica , DNA de Algas/genética , Genoma , Phytophthora/genética , Phytophthora/patogenicidade , Proteínas de Algas/genética , Proteínas de Algas/fisiologia , Genes , Hidrolases/genética , Hidrolases/metabolismo , Fotossíntese/genética , Filogenia , Mapeamento Físico do Cromossomo , Phytophthora/classificação , Phytophthora/fisiologia , Doenças das Plantas/microbiologia , Polimorfismo de Nucleotídeo Único , Sequências Repetitivas de Ácido Nucleico , Análise de Sequência de DNA , Simbiose , Toxinas Biológicas/genéticaRESUMO
Human chromosome 16 features one of the highest levels of segmentally duplicated sequence among the human autosomes. We report here the 78,884,754 base pairs of finished chromosome 16 sequence, representing over 99.9% of its euchromatin. Manual annotation revealed 880 protein-coding genes confirmed by 1,670 aligned transcripts, 19 transfer RNA genes, 341 pseudogenes and three RNA pseudogenes. These genes include metallothionein, cadherin and iroquois gene families, as well as the disease genes for polycystic kidney disease and acute myelomonocytic leukaemia. Several large-scale structural polymorphisms spanning hundreds of kilobase pairs were identified and result in gene content differences among humans. Whereas the segmental duplications of chromosome 16 are enriched in the relatively gene-poor pericentromere of the p arm, some are involved in recent gene duplication and conversion events that are likely to have had an impact on the evolution of primates and human disease susceptibility.
Assuntos
Cromossomos Humanos Par 16/genética , Duplicação Gênica , Mapeamento Físico do Cromossomo , Animais , Genes/genética , Genômica , Heterocromatina/genética , Humanos , Dados de Sequência Molecular , Polimorfismo Genético/genética , Análise de Sequência de DNA , Sintenia/genéticaRESUMO
Chromosome 5 is one of the largest human chromosomes and contains numerous intrachromosomal duplications, yet it has one of the lowest gene densities. This is partially explained by numerous gene-poor regions that display a remarkable degree of noncoding conservation with non-mammalian vertebrates, suggesting that they are functionally constrained. In total, we compiled 177.7 million base pairs of highly accurate finished sequence containing 923 manually curated protein-coding genes including the protocadherin and interleukin gene families. We also completely sequenced versions of the large chromosome-5-specific internal duplications. These duplications are very recent evolutionary events and probably have a mechanistic role in human physiological variation, as deletions in these regions are the cause of debilitating disorders including spinal muscular atrophy.
Assuntos
Cromossomos Humanos Par 5/genética , Análise de Sequência de DNA , Animais , Composição de Bases , Caderinas/genética , Sequência Conservada/genética , Duplicação Gênica , Genes/genética , Doenças Genéticas Inatas/genética , Genômica , Humanos , Interleucinas/genética , Dados de Sequência Molecular , Atrofia Muscular Espinal/genética , Pan troglodytes/genética , Mapeamento Físico do Cromossomo , Pseudogenes/genética , Sintenia/genética , Vertebrados/genéticaRESUMO
Chromosome 19 has the highest gene density of all human chromosomes, more than double the genome-wide average. The large clustered gene families, corresponding high G + C content, CpG islands and density of repetitive DNA indicate a chromosome rich in biological and evolutionary significance. Here we describe 55.8 million base pairs of highly accurate finished sequence representing 99.9% of the euchromatin portion of the chromosome. Manual curation of gene loci reveals 1,461 protein-coding genes and 321 pseudogenes. Among these are genes directly implicated in mendelian disorders, including familial hypercholesterolaemia and insulin-resistant diabetes. Nearly one-quarter of these genes belong to tandemly arranged families, encompassing more than 25% of the chromosome. Comparative analyses show a fascinating picture of conservation and divergence, revealing large blocks of gene orthology with rodents, scattered regions with more recent gene family expansions and deletions, and segments of coding and non-coding conservation with the distant fish species Takifugu.
Assuntos
Cromossomos Humanos Par 19/genética , Genes/genética , Mapeamento Físico do Cromossomo , Processamento Alternativo/genética , Animais , Composição de Bases , Sequência Conservada/genética , Ilhas de CpG/genética , Evolução Molecular , Duplicação Gênica , Genética Médica , Humanos , Camundongos , Dados de Sequência Molecular , Família Multigênica/genética , Pseudogenes/genética , Análise de Sequência de DNARESUMO
The first chordates appear in the fossil record at the time of the Cambrian explosion, nearly 550 million years ago. The modern ascidian tadpole represents a plausible approximation to these ancestral chordates. To illuminate the origins of chordate and vertebrates, we generated a draft of the protein-coding portion of the genome of the most studied ascidian, Ciona intestinalis. The Ciona genome contains approximately 16,000 protein-coding genes, similar to the number in other invertebrates, but only half that found in vertebrates. Vertebrate gene families are typically found in simplified form in Ciona, suggesting that ascidians contain the basic ancestral complement of genes involved in cell signaling and development. The ascidian genome has also acquired a number of lineage-specific innovations, including a group of genes engaged in cellulose metabolism that are related to those in bacteria and fungi.