RESUMEN
Water lilies belong to the angiosperm order Nymphaeales. Amborellales, Nymphaeales and Austrobaileyales together form the so-called ANA-grade of angiosperms, which are extant representatives of lineages that diverged the earliest from the lineage leading to the extant mesangiosperms1-3. Here we report the 409-megabase genome sequence of the blue-petal water lily (Nymphaea colorata). Our phylogenomic analyses support Amborellales and Nymphaeales as successive sister lineages to all other extant angiosperms. The N. colorata genome and 19 other water lily transcriptomes reveal a Nymphaealean whole-genome duplication event, which is shared by Nymphaeaceae and possibly Cabombaceae. Among the genes retained from this whole-genome duplication are homologues of genes that regulate flowering transition and flower development. The broad expression of homologues of floral ABCE genes in N. colorata might support a similarly broadly active ancestral ABCE model of floral organ determination in early angiosperms. Water lilies have evolved attractive floral scents and colours, which are features shared with mesangiosperms, and we identified their putative biosynthetic genes in N. colorata. The chemical compounds and biosynthetic genes behind floral scents suggest that they have evolved in parallel to those in mesangiosperms. Because of its unique phylogenetic position, the N. colorata genome sheds light on the early evolution of angiosperms.
Asunto(s)
Genoma de Planta , Nymphaea/genética , Filogenia , Flores/genética , Flores/metabolismo , Nymphaea/metabolismo , Odorantes/análisisRESUMEN
Common purslane (Portulaca oleracea) integrates both C4 and crassulacean acid metabolism (CAM) photosynthesis pathways and is a promising model plant to explore C4-CAM plasticity. Here, we report a high-quality chromosome-level genome of nicotinamide adenine dinucleotide (NAD)-malic enzyme (ME) subtype common purslane that provides evidence for 2 rounds of whole-genome duplication (WGD) with an ancient WGD (P-ß) in the common ancestor to Portulacaceae and Cactaceae around 66.30 million years ago (Mya) and another (Po-α) specific to common purslane lineage around 7.74 Mya. A larger number of gene copies encoding key enzymes/transporters involved in C4 and CAM pathways were detected in common purslane than in related species. Phylogeny, conserved functional site, and collinearity analyses revealed that the Po-α WGD produced the phosphoenolpyruvate carboxylase-encoded gene copies used for photosynthesis in common purslane, while the P-ß WGD event produced 2 ancestral genes of functionally differentiated (C4- and CAM-specific) beta carbonic anhydrases involved in the C4 + CAM pathways. Additionally, cis-element enrichment analysis in the promoters showed that CAM-specific genes have recruited both evening and midnight circadian elements as well as the Abscisic acid (ABA)-independent regulatory module mediated by ethylene-response factor cis-elements. Overall, this study provides insights into the origin and evolutionary process of C4 and CAM pathways in common purslane, as well as potential targets for engineering crops by integrating C4 or CAM metabolism.
Asunto(s)
Portulaca , Portulaca/genética , Portulaca/metabolismo , Duplicación de Gen , Metabolismo Ácido de las Crasuláceas , Evolución Biológica , Filogenia , Fotosíntesis/genéticaRESUMEN
Intracellular gene transfers (IGTs) between the nucleus and organelles, including plastids and mitochondria, constantly reshape the nuclear genome during evolution. Despite the substantial contribution of IGTs to genome variation, the dynamic trajectories of IGTs at the pangenomic level remain elusive. Here, we developed an approach, IGTminer, that maps the evolutionary trajectories of IGTs using collinearity and gene reannotation across multiple genome assemblies. We applied IGTminer to create a nuclear organellar gene (NOG) map across 67 genomes covering 15 Poaceae species, including important crops. The resulting NOGs were verified by experiments and sequencing data sets. Our analysis revealed that most NOGs were recently transferred and lineage specific and that Triticeae species tended to have more NOGs than other Poaceae species. Wheat (Triticum aestivum) had a higher retention rate of NOGs than maize (Zea mays) and rice (Oryza sativa), and the retained NOGs were likely involved in photosynthesis and translation pathways. Large numbers of NOG clusters were aggregated in hexaploid wheat during 2 rounds of polyploidization, contributing to the genetic diversity among modern wheat accessions. We implemented an interactive web server to facilitate the exploration of NOGs in Poaceae. In summary, this study provides resources and insights into the roles of IGTs in shaping interspecies and intraspecies genome variation and driving plant genome evolution.
Asunto(s)
Oryza , Poaceae , Poaceae/genética , Triticum/genética , Genoma de Planta/genética , Oryza/genética , Zea mays/genética , Evolución MolecularRESUMEN
All flowering plants are now recognized as diploidized paleopolyploids (Jiao et al., 2011; One Thousand Plant Transcriptomes Initiative, 2019), and polyploid species comprise approximately 30% of contemporary plant species (Wood et al., 2009; Barker et al., 2016a). A major implication of these discoveries is that, to appreciate the evolution of plant diversity, we need to understand the fundamental biology of polyploids and diploidization. This need is broadly recognized by our community as there is a continued, growing interest in polyploidy as a research topic. Over the past 25 years, the sequencing and analysis of plant genomes has revolutionized our understanding of the importance of polyploid speciation to the evolution of land plants.
Asunto(s)
Genoma de Planta , Genómica , Poliploidía , Evolución Biológica , Magnoliopsida/genéticaRESUMEN
BACKGROUND: Sapria himalayana (Rafflesiaceae) is an endoparasitic plant characterized by a greatly reduced vegetative body and giant flowers; however, the mechanisms underlying its special lifestyle and greatly altered plant form remain unknown. To illustrate the evolution and adaptation of S. himalayasna, we report its de novo assembled genome and key insights into the molecular basis of its floral development, flowering time, fatty acid biosynthesis, and defense responses. RESULTS: The genome of S. himalayana is ~ 1.92 Gb with 13,670 protein-coding genes, indicating remarkable gene loss (~ 54%), especially genes involved in photosynthesis, plant body, nutrients, and defense response. Genes specifying floral organ identity and controlling organ size were identified in S. himalayana and Rafflesia cantleyi, and showed analogous spatiotemporal expression patterns in both plant species. Although the plastid genome had been lost, plastids likely biosynthesize essential fatty acids and amino acids (aromatic amino acids and lysine). A set of credible and functional horizontal gene transfer (HGT) events (involving genes and mRNAs) were identified in the nuclear and mitochondrial genomes of S. himalayana, most of which were under purifying selection. Convergent HGTs in Cuscuta, Orobanchaceae, and S. himalayana were mainly expressed at the parasite-host interface. Together, these results suggest that HGTs act as a bridge between the parasite and host, assisting the parasite in acquiring nutrients from the host. CONCLUSIONS: Our results provide new insights into the flower development process and endoparasitic lifestyle of Rafflesiaceae plants. The amount of gene loss in S. himalayana is consistent with the degree of reduction in its body plan. HGT events are common among endoparasites and play an important role in their lifestyle adaptation.
Asunto(s)
Genoma Mitocondrial , Transferencia de Gen Horizontal , Plantas/genética , Flores/genética , FilogeniaRESUMEN
Gene innovation plays an essential role in trait evolution. Rhizobial symbioses, the most important N2-fixing agent in agricultural systems that exists mainly in Leguminosae, is one of the most attractive evolution events. However, the gene innovations underlying Leguminosae root nodule symbiosis (RNS) remain largely unknown. Here, we investigated the gene gain event in Leguminosae RNS evolution through comprehensive phylogenomic analyses. We revealed that Leguminosae-gain genes were acquired by gene duplication and underwent a strong purifying selection. Kyoto Encyclopedia of Genes and Genomes analyses showed that the innovated genes were enriched in flavonoid biosynthesis pathways, particular downstream of chalcone synthase (CHS). Among them, Leguminosae-gain type â ¡ chalcone isomerase (CHI) could be further divided into CHI1A and CHI1B clades, which resulted from the products of tandem duplication. Furthermore, the duplicated CHI genes exhibited exon-intron structural divergences evolved through exon/intron gain/loss and insertion/deletion. Knocking down CHI1B significantly reduced nodulation in Glycine max (soybean) and Medicago truncatula; whereas, knocking down its duplication gene CHI1A had no effect on nodulation. Therefore, Leguminosae-gain type â ¡ CHI participated in RNS and the duplicated CHI1A and CHI1B genes exhibited RNS functional divergence. This study provides functional insights into Leguminosae-gain genetic innovation and sub-functionalization after gene duplication that contribute to the evolution and adaptation of RNS in Leguminosae.
Asunto(s)
Flavonoides , Duplicación de Gen , Nódulos de las Raíces de las Plantas , Simbiosis , Simbiosis/genética , Simbiosis/fisiología , Nódulos de las Raíces de las Plantas/genética , Nódulos de las Raíces de las Plantas/microbiología , Flavonoides/biosíntesis , Flavonoides/metabolismo , Fabaceae/genética , Filogenia , Medicago truncatula/genética , Medicago truncatula/microbiología , Evolución Molecular , Genes de Plantas , Glycine max/genética , Glycine max/microbiología , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Nodulación de la Raíz de la Planta/genética , Regulación de la Expresión Génica de las Plantas , Liasas IntramolecularesRESUMEN
Organisms continuously require genetic variation to adapt to fluctuating environments, yet major evolutionary events are episodic, making the relationship between genome evolution and organismal adaptation of considerable interest. Here, by genome-wide comparison of sorghum, maize, and rice SNPs, we investigated reservoirs of genetic variations with high precision. For sorghum and rice, which have not experienced whole-genome duplication in 96 million years or more, tandem duplicates accumulate relatively more SNPs than paralogous genes retained from genome duplication. However, maize, which experienced lineage-specific genome duplication and has a relatively larger supply of paralogous duplicates, shows SNP enrichment in paralogous genes. The proportion of genes showing signatures of recent positive selection is higher in small-scale (tandem and transposed) than genome-scale duplicates in sorghum, but the opposite is true in maize. A large proportion of recent duplications in rice are species-specific; however, most recent duplications in sorghum are derived from ancestral gene families. A new retrotransposon family was also a source of many recent sorghum duplications, illustrating a role in providing variation for genetic innovations. This study shows that diverse evolutionary mechanisms provide the raw genetic material for adaptation in taxa with divergent histories of genome evolution.
Asunto(s)
Grano Comestible/genética , Evolución Molecular , Duplicación de Gen , Genoma de Planta , Genes de Plantas , Familia de Multigenes , Oryza/genética , Polimorfismo de Nucleótido Simple , Retroelementos , Selección Genética , Sorghum/genética , Sintenía , Zea mays/genéticaRESUMEN
BACKGROUND: To illustrate the molecular mechanism of mycoheterotrophic interactions between orchids and fungi, we assembled chromosome-level reference genome of Gastrodia menghaiensis (Orchidaceae) and analyzed the genomes of two species of Gastrodia. RESULTS: Our analyses indicated that the genomes of Gastrodia are globally diminished in comparison to autotrophic orchids, even compared to Cuscuta (a plant parasite). Genes involved in arbuscular mycorrhizae colonization were found in genomes of Gastrodia, and many of the genes involved biological interaction between Gatrodia and symbiotic microbionts are more numerous than in photosynthetic orchids. The highly expressed genes for fatty acid and ammonium root transporters suggest that fungi receive material from orchids, although most raw materials flow from the fungi. Many nuclear genes (e.g. biosynthesis of aromatic amino acid L-tryptophan) supporting plastid functions are expanded compared to photosynthetic orchids, an indication of the importance of plastids even in totally mycoheterotrophic species. CONCLUSION: Gastrodia menghaiensis has the smallest proteome thus far among angiosperms. Many of the genes involved biological interaction between Gatrodia and symbiotic microbionts are more numerous than in photosynthetic orchids.
Asunto(s)
Gastrodia , Micorrizas , Orchidaceae , Gastrodia/genética , Micorrizas/genética , Orchidaceae/genética , Orchidaceae/microbiología , Filogenia , Simbiosis/genéticaRESUMEN
Clarifying the evolutionary processes underlying species diversification and adaptation is a key focus of evolutionary biology. Begonia (Begoniaceae) is one of the most species-rich angiosperm genera with c. 2000 species, most of which are shade-adapted. Here, we present chromosome-scale genome assemblies for four species of Begonia (B. loranthoides, B. masoniana, B. darthvaderiana and B. peltatifolia), and whole genome shotgun data for an additional 74 Begonia representatives to investigate lineage evolution and shade adaptation of the genus. The four genome assemblies range in size from 331.75 Mb (B. peltatifolia) to 799.83 Mb (B. masoniana), and harbor 22 059-23 444 protein-coding genes. Synteny analysis revealed a lineage-specific whole-genome duplication (WGD) that occurred just before the diversification of Begonia. Functional enrichment of gene families retained after WGD highlights the significance of modified carbohydrate metabolism and photosynthesis possibly linked to shade adaptation in the genus, which is further supported by expansions of gene families involved in light perception and harvesting. Phylogenomic reconstructions and genomics studies indicate that genomic introgression has also played a role in the evolution of Begonia. Overall, this study provides valuable genomic resources for Begonia and suggests potential drivers underlying the diversity and adaptive evolution of this mega-diverse clade.
Asunto(s)
Begoniaceae , Begoniaceae/genética , Evolución Molecular , Genoma , Filogenia , Sintenía/genéticaRESUMEN
Bryophytes including mosses, liverworts, and hornworts are among the earliest land plants, and occupy a crucial phylogenetic position to aid in the understanding of plant terrestrialization. Despite their small size and simple structure, bryophytes are the second largest group of extant land plants. They live ubiquitously in various habitats and are highly diversified, with adaptive strategies to modern ecosystems on Earth. More and more genomes and transcriptomes have been assembled to address fundamental questions in plant biology. Here, we review recent advances in bryophytes associated with diversity, phylogeny, and ecological adaptation. Phylogenomic studies have provided increasing supports for the monophyly of bryophytes, with hornworts sister to the Setaphyta clade including liverworts and mosses. Further comparative genomic analyses revealed that multiple whole-genome duplications might have contributed to the species richness and morphological diversity in mosses. We highlight that the biological changes through gene gain or neofunctionalization that primarily evolved in bryophytes have facilitated the adaptation to early land environments; among the strategies to adapt to modern ecosystems in bryophytes, desiccation tolerance is the most remarkable. More genomic information for bryophytes would shed light on key mechanisms for the ecological success of these 'dwarfs' in the plant kingdom.
Asunto(s)
Briófitas , Embryophyta , Briófitas/genética , Ecosistema , Embryophyta/genética , Genómica , Filogenia , Plantas/genética , TranscriptomaRESUMEN
Genes encoding interacting proteins tend to be co-retained after whole-genome duplication (WGD). The preferential retention after WGD has been explained by the gene balance hypothesis (GBH). However, small-scale duplications could independently occur in the connected gene families. Certain evolutionary strategies might keep the dosage balanced. Here, we examined the gene duplication, interaction and expression patterns of calcineurin B-like (CBL) and CBL-interacting protein kinase (CIPK) gene families to understand the underlying principles. The ratio of the CBL and CIPK gene numbers evolved from 5 : 7 in Physcomitrella to 10 : 26 in Arabidopsis, and retrotransposition, tandem duplication, and WGDs contributed to the expansion. Two pairs of CBLs and six pairs of CIPKs were retained after the α WGD in Arabidopsis, in which specific interaction patterns were identified. In some cases, two retained CBLs (CIPKs) might compete to interact with a sole CIPK (CBL). Results of gene expression analyses indicated that the relatively over-retained duplicates tend to show asymmetric expression, thus avoiding competition. In conclusion, our results suggested that the highly specific interaction, together with the differential gene expression pattern, jointly maintained the balanced dosage for the interacting CBL and CIPK proteins.
Asunto(s)
Arabidopsis , Proteínas de Plantas , Arabidopsis/genética , Arabidopsis/metabolismo , Proteínas de Unión al Calcio/genética , Duplicación de Gen , Proteínas de Plantas/metabolismo , Proteínas Serina-Treonina Quinasas/genética , Proteínas Serina-Treonina Quinasas/metabolismoRESUMEN
Flowering plants, or angiosperms, consist of more than 300,000 species, far more than any other land plant lineages. The accumulated evidence indicates that multiple ancient polyploidy events occurred around 100 to 120 million years ago during the Cretaceous and drove the early diversification of four major clades of angiosperms: gamma whole-genome triplication in the common ancestor of core eudicots, tau whole-genome duplication during the early diversification of monocots, lambda whole-genome duplication during the early diversification of magnoliids, and pi whole-genome duplication in the Nymphaeales lineage. These four polyploidy events have played essential roles in the adaptive evolution and diversification of major clades of flowering plants. Here, we specifically review the current understanding of this wave of ancient whole-genome duplications and their evolutionary significance. Notably, although these ancient whole-genome duplications occurred independently, they have contributed to the expansion of many stress-related genes (e.g., heat shock transcription factors and Arabidopsis response regulators)ï¼and these genes could have been selected for by global environmental changes in the Cretaceous. Therefore, this ancient wave of paleopolyploidy events could have significantly contributed to the adaptation of angiosperms to environmental changes, and potentially promoted the wide diversification of flowering plants.
Asunto(s)
Adaptación Fisiológica/genética , Magnoliopsida/genética , Fenómenos Fisiológicos de las Plantas/genética , Poliploidía , Estrés Fisiológico/genética , Evolución Biológica , Genoma de Planta/genética , Genoma de Planta/fisiología , Magnoliopsida/fisiología , Filogenia , Estrés Fisiológico/fisiologíaRESUMEN
The emergence of human infection with a novel H7N9 influenza virus in China raises a pandemic concern. Chicken H9N2 viruses provided all six of the novel reassortant's internal genes. However, it is not fully understood how the prevalence and evolution of these H9N2 chicken viruses facilitated the genesis of the novel H7N9 viruses. Here we show that over more than 10 y of cocirculation of multiple H9N2 genotypes, a genotype (G57) emerged that had changed antigenicity and improved adaptability in chickens. It became predominant in vaccinated farm chickens in China, caused widespread outbreaks in 2010-2013 before the H7N9 viruses emerged in humans, and finally provided all of their internal genes to the novel H7N9 viruses. The prevalence and variation of H9N2 influenza virus in farmed poultry could provide an important early warning of the emergence of novel reassortants with pandemic potential.
Asunto(s)
Pollos/virología , Evolución Molecular , Subtipo H7N9 del Virus de la Influenza A/genética , Subtipo H9N2 del Virus de la Influenza A/genética , Animales , Variación Antigénica/genética , Antígenos Virales/genética , China/epidemiología , Genes Virales , Flujo Genético , Genotipo , Glicoproteínas Hemaglutininas del Virus de la Influenza/genética , Glicoproteínas Hemaglutininas del Virus de la Influenza/inmunología , Humanos , Subtipo H7N9 del Virus de la Influenza A/inmunología , Subtipo H7N9 del Virus de la Influenza A/patogenicidad , Subtipo H9N2 del Virus de la Influenza A/inmunología , Subtipo H9N2 del Virus de la Influenza A/patogenicidad , Gripe Aviar/epidemiología , Gripe Aviar/virología , Gripe Humana/epidemiología , Gripe Humana/virología , Pandemias , Filogenia , Virus Reordenados/genética , Virus Reordenados/inmunología , Virus Reordenados/patogenicidad , Estudios RetrospectivosRESUMEN
Seagrasses are marine angiosperms that evolved from land plants but returned to the sea around 140 million years ago during the early evolution of monocotyledonous plants. They successfully adapted to abiotic stresses associated with growth in the marine environment, and today, seagrasses are distributed in coastal waters worldwide. Seagrass meadows are an important oceanic carbon sink and provide food and breeding grounds for diverse marine species. Here, we report the assembly and characterization of the Zostera muelleri genome, a southern hemisphere temperate species. Multiple genes were lost or modified in Z. muelleri compared with terrestrial or floating aquatic plants that are associated with their adaptation to life in the ocean. These include genes for hormone biosynthesis and signaling and cell wall catabolism. There is evidence of whole-genome duplication in Z. muelleri; however, an ancient pan-commelinid duplication event is absent, highlighting the early divergence of this species from the main monocot lineages.
Asunto(s)
Adaptación Fisiológica/genética , Ecosistema , Genoma de Planta/genética , Zosteraceae/genética , Organismos Acuáticos/genética , Duplicación de Gen , Ontología de Genes , Genes de Plantas/genética , Anotación de Secuencia Molecular , Océanos y Mares , Proteínas de Plantas/genética , Análisis de Secuencia de ARNRESUMEN
Unraveling widespread polyploidy events throughout plant evolution is a necessity for inferring the impacts of whole-genome duplication (WGD) on speciation, functional innovations, and to guide identification of true orthologs in divergent taxa. Here, we employed an integrated syntenic and phylogenomic analyses to reveal an ancient WGD that shaped the genomes of all commelinid monocots, including grasses, bromeliads, bananas (Musa acuminata), ginger, palms, and other plants of fundamental, agricultural, and/or horticultural interest. First, comprehensive phylogenomic analyses revealed 1421 putative gene families that retained ancient duplication shared by Musa (Zingiberales) and grass (Poales) genomes, indicating an ancient WGD in monocots. Intergenomic synteny blocks of Musa and Oryza were investigated, and 30 blocks were shown to be duplicated before Musa-Oryza divergence an estimated 120 to 150 million years ago. Synteny comparisons of four monocot (rice [Oryza sativa], sorghum [Sorghum bicolor], banana, and oil palm [Elaeis guineensis]) and two eudicot (grape [Vitis vinifera] and sacred lotus [Nelumbo nucifera]) genomes also support this additional WGD in monocots, herein called Tau (τ). Integrating synteny and phylogenomic comparisons achieves better resolution of ancient polyploidy events than either approach individually, a principle that is exemplified in the disambiguation of a WGD series of rho (ρ)-sigma (σ)-tau (τ) in the grass lineages that echoes the alpha (α)-beta (ß)-gamma (γ) series previously revealed in the Arabidopsis thaliana lineage.
Asunto(s)
Evolución Molecular , Genoma de Planta/genética , Magnoliopsida/genética , Duplicación de Gen , Genómica , Filogenia , Poliploidía , Alineación de Secuencia , Análisis de Secuencia de ADN , SinteníaRESUMEN
Whole-genome duplication (WGD), or polyploidy, followed by gene loss and diploidization has long been recognized as an important evolutionary force in animals, fungi and other organisms, especially plants. The success of angiosperms has been attributed, in part, to innovations associated with gene or whole-genome duplications, but evidence for proposed ancient genome duplications pre-dating the divergence of monocots and eudicots remains equivocal in analyses of conserved gene order. Here we use comprehensive phylogenomic analyses of sequenced plant genomes and more than 12.6 million new expressed-sequence-tag sequences from phylogenetically pivotal lineages to elucidate two groups of ancient gene duplications-one in the common ancestor of extant seed plants and the other in the common ancestor of extant angiosperms. Gene duplication events were intensely concentrated around 319 and 192 million years ago, implicating two WGDs in ancestral lineages shortly before the diversification of extant seed plants and extant angiosperms, respectively. Significantly, these ancestral WGDs resulted in the diversification of regulatory genes important to seed and flower development, suggesting that they were involved in major innovations that ultimately contributed to the rise and eventual dominance of seed plants and angiosperms.
Asunto(s)
Evolución Molecular , Genoma de Planta/genética , Magnoliopsida/clasificación , Magnoliopsida/genética , Poliploidía , Genómica , FilogeniaRESUMEN
BACKGROUND: Root nodule symbiosis (RNS) is a fascinating evolutionary event. Given that limited genes conferring the evolution of RNS in Leguminosae have been functionally validated, the genetic basis of the evolution of RNS remains largely unknown. Identifying the genes involved in the evolution of RNS will help to reveal the mystery. RESULTS: Here, we investigate the gene loss event during the evolution of RNS in Leguminosae through phylogenomic and synteny analyses in 48 species including 16 Leguminosae species. We reveal that loss of the Lateral suppressor gene, a member of the GRAS-domain protein family, is associated with the evolution of RNS in Leguminosae. Ectopic expression of the Lateral suppressor (Ls) gene from tomato and its homolog MONOCULM 1 (MOC1) and Os7 from rice in soybean and Medicago truncatula result in almost completely lost nodulation capability. Further investigation shows that Lateral suppressor protein, Ls, MOC1, and Os7 might function through an interaction with NODULATION SIGNALING PATHWAY 2 (NSP2) and CYCLOPS to repress the transcription of NODULE INCEPTION (NIN) to inhibit the nodulation in Leguminosae. Additionally, we find that the cathepsin H (CTSH), a conserved protein, could interact with Lateral suppressor protein, Ls, MOC1, and Os7 and affect the nodulation. CONCLUSIONS: This study sheds light on uncovering the genetic basis of the evolution of RNS in Leguminosae and suggests that gene loss plays an essential role.
Asunto(s)
Evolución Molecular , Fabaceae , Filogenia , Proteínas de Plantas , Nódulos de las Raíces de las Plantas , Simbiosis , Simbiosis/genética , Nódulos de las Raíces de las Plantas/microbiología , Nódulos de las Raíces de las Plantas/genética , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Fabaceae/genética , Fabaceae/microbiología , Regulación de la Expresión Génica de las Plantas , Nodulación de la Raíz de la Planta/genética , Medicago truncatula/genética , Medicago truncatula/microbiología , Genes de Plantas , Glycine max/genética , Glycine max/microbiologíaRESUMEN
BACKGROUND: Parasitic plants, represented by several thousand species of angiosperms, use modified structures known as haustoria to tap into photosynthetic host plants and extract nutrients and water. As a result of their direct plant-plant connections with their host plant, parasitic plants have special opportunities for horizontal gene transfer, the nonsexual transmission of genetic material across species boundaries. There is increasing evidence that parasitic plants have served as recipients and donors of horizontal gene transfer (HGT), but the long-term impacts of eukaryotic HGT in parasitic plants are largely unknown. RESULTS: Here we show that a gene encoding albumin 1 KNOTTIN-like protein, closely related to the albumin 1 genes only known from papilionoid legumes, where they serve dual roles as food storage and insect toxin, was found in Phelipanche aegyptiaca and related parasitic species of family Orobanchaceae, and was likely acquired by a Phelipanche ancestor via HGT from a legume host based on phylogenetic analyses. The KNOTTINs are well known for their unique "disulfide through disulfide knot" structure and have been extensively studied in various contexts, including drug design. Genomic sequences from nine related parasite species were obtained, and 3D protein structure simulation tests and evolutionary constraint analyses were performed. The parasite gene we identified here retains the intron structure, six highly conserved cysteine residues necessary to form a KNOTTIN protein, and displays levels of purifying selection like those seen in legumes. The albumin 1 xenogene has evolved through >150 speciation events over ca. 16 million years, forming a small family of differentially expressed genes that may confer novel functions in the parasites. Moreover, further data show that a distantly related parasitic plant, Cuscuta, obtained two copies of albumin 1 KNOTTIN-like genes from legumes through a separate HGT event, suggesting that legume KNOTTIN structures have been repeatedly co-opted by parasitic plants. CONCLUSIONS: The HGT-derived albumins in Phelipanche represent a novel example of how plants can acquire genes from other plants via HGT that then go on to duplicate, evolve, and retain the specialized features required to perform a unique host-derived function.
Asunto(s)
Miniproteínas Nodales de Cistina/genética , Evolución Molecular , Transferencia de Gen Horizontal , Genes de Plantas , Orobanchaceae/genética , Secuencia de Aminoácidos , Teorema de Bayes , ADN de Plantas/genética , Fabaceae/genética , Duplicación de Gen , Funciones de Verosimilitud , Datos de Secuencia Molecular , Filogenia , Estructura Terciaria de Proteína , Alineación de Secuencia , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: Previous studies in basal angiosperms have provided insight into the diversity within the angiosperm lineage and helped to polarize analyses of flowering plant evolution. However, there is still not an experimental system for genetic studies among basal angiosperms to facilitate comparative studies and functional investigation. It would be desirable to identify a basal angiosperm experimental system that possesses many of the features found in existing plant model systems (e.g., Arabidopsis and Oryza). RESULTS: We have considered all basal angiosperm families for general characteristics important for experimental systems, including availability to the scientific community, growth habit, and membership in a large basal angiosperm group that displays a wide spectrum of phenotypic diversity. Most basal angiosperms are woody or aquatic, thus are not well-suited for large scale cultivation, and were excluded. We further investigated members of Aristolochiaceae for ease of culture, life cycle, genome size, and chromosome number. We demonstrated self-compatibility for Aristolochia elegans and A. fimbriata, and transformation with a GFP reporter construct for Saruma henryi and A. fimbriata. Furthermore, A. fimbriata was easily cultivated with a life cycle of just three months, could be regenerated in a tissue culture system, and had one of the smallest genomes among basal angiosperms. An extensive multi-tissue EST dataset was produced for A. fimbriata that includes over 3.8 million 454 sequence reads. CONCLUSIONS: Aristolochia fimbriata has numerous features that facilitate genetic studies and is suggested as a potential model system for use with a wide variety of technologies. Emerging genetic and genomic tools for A. fimbriata and closely related species can aid the investigation of floral biology, developmental genetics, biochemical pathways important in plant-insect interactions as well as human health, and various other features present in early angiosperms.
Asunto(s)
Aristolochia/genética , Aristolochia/fisiología , Genoma de Planta/genéticaRESUMEN
Assembly of a high-quality genome is important for downstream comparative and functional genomic studies. However, most tools for genome assembly assessment only give qualitative reports, which do not pinpoint assembly errors at specific regions. Here, we develop a new reference-free tool, Clipping information for Revealing Assembly Quality (CRAQ), which maps raw reads back to assembled sequences to identify regional and structural assembly errors based on effective clipped alignment information. Error counts are transformed into corresponding assembly evaluation indexes to reflect the assembly quality at single-nucleotide resolution. Notably, CRAQ distinguishes assembly errors from heterozygous sites or structural differences between haplotypes. This tool can clearly indicate low-quality regions and potential structural error breakpoints; thus, it can identify misjoined regions that should be split for further scaffold building and improvement of the assembly. We have benchmarked CRAQ on multiple genomes assembled using different strategies, and demonstrated the misjoin correction for improving the constructed pseudomolecules.