RESUMO
Malvaceae comprise some 4,225 species in 243 genera and nine subfamilies and include economically important species, such as cacao, cotton, durian, and jute, with cotton an important model system for studying the domestication of polyploids. Here, we use chromosome-level genome assemblies from representatives of five or six subfamilies (depending on the placement of Ochroma) to differentiate coexisting subgenomes and their evolution during the family's deep history. The results reveal that the allohexaploid Helicteroideae partially derive from an allotetraploid Sterculioideae and also form a component of the allodecaploid Bombacoideae and Malvoideae. The ancestral Malvaceae karyotype consists of 11 protochromosomes. Four subfamilies share a unique reciprocal chromosome translocation, and two other subfamilies share a chromosome fusion. DNA alignments of single-copy nuclear genes do not yield the same relationships as inferred from chromosome structural traits, probably because of genes originating from different ancestral subgenomes. These results illustrate how chromosome-structural data can unravel the evolutionary history of groups with ancient hybrid genomes.
Assuntos
Genoma de Planta , Gossypium , Genoma de Planta/genética , Gossypium/genética , Genômica/métodos , Poliploidia , Cariótipo , Evolução MolecularRESUMO
The assembly of W and Y chromosomes poses significant challenges in vertebrate genome sequencing and assembly. Here, we successfully assembled the W chromosome of Verasper variegatus with a length of 20.48 Mb by combining population and PacBio HiFi sequencing data. It was identified as a young sex chromosome and showed signs of expansion in repetitive sequences. The major component of the expansion was Ty3/Gypsy. The ancestral Osteichthyes karyotype consists of 24 protochromosomes. The sex chromosomes in four Pleuronectiformes species derived from a pair of homologous protochromosomes resulting from a whole-genome duplication event in teleost fish, yet with different sex-determination systems. V. variegatus and Cynoglossus semilaevis adhere to the ZZ/ZW system, while Hippoglossus stenolepis and H. hippoglossus follow the XX/XY system. Interestingly, V. variegatus and H. hippoglossus derived from one protochromosome, while C. semilaevis and H. stenolepis derived from another protochromosome. Our study provides valuable insights into the evolution of sex chromosomes in flatfish and sheds light on the important role of whole-genome duplication in shaping the evolution of sex chromosomes.
Assuntos
Linguados , Linguado , Animais , Mapeamento Cromossômico , Evolução Molecular , Linguados/genética , Linguado/genética , Cromossomos Sexuais , Cromossomo YRESUMO
BACKGROUND: Two widely cultivated annual buckwheat crops, Fagopyrum esculentum and F. tataricum, differ from each other in both rutin concentration and reproductive system. However, the underlying genetic mechanisms remain poorly elucidated. RESULTS: Here, we report the first haplotype-resolved chromosome-level genome assemblies of the two species. Two haplotype genomes of F. esculentum were assembled as 1.23 and 1.19 Gb with N50 = 9.8 and 12.4 Mb, respectively; the two haplotype genomes of F. tataricum were 453.7 and 446.2 Mb with N50 = 50 and 30 Mb, respectively. We further annotated protein-coding genes of each haplotype genome based on available gene sets and 48 newly sequenced transcriptomes. We found that more repetitive sequences, especially expansion of long terminal repeat retrotransposons (LTR-RTs), contributed to the large genome size of F. esculentum. Based on the well-annotated sequences, gene expressions, and luciferase experiments, we identified the sequence mutations of the promoter regions of two key genes that are likely to have greatly contributed to the high rutin concentration and selfing reproduction in F. tartaricum. CONCLUSIONS: Our results highlight the importance of high-quality genomes to identify genetic mutations underlying phenotypic differences between closely related species. F. tataricum may have been experienced stronger selection than F. esculentum through choosing these two non-coding alleles for the desired cultivation traits. These findings further suggest that genetic manipulation of the non-coding promoter regions could be widely employed for breeding buckwheat and other crops.
Assuntos
Fagopyrum , Rutina , Rutina/genética , Rutina/metabolismo , Fagopyrum/genética , Fagopyrum/metabolismo , Haplótipos , Melhoramento Vegetal , Genitália/metabolismoRESUMO
Fragaria vesca, commonly known as wild or woodland strawberry, is the most widely distributed diploid Fragaria species and is native to Europe and Asia. Because of its small plant size, low heterozygosity, and relative ease of genetic transformation, F. vesca has been a model plant for fruit research since the publication of its Illumina-based genome in 2011. However, its genomic contribution to octoploid cultivated strawberry remains a long-standing question. Here, we de novo assembled and annotated a telomere-to-telomere, gap-free genome of F. vesca 'Hawaii 4', with all seven chromosomes assembled into single contigs, providing the highest completeness and assembly quality to date. The gap-free genome is 220 785 082 bp in length and encodes 36 173 protein-coding gene models, including 1153 newly annotated genes. All 14 telomeres and seven centromeres were annotated within the seven chromosomes. Among the three previously recognized wild diploid strawberry ancestors, F. vesca, F. iinumae, and F. viridis, phylogenomic analysis showed that F. vesca and F. viridis are the ancestors of the cultivated octoploid strawberry F. × ananassa, and F. vesca is its closest relative. Three subgenomes of F. × ananassa belong to the F. vesca group, and one is sister to F. viridis. We anticipate that this high-quality, telomere-to-telomere, gap-free F. vesca genome, combined with our phylogenomic inference of the origin of cultivated strawberry, will provide insight into the genomic evolution of Fragaria and facilitate strawberry genetics and molecular breeding.
RESUMO
Angiosperms dominate the Earth's ecosystems and provide most of the basic necessities for human life. The major angiosperm clades comprise 64 orders, as recognized by the APG IV classification. However, the phylogenetic relationships of angiosperms remain unclear, as phylogenetic trees with different topologies have been reconstructed depending on the sequence datasets utilized, from targeted genes to transcriptomes. Here, we used currently available de novo genome data to reconstruct the phylogenies of 366 angiosperm species from 241 genera belonging to 97 families across 43 of the 64 orders based on orthologous genes from the nuclear, plastid, and mitochondrial genomes of the same species with compatible datasets. The phylogenetic relationships were largely consistent with previously constructed phylogenies based on sequence variations in each genome type. However, there were major inconsistencies in the phylogenetic relationships of the five Mesangiospermae lineages when different genomes were examined. We discuss ways to address these inconsistencies, which could ultimately lead to the reconstruction of a comprehensive angiosperm tree of life. The angiosperm phylogenies presented here provide a basic framework for further updates and comparisons. These phylogenies can also be used as guides to examine the evolutionary trajectories among the three genome types during lineage radiation.
Assuntos
Magnoliopsida , Humanos , Filogenia , Magnoliopsida/genética , Ecossistema , Evolução Biológica , Plastídeos , Evolução MolecularRESUMO
BACKGROUND: Eudicots are the most diverse group of flowering plants that compromise five well-defined lineages: core eudicots, Ranunculales, Proteales, Trochodendrales, and Buxales. However, the phylogenetic relationships between these five lineages and their chromosomal evolutions remain unclear, and a lack of high-quality genome analyses for Buxales has hindered many efforts to address this knowledge gap. RESULTS: Here, we present a high-quality chromosome-level genome of Buxus austro-yunnanensis (Buxales). Our phylogenomic analyses revealed that Buxales and Trochodendrales are genetically similar and classified as sisters. Additionally, both are sisters to the core eudicots, while Ranunculales was found to be the first lineage to diverge from these groups. Incomplete lineage sorting and hybridization were identified as the main contributors to phylogenetic discordance (34.33%) between the lineages. In fact, B. austro-yunnanensis underwent only one whole-genome duplication event, and collinear gene phylogeny analyses suggested that separate independent polyploidizations occurred in the five eudicot lineages. Using representative genomes from these five lineages, we reconstructed the ancestral eudicot karyotype (AEK) and generated a nearly gapless karyotype projection for each eudicot species. Within core eudicots, we recovered one common chromosome fusion event in asterids and malvids, respectively. Further, we also found that the previously reported fused AEKs in Aquilegia (Ranunculales) and Vitis (core eudicots) have different fusion positions, which indicates that these two species have different karyotype evolution histories. CONCLUSIONS: Based on our phylogenomic and karyotype evolution analyses, we revealed the likely relationships and evolutionary histories of early eudicots. Ultimately, our study expands genomic resources for early-diverging eudicots.
Assuntos
Buxus , Magnoliopsida , Buxus/genética , Evolução Molecular , Genoma de Planta , Cariótipo , Magnoliopsida/genética , FilogeniaRESUMO
Evidence of whole-genome duplications (WGDs) and subsequent karyotype changes has been detected in most major lineages of living organisms on Earth. To clarify the complex resulting multi-layered patterns of gene collinearity in genome analyses, there is a need for convenient and accurate toolkits. To meet this need, we developed WGDI (Whole-Genome Duplication Integrated analysis), a Python-based command-line tool that facilitates comprehensive analysis of recursive polyploidization events and cross-species genome alignments. WGDI supports three main workflows (polyploid inference, hierarchical inference of genomic homology, and ancestral chromosome karyotyping) that can improve the detection of WGD and characterization of WGD-related events based on high-quality chromosome-level genomes. Significantly, it can extract complete synteny blocks and facilitate reconstruction of detailed karyotype evolution. This toolkit is freely available at GitHub (https://github.com/SunPengChuan/wgdi). As an example of its application, WGDI convincingly clarified karyotype evolution in Aquilegia coerulea and Vitis vinifera following WGDs and rejected the hypothesis that Aquilegia contributed as a parental lineage to the allopolyploid origin of core dicots.
Assuntos
Duplicação Gênica , GenômicaRESUMO
Ice plant (Mesembryanthemum crystallinum), a member of the Aizoaceae family, is a typical halophyte crop and a model plant for studying the mechanism of transition from C3 photosynthesis to crassulacean acid metabolism (CAM). Here, we report a high-quality chromosome-level ice plant genome sequence. This 98.05% genome sequence is anchored to nine chromosomes, with a total length of 377.97 Mb and an N50 scaffold of 40.45 Mb. Almost half of the genome (48.04%) is composed of repetitive sequences, and 24 234 genes have been annotated. Subsequent to the ancient whole-genome triplication (WGT) that occurred in eudicots, there has been no recent whole-genome duplication (WGD) or WGT in ice plants. However, we detected a novel WGT event that occurred in the same order in Simmondsia chinensis, which was previously overlooked. Our findings revealed that ice plants have undergone chromosome rearrangements and gene removal during evolution. Combined with transcriptome and comparative genomic data and expression verification, we identified several key genes involved in the CAM pathway and constructed a comprehensive network. As the first genome of the Aizoaceae family to be released, this report will provide a rich data resource for comparative and functional genomic studies of Aizoaceae, especially for studies on salt tolerance and C3-to-CAM transitions to improve crop yield and resistance.
Assuntos
Mesembryanthemum , Mesembryanthemum/genética , Metabolismo Ácido das Crassuláceas , Fotossíntese , Genoma de Planta/genética , Plantas Tolerantes a Sal/genética , Evolução MolecularRESUMO
Populations of widespread species are usually geographically distributed through contrasting stresses, but underlying genetic mechanisms controlling this adaptation remain largely unknown. Here, we show that in Arabidopsis thaliana, allelic changes in the cis-regulatory elements, WT box and W box, in the promoter of a key transcription factor associated with oxygen sensing, RELATED TO AP 2.12 (RAP2.12), are responsible for differentially regulating tolerance to drought and flooding. These two cis-elements are regulated by different transcription factors that downstream of RAP2.12 results in differential accumulation of hypoxia-responsive transcripts. The evolution from one cis-element haplotype to the other is associated with the colonization of humid environments from arid habitats. This gene thus promotes both drought and flooding adaptation via an adaptive mechanism that diversifies its regulation through noncoding alleles.
Assuntos
Proteínas de Arabidopsis , Arabidopsis , Alelos , Arabidopsis/metabolismo , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Proteínas de Ligação a DNA/genética , Secas , Regulação da Expressão Gênica de Plantas , Umidade , Estresse Fisiológico , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
Most extant angiosperms belong to Mesangiospermae, which comprises eudicots, monocots, magnoliids, Chloranthales and Ceratophyllales. However, phylogenetic relationships between these five lineages remain unclear. Here, we report the high-quality genome of a member of the Chloranthales lineage (Chloranthus sessilifolius). We detect only one whole genome duplication within this species and find that polyploidization events in different Mesangiospermae lineage are mutually independent. We also find that the members of all floral development-related gene lineages are present in C. sessilifolius despite its extremely simplified flower. The AP1 and PI genes, however, show a weak floral tissue-specialized expression. Our phylogenomic analyses suggest that Chloranthales and magnoliids are sister groups, and both are together sister to the clade comprising Ceratophyllales and eudicots, while the monocot lineage is sister to all other Mesangiospermae. Our findings suggest that in addition to hybridization, incomplete lineage sorting may largely account for phylogenetic inconsistencies between the observed gene trees.
Assuntos
Especiação Genética , Genoma de Planta , Magnoliopsida/genética , Evolução Molecular , Hibridização Genética , FilogeniaRESUMO
BACKGROUND: Coconut is an important tropical oil and fruit crop whose evolutionary position renders it a fantastic species for the investigation of the evolution of monocot chromosomes and the subsequent differentiation of ancient plants. RESULTS: Here, we report the assembly and annotation of reference-grade genomes of Cn. tall and Cn. dwarf, whose genome sizes are 2.40 Gb and 2.39 Gb, respectively. The comparative analysis reveals that the two coconut subspecies diverge about 2-8 Mya while the conserved Arecaceae-specific whole-genome duplication (ω WGD) occurs approximately 47-53 Mya. It additionally allows us to reconstruct the ancestral karyotypes of the ten ancient monocot chromosomes and the evolutionary trajectories of the 16 modern coconut chromosomes. Fiber synthesis genes in Cn. tall, related to lignin and cellulose synthesis, are found at a higher copy number and expression level than dwarf coconuts. Integrated multi-omics analysis reveals that the difference in coconut plant height is the result of altered gibberellin metabolism, with both the GA20ox copy number and a single-nucleotide change in the promoter together leading to the difference in plant height between Cn. tall and Cn. dwarf. CONCLUSION: We provide high-quality coconut genomes and reveal the genetic basis of trait differences between two coconuts through multi-omics analysis. We also reveal that the selection of plant height has been targeted for the same gene for millions of years, not only in natural selection of ancient plant as illustrated in coconut, but also for artificial selection in cultivated crops such as rice and maize.
Assuntos
Cromossomos de Plantas , Cocos/genética , Evolução Molecular , Genoma de Planta , Vias Biossintéticas , Cocos/anatomia & histologia , Cocos/metabolismo , Genômica , CariótipoRESUMO
Deserts exert strong selection pressures on plants, but the underlying genomic drivers of ecological adaptation and subsequent speciation remain largely unknown. Here, we generated de novo genome assemblies and conducted population genomic analyses of the psammophytic genus Pugionium (Brassicaceae). Our results indicated that this bispecific genus had undergone an allopolyploid event, and the two parental genomes were derived from two ancestral lineages with different chromosome numbers and structures. The postpolyploid expansion of gene families related to abiotic stress responses and lignin biosynthesis facilitated environmental adaptations of the genus to desert habitats. Population genomic analyses of both species further revealed their recent divergence with continuous gene flow, and the most divergent regions were found to be centered on three highly structurally reshuffled chromosomes. Genes under selection in these regions, which were mainly located in one of the two subgenomes, contributed greatly to the interspecific divergence in microhabitat adaptation.
Assuntos
Adaptação Fisiológica/genética , Brassicaceae/genética , Ecossistema , Especiação Genética , Genoma de Planta , Brassicaceae/classificação , Brassicaceae/fisiologia , Filogenia , PoliploidiaRESUMO
Tea, coffee, and cocoa are the three most popular nonalcoholic beverages in the world and have extremely high economic and cultural value. The genomes of four tea plant varieties have recently been sequenced, but there is some debate regarding the characterization of a whole-genome duplication (WGD) event in tea plants. Whether the WGD in the tea plant is shared with other plants in order Ericales and how it contributed to tea plant evolution remained unanswered. Here we re-analyzed the tea plant genome and provided evidence that tea experienced only WGD event after the core-eudicot whole-genome triplication (WGT) event. This WGD was shared by the Polemonioids-Primuloids-Core Ericales (PPC) sections, encompassing at least 17 families in the order Ericales. In addition, our study identified eight pairs of duplicated genes in the catechins biosynthesis pathway, four pairs of duplicated genes in the theanine biosynthesis pathway, and one pair of genes in the caffeine biosynthesis pathway, which were expanded and retained following this WGD. Nearly all these gene pairs were expressed in tea plants, implying the contribution of the WGD. This study shows that in addition to the role of the recent tandem gene duplication in the accumulation of tea flavor-related genes, the WGD may have been another main factor driving the evolution of tea flavor.
RESUMO
Hazelnut is popular for its flavor, and it has also been suggested that hazelnut is beneficial to cardiovascular health because it is rich in oleic acid. Here, we report the first high-quality chromosome-scale genome for the hazelnut species Corylus mandshurica (2n = 22), which has a high concentration of oleic acid in its nuts. The assembled genome is 367.67 Mb in length, and the contig N50 is 14.85 Mb. All contigs were assembled into 11 chromosomes, and 28,409 protein-coding genes were annotated. We reconstructed the evolutionary trajectories of the genomes of Betulaceae species and revealed that the 11 chromosomes of the hazelnut genus were derived from the most ancestral karyotype in Betula pendula, which has 14 protochromosomes, by inferring homology among five Betulaceae genomes. We identified 96 candidate genes involved in oleic acid biosynthesis, and 10 showed rapid evolution or positive selection. These findings will help us to understand the mechanisms of lipid synthesis and storage in hazelnuts. Several gene families related to salicylic acid metabolism and stress responses experienced rapid expansion in this hazelnut species, which may have increased its stress tolerance. The reference genome presented here constitutes a valuable resource for molecular breeding and genetic improvement of the important agronomic properties of hazelnut.
RESUMO
Ethiopian mustard (Brassica carinata) in the Brassicaceae family possesses many excellent agronomic traits. Here, the high-quality genome sequence of B. carinata is reported. Characterization revealed a genome anchored to 17 chromosomes with a total length of 1.087 Gb and an N50 scaffold length of 60 Mb. Repetitive sequences account for approximately 634 Mb or 58.34% of the B. carinata genome. Notably, 51.91% of 97,149 genes are confined to the terminal 20% of chromosomes as a result of the expansion of repeats in pericentromeric regions. Brassica carinata shares one whole-genome triplication event with the five other species in U's triangle, a classic model of evolution and polyploidy in Brassica. Brassica carinata was deduced to have formed â¼0.047 Mya, which is slightly earlier than B. napus but later than B. juncea. Our analysis indicated that the relationship between the two subgenomes (BcaB and BcaC) is greater than that between other two tetraploid subgenomes (BjuB and BnaC) and their respective diploid parents. RNA-seq datasets and comparative genomic analysis were used to identify several key genes in pathways regulating disease resistance and glucosinolate metabolism. Further analyses revealed that genome triplication and tandem duplication played important roles in the expansion of those genes in Brassica species. With the genome sequencing of B. carinata completed, the genomes of all six Brassica species in U's triangle are now resolved. The data obtained from genome sequencing, transcriptome analysis, and comparative genomic efforts in this study provide valuable insights into the genome evolution of the six Brassica species in U's triangle.
Assuntos
Evolução Biológica , Brassica/genética , Genoma de Planta , Poliploidia , Modelos BiológicosRESUMO
Celery (Apium graveolens L. 2n = 2x = 22), a member of the Apiaceae family, is among the most important and globally grown vegetables. Here, we report a high-quality genome sequence assembly, anchored to 11 chromosomes, with total length of 3.33 Gb and N50 scaffold length of 289.78 Mb. Most (92.91%) of the genome is composed of repetitive sequences, with 62.12% of 31 326 annotated genes confined to the terminal 20% of chromosomes. Simultaneous bursts of shared long-terminal repeats (LTRs) in different Apiaceae plants suggest inter-specific exchanges. Two ancestral polyploidizations were inferred, one shared by Apiales taxa and the other confined to Apiaceae. We reconstructed 8 Apiales proto-chromosomes, inferring their evolutionary trajectories from the eudicot common ancestor to extant plants. Transcriptome sequencing in three tissues (roots, leaves and petioles), and varieties with different-coloured petioles, revealed 4 and 2 key genes in pathways regulating anthocyanin and coumarin biosynthesis, respectively. A remarkable paucity of NBS disease-resistant genes in celery (62) and other Apiales was explained by extensive loss and limited production of these genes during the last ~10 million years, raising questions about their biotic defence mechanisms and motivating research into effects of chemicals, for example coumarins, that give off distinctive odours. Celery genome sequencing and annotation facilitates further research into important gene functions and breeding, and comparative genomic analyses in Apiales.
Assuntos
Apium , Apium/genética , Genes de Plantas , Cariótipo , Melhoramento Vegetal , VerdurasRESUMO
Lycophytes and seed plants constitute the typical vascular plants. Lycophytes have been thought to have no paleo-polyploidization although the event is known to be critical for the fast expansion of seed plants. Here, genomic analyses including the homologous gene dot plot analysis detected multiple paleo-polyploidization events, with one occurring approximately 13-15 million years ago (MYA) and another about 125-142 MYA, during the evolution of the genome of Selaginella moellendorffii, a model lycophyte. In addition, comparative analysis of reconstructed ancestral genomes of lycophytes and angiosperms suggested that lycophytes were affected by more paleo-polyploidization events than seed plants. Results from the present genomic analyses indicate that paleo-polyploidization has contributed to the successful establishment of both lineages-lycophytes and seed plants-of vascular plants.
Assuntos
Evolução Molecular , Genoma de Planta , Poliploidia , Selaginellaceae/genética , Genômica , FilogeniaRESUMO
BACKGROUND: Belonging to lineage I of Brassicaceae, Camelina sativa is formed by two hybridizations of three species (three sub-genomes). The three sub-genomes were diverged from a common ancestor, likely derived from lineage I (Ancestral Crucifer karyotype, ACK). The karyotype evolutionary trajectories of the C. sativa chromosomes are currently unknown. Here, we managed to adopt a telomere-centric theory proposed previously to explain the karyotype evolution in C. sativa. RESULTS: By characterizing the homology between A. lyrata and C. sativa chromosomes, we inferred ancestral diploid karyotype of C. sativa (ADK), including 7 ancestral chromosomes, and reconstructed the evolutionary trajectories leading to the formation of extant C. sativa genome. The process involved 2 chromosome fusions. We found that sub-genomes Cs-G1 and Cs-G2 may share a closer common ancestor than Cs-G3. Together with other lines of evidence from Arabidopsis, we propose that the Brassicaceae plants, even the eudicots, follow a chromosome fusion mechanism favoring end-end joining of different chromosomes, rather than a mechanism favoring the formation circular chromosomes and nested chromosome fusion preferred by the monocots. CONCLUSIONS: The present work will contribute to understanding the formation of C. sativa chromosomes, providing insight into Brassicaceae karyotype evolution.
Assuntos
Brassicaceae , Cromossomos de Plantas , Evolução Molecular , Cariótipo , Arabidopsis/genética , Brassicaceae/classificação , Brassicaceae/genética , Cromossomos de Plantas/genética , Diploide , CariotipagemRESUMO
Polyploidies produce a large number of duplicated regions and genes in genomes, which have a long-term impact and stimulate genetic innovation. The high similarity between homeologous chromosomes, forming different subgenomes, or homologous regions after genome repatterning, may permit illegitimate DNA recombination. Here, based on gene colinearity, we aligned the (sub)genomes of common wheat (Triticum aestivum, AABBDD genotype) and its relatives, including Triticum urartu (AA), Aegilops tauschii (DD), and T. turgidum ssp. dicoccoides (AABB) to detect the homeologous (paralogous or orthologous) colinear genes within and between (sub)genomes. Besides, we inferred more ancient paralogous regions produced by a much ancient grass-common tetraploidization. By comparing the sequence similarity between paralogous and orthologous genes, we assumed abnormality in the topology of constructed gene trees, which could be explained by gene conversion as a result of illegitimate recombination. We found large numbers of inferred converted genes (>2,000 gene pairs) suggested long-lasting genome instability of the hexaploid plant, and preferential donor roles by DD genes. Though illegitimate recombination was much restricted, duplicated genes produced by an ancient whole-genome duplication, which occurred millions of years ago, also showed evidence of likely gene conversion. As to biological function, we found that ~40% catalytic genes in colinearity, including those involved in starch biosynthesis, were likely affected by gene conversion. The present study will contribute to understanding the functional and structural innovation of the common wheat genome.