RESUMO
BACKGROUND: Panax species are important herbal medicinal plants in the Araliaceae family. Recently, we reported the complete chloroplast genomes and 45S nuclear ribosomal DNA sequences from seven Panax species, two (P . quinqu e folius and P . trifolius) from North America and five (P . ginseng, P . notoginseng, P . japonicus, P . vietnamensis, and P . stipuleanatus) from Asia. METHODS: We conducted phylogenetic analysis of these chloroplast sequences with 12 other Araliaceae species and comprehensive comparative analysis among the seven Panax whole chloroplast genomes. RESULTS: We identified 1,128 single nucleotide polymorphisms (SNP) in coding gene sequences, distributed among 72 of the 79 protein-coding genes in the chloroplast genomes of the seven Panax species. The other seven genes (including psaJ, psbN, rpl23, psbF, psbL, rps18, and rps7) were identical among the Panax species. We also discovered that 12 large chloroplast genome fragments were transferred into the mitochondrial genome based on sharing of more than 90% sequence similarity. The total size of transferred fragments was 60,331 bp, corresponding to approximately 38.6% of chloroplast genome. We developed 18 SNP markers from the chloroplast genic coding sequence regions that were not similar to regions in the mitochondrial genome. These markers included two or three species-specific markers for each species and can be used to authenticate all the seven Panax species from the others. CONCLUSION: The comparative analysis of chloroplast genomes from seven Panax species elucidated their genetic diversity and evolutionary relationships, and 18 species-specific markers were able to discriminate among these species, thereby furthering efforts to protect the ginseng industry from economically motivated adulteration.
RESUMO
Panax ginseng C. A. Meyer, reputed as the king of medicinal herbs, has slow growth, long generation time, low seed production and complicated genome structure that hamper its study. Here, we unveil the genomic architecture of tetraploid P. ginseng by de novo genome assembly, representing 2.98 Gbp with 59 352 annotated genes. Resequencing data indicated that diploid Panax species diverged in association with global warming in Southern Asia, and two North American species evolved via two intercontinental migrations. Two whole genome duplications (WGD) occurred in the family Araliaceae (including Panax) after divergence with the Apiaceae, the more recent one contributing to the ability of P. ginseng to overwinter, enabling it to spread broadly through the Northern Hemisphere. Functional and evolutionary analyses suggest that production of pharmacologically important dammarane-type ginsenosides originated in Panax and are produced largely in shoot tissues and transported to roots; that newly evolved P. ginseng fatty acid desaturases increase freezing tolerance; and that unprecedented retention of chlorophyll a/b binding protein genes enables efficient photosynthesis under low light. A genome-scale metabolic network provides a holistic view of Panax ginsenoside biosynthesis. This study provides valuable resources for improving medicinal values of ginseng either through genomics-assisted breeding or metabolic engineering.
Assuntos
Genoma de Planta/genética , Panax/genética , Adaptação Biológica/genética , Evolução Biológica , Diploide , Genes de Cloroplastos/genética , Genes de Plantas/genética , Ginsenosídeos/biossíntese , Panax/metabolismo , TetraploidiaRESUMO
The complete chloroplast genome sequence of Codonopsis lanceolata was determined by next generation sequencing. The total length of chloroplast genome of C. lanceolata was 169,447 bp long, including a large single-copy (LSC) region of 85,253 bp, a small single-copy (SSC) region of 8060 bp, and a pair of identical inverted repeat regions (IRs) of 38,067 bp. A total of 110 genes was annotated, resulting in 79 protein-coding genes, 27 tRNA genes, and 4 rRNA genes. The phylogenetic analysis of C. lanceolata with related chloroplast genome sequences in this study provided the taxonomical relationship of C. lanceolata in the genus Campanula.
RESUMO
Genome duplication and repeat multiplication contribute to genome evolution in plants. Our previous work identified a recent allotetraploidization event and five high-copy LTR retrotransposon (LTR-RT) families PgDel, PgTat, PgAthila, PgTork, and PgOryco in Panax ginseng. Here, using whole-genome sequences, we quantified major repeats in five Panax species and investigated their role in genome evolution. The diploids P. japonicus, P. vietnamensis, and P. notoginseng and the tetraploids P. ginseng and P. quinquefolius were analyzed alongside their relative Aralia elata. These species possess 0.8-4.9 Gb haploid genomes. The PgDel, PgTat, PgAthila, and PgTork LTR-RT superfamilies accounted for 39-52% of the Panax species genomes and 17% of the A. elata genome. PgDel included six subfamily members, each with a distinct genome distribution. In particular, the PgDel1 subfamily occupied 23-35% of the Panax genomes and accounted for much of their genome size variation. PgDel1 occupied 22.6% (0.8 Gb of 3.6 Gb) and 34.5% (1.7 Gb of 4.9 Gb) of the P. ginseng and P. quinquefolius genomes, respectively. Our findings indicate that the P. quinquefolius genome may have expanded due to rapid PgDel1 amplification over the last million years as a result of environmental adaptation following migration from Asia to North America.
Assuntos
Tamanho do Genoma , Genoma de Planta , Genômica , Panax/classificação , Panax/genética , Retroelementos , Mapeamento Cromossômico , Variação Genética , Genômica/métodos , Família Multigênica , Sequências Repetitivas de Ácido Nucleico , Sequenciamento Completo do GenomaRESUMO
Ginseng represents a set of high-value medicinal plants of different species: Panax ginseng (Asian ginseng), Panax quinquefolius (American ginseng), Panax notoginseng (Chinese ginseng), Panax japonicus (Bamboo ginseng), and Panax vietnamensis (Vietnamese ginseng). Each species is pharmacologically and economically important, with differences in efficacy and price. Accordingly, an authentication system is needed to combat economically motivated adulteration of Panax products. We conducted comparative analysis of the chloroplast genome sequences of these five species, identifying 34-124 InDels and 141-560 SNPs. Fourteen InDel markers were developed to authenticate the Panax species. Among these, eight were species-unique markers that successfully differentiated one species from the others. We generated at least one species-unique marker for each of the five species, and any of the species can be authenticated by selection among these markers. The markers are reliable, easily detectable, and valuable for applications in the ginseng industry as well as in related research.
Assuntos
Genoma de Cloroplastos , Panax/genética , Marcadores Genéticos , Panax/classificação , Proteínas de Plantas/genética , Raízes de Plantas/classificação , Raízes de Plantas/genética , Polimorfismo de Nucleotídeo Único , Especificidade da EspécieRESUMO
The complete chloroplast genome sequence of Panax vietnamensis, a medicinal herb belonging to Araliaceae family, was generated by de novo assembly using whole genome next-generation sequences. The chloroplast genome was a circular form of 155 992 bp long and showed typical chloroplast genome structure consisting of a large single-copy region of 86 177 bp, a small single copy region of 17 935 bp and a pair of inverted repeats of 25 940 bp. The chloroplast genome had 79 protein-coding genes, 29 tRNA genes and 4 rRNA genes. The phylogenetic analysis with the reported chloroplast genomes revealed that four Panax species were grouped in the same clade and P. vietnamensis is more closely related to P. notoginseng than P. ginseng and P. quinquefolius.
Assuntos
Genes de Cloroplastos , Genoma de Cloroplastos , Panax/genética , Filogenia , Sequência de Bases , DNA de Cloroplastos , Tamanho do Genoma , Genoma de Planta , Genômica , Análise de Sequência de DNARESUMO
Cynanchum auriculatum is a climbing vine belonging to the Apocynaceae family and shows very similar morphology to Cynanchum wilfordii, a medicinal plant. The complete chloroplast genome of C. auriculatum was generated by de novo assembly using the small amount of whole genome sequencing data. The chloroplast genome of C. auriculatum was 160 840 bp in length and consisted of four distinct regions, such as large single copy region (91 973 bp), small single copy region (19 667 bp), and a pair of inverted repeat regions (24 600 bp). The overall GC contents of the chloroplast genome were 37.8%. A total of 114 genes were predicted and included 80 protein-coding genes, 30 tRNA genes, and four rRNA genes. Phylogenetic analysis with the reported chloroplast genomes revealed that C. auriculatum is most closely related to Cynanchum wilfordii, a medicinal plant.
Assuntos
Apocynaceae/genética , Cloroplastos/genética , Genoma de Cloroplastos , Apocynaceae/classificação , Composição de Bases , DNA de Cloroplastos/química , DNA de Cloroplastos/isolamento & purificação , DNA de Cloroplastos/metabolismo , Sequências Repetidas Invertidas/genética , Fases de Leitura Aberta/genética , Filogenia , Proteínas de Plantas/química , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , RNA Ribossômico/química , RNA Ribossômico/genética , RNA de Transferência/química , RNA de Transferência/genética , Análise de Sequência de DNARESUMO
Ledebouriella seseloides (Hoffm.) H.Wolff is a traditional medicinal herb belonging to Apiaceae family, whose dried roots and rhizomes have been used as traditional medicine in East Asian countries. The complete chloroplast genome of L. seseloides was obtained by de novo assembly using the small amount of whole genome sequencing data. The chloroplast genome of L. seseloides was 147 880 bp in length, which consisted of large single copy region (93 222 bp), small single copy region (17 324 bp), and a pair of inverted repeat regions (18 667 bp). The overall GC contents of the chloroplast genome were 37.5%. A total of 113 genes were annotated, which included 79 protein-coding genes, 30 tRNA genes, and four rRNA genes. Phylogenetic analysis with the reported chloroplast genomes revealed that L. seseloides is most closely related to Petroselinum crispum (parsley), an herb widely used in cooking.
Assuntos
Apiaceae/genética , Cloroplastos/genética , Genoma de Cloroplastos , Apiaceae/citologia , Composição de Bases , DNA Ribossômico/genética , Ordem dos Genes , Tamanho do Genoma , Filogenia , RNA de Transferência/genética , Análise de Sequência de DNA/métodosRESUMO
The complete chloroplast genome sequence of Panax quinquefolius, an important medicinal herb, was generated by de novo assembly with low-coverage whole-genome sequence data and manual correction. A circular 156 088-bp chloroplast genome showed typical chloroplast genome structure comprising a large single copy region of 86 095 bp, a small single copy region of 17 993 bp, and a pair of inverted repeats of 26 000 bp. The chloroplast genome had 87 protein-coding genes, 37 tRNA genes, and eight rRNA genes. Phylogenetic analysis with the chloroplast genome revealed that P. quinquefolius is much closer to P. ginseng than P. notoginseng.
Assuntos
Genoma de Cloroplastos , Genômica , Panax/classificação , Panax/genética , Genes de Plantas , Genômica/métodos , Fases de Leitura Aberta , Filogenia , Sequenciamento Completo do GenomaRESUMO
Cynanchum wilfordii (Maxim.) Hemsl. is a traditional medicinal herb belonging to the Asclepiadoideae subfamily, whose dried roots have been used as traditional medicine in Asia. The complete chloroplast genome of C. wilfordii was generated by de novo assembly using the small amount of whole genome sequencing data. The chloroplast genome of C. wilfordii was 161 241 bp long, composed of large single copy region (91 995 bp), small single copy region (19 930 bp) and a pair of inverted repeat regions (24 658 bp). The overall GC contents of the chloroplast genome was 37.8%. A total of 114 genes were annotated, which included 80 protein-coding genes, 30 tRNA genes and 4 rRNA genes. Phylogenetic analysis with the reported chloroplast genomes revealed that C. wilfordii is most closely related to Asclepias nivea (Caribbean milkweed) and Asclepias syriaca (common milkweed) within the Asclepiadoideae subfamily.
Assuntos
Cynanchum/genética , Genoma de Cloroplastos , Composição de Bases , Evolução Molecular , Medicina Tradicional Coreana , Filogenia , Plantas Medicinais/genética , Sequenciamento Completo do GenomaRESUMO
Eleutherococcus gracilistylus is a plant species that is close to E. senticosus, a famous medicinal plant called Siberian ginseng. The complete chloroplast genome sequence of the E. gracilistylus was determined by de novo assembly using whole genome next generation sequences. The chloroplast genome of E. gracilistylus was 156 770 bp long and showed distinct four partite structures such as a large single copy region of 86 729 bp, a small single copy region of 18 175 bp, and a pair of inverted repeat regions of 25 933 bp. The overall GC contents of the genome sequence were 36.8%. The chloroplast genome of E. gracilistylus contains 79 protein-coding sequences, 30 tRNA genes, and four rRNA genes. The phylogenetic analysis with the reported chloroplast genomes confirmed close taxonomical relationship of E. gracilistylus with E. senticosus.
Assuntos
Eleutherococcus/genética , Genoma de Cloroplastos , Composição de Bases , Cloroplastos/genética , DNA de Cloroplastos/genética , Genes de Plantas , Sequências Repetidas Invertidas , Filogenia , Sequenciamento Completo do GenomaRESUMO
We report complete sequences of chloroplast (cp) genome and 45S nuclear ribosomal DNA (45S nrDNA) for 11 Panax ginseng cultivars. We have obtained complete sequences of cp and 45S nrDNA, the representative barcoding target sequences for cytoplasm and nuclear genome, respectively, based on low coverage NGS sequence of each cultivar. The cp genomes sizes ranged from 156,241 to 156,425 bp and the major size variation was derived from differences in copy number of tandem repeats in the ycf1 gene and in the intergenic regions of rps16-trnUUG and rpl32-trnUAG. The complete 45S nrDNA unit sequences were 11,091 bp, representing a consensus single transcriptional unit with an intergenic spacer region. Comparative analysis of these sequences as well as those previously reported for three Chinese accessions identified very rare but unique polymorphism in the cp genome within P. ginseng cultivars. There were 12 intra-species polymorphisms (six SNPs and six InDels) among 14 cultivars. We also identified five SNPs from 45S nrDNA of 11 Korean ginseng cultivars. From the 17 unique informative polymorphic sites, we developed six reliable markers for analysis of ginseng diversity and cultivar authentication.