Your browser doesn't support javascript.
loading
Full-length SMRT transcriptome sequencing and microsatellite characterization in Paulownia catalpifolia.
Feng, Yanzhi; Zhao, Yang; Zhang, Jiajia; Wang, Baoping; Yang, Chaowei; Zhou, Haijiang; Qiao, Jie.
Afiliação
  • Feng Y; Paulownia Research and Development Center of State Administration of Forestry and Grassland, Zhengzhou, 450003, China.
  • Zhao Y; Non-Timber Forestry Research and Development Center, Chinese Academy of Forestry, Zhengzhou, 450003, China.
  • Zhang J; Key Laboratory of Non-Timber Forest Germplasm Enhancement and Utilization of State Forestry Administration, Zhengzhou, 450003, China.
  • Wang B; National Innovation Alliance of Paulownia, Zhengzhou, 450003, China.
  • Yang C; Paulownia Research and Development Center of State Administration of Forestry and Grassland, Zhengzhou, 450003, China.
  • Zhou H; Non-Timber Forestry Research and Development Center, Chinese Academy of Forestry, Zhengzhou, 450003, China.
  • Qiao J; Key Laboratory of Non-Timber Forest Germplasm Enhancement and Utilization of State Forestry Administration, Zhengzhou, 450003, China.
Sci Rep ; 11(1): 8734, 2021 04 22.
Article em En | MEDLINE | ID: mdl-33888729
ABSTRACT
Paulownia catalpifolia is an important, fast-growing timber species known for its high density, color and texture. However, few transcriptomic and genetic studies have been conducted in P. catalpifolia. In this study, single-molecule real-time sequencing technology was applied to obtain the full-length transcriptome of P. catalpifolia leaves treated with varying degrees of drought stress. The sequencing data were then used to search for microsatellites, or simple sequence repeats (SSRs). A total of 28.83 Gb data were generated, 25,969 high-quality (HQ) transcripts with an average length of 1624 bp were acquired after removing the redundant reads, and 25,602 HQ transcripts (98.59%) were annotated using public databases. Among the HQ transcripts, 16,722 intact coding sequences, 149 long non-coding RNAs and 179 alternative splicing events were predicted, respectively. A total of 7367 SSR loci were distributed throughout 6293 HQ transcripts, of which 763 complex SSRs and 6604 complete SSRs. The SSR appearance frequency was 28.37%, and the average distribution distance was 5.59 kb. Among the 6604 complete SSR loci, 1-3 nucleotide repeats were dominant, occupying 97.85% of the total SSR loci, of which mono-, di- and tri-nucleotide repeats were 44.68%, 33.86% and 19.31%, respectively. We detected 112 repeat motifs, of which A/T (42.64%), AG/CT (12.22%), GA/TC (9.63%), GAA/TTC (1.57%) and CCA/TGG (1.54%) were most common in mono-, di- and tri-nucleotide repeats, respectively. The length of the repeat SSR motifs was 10-88 bp, and 4997 (75.67%) were ≤ 20 bp. This study provides a novel full-length transcriptome reference for P. catalpifolia and will facilitate the identification of germplasm resources and breeding of new drought-resistant P. catalpifolia varieties.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise de Sequência de RNA / Repetições de Microssatélites / Transcriptoma / Lamiales / Imagem Individual de Molécula Idioma: En Revista: Sci Rep Ano de publicação: 2021 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise de Sequência de RNA / Repetições de Microssatélites / Transcriptoma / Lamiales / Imagem Individual de Molécula Idioma: En Revista: Sci Rep Ano de publicação: 2021 Tipo de documento: Article País de afiliação: China