Your browser doesn't support javascript.
loading
Machine learning, transcriptome, and genotyping chip analyses provide insights into SNP markers identifying flower color in Platycodon grandiflorus.
Yu, Go-Eun; Shin, Younhee; Subramaniyam, Sathiyamoorthy; Kang, Sang-Ho; Lee, Si-Myung; Cho, Chuloh; Lee, Seung-Sik; Kim, Chang-Kug.
Afiliação
  • Yu GE; Genomics Division, National Institute of Agricultural Sciences, Jeonju, 54874, Korea.
  • Shin Y; Research and Development Center, Insilicogen Inc., Yongin-si 16954, Gyeonggi-do, Republic of Korea.
  • Subramaniyam S; Research and Development Center, Insilicogen Inc., Yongin-si 16954, Gyeonggi-do, Republic of Korea.
  • Kang SH; Genomics Division, National Institute of Agricultural Sciences, Jeonju, 54874, Korea.
  • Lee SM; Genomics Division, National Institute of Agricultural Sciences, Jeonju, 54874, Korea.
  • Cho C; Crop Foundation Research Division, National Institute of Crop Science, RDA, Wanju, 55365, Korea.
  • Lee SS; Advanced Radiation Technology Institute, Korea Atomic Energy Research Institute, 29 Geumgu-gil, Jeongeup, 56212, Korea.
  • Kim CK; Department of Radiation Science and Technology, University of Science and Technology, Daejeon, 34113, Korea.
Sci Rep ; 11(1): 8019, 2021 04 13.
Article em En | MEDLINE | ID: mdl-33850210
ABSTRACT
Bellflower is an edible ornamental gardening plant in Asia. For predicting the flower color in bellflower plants, a transcriptome-wide approach based on machine learning, transcriptome, and genotyping chip analyses was used to identify SNP markers. Six machine learning methods were deployed to explore the classification potential of the selected SNPs as features in two datasets, namely training (60 RNA-Seq samples) and validation (480 Fluidigm chip samples). SNP selection was performed in sequential order. Firstly, 96 SNPs were selected from the transcriptome-wide SNPs using the principal compound analysis (PCA). Then, 9 among 96 SNPs were later identified using the Random forest based feature selection method from the Fluidigm chip dataset. Among six machines, the random forest (RF) model produced higher classification performance than the other models. The 9 SNP marker candidates selected for classifying the flower color classification were verified using the genomic DNA PCR with Sanger sequencing. Our results suggest that this methodology could be used for future selection of breeding traits even though the plant accessions are highly heterogeneous.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Polimorfismo de Nucleotídeo Único / Platycodon / Aprendizado de Máquina Tipo de estudo: Prognostic_studies Idioma: En Revista: Sci Rep Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Polimorfismo de Nucleotídeo Único / Platycodon / Aprendizado de Máquina Tipo de estudo: Prognostic_studies Idioma: En Revista: Sci Rep Ano de publicação: 2021 Tipo de documento: Article