Your browser doesn't support javascript.
loading
Relationship between gene regulation network structure and prediction accuracy in high dimensional regression.
Okinaga, Yuichi; Kyogoku, Daisuke; Kondo, Satoshi; Nagano, Atsushi J; Hirose, Kei.
Afiliação
  • Okinaga Y; Graduate School of Mathematics, Kyushu University, 744 Motooka, Fukuoka, 819-0395, Japan.
  • Kyogoku D; The Museum of Nature and Human Activities, 6 Yayoigaoka, Sanda, Hyogo, 669-1546, Japan.
  • Kondo S; Agriculture and Biotechnology Business Division, Toyota Motor Corporation, Miyoshi, Aichi, 470-0201, Japan.
  • Nagano AJ; Faculty of Agriculture, Ryukoku University, Otsu, Shiga, 520-2194, Japan. anagano1234@gmail.com.
  • Hirose K; Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata, 997-0017, Japan. anagano1234@gmail.com.
Sci Rep ; 11(1): 11483, 2021 06 01.
Article em En | MEDLINE | ID: mdl-34075095
ABSTRACT
The least absolute shrinkage and selection operator (lasso) and principal component regression (PCR) are popular methods of estimating traits from high-dimensional omics data, such as transcriptomes. The prediction accuracy of these estimation methods is highly dependent on the covariance structure, which is characterized by gene regulation networks. However, the manner in which the structure of a gene regulation network together with the sample size affects prediction accuracy has not yet been sufficiently investigated. In this study, Monte Carlo simulations are conducted to investigate the prediction accuracy for several network structures under various sample sizes. When the gene regulation network is a random graph, a sufficiently large number of observations are required to ensure good prediction accuracy with the lasso. The PCR provided poor prediction accuracy regardless of the sample size. However, a real gene regulation network is likely to exhibit a scale-free structure. In such cases, the simulation indicates that a relatively small number of observations, such as [Formula see text], is sufficient to allow the accurate prediction of traits from a transcriptome with the lasso.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genômica / Locos de Características Quantitativas / Redes Reguladoras de Genes / Modelos Genéticos Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Sci Rep Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Japão

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genômica / Locos de Características Quantitativas / Redes Reguladoras de Genes / Modelos Genéticos Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Sci Rep Ano de publicação: 2021 Tipo de documento: Article País de afiliação: Japão
...