Your browser doesn't support javascript.
loading
Prioritizing candidate eQTL causal genes in Arabidopsis using RANDOM FORESTS.
Hartanto, Margi; Sami, Asif Ahmed; de Ridder, Dick; Nijveen, Harm.
Afiliação
  • Hartanto M; Bioinformatics Group, Wageningen University and Research, 6708 PB Wageningen, The Netherlands.
  • Sami AA; Bioinformatics Group, Wageningen University and Research, 6708 PB Wageningen, The Netherlands.
  • de Ridder D; Bioinformatics Group, Wageningen University and Research, 6708 PB Wageningen, The Netherlands.
  • Nijveen H; Bioinformatics Group, Wageningen University and Research, 6708 PB Wageningen, The Netherlands.
G3 (Bethesda) ; 12(11)2022 11 04.
Article em En | MEDLINE | ID: mdl-36149290
ABSTRACT
Expression quantitative trait locus mapping has been widely used to study the genetic regulation of gene expression in Arabidopsis thaliana. As a result, a large amount of expression quantitative trait locus data has been generated for this model plant; however, only a few causal expression quantitative trait locus genes have been identified, and experimental validation is costly and laborious. A prioritization method could help speed up the identification of causal expression quantitative trait locus genes. This study extends the machine-learning-based QTG-Finder2 method for prioritizing candidate causal genes in phenotype quantitative trait loci to be used for expression quantitative trait loci by adding gene structure, protein interaction, and gene expression. Independent validation shows that the new algorithm can prioritize 16 out of 25 potential expression quantitative trait locus causal genes within the top 20% rank. Several new features are important in prioritizing causal expression quantitative trait locus genes, including the number of protein-protein interactions, unique domains, and introns. Overall, this study provides a foundation for developing computational methods to prioritize candidate expression quantitative trait locus causal genes. The prediction of all genes is available in the AraQTL workbench (https//www.bioinformatics.nl/AraQTL/) to support the identification of gene expression regulators in Arabidopsis.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Arabidopsis Tipo de estudo: Clinical_trials / Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Arabidopsis Tipo de estudo: Clinical_trials / Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article