Evaluating the performance of random forest and iterative random forest based methods when applied to gene expression data.

Walker, Angelica M; Cliff, Ashley; Romero, Jonathon; Shah, Manesh B; Jones, Piet; Felipe Machado Gazolla, Joao Gabriel; Jacobson, Daniel A; Kainer, David

Walker, Angelica M; Cliff, Ashley; Romero, Jonathon; Shah, Manesh B; Jones, Piet; Felipe Machado Gazolla, Joao Gabriel; Jacobson, Daniel A; Kainer, David.

Afiliación

Walker AM; The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee Knoxville, 821 Volunteer Blvd, Knoxville 37996, TN, USA.
Cliff A; The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee Knoxville, 821 Volunteer Blvd, Knoxville 37996, TN, USA.
Romero J; The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee Knoxville, 821 Volunteer Blvd, Knoxville 37996, TN, USA.
Shah MB; Computational and Predictive Biology, Oak Ridge National Laboratory, 1 Bethel Valley Rd, Oak Ridge 37830, TN, USA.
Jones P; The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee Knoxville, 821 Volunteer Blvd, Knoxville 37996, TN, USA.
Felipe Machado Gazolla JG; Computational and Predictive Biology, Oak Ridge National Laboratory, 1 Bethel Valley Rd, Oak Ridge 37830, TN, USA.
Jacobson DA; Computational and Predictive Biology, Oak Ridge National Laboratory, 1 Bethel Valley Rd, Oak Ridge 37830, TN, USA.
Kainer D; Computational and Predictive Biology, Oak Ridge National Laboratory, 1 Bethel Valley Rd, Oak Ridge 37830, TN, USA.

Comput Struct Biotechnol J ; 20: 3372-3386, 2022.

Article en En | MEDLINE | ID: mdl-35832622

ABSTRACT

ABSTRACT

Gene-to-gene networks, such as Gene Regulatory Networks (GRN) and Predictive Expression Networks (PEN) capture relationships between genes and are beneficial for use in downstream biological analyses. There exists multiple network inference tools to produce these gene-to-gene networks from matrices of gene expression data. Random Forest-Leave One Out Prediction (RF-LOOP) is a method that has been shown to be efficient at producing these gene-to-gene networks, frequently known as GEne Network Inference with Ensemble of trees (GENIE3). Random Forest can be replaced in this process by iterative Random Forest (iRF), which performs variable selection and boosting. Here we validate that iterative Random Forest-Leave One Out Prediction (iRF-LOOP) produces higher quality networks than GENIE3 (RF-LOOP). We use both synthetic and empirical networks from the Dialogue for Reverse Engineering Assessment and Methods (DREAM) Challenges by Sage Bionetworks, as well as two additional empirical networks created from Arabidopsis thaliana and Populus trichocarpa expression data.

Palabras clave

GENIE3, GEne Network Inference with Ensemble of trees; GRN, Gene Regulatory Network; Gene expression networks; Iterative random forest; Network biology; PEN, Predictive Expression Network; RF, Random Forest; RF-LOOP, Random Forest Leave One Out Prediction; Random forest; iRF, iterative Random Forest; iRF-LOOP, iterativeRandom Forest Leave One Out Prediction

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Clinical_trials / Prognostic_studies Idioma: En Revista: Comput Struct Biotechnol J Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google