RESUMO
Gene co-expression networks (GCN) present undirected relations between genes to understand molecular structures behind the diseases, including cancer. The utilization of various biological datasets and gene network inference (GNI) algorithms can reveal meaningful gene-gene interactions of GCNs. This study applies three GNI algorithms on mRNA gene expression, RNA-Seq, and miRNA-target genes datasets to infer GCNs of breast and prostate cancers. To evaluate the performance of the GCNs, we utilize overlap analysis via literature data, topological assessment, and Gene Ontology-based biological assessment. The results emphasize how the selection of biological datasets and GNI algorithms affect the performance results on different evaluation criteria. GCNs on microarray gene expression data slightly outperform in overlap analysis. Also, GCNs on RNA-Seq and gene expression datasets follow scale-free topology. The biological assessment results are close to each other on all biological datasets. C3NET algorithm-based GCNs did not contain any biological assessment modules; therefore, it is not optimal for biological assessment. GNI algorithms' selection did not change the overlap analysis and topological assessment results. Our primary objective is to compare the performance results of biological datasets and GNI algorithms based on different evaluation criteria. For this purpose, we developed the GNIAP R package that enables users to select different GNI algorithms to infer GCNs. The GNIAP R package also provides literature-based overlap analysis, and topological and biological analyses on GCNs. Users can access the GNIAP R package via https://github.com/ozgurcingiz/GNIAP .
Assuntos
Redes Reguladoras de Genes , Neoplasias da Próstata , Algoritmos , Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Seleção de Pacientes , Neoplasias da Próstata/genéticaRESUMO
Advances in DNA sequencing technologies enable researchers to integrate various biological datasets in order to reveal hidden relations at the molecular level. In this study, we present a two-tiered combinatorial structure (TTCS) to integrate gene co-expression networks (GCNs) that are inferred from microarray gene expression, RNA-Seq and miRNA-target gene data. In the initial phase of TTCS, we derive GCNs by using gene network inference (GNI) algorithms for each dataset. In the first and second integration phases, we use straightforward methods: intersection, union and simple majority voting to combine GCNs. We use overlap, topological and biological analyses in performance evaluation and investigate the integration effects of GCNs separately for all phases. Our results prove that the first integration phase has limited contribution on performance. However, combining the biological datasets in the second phase significantly enhances the overlap and topological performance analyses.