Your browser doesn't support javascript.
SEMtree: tree-based structure learning methods with structural equation models.
Grassi, Mario; Tarantino, Barbara.
  • Grassi M; Department of Brain and Behavioral Sciences, University of Pavia, Pavia 27100, Italy.
  • Tarantino B; Department of Brain and Behavioral Sciences, University of Pavia, Pavia 27100, Italy.
Bioinformatics ; 39(6)2023 Jun 01.
Статья в английский | MEDLINE | ID: covidwho-20236221
ABSTRACT
MOTIVATION With the exponential growth of expression and protein-protein interaction (PPI) data, the identification of functional modules in PPI networks that show striking changes in molecular activity or phenotypic signatures becomes of particular interest to reveal process-specific information that is correlated with cellular or disease states. This requires both the identification of network nodes with reliability scores and the availability of an efficient technique to locate the network regions with the highest scores. In the literature, a number of heuristic methods have been suggested. We propose SEMtree(), a set of tree-based structure discovery algorithms, combining graph and statistically interpretable parameters together with a user-friendly R package based on structural equation models framework.

RESULTS:

Condition-specific changes from differential expression and gene-gene co-expression are recovered with statistical testing of node, directed edge, and directed path difference between groups. In the end, from a list of seed (i.e. disease) genes or gene P-values, the perturbed modules with undirected edges are generated with five state-of-the-art active subnetwork detection methods. The latter are supplied to causal additive trees based on Chu-Liu-Edmonds' algorithm (Chow and Liu, Approximating discrete probability distributions with dependence trees. IEEE Trans Inform Theory 1968;14462-7) in SEMtree() to be converted in directed trees. This conversion allows to compare the methods in terms of directed active subnetworks. We applied SEMtree() to both Coronavirus disease (COVID-19) RNA-seq dataset (GEO accession GSE172114) and simulated datasets with various differential expression patterns. Compared to existing methods, SEMtree() is able to capture biologically relevant subnetworks with simple visualization of directed paths, good perturbation extraction, and classifier performance. AVAILABILITY AND IMPLEMENTATION SEMtree() function is implemented in the R package SEMgraph, easily available at https//CRAN.R-project.org/package=SEMgraph.
Тема - темы

Полный текст: Имеется в наличии Коллекция: Международные базы данных база данных: MEDLINE Основная тема: Gene Regulatory Networks / COVID-19 Тип исследования: Экспериментальные исследования / Прогностическое исследование / Рандомизированные контролируемые испытания Пределы темы: Люди Язык: английский Тематика журнала: Медицинская информатика Год: 2023 Тип: Статья Аффилированная страна: Bioinformatics

Документы, близкие по теме

MEDLINE

...
LILACS

LIS


Полный текст: Имеется в наличии Коллекция: Международные базы данных база данных: MEDLINE Основная тема: Gene Regulatory Networks / COVID-19 Тип исследования: Экспериментальные исследования / Прогностическое исследование / Рандомизированные контролируемые испытания Пределы темы: Люди Язык: английский Тематика журнала: Медицинская информатика Год: 2023 Тип: Статья Аффилированная страна: Bioinformatics