Gaining Biological Insights through Supervised Data Visualization.

Rhodes, Jake S; Aumon, Adrien; Morin, Sacha; Girard, Marc; Larochelle, Catherine; Brunet-Ratnasingham, Elsa; Pagliuzza, Amélie; Marchitto, Lorie; Zhang, Wei; Cutler, Adele; Grand'Maison, Francois; Zhou, Anhong; Finzi, Andrés; Chomont, Nicolas; Kaufmann, Daniel E; Zandee, Stephanie; Prat, Alexandre; Wolf, Guy; Moon, Kevin R

Rhodes, Jake S; Aumon, Adrien; Morin, Sacha; Girard, Marc; Larochelle, Catherine; Brunet-Ratnasingham, Elsa; Pagliuzza, Amélie; Marchitto, Lorie; Zhang, Wei; Cutler, Adele; Grand'Maison, Francois; Zhou, Anhong; Finzi, Andrés; Chomont, Nicolas; Kaufmann, Daniel E; Zandee, Stephanie; Prat, Alexandre; Wolf, Guy; Moon, Kevin R.

bioRxiv ; 2024 Jan 21.

Article em En | MEDLINE | ID: mdl-38293135

ABSTRACT

ABSTRACT

Dimensionality reduction-based data visualization is pivotal in comprehending complex biological data. The most common methods, such as PHATE, t-SNE, and UMAP, are unsupervised and therefore reflect the dominant structure in the data, which may be independent of expert-provided labels. Here we introduce a supervised data visualization method called RF-PHATE, which integrates expert knowledge for further exploration of the data. RF-PHATE leverages random forests to capture intricate featurelabel relationships. Extracting information from the forest, RF-PHATE generates low-dimensional visualizations that highlight relevant data relationships while disregarding extraneous features. This approach scales to large datasets and applies to classification and regression. We illustrate RF-PHATE's prowess through three case studies. In a multiple sclerosis study using longitudinal clinical and imaging data, RF-PHATE unveils a sub-group of patients with non-benign relapsingremitting Multiple Sclerosis, demonstrating its aptitude for time-series data. In the context of Raman spectral data, RF-PHATE effectively showcases the impact of antioxidants on diesel exhaust-exposed lung cells, highlighting its proficiency in noisy environments. Furthermore, RF-PHATE aligns established geometric structures with COVID-19 patient outcomes, enriching interpretability in a hierarchical manner. RF-PHATE bridges expert insights and visualizations, promising knowledge generation. Its adaptability, scalability, and noise tolerance underscore its potential for widespread adoption.

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article