Your browser doesn't support javascript.
loading
Datascape: exploring heterogeneous dataspace.
Rolland, Jakez; Boutin, Ronan; Eveillard, Damien; Delahaye, Benoit.
Afiliação
  • Rolland J; Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, 44322, Nantes, France. jakez.rolland@univ-nantes.fr.
  • Boutin R; Bio Logbook, 44200, Nantes, France. jakez.rolland@univ-nantes.fr.
  • Eveillard D; Bio Logbook, 44200, Nantes, France.
  • Delahaye B; Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, 44322, Nantes, France.
Sci Rep ; 14(1): 7041, 2024 Apr 05.
Article em En | MEDLINE | ID: mdl-38580694
ABSTRACT
Data science is a powerful field for gaining insights, comparing, and predicting behaviors from datasets. However, the diversity of methods and hypotheses needed to abstract a dataset exhibits a lack of genericity. Moreover, the shape of a dataset, which structures its contained information and uncertainties, is rarely considered. Inspired by state-of-the-art manifold learning and hull estimations algorithms, we propose a novel framework, the datascape, that leverages topology and graph theory to abstract heterogeneous datasets. Built upon the combination of a nearest neighbor graph, a set of convex hulls, and a metric distance that respects the shape of the data, the datascape allows exploration of the dataset's underlying space. We show that the datascape can uncover underlying functions from simulated datasets, build predictive algorithms with performance close to state-of-the-art algorithms, and reveal insightful geodesic paths between points. It demonstrates versatility through ecological, medical, and simulated data use cases.

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article