Búsqueda | Portal de Búsqueda de la BVS Colombia

Learning clinical networks from medical records based on information estimates in mixed-type data.

Cabeli, Vincent; Verny, Louis; Sella, Nadir; Uguzzoni, Guido; Verny, Marc; Isambert, Hervé.

PLoS Comput Biol ; 16(5): e1007866, 2020 05.

Artículo en Inglés | MEDLINE | ID: mdl-32421707

RESUMEN

The precise diagnostics of complex diseases require to integrate a large amount of information from heterogeneous clinical and biomedical data, whose direct and indirect interdependences are notoriously difficult to assess. To this end, we propose an efficient computational approach to simultaneously compute and assess the significance of multivariate information between any combination of mixed-type (continuous/categorical) variables. The method is then used to uncover direct, indirect and possibly causal relationships between mixed-type data from medical records, by extending a recent machine learning method to reconstruct graphical models beyond simple categorical datasets. The method is shown to outperform existing tools on benchmark mixed-type datasets, before being applied to analyze the medical records of eldery patients with cognitive disorders from La Pitié-Salpêtrière Hospital, Paris. The resulting clinical network visually captures the global interdependences in these medical records and some facets of clinical diagnosis practice, without specific hypothesis nor prior knowledge on any clinically relevant information. In particular, it provides some physiological insights linking the consequence of cerebrovascular accidents to the atrophy of important brain structures associated to cognitive impairment.

Asunto(s)

Aprendizaje , Registros Médicos , Algoritmos , Conjuntos de Datos como Asunto , Humanos , Aprendizaje Automático , Paris

MIIC online: a web server to reconstruct causal or non-causal networks from non-perturbative data.

Sella, Nadir; Verny, Louis; Uguzzoni, Guido; Affeldt, Séverine; Isambert, Hervé.

Bioinformatics ; 34(13): 2311-2313, 2018 07 01.

Artículo en Inglés | MEDLINE | ID: mdl-29300827

RESUMEN

Summary: We present a web server running the MIIC algorithm, a network learning method combining constraint-based and information-theoretic frameworks to reconstruct causal, non-causal or mixed networks from non-perturbative data, without the need for an a priori choice on the class of reconstructed network. Starting from a fully connected network, the algorithm first removes dispensable edges by iteratively subtracting the most significant information contributions from indirect paths between each pair of variables. The remaining edges are then filtered based on their confidence assessment or oriented based on the signature of causality in observational data. MIIC online server can be used for a broad range of biological data, including possible unobserved (latent) variables, from single-cell gene expression data to protein sequence evolution and outperforms or matches state-of-the-art methods for either causal or non-causal network reconstruction. Availability and implementation: MIIC online can be freely accessed at https://miic.curie.fr. Supplementary information: Supplementary data are available at Bioinformatics online.

Asunto(s)

Redes Neurales de la Computación , Algoritmos , Computadores , Programas Informáticos

Learning causal networks with latent variables from multivariate information in genomic data.

Verny, Louis; Sella, Nadir; Affeldt, Séverine; Singh, Param Priya; Isambert, Hervé.

PLoS Comput Biol ; 13(10): e1005662, 2017 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-28968390

RESUMEN

Learning causal networks from large-scale genomic data remains challenging in absence of time series or controlled perturbation experiments. We report an information- theoretic method which learns a large class of causal or non-causal graphical models from purely observational data, while including the effects of unobserved latent variables, commonly found in many genomic datasets. Starting from a complete graph, the method iteratively removes dispensable edges, by uncovering significant information contributions from indirect paths, and assesses edge-specific confidences from randomization of available data. The remaining edges are then oriented based on the signature of causality in observational data. The approach and associated algorithm, miic, outperform earlier methods on a broad range of benchmark networks. Causal network reconstructions are presented at different biological size and time scales, from gene regulation in single cells to whole genome duplication in tumor development as well as long term evolution of vertebrates. Miic is publicly available at https://github.com/miicTeam/MIIC.

Asunto(s)

Biomarcadores/análisis , Biología Computacional/métodos , Redes Reguladoras de Genes , Genómica/métodos , Animales , Neoplasias de la Mama/genética , Células Cultivadas , Embrión de Mamíferos/citología , Embrión de Mamíferos/metabolismo , Femenino , Perfilación de la Expresión Génica , Hematopoyesis , Humanos , Ratones , Modelos Genéticos , Análisis de la Célula Individual

3off2: A network reconstruction algorithm based on 2-point and 3-point information statistics.

Affeldt, Séverine; Verny, Louis; Isambert, Hervé.

BMC Bioinformatics ; 17 Suppl 2: 12, 2016 Jan 20.

Artículo en Inglés | MEDLINE | ID: mdl-26823190

RESUMEN

BACKGROUND: The reconstruction of reliable graphical models from observational data is important in bioinformatics and other computational fields applying network reconstruction methods to large, yet finite datasets. The main network reconstruction approaches are either based on Bayesian scores, which enable the ranking of alternative Bayesian networks, or rely on the identification of structural independencies, which correspond to missing edges in the underlying network. Bayesian inference methods typically require heuristic search strategies, such as hill-climbing algorithms, to sample the super-exponential space of possible networks. By contrast, constraint-based methods, such as the PC and IC algorithms, are expected to run in polynomial time on sparse underlying graphs, provided that a correct list of conditional independencies is available. Yet, in practice, conditional independencies need to be ascertained from the available observational data, based on adjustable statistical significance levels, and are not robust to sampling noise from finite datasets. RESULTS: We propose a more robust approach to reconstruct graphical models from finite datasets. It combines constraint-based and Bayesian approaches to infer structural independencies based on the ranking of their most likely contributing nodes. In a nutshell, this local optimization scheme and corresponding 3off2 algorithm iteratively "take off" the most likely conditional 3-point information from the 2-point (mutual) information between each pair of nodes. Conditional independencies are thus derived by progressively collecting the most significant indirect contributions to all pairwise mutual information. The resulting network skeleton is then partially directed by orienting and propagating edge directions, based on the sign and magnitude of the conditional 3-point information of unshielded triples. The approach is shown to outperform both constraint-based and Bayesian inference methods on a range of benchmark networks. The 3off2 approach is then applied to the reconstruction of the hematopoiesis regulation network based on recent single cell expression data and is found to retrieve more experimentally ascertained regulations between transcription factors than with other available methods. CONCLUSIONS: The novel information-theoretic approach and corresponding 3off2 algorithm combine constraint-based and Bayesian inference methods to reliably reconstruct graphical models, despite inherent sampling noise in finite datasets. In particular, experimentally verified interactions as well as novel predicted regulations are established on the hematopoiesis regulatory networks based on single cell expression data.

Asunto(s)

Algoritmos , Teorema de Bayes , Redes Reguladoras de Genes , Hematopoyesis , Modelos Genéticos , Animales , Biología Computacional/métodos , Perfilación de la Expresión Génica , Ratones , Análisis de la Célula Individual

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA