Your browser doesn't support javascript.
loading
Learning genetic and environmental graphical models from family data.
Ribeiro, Adèle H; Maria Pavan Soler, Júlia.
Afiliación
  • Ribeiro AH; Department of Computer Science, Institute of Mathematics and Statistics, University of São Paulo (IME-USP), São Paulo, Brazil.
  • Maria Pavan Soler J; Department of Statistics, Institute of Mathematics and Statistics, University of São Paulo (IME-USP), São Paulo, Brazil.
Stat Med ; 39(18): 2403-2422, 2020 08 15.
Article en En | MEDLINE | ID: mdl-32346898
Many challenging problems in biomedical research rely on understanding how variables are associated with each other and influenced by genetic and environmental factors. Probabilistic graphical models (PGMs) are widely acknowledged as a very natural and formal language to describe relationships among variables and have been extensively used for studying complex diseases and traits. In this work, we propose methods that leverage observational Gaussian family data for learning a decomposition of undirected and directed acyclic PGMs according to the influence of genetic and environmental factors. Many structure learning algorithms are strongly based on a conditional independence test. For independent measurements of normally distributed variables, conditional independence can be tested through standard tests for zero partial correlation. In family data, the assumption of independent measurements does not hold since related individuals are correlated due to mainly genetic factors. Based on univariate polygenic linear mixed models, we propose tests that account for the familial dependence structure and allow us to assess the significance of the partial correlation due to genetic (between-family) factors and due to other factors, denoted here as environmental (within-family) factors, separately. Then, we extend standard structure learning algorithms, including the IC/PC and the really fast causal inference (RFCI) algorithms, to Gaussian family data. The algorithms learn the most likely PGM and its decomposition into two components, one explained by genetic factors and the other by environmental factors. The proposed methods are evaluated by simulation studies and applied to the Genetic Analysis Workshop 13 simulated dataset, which captures significant features of the Framingham Heart Study.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Modelos Estadísticos Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Stat Med Año: 2020 Tipo del documento: Article País de afiliación: Brasil

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Modelos Estadísticos Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: Stat Med Año: 2020 Tipo del documento: Article País de afiliación: Brasil