Your browser doesn't support javascript.
loading
Discovering Condition-Specific Gene Co-Expression Patterns Using Gaussian Mixture Models: A Cancer Case Study.
Ficklin, Stephen P; Dunwoodie, Leland J; Poehlman, William L; Watson, Christopher; Roche, Kimberly E; Feltus, F Alex.
  • Ficklin SP; Department of Horticulture, Washington State University, Pullman, WA, 99164, USA. stephen.ficklin@wsu.edu.
  • Dunwoodie LJ; Department of Genetics & Biochemistry, Clemson University, Clemson, SC, 29631, USA.
  • Poehlman WL; Department of Genetics & Biochemistry, Clemson University, Clemson, SC, 29631, USA.
  • Watson C; Molecular Plant Sciences Program, Washington State University, Pullman, WA, 99164, USA.
  • Roche KE; Department of Genetics & Biochemistry, Clemson University, Clemson, SC, 29631, USA.
  • Feltus FA; Department of Genetics & Biochemistry, Clemson University, Clemson, SC, 29631, USA. ffeltus@clemson.edu.
Sci Rep ; 7(1): 8617, 2017 08 17.
Article en En | MEDLINE | ID: mdl-28819158
A gene co-expression network (GCN) describes associations between genes and points to genetic coordination of biochemical pathways. However, genetic correlations in a GCN are only detectable if they are present in the sampled conditions. With the increasing quantity of gene expression samples available in public repositories, there is greater potential for discovery of genetic correlations from a variety of biologically interesting conditions. However, even if gene correlations are present, their discovery can be masked by noise. Noise is introduced from natural variation (intrinsic and extrinsic), systematic variation (caused by sample measurement protocols and instruments), and algorithmic and statistical variation created by selection of data processing tools. A variety of published studies, approaches and methods attempt to address each of these contributions of variation to reduce noise. Here we describe an approach using Gaussian Mixture Models (GMMs) to address natural extrinsic (condition-specific) variation during network construction from mixed input conditions. To demonstrate utility, we build and analyze a condition-annotated GCN from a compendium of 2,016 mixed gene expression data sets from five tumor subtypes obtained from The Cancer Genome Atlas. Our results show that GMMs help discover tumor subtype specific gene co-expression patterns (modules) that are significantly enriched for clinical attributes.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Regulación Neoplásica de la Expresión Génica / Perfilación de la Expresión Génica / Redes Reguladoras de Genes / Neoplasias Tipo de estudio: Diagnostic_studies / Guideline / Prognostic_studies Límite: Humans Idioma: En Año: 2017 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Regulación Neoplásica de la Expresión Génica / Perfilación de la Expresión Génica / Redes Reguladoras de Genes / Neoplasias Tipo de estudio: Diagnostic_studies / Guideline / Prognostic_studies Límite: Humans Idioma: En Año: 2017 Tipo del documento: Article