Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
BMC Bioinformatics ; 21(1): 3, 2020 Jan 02.
Artículo en Inglés | MEDLINE | ID: mdl-31898480

RESUMEN

BACKGROUND: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels of gene expression, regression methods have proven to be useful tools. They have been successfully applied to predict mRNA levels from genome-wide experimental data and they provide insight into context-dependent gene regulatory mechanisms. However, heterogeneity arising from gene-set specific regulatory interactions is often overlooked. RESULTS: We show that regression models that predict gene expression by using experimentally derived ChIP-seq profiles of TFs can be significantly improved by mixture modelling. In order to find biologically relevant gene clusters, we employ a Bayesian allocation procedure which allows us to integrate additional biological information such as three-dimensional nuclear organization of chromosomes and gene function. The data integration procedure involves transforming the additional data into gene similarity values. We propose a generic similarity measure that is especially suitable for situations where the additional data are of both continuous and discrete type, and compare its performance with similar measures in the context of mixture modelling. CONCLUSIONS: We applied the proposed method on a data from mouse embryonic stem cells (ESC). We find that including additional data results in mixture components that exhibit biologically meaningful gene clusters, and provides valuable insight into the heterogeneity of the regulatory interactions.


Asunto(s)
Células Madre Embrionarias/metabolismo , Regulación de la Expresión Génica , Células Madre Pluripotentes/metabolismo , Animales , Teorema de Bayes , Cromatina/genética , Cromatina/metabolismo , Inmunoprecipitación de Cromatina , Genoma , Ratones , Análisis de Regresión , Factores de Transcripción/genética , Factores de Transcripción/metabolismo
2.
Biom J ; 60(3): 547-563, 2018 05.
Artículo en Inglés | MEDLINE | ID: mdl-29320604

RESUMEN

Cross-sectional studies may shed light on the evolution of a disease like cancer through the comparison of patient traits among disease stages. This problem is especially challenging when a gene-gene interaction network needs to be reconstructed from omics data, and, in addition, the patients of each stage need not form a homogeneous group. Here, the problem is operationalized as the estimation of stage-wise mixtures of Gaussian graphical models (GGMs) from high-dimensional data. These mixtures are fitted by a (fused) ridge penalized EM algorithm. The fused ridge penalty shrinks GGMs of contiguous stages. The (fused) ridge penalty parameters are chosen through cross-validation. The proposed estimation procedures are shown to be consistent and their performance in other respects is studied in simulation. The down-stream exploitation of the fitted GGMs is outlined. In a data illustration the methodology is employed to identify gene-gene interaction network changes in the transition from normal to cancer prostate tissue.


Asunto(s)
Biología Computacional , Estudios Transversales , Redes Reguladoras de Genes , Humanos , Modelos Estadísticos , Distribución Normal
3.
Ann Appl Stat ; 11(1): 41-68, 2017 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-28408966

RESUMEN

Reconstructing a gene network from high-throughput molecular data is an important but challenging task, as the number of parameters to estimate easily is much larger than the sample size. A conventional remedy is to regularize or penalize the model likelihood. In network models, this is often done locally in the neighbourhood of each node or gene. However, estimation of the many regularization parameters is often difficult and can result in large statistical uncertainties. In this paper we propose to combine local regularization with global shrinkage of the regularization parameters to borrow strength between genes and improve inference. We employ a simple Bayesian model with non-sparse, conjugate priors to facilitate the use of fast variational approximations to posteriors. We discuss empirical Bayes estimation of hyper-parameters of the priors, and propose a novel approach to rank-based posterior thresholding. Using extensive model- and data-based simulations, we demonstrate that the proposed inference strategy outperforms popular (sparse) methods, yields more stable edges, and is more reproducible. The proposed method, termed ShrinkNet, is then applied to Glioblastoma to investigate the interactions between genes associated with patient survival.

4.
Neuroimage ; 119: 305-15, 2015 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-26072253

RESUMEN

In this paper we introduce a covariance framework for the analysis of single subject EEG and MEG data that takes into account observed temporal stationarity on small time scales and trial-to-trial variations. We formulate a model for the covariance matrix, which is a Kronecker product of three components that correspond to space, time and epochs/trials, and consider maximum likelihood estimation of the unknown parameter values. An iterative algorithm that finds approximations of the maximum likelihood estimates is proposed. Our covariance model is applicable in a variety of cases where spontaneous EEG or MEG acts as source of noise and realistic noise covariance estimates are needed, such as in evoked activity studies, or where the properties of spontaneous EEG or MEG are themselves the topic of interest, like in combined EEG-fMRI experiments in which the correlation between EEG and fMRI signals is investigated. We use a simulation study to assess the performance of the estimator and investigate the influence of different assumptions about the covariance factors on the estimated covariance matrix and on its components. We apply our method to real EEG and MEG data sets.


Asunto(s)
Mapeo Encefálico/métodos , Encéfalo/fisiología , Electroencefalografía/métodos , Imagen por Resonancia Magnética/métodos , Magnetoencefalografía/métodos , Procesamiento de Señales Asistido por Computador , Adulto , Algoritmos , Ondas Encefálicas , Simulación por Computador , Femenino , Humanos , Funciones de Verosimilitud , Masculino , Reproducibilidad de los Resultados , Adulto Joven
5.
Stat Appl Genet Mol Biol ; 11(5): Article 2, 2012 Sep 25.
Artículo en Inglés | MEDLINE | ID: mdl-23023699

RESUMEN

Gene regulatory networks, in which edges between nodes describe interactions between transcription factors (TFs) and their target genes, model regulatory interactions that determine the cell-type and condition-specific expression of genes. Regression methods can be used to identify TF-target gene interactions from gene expression and DNA sequence data. The response variable, i.e. observed gene expression, is modeled as a function of many predictor variables simultaneously. In practice, it is generally not possible to select a single model that clearly achieves the best fit to the observed experimental data and the selected models typically contain overlapping sets of predictor variables. Moreover, parameters that represent the marginal effect of the individual predictors are not always present. In this paper, we use the statistical framework of estimation of variable importance to define variable importance as a parameter of interest and study two different estimators of this parameter in the context of gene regulatory networks. On yeast data we show that the resulting parameter has a biologically appealing interpretation. We apply the proposed methodology on mammalian gene expression data to gain insight into the temporal activity of TFs that underly gene expression changes in F11 cells in response to Forskolin stimulation.


Asunto(s)
Redes Reguladoras de Genes , Funciones de Verosimilitud , Perfilación de la Expresión Génica/estadística & datos numéricos , Modelos Genéticos , Probabilidad , Análisis de Regresión , Factores de Transcripción/genética , Factores de Transcripción/metabolismo
6.
Bioinformatics ; 28(2): 214-21, 2012 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-22106333

RESUMEN

MOTIVATION: Gene regulatory networks, in which edges between nodes describe interactions between transcriptional regulators and their target genes, determine the coordinated spatiotemporal expression of genes. Especially in higher organisms, context-specific combinatorial regulation by transcription factors (TFs) is believed to determine cellular states and fates. TF-target gene interactions can be studied using high-throughput techniques such as ChIP-chip or ChIP-Seq. These experiments are time and cost intensive, and further limited by, for instance, availability of high affinity TF antibodies. Hence, there is a practical need for methods that can predict TF-TF and TF-target gene interactions in silico, i.e. from gene expression and DNA sequence data alone. We propose GEMULA, a novel approach based on linear models to predict TF-gene expression associations and TF-TF interactions from experimental data. GEMULA is based on linear models, fast and considers a wide range of biologically plausible models that describe gene expression data as a function of predicted TF binding to gene promoters. RESULTS: We show that models inferred with GEMULA are able to explain roughly 70% of the observed variation in gene expression in the yeast heat shock response. The functional relevance of the inferred TF-TF interactions in these models are validated by different sources of independent experimental evidence. We also have applied GEMULA to an in vitro model of neuronal outgrowth. Our findings confirm existing knowledge on gene regulatory interactions underlying neuronal outgrowth, but importantly also generate new insights into the temporal dynamics of this gene regulatory network that can now be addressed experimentally. AVAILABILITY: The GEMULA R-package is available from http://www.few.vu.nl/~degunst/gemula_1.0.tar.gz.


Asunto(s)
Redes Reguladoras de Genes , Modelos Genéticos , Programas Informáticos , Animales , Regulación de la Expresión Génica , Humanos , Modelos Lineales , Análisis de Secuencia por Matrices de Oligonucleótidos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Factores de Transcripción/metabolismo
7.
Nucleic Acids Res ; 39(13): 5313-27, 2011 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-21422075

RESUMEN

All cellular processes are regulated by condition-specific and time-dependent interactions between transcription factors and their target genes. While in simple organisms, e.g. bacteria and yeast, a large amount of experimental data is available to support functional transcription regulatory interactions, in mammalian systems reconstruction of gene regulatory networks still heavily depends on the accurate prediction of transcription factor binding sites. Here, we present a new method, log-linear modeling of 3D contingency tables (LLM3D), to predict functional transcription factor binding sites. LLM3D combines gene expression data, gene ontology annotation and computationally predicted transcription factor binding sites in a single statistical analysis, and offers a methodological improvement over existing enrichment-based methods. We show that LLM3D successfully identifies novel transcriptional regulators of the yeast metabolic cycle, and correctly predicts key regulators of mouse embryonic stem cell self-renewal more accurately than existing enrichment-based methods. Moreover, in a clinically relevant in vivo injury model of mammalian neurons, LLM3D identified peroxisome proliferator-activated receptor γ (PPARγ) as a neuron-intrinsic transcriptional regulator of regenerative axon growth. In conclusion, LLM3D provides a significant improvement over existing methods in predicting functional transcription regulatory interactions in the absence of experimental transcription factor binding data.


Asunto(s)
Perfilación de la Expresión Génica , Redes Reguladoras de Genes , Factores de Transcripción/metabolismo , Animales , Sitios de Unión , Línea Celular , Células Madre Embrionarias/metabolismo , Genoma , Modelos Lineales , Ratones , Regeneración Nerviosa/genética , Neuronas/metabolismo , PPAR gamma/metabolismo , Ratas , Ratas Wistar , Levaduras/genética , Levaduras/metabolismo
8.
Eur J Neurosci ; 25(12): 3629-37, 2007 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-17610582

RESUMEN

Successful regeneration of injured neurons requires a complex molecular response that involves the expression, modification and transport of a large number of proteins. The identity of neuronal proteins responsible for the initiation of regenerative neurite outgrowth is largely unknown. Dorsal root ganglion (DRG) neurons display robust and successful regeneration following lesion of their peripheral neurite, whereas outgrowth of central neurites is weak and does not lead to functional recovery. We have utilized this differential response to gain insight in the early transcriptional events associated with successful regeneration. Surprisingly, our study shows that peripheral and central nerve crushes elicit very distinct transcriptional activation, revealing a large set of novel genes that are differentially regulated within the first 24 h after the lesion. Here we show that Ankrd1, a gene known to act as a transcriptional modulator, is involved in neurite outgrowth of a DRG neuron-derived cell line as well as in cultured adult DRG neurons. This gene, and others identified in this study, may be part of the transcriptional regulatory module that orchestrates the onset of successful regeneration.


Asunto(s)
Regulación de la Expresión Génica/fisiología , Regeneración Nerviosa/fisiología , Neuropatía Ciática/fisiopatología , Traumatismos de la Médula Espinal/fisiopatología , Factores de Transcripción/metabolismo , Animales , Células Cultivadas , Femenino , Ganglios Espinales/patología , Perfilación de la Expresión Génica/métodos , Hibridación in Situ/métodos , Masculino , Proteínas Musculares , Neuronas/metabolismo , Proteínas Nucleares , Ratas , Ratas Sprague-Dawley , Ratas Wistar , Proteínas Represoras , Neuropatía Ciática/patología , Traumatismos de la Médula Espinal/patología , Transfección
9.
Twin Res ; 7(6): 607-16, 2004 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-15607012

RESUMEN

Longitudinal height and weight data from 4649 Dutch twin pairs between birth and 2.5 years of age were analyzed. The data were first summarized into parameters of a polynomial of degree 4 by a mixed-effects procedure. Next, the variation and covariation in the parameters of the growth curve (size at one year of age, growth velocity, deceleration of growth, rate of change in deceleration [i.e., jerk] and rate of change in jerk [i.e., snap]) were decomposed into genetic and nongenetic sources. Additionally, the variation in the estimated size at birth and at 2 years of age interpolated from the polynomial was decomposed into genetic and nongenetic components. Variation in growth was best characterized by a genetic model which included additive genetic, common environmental and specific environmental influences, plus effects of gestational age. The effect of gestational age was largest for size at birth, explaining 39% of the variance. The differences between monozygotic and dizygotic twin correlations were largest for size at 1 and 2 years of age and growth velocity of weight, which suggests that these parameters are more influenced by heritability than size at birth, deceleration and jerk. The percentage of variance explained by additive genetic influences for height at 2 years of age was 52% for females and 58% for males. For weight at 2 years of age, heritability was approximately 58% for both sexes. Variation in snap height for males was also mainly influenced by additive genetic factors, while snap for females was influenced by both additive genetic and common environmental factors. The correlations for the additive genetic and common environmental factors for deceleration and snap are large, indicating that these parameters are almost entirely under control of the same additive genetic and common environmental factors. Female jerk and snap, and also female height at birth and height at 2 years of age, are mostly under control of the same additive genetic factor.


Asunto(s)
Estatura/fisiología , Peso Corporal/fisiología , Gemelos/fisiología , Adolescente , Adulto , Factores de Edad , Algoritmos , Estatura/genética , Peso Corporal/genética , Edad Gestacional , Humanos , Lactante , Recién Nacido , Estudios Longitudinales , Modelos Genéticos , Países Bajos , Gemelos/genética , Gemelos Dicigóticos/genética , Gemelos Dicigóticos/fisiología , Gemelos Monocigóticos/genética , Gemelos Monocigóticos/fisiología
10.
FASEB J ; 18(7): 848-50, 2004 May.
Artículo en Inglés | MEDLINE | ID: mdl-15033927

RESUMEN

Intermittent exposure to addictive drugs causes long-lasting changes in responsiveness to these substances due to persistent molecular and cellular alterations within the meso-corticolimbic system. In this report, we studied the expression profiles of 159 genes in the rat nucleus accumbens during morphine exposure (14 days, 10 mg/kg s.c.) and drug-abstinence (3 weeks). We used real-time quantitative PCR to monitor gene expression after establishing its sensitivity and resolution to resolve small changes in expression for genes in various abundance classes. Morphine-exposure (5 time points) and subsequent abstinence (6 time points) induced phase-specific temporal gene expression of distinct functional groups of genes, for example, short-term homeostatic responses. Opiate withdrawal appeared to be a new stimulus in terms of gene expression and mediates a marked wave of gene repression. Prolonged abstinence resulted in persistently changed expression levels of genes involved in neuronal outgrowth and re-wiring. Our findings substantiate the hypothesis that this new gene program, initiated upon morphine-withdrawal, may subserve long-term neuronal plasticity involved in the persistent behavioral consequences of repeated drug-exposure.


Asunto(s)
Regulación de la Expresión Génica/efectos de los fármacos , Dependencia de Morfina/genética , Morfina/farmacología , Proteínas del Tejido Nervioso/biosíntesis , Plasticidad Neuronal/efectos de los fármacos , Núcleo Accumbens/efectos de los fármacos , Síndrome de Abstinencia a Sustancias/genética , Adaptación Fisiológica/genética , Animales , Conducta Animal/efectos de los fármacos , Sistemas de Computación , Perfilación de la Expresión Génica , Genes Inmediatos-Precoces/efectos de los fármacos , Proteínas Inmediatas-Precoces/biosíntesis , Masculino , Dependencia de Morfina/metabolismo , Proteínas del Tejido Nervioso/genética , Neurotransmisores/biosíntesis , Neurotransmisores/genética , Núcleo Accumbens/metabolismo , Péptidos Opioides/biosíntesis , Reacción en Cadena de la Polimerasa , Ratas , Ratas Wistar , Reproducibilidad de los Resultados , Síndrome de Abstinencia a Sustancias/metabolismo , Transmisión Sináptica/efectos de los fármacos , Factores de Transcripción/biosíntesis , Factores de Transcripción/genética
11.
Stat Med ; 22(10): 1691-707, 2003 May 30.
Artículo en Inglés | MEDLINE | ID: mdl-12720305

RESUMEN

We describe a Bayesian approach to incorporate between-individual heterogeneity associated with parameters of complicated biological models. We emphasize the use of the Markov chain Monte Carlo (MCMC) method in this context and demonstrate the implementation and use of MCMC by analysis of simulated overdispersed Poisson counts and by analysis of an experimental data set on preneoplastic liver lesions (their number and sizes) in the presence of heterogeneity. These examples show that MCMC-based estimates, derived from the posterior distribution with uniform priors, may agree well with maximum likelihood estimates (if available). However, with heterogeneous parameters, maximum likelihood estimates can be difficult to obtain, involving many integrations. In this case, the MCMC method offers substantial computational advantages.


Asunto(s)
Neoplasias Hepáticas/patología , Cadenas de Markov , Modelos Biológicos , Método de Montecarlo , Animales , Teorema de Bayes , Nitrosaminas , Distribución de Poisson , Lesiones Precancerosas , Ratas , Procesos Estocásticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...