Your browser doesn't support javascript.
loading
A hidden markov model for identifying differentially methylated sites in bisulfite sequencing data.
Shokoohi, Farhad; Stephens, David A; Bourque, Guillaume; Pastinen, Tomi; Greenwood, Celia M T; Labbe, Aurélie.
Afiliação
  • Shokoohi F; Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Quebec, Canada.
  • Stephens DA; Department of Mathematics and Statistics, McGill University, Montreal, Quebec, Canada.
  • Bourque G; Lady Davis Institute for Medical Research, Montreal, Quebec, Canada.
  • Pastinen T; Department of Mathematics and Statistics, McGill University, Montreal, Quebec, Canada.
  • Greenwood CMT; Department of Human Genetics, McGill University, Montreal, Quebec, Canada.
  • Labbe A; Department of Human Genetics, McGill University, Montreal, Quebec, Canada.
Biometrics ; 75(1): 210-221, 2019 03.
Article em En | MEDLINE | ID: mdl-30168593
DNA methylation studies have enabled researchers to understand methylation patterns and their regulatory roles in biological processes and disease. However, only a limited number of statistical approaches have been developed to provide formal quantitative analysis. Specifically, a few available methods do identify differentially methylated CpG (DMC) sites or regions (DMR), but they suffer from limitations that arise mostly due to challenges inherent in bisulfite sequencing data. These challenges include: (1) that read-depths vary considerably among genomic positions and are often low; (2) both methylation and autocorrelation patterns change as regions change; and (3) CpG sites are distributed unevenly. Furthermore, there are several methodological limitations: almost none of these tools is capable of comparing multiple groups and/or working with missing values, and only a few allow continuous or multiple covariates. The last of these is of great interest among researchers, as the goal is often to find which regions of the genome are associated with several exposures and traits. To tackle these issues, we have developed an efficient DMC identification method based on Hidden Markov Models (HMMs) called "DMCHMM" which is a three-step approach (model selection, prediction, testing) aiming to address the aforementioned drawbacks. Our proposed method is different from other HMM methods since it profiles methylation of each sample separately, hence exploiting inter-CpG autocorrelation within samples, and it is more flexible than previous approaches by allowing multiple hidden states. Using simulations, we show that DMCHMM has the best performance among several competing methods. An analysis of cell-separated blood methylation profiles is also provided.
Assuntos
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Sulfitos / Cadeias de Markov / Ilhas de CpG / Metilação de DNA Tipo de estudo: Health_economic_evaluation / Prognostic_studies Limite: Animals / Humans Idioma: En Revista: Biometrics Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Canadá

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Sulfitos / Cadeias de Markov / Ilhas de CpG / Metilação de DNA Tipo de estudo: Health_economic_evaluation / Prognostic_studies Limite: Animals / Humans Idioma: En Revista: Biometrics Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Canadá