Your browser doesn't support javascript.
loading
Bigmelon: tools for analysing large DNA methylation datasets.
Gorrie-Stone, Tyler J; Smart, Melissa C; Saffari, Ayden; Malki, Karim; Hannon, Eilis; Burrage, Joe; Mill, Jonathan; Kumari, Meena; Schalkwyk, Leonard C.
Affiliation
  • Gorrie-Stone TJ; School of Biological Sciences, University of Essex, Colchester, UK.
  • Smart MC; Institute for Social and Economic Research, University of Essex, Colchester, UK.
  • Saffari A; Department of Psychological Sciences, Birkbeck, University of London, London, UK.
  • Malki K; Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK.
  • Hannon E; MRC Unit, The Gambia and MRC International Nutrition Group, London School of Hygiene and Tropical Medicine, London, UK.
  • Burrage J; Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
  • Mill J; University of Exeter Medical School, University of Exeter, Exeter, UK.
  • Kumari M; University of Exeter Medical School, University of Exeter, Exeter, UK.
  • Schalkwyk LC; University of Exeter Medical School, University of Exeter, Exeter, UK.
Bioinformatics ; 35(6): 981-986, 2019 03 15.
Article in En | MEDLINE | ID: mdl-30875430
ABSTRACT
MOTIVATION The datasets generated by DNA methylation analyses are getting bigger. With the release of the HumanMethylationEPIC micro-array and datasets containing thousands of samples, analyses of these large datasets using R are becoming impractical due to large memory requirements. As a result there is an increasing need for computationally efficient methodologies to perform meaningful analysis on high dimensional data.

RESULTS:

Here we introduce the bigmelon R package, which provides a memory efficient workflow that enables users to perform the complex, large scale analyses required in epigenome wide association studies (EWAS) without the need for large RAM. Building on top of the CoreArray Genomic Data Structure file format and libraries packaged in the gdsfmt package, we provide a practical workflow that facilitates the reading-in, preprocessing, quality control and statistical analysis of DNA methylation data.We demonstrate the capabilities of the bigmelon package using a large dataset consisting of 1193 human blood samples from the Understanding Society UK Household Longitudinal Study, assayed on the EPIC micro-array platform. AVAILABILITY AND IMPLEMENTATION The bigmelon package is available on Bioconductor (http//bioconductor.org/packages/bigmelon/). The Understanding Society dataset is available at https//www.understandingsociety.ac.uk/about/health/data upon request. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Software / DNA Methylation Type of study: Observational_studies / Risk_factors_studies Limits: Humans Language: En Journal: Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2019 Type: Article Affiliation country: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Software / DNA Methylation Type of study: Observational_studies / Risk_factors_studies Limits: Humans Language: En Journal: Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2019 Type: Article Affiliation country: United kingdom