Your browser doesn't support javascript.
loading
Integration of datasets for individual prediction of DNA methylation-based biomarkers.
Merzbacher, Charlotte; Ryan, Barry; Goldsborough, Thibaut; Hillary, Robert F; Campbell, Archie; Murphy, Lee; McIntosh, Andrew M; Liewald, David; Harris, Sarah E; McRae, Allan F; Cox, Simon R; Cannings, Timothy I; Vallejos, Catalina A; McCartney, Daniel L; Marioni, Riccardo E.
Affiliation
  • Merzbacher C; School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, UK.
  • Ryan B; School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, UK.
  • Goldsborough T; School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, UK.
  • Hillary RF; Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU, UK.
  • Campbell A; Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU, UK.
  • Murphy L; Edinburgh Clinical Research Facility, University of Edinburgh, Edinburgh, EH4 2XU, UK.
  • McIntosh AM; Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU, UK.
  • Liewald D; Division of Psychiatry, Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, UK.
  • Harris SE; Department of Psychology, Lothian Birth Cohorts, University of Edinburgh, Edinburgh, EH8 9JZ, UK.
  • McRae AF; Department of Psychology, Lothian Birth Cohorts, University of Edinburgh, Edinburgh, EH8 9JZ, UK.
  • Cox SR; Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia.
  • Cannings TI; Department of Psychology, Lothian Birth Cohorts, University of Edinburgh, Edinburgh, EH8 9JZ, UK.
  • Vallejos CA; Maxwell Institute for Mathematical Sciences, School of Mathematics, University of Edinburgh, Edinburgh, EH9 3FD, UK.
  • McCartney DL; MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU, UK.
  • Marioni RE; The Alan Turing Institute, London, UK.
Genome Biol ; 24(1): 278, 2023 Dec 05.
Article in En | MEDLINE | ID: mdl-38053194
BACKGROUND: Epigenetic scores (EpiScores) can provide biomarkers of lifestyle and disease risk. Projecting new datasets onto a reference panel is challenging due to separation of technical and biological variation with array data. Normalisation can standardise data distributions but may also remove population-level biological variation. RESULTS: We compare two birth cohorts (Lothian Birth Cohorts of 1921 and 1936 - nLBC1921 = 387 and nLBC1936 = 498) with blood-based DNA methylation assessed at the same chronological age (79 years) and processed in the same lab but in different years and experimental batches. We examine the effect of 16 normalisation methods on a novel BMI EpiScore (trained in an external cohort, n = 18,413), and Horvath's pan-tissue DNA methylation age, when the cohorts are normalised separately and together. The BMI EpiScore explains a maximum variance of R2=24.5% in BMI in LBC1936 (SWAN normalisation). Although there are cross-cohort R2 differences, the normalisation method makes a minimal difference to within-cohort estimates. Conversely, a range of absolute differences are seen for individual-level EpiScore estimates for BMI and age when cohorts are normalised separately versus together. While within-array methods result in identical EpiScores whether a cohort is normalised on its own or together with the second dataset, a range of differences is observed for between-array methods. CONCLUSIONS: Normalisation methods returning similar EpiScores, whether cohorts are analysed separately or together, will minimise technical variation when projecting new data onto a reference panel. These methods are important for cases where raw data is unavailable and joint normalisation of cohorts is computationally expensive.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: DNA Methylation / Epigenomics Limits: Aged / Humans Language: En Journal: Genome Biol Journal subject: BIOLOGIA MOLECULAR / GENETICA Year: 2023 Document type: Article Country of publication: Reino Unido

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: DNA Methylation / Epigenomics Limits: Aged / Humans Language: En Journal: Genome Biol Journal subject: BIOLOGIA MOLECULAR / GENETICA Year: 2023 Document type: Article Country of publication: Reino Unido