RESUMEN
BACKGROUND: Environmental metal exposures have been associated with multiple deleterious health endpoints. DNA methylation (DNAm) may provide insight into the mechanisms underlying these relationships. Toenail metals are non-invasive biomarkers, reflecting a medium-term time exposure window. OBJECTIVES: This study examined variation in leukocyte DNAm and toenail arsenic (As), cadmium (Cd), lead (Pb), manganese (Mn), and mercury (Hg) among elderly men in the Normative Aging Study, a longitudinal cohort. METHODS: We repeatedly collected samples of blood and toenail clippings. We measured DNAm in leukocytes with the Illumina HumanMethylation450 K BeadChip. We first performed median regression to evaluate the effects of each individual toenail metal on DNAm at three levels: individual cytosine-phosphate-guanine (CpG) sites, regions, and pathways. Then, we applied a Bayesian kernel machine regression (BKMR) to assess the joint and individual effects of metal mixtures on DNAm. Significant CpGs were identified using a multiple testing correction based on the independent degrees of freedom approach for correlated outcomes. The approach considers the effective degrees of freedom in the DNAm data using the principal components that explain >95% variation of the data. RESULTS: We included 564 subjects (754 visits) between 1999 and 2013. The numbers of significantly differentially methylated CpG sites, regions, and pathways varied by metals. For example, we found six significant pathways for As, three for Cd, and one for Mn. The As-associated pathways were associated with cancer (e.g., skin cancer) and cardiovascular disease, whereas the Cd-associated pathways were related to lung cancer. Metal mixtures were also associated with 47 significant CpG sites, as well as pathways, mainly related to cancer and cardiovascular disease. CONCLUSIONS: This study provides an approach to understanding the potential epigenetic mechanisms underlying observed relations between toenail metals and adverse health endpoints.
Asunto(s)
Arsénico , Enfermedades Cardiovasculares , Mercurio , Masculino , Humanos , Anciano , Metilación de ADN , Cadmio , Epigenoma , Uñas , Teorema de Bayes , Metales/toxicidad , Envejecimiento , Arsénico/toxicidad , Leucocitos , ManganesoRESUMEN
BACKGROUND: Epigenome-wide association studies of ambient fine particulate matter (PM2.5) have been reported. However, few have examined PM2.5 components (PMCs) and sources or included repeated measures. The lack of high-resolution exposure measurements is the key limitation. We hypothesized that significant changes in DNA methylation might vary by PMCs and the sources. METHODS: We predicted the annual average of 14 PMCs using novel high-resolution exposure models across the contiguous U.S., between 2000-2018. The resolution was 50 m × 50 m in the Greater Boston Area. We also identified PM2.5 sources using positive matrix factorization. We repeatedly collected blood samples and measured leukocyte DNAm with the Illumina HumanMethylation450K BeadChip in the Normative Aging Study. We then used median regression with subject-specific intercepts to estimate the associations between long-term (one-year) exposure to PMCs / PM2.5 sources and DNA methylation at individual cytosine-phosphate-guanine CpG sites. Significant probes were identified by the number of independent degrees of freedom approach, using the number of principal components explaining > 95% of the variation of the DNA methylation data. We also performed regional and pathway analyses to identify significant regions and pathways. RESULTS: We included 669 men with 1,178 visits between 2000-2013. The subjects had a mean age of 75 years. The identified probes, regions, and pathways varied by PMCs and their sources. For example, iron was associated with 6 probes and 6 regions, whereas nitrate was associated with 15 probes and 3 regions. The identified pathways from biomass burning, coal burning, and heavy fuel oil combustion sources were associated with cancer, inflammation, and cardiovascular diseases, whereas there were no pathways associated with all traffic. CONCLUSIONS: Our findings showed that the effects of PM2.5 on DNAm varied by its PMCs and sources.
Asunto(s)
Contaminantes Atmosféricos , Contaminación del Aire , Masculino , Humanos , Anciano , Metilación de ADN , Contaminantes Atmosféricos/efectos adversos , Contaminantes Atmosféricos/análisis , Epigenoma , Material Particulado/efectos adversos , Material Particulado/análisis , Polvo/análisis , Envejecimiento/genética , Carbón Mineral , Contaminación del Aire/efectos adversos , Contaminación del Aire/análisisRESUMEN
Maternal anxiety during pregnancy is associated with adverse foetal, neonatal, and child outcomes, but biological mechanisms remain unclear. Altered foetal DNA methylation (DNAm) has been proposed as a potential underlying mechanism. In the current study, we performed a meta-analysis to examine the associations between maternal anxiety, measured prospectively during pregnancy, and genome-wide DNAm from umbilical cord blood. Sixteen non-overlapping cohorts from 12 independent longitudinal studies of the Pregnancy And Childhood Epigenetics Consortium participated, resulting in a combined dataset of 7243 mother-child dyads. We examined prenatal anxiety in relation to genome-wide DNAm and differentially methylated regions. We observed no association between the general symptoms of anxiety during pregnancy or pregnancy-related anxiety, and DNAm at any of the CpG sites, after multiple-testing correction. Furthermore, we identify no differentially methylated regions associated with maternal anxiety. At the cohort-level, of the 21 associations observed in individual cohorts, none replicated consistently in the other cohorts. In conclusion, contrary to some previous studies proposing cord blood DNAm as a promising potential mechanism explaining the link between maternal anxiety during pregnancy and adverse outcomes in offspring, we found no consistent evidence for any robust associations between maternal anxiety and DNAm in cord blood. Larger studies and analysis of DNAm in other tissues may be needed to establish subtle or subgroup-specific associations between maternal anxiety and the foetal epigenome.
Asunto(s)
Metilación de ADN , Epigenoma , Ansiedad/genética , Metilación de ADN/genética , Epigénesis Genética/genética , Epigenómica , Femenino , Humanos , EmbarazoRESUMEN
AIMS/HYPOTHESIS: Development of type 2 diabetes depends on environmental and genetic factors. We investigated the epigenome-wide association of prevalent diabetes with DNA methylation (DNAm) in peripheral blood. METHODS: DNAm was measured in whole blood with the Illumina Infinium HumanMethylation450 BeadChip in two subsamples of participants from the ESTHER cohort study. Cohort 1 included 988 participants, who were consecutively recruited between July and October 2000 and cohort 2 included 527 randomly selected participants. The association of DNAm with prevalent type 2 diabetes at recruitment was estimated using median regression analysis adjusting for sex, age, BMI, smoking behaviour, cell composition and batch at 361,922 CpG sites. RESULTS: Type 2 diabetes was prevalent in 16% of the participants, and diabetes was poorly controlled in 45% of the diabetic patients. In cohort 1 (discovery) DNAm at 39 CpGs was significantly associated with prevalent diabetes after correction for multiple testing. In cohort 2 (replication) at one of these CpGs, DNAm was still significantly associated. Decreasing methylation levels at cg19693031 with increasing fasting glucose and HbA1c concentrations were observed using restricted cubic spline analysis. In diabetic patients with poorly controlled diabetes, the decrease in estimated DNAm levels was approximately 5% in comparison with participants free of diagnosed diabetes. CONCLUSIONS/INTERPRETATION: Cg19693031, which is located within the 3'-untranslated region of TXNIP, might play a role in the pathophysiology of type 2 diabetes. This result appears biologically plausible given that thioredoxin-interacting protein is overexpressed in diabetic animals and humans and 3'-untranslated regions are known to play a regulatory role in gene expression.
Asunto(s)
Metilación de ADN , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Epigénesis Genética , Leucocitos/citología , Regiones no Traducidas 3' , Anciano , Glucemia/análisis , Índice de Masa Corporal , Proteínas Portadoras/metabolismo , Estudios de Cohortes , Islas de CpG , Estudios Transversales , Femenino , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Persona de Mediana Edad , Fumar , Encuestas y CuestionariosRESUMEN
Long-lasting oxidative stress exposure may lead to relatively stable epigenetic modifications of the DNA in order to activate anti-oxidative defence mechanisms. Oxidative stress related DNA methylation may therefore be associated (causally or as a by-product) with cancer. We measured derivatives of reactive oxygen metabolites (D-ROM), total thiol levels (TTL) and DNA methylation with the Illumina Infinium 450K BeadChip in three samples of German individuals aged ≥50 years: n = 1,000 ESTHER study baseline participants (DNA methylation only), n = 99 ESTHER eight-year follow-up participants and n = 142 participants of the BLITZ study. The correlation coefficient of methylation at cg10342304 and D-ROM in the ESTHER 8-year follow-up sample (r = -0.427; P = 1 × 10(-5)) was replicated with a P-value indicating statistical significance after correction for multiple testing in the BLITZ sample (r = -0.192; P = 0.022). The association was robust to adjusting for potential confounders. In the ESTHER baseline sample, the hazard ratio for cancer development in 11 years of follow-up comparing bottom and top quartile of DNA methylation at cg10342304 was 1.86 (95%-confidence-interval 1.01-3.43). In summary, this first epigenome-wide screening and replication study with oxidative status markers observed a negative correlation of D-ROM levels and DNA methylation at cg10342304 in two independent cohorts. This CpG site is located in the body region of the nucleoredoxin gene. The nucleoredoxin protein is a redox-dependent inhibitor of the Wnt/ß-catenin signaling pathway, a well-characterized cancer pathway. If the observed CpG-cancer association can be successfully replicated by other studies, this epigenetic marker could be an interesting biomarker of cancer risk.
Asunto(s)
Biomarcadores de Tumor/sangre , Epigénesis Genética , Neoplasias/sangre , Neoplasias/genética , Oxidación-Reducción , Anciano , Anciano de 80 o más Años , Estudios de Cohortes , Islas de CpG , Metilación de ADN , Replicación del ADN , Femenino , Estudio de Asociación del Genoma Completo , Alemania , Humanos , Masculino , Persona de Mediana Edad , Especies Reactivas de Oxígeno/sangreRESUMEN
BACKGROUND: Several epigenome-wide association studies (EWAS) of ambient particulate matter with aerodynamic diameter ≤ 2.5 µm (PM2.5) have been reported. However, EWAS of PM2.5 elements (PEs), reflecting different emission sources, are very limited. OBJECTIVES: We performed EWAS of short- and intermediate-term exposure to PM2.5 and 13 PEs. We hypothesized that significant changes in DNAm may vary by PM2.5 mass and its elements. METHODS: We repeatedly collected blood samples in the Normative Aging Study and measured leukocyte DNA methylation (DNAm) with the Illumina HumanMethylation450K BeadChip. We collected daily PM2.5 and 13 PEs at a fixed central site. To estimate the associations between each PE and DNAm at individual cytosine-phosphate-guanine (CpG) sites, we incorporated a distributed-lag (0-27 d) term in the setting of median regression with subject-specific intercept and examined cumulative lag associations. We also accounted for selection bias due to loss to follow-up and mortality prior to enrollment. Significantly differentially methylated probes (DMPs) were identified using Bonferroni correction for multiple testing. We further conducted regional and pathway analyses to identify significantly differentially methylated regions (DMRs) and pathways. RESULTS: We included 695 men with 1,266 visits between 1999 and 2013. The subjects had a mean age of 75 years. The significant DMPs, DMRs, and pathways varied by to PM2.5 total mass and PEs. For example, PM2.5 total mass was associated with 2,717 DMPs and 10,470 DMRs whereas Pb was associated with 3,173 DMPs and 637 DMRs. The identified pathways by PM2.5 mass were mostly involved in mood disorders, neuroplasticity, immunity, and inflammation, whereas the pathways associated with motor vehicles (BC, Cu, Pb, and Zn) were related with cardiovascular disease and cancer (e.g., "PPARs signaling"). CONCLUSIONS: PM2.5 and PE were associated with methylation changes at multiple probes and along multiple pathways, in ways that varied by particle components.
Asunto(s)
Contaminantes Atmosféricos , Metilación de ADN , Anciano , Envejecimiento , Contaminantes Atmosféricos/análisis , Contaminantes Atmosféricos/toxicidad , Epigenoma , Humanos , Leucocitos , Masculino , Material Particulado/análisis , Material Particulado/toxicidadRESUMEN
Saliva is a widely used biological sample, especially in pediatric research, containing a heterogenous mixture of immune and epithelial cells. Associations of exposure or disease with saliva DNA methylation can be influenced by cell-type proportions. Here, we developed a saliva cell-type DNA methylation reference panel to estimate interindividual cell-type heterogeneity in whole saliva studies. Saliva was collected from 22 children (7-16 years) and sorted into immune and epithelial cells, using size exclusion filtration and magnetic bead sorting. DNA methylation was measured using the Illumina MethylationEPIC BeadChip. We assessed cell-type differences in DNA methylation profiles and tested for enriched biological pathways. Immune and epithelial cells differed at 181,577 (22.8%) DNA methylation sites (t-test p < 6.28 × 10-8). Immune cell hypomethylated sites are mapped to genes enriched for immune pathways (p < 3.2 × 10-5). Epithelial cell hypomethylated sites were enriched for cornification (p = 5.2 × 10-4), a key process for hard palette formation. Saliva immune and epithelial cells have distinct DNA methylation profiles which can drive whole-saliva DNA methylation measures. A primary saliva DNA methylation reference panel, easily implemented with an R package, will allow estimates of cell proportions from whole saliva samples and improve epigenetic epidemiology studies by accounting for measurement heterogeneity by cell-type proportions.
Asunto(s)
Metilación de ADN , Saliva , Niño , Islas de CpG , Estudios Epidemiológicos , Epigénesis Genética , Epigenómica , HumanosRESUMEN
BACKGROUND: Both parental and neighbourhood socio-economic status (SES) are linked to poorer health independently of personal SES measures, but the biological mechanisms are unclear. Our objective was to examine these influences via epigenetic age acceleration (EAA)-the discrepancy between chronological and epigenetic ages. METHODS: We examined three USA-based [Coronary Artery Risk Disease in Adults (CARDIA) study, Fragile Families and Child Wellbeing Study (FFCWS) and Programming Research in Obesity, Growth, Environment and Social Stressors (PROGRESS)] and one Mexico-based (Project Viva) cohort. DNA methylation was measured using Illumina arrays, personal/parental SES by questionnaire and neighbourhood disadvantage from geocoded address. In CARDIA, we examined the most strongly associated personal, parental and neighbourhood SES measures with EAA (Hannum's method) at study years 15 and 20 separately and combined using a generalized estimating equation (GEE) and compared with other EAA measures (Horvath's EAA, PhenoAge and GrimAge calculators, and DunedinPoAm). RESULTS: EAA was associated with paternal education in CARDIA [GEEs: ßsome college = -1.01 years (-1.91, -0.11) and ßAsunto(s)
Envejecimiento
, Metilación de ADN
, Adolescente
, Adulto
, Envejecimiento/genética
, Niño
, Estudios de Cohortes
, Escolaridad
, Epigénesis Genética
, Femenino
, Humanos
, Masculino
, México/epidemiología
RESUMEN
Altered maternal haemoglobin levels during pregnancy are associated with pre-clinical and clinical conditions affecting the fetus. Evidence from animal models suggests that these associations may be partially explained by differential DNA methylation in the newborn with possible long-term consequences. To test this in humans, we meta-analyzed the epigenome-wide associations of maternal haemoglobin levels during pregnancy with offspring DNA methylation in 3,967 newborn cord blood and 1,534 children and 1,962 adolescent whole-blood samples derived from 10 cohorts. DNA methylation was measured using Illumina Infinium Methylation 450K or MethylationEPIC arrays covering 450,000 and 850,000 methylation sites, respectively. There was no statistical support for the association of maternal haemoglobin levels with offspring DNA methylation either at individual methylation sites or clustered in regions. For most participants, maternal haemoglobin levels were within the normal range in the current study, whereas adverse perinatal outcomes often arise at the extremes. Thus, this study does not rule out the possibility that associations with offspring DNA methylation might be seen in studies with more extreme maternal haemoglobin levels.
Asunto(s)
Metilación de ADN , Epigénesis Genética , Adolescente , Niño , Preescolar , Epigenoma , Epigenómica , Femenino , Sangre Fetal/metabolismo , Hemoglobinas/genética , Hemoglobinas/metabolismo , Humanos , Recién Nacido , EmbarazoRESUMEN
BACKGROUND: Sleep is important for healthy functioning in children. Numerous genetic and environmental factors, from conception onwards, may influence this phenotype. Epigenetic mechanisms such as DNA methylation have been proposed to underlie variation in sleep or may be an early-life marker of sleep disturbances. We examined if DNA methylation at birth or in school age is associated with parent-reported and actigraphy-estimated sleep outcomes in children. METHODS: We meta-analysed epigenome-wide association study results. DNA methylation was measured from cord blood at birth in 11 cohorts and from peripheral blood in children (4-13 years) in 8 cohorts. Outcomes included parent-reported sleep duration, sleep initiation and fragmentation problems, and actigraphy-estimated sleep duration, sleep onset latency and wake-after-sleep-onset duration. RESULTS: We found no associations between DNA methylation at birth and parent-reported sleep duration (n = 3658), initiation problems (n = 2504), or fragmentation (n = 1681) (p values above cut-off 4.0 × 10-8). Lower methylation at cg24815001 and cg02753354 at birth was associated with longer actigraphy-estimated sleep duration (p = 3.31 × 10-8, n = 577) and sleep onset latency (p = 8.8 × 10-9, n = 580), respectively. DNA methylation in childhood was not cross-sectionally associated with any sleep outcomes (n = 716-2539). CONCLUSION: DNA methylation, at birth or in childhood, was not associated with parent-reported sleep. Associations observed with objectively measured sleep outcomes could be studied further if additional data sets become available.
Asunto(s)
Metilación de ADN , Trastornos del Sueño-Vigilia , Epigénesis Genética , Epigenoma , Humanos , Sueño/genética , Trastornos del Sueño-Vigilia/genéticaRESUMEN
BACKGROUND: DNA methylation (DNAm) may play a role in age-related outcomes. It is not yet known which DNAm-based biomarkers of age acceleration (BoAA) has the strongest association with age-related endpoints. METHODS: We collected the blood samples from two independent cohorts: the Normative Ageing Study, and the Cooperative Health Research in the Region of Augsburg cohort. We measured epigenome-wide DNAm level, and generated five DNAm BoAA at baseline. We used Cox proportional hazards model to analyze the relationships between BoAA and all-cause death. We applied the Fine and Gray competing risk model to estimate the risk of BoAA on myocardial infarction (MI), stroke, and cancer, accounting for death of other reasons as the competing risks. We used random-effects meta-analyses to pool the individual results, with adjustment for multiple testing. FINDINGS: The mean chronological ages in the two cohorts were 74, and 61, respectively. Baseline GrimAgeAccel, and DNAm-related mortality risk score (DNAmRS) both had strong associations with all-cause death, MI, and stroke, independent from chronological age. For example, a one standard deviation (SD) increment in GrimAgeAccel was significantly associated with increased risk of all-cause death [hazard ratio (HR): 2.01; 95% confidence interval (CI), 1.15, 3.50], higher risk of MI (HR: 1.44; 95% CI, 1.16, 1.79), and elevated risk of stroke (HR: 1.42; 95% CI, 1.06, 1.91). There were no associations between any BoAA and cancer. INTERPRETATION: From the public health perspective, GrimAgeAccel is the most useful tool for identifying at-risk elderly, and evaluating the efficacy of anti-aging interventions. FUNDING: National Institute of Environmental Health Sciences of U.S., Harvard Chan-NIEHS Center for Environmental Health, German Federal Ministry of Education and Research, and the State of Bavaria in Germany.
Asunto(s)
Envejecimiento/genética , Biomarcadores , Causas de Muerte , Metilación de ADN , Epigénesis Genética , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Humanos , Incidencia , Masculino , Persona de Mediana Edad , Infarto del Miocardio/epidemiología , Infarto del Miocardio/genética , Infarto del Miocardio/mortalidad , Neoplasias/epidemiología , Neoplasias/genética , Neoplasias/mortalidad , Modelos de Riesgos Proporcionales , Vigilancia en Salud Pública , Accidente Cerebrovascular/epidemiología , Accidente Cerebrovascular/genética , Accidente Cerebrovascular/mortalidadRESUMEN
DNA methylation microarrays have been the platform of choice for epigenome-wide association studies in epidemiology, but declining costs have rendered targeted bisulphite sequencing a feasible alternative. Nonetheless, the literature for researchers seeking guidance on which platform to choose is sparse. To fill this gap, we conducted a comparison study in which we processed cord blood samples from four newborns in duplicates using both the Illumina HumanMethylationEPIC BeadChip and the Illumina TruSeq Methyl Capture EPIC Kit, and evaluated both platforms in regard to coverage, reproducibility, and identification of differential methylation. We conclude that with current analytic goals microarrays still outperform bisulphite sequencing for precise quantification of DNA methylation.
Asunto(s)
Metilación de ADN , Epigenómica/métodos , Estudio de Asociación del Genoma Completo/métodos , Análisis de Secuencia de ADN/métodos , Adulto , Estudios de Cohortes , Epigenoma , Epigenómica/normas , Femenino , Estudio de Asociación del Genoma Completo/normas , Humanos , Recién Nacido , Sensibilidad y Especificidad , Análisis de Secuencia de ADN/normasRESUMEN
The effects of prenatal lead exposure on child development include impaired growth and cognitive function. DNA methylation might be involved in the underlying mechanisms and previous epigenome-wide association studies reported associations between lead exposure during pregnancy and cord blood methylation levels. However, it is unclear during which developmental stage lead exposure is most harmful. Cord blood methylation levels were assayed in 420 children from a Mexican pre-birth cohort using the Illumina Infinium MethylationEPIC microarray. Lead concentrations were measured in umbilical cord blood as well as in blood samples from the mothers collected at 2nd and 3rd trimester and delivery using inductively coupled plasma-mass spectrometry. In addition, maternal bone lead levels were measured in tibia and patella using X-ray fluorescence. Comprehensive quality control and preprocessing of microarray data was followed by an unbiased restriction to methylation sites with substantial variance. Methylation levels at 202 111 cytosine-phosphate-guanine sites were regressed on each exposure adjusting for child sex, leukocyte composition, batch variables, gestational age, birthweight-for-gestational-age, maternal age, maternal education and mode of delivery. We find no association between prenatal lead exposure and cord blood methylation. This null result is strengthened by a sensitivity analysis showing that in the same dataset known biomarkers for birthweight-for-gestational-age can be recovered and the fact that phenotypic associations with lead exposure have been described in the same cohort.
RESUMEN
Elderly individuals who are never smokers but have the same height and chronological age can have substantial differences in lung function. The underlying biological mechanisms are unclear. To evaluate the associations of different biomarkers of aging (BoA) and lung function, we performed a repeated-measures analysis in the Normative Aging Study using linear mixed-effect models. We generated GrimAgeAccel, PhenoAgeAccel, extrinsic and intrinsic epigenetic age acceleration using a publically available online calculator. We calculated Zhang's DNAmRiskScore based on 10 CpGs. We measured telomere length (TL) and mitochondrial DNA copy number (mtDNA-CN) using quantitative real-time polymerase chain reaction. A pulmonary function test was performed measuring forced expiratory volume in 1 second / forced vital capacity (FEV1/FVC), FEV1, and maximum mid-expiratory flow (MMEF). Epigenetic-based BoA were associated with lower lung function. For example, a one-year increase in GrimAgeAccel was associated with a 13.64 mL [95% confidence interval (CI), 5.11 to 22.16] decline in FEV1; a 0.2 increase in Zhang's DNAmRiskScore was associated with a 0.009 L/s (0.005 to 0.013) reduction in MMEF. No association was found between TL/mtDNA-CN and lung function. Overall, this paper shows that epigenetics might be a potential mechanism underlying pulmonary dysfunction in the elderly.
Asunto(s)
Envejecimiento/genética , Epigénesis Genética/fisiología , Pulmón/fisiología , Modelos Genéticos , Anciano , Anciano de 80 o más Años , Biomarcadores/análisis , ADN Mitocondrial/genética , Femenino , Volumen Espiratorio Forzado/genética , Dosificación de Gen , Humanos , Modelos Lineales , Masculino , Flujo Espiratorio Medio Máximo/genética , Persona de Mediana Edad , Homeostasis del Telómero/fisiología , Capacidad Vital/genéticaRESUMEN
BACKGROUND: DNA methylation microarrays are popular for epigenome-wide association studies (EWAS), but spurious values complicate downstream analysis and threaten replication. Conventional cut-offs for detection p values for filtering out undetected probes were demonstrated in a single previous study as insufficient leading to many apparent methylation calls in samples from females in probes targeting the Y-chromosome. We present an alternative approach to calculate more accurate detection p values utilizing non-specific background fluorescence. We evaluate and compare our proposed approach of filtering observations with conventional ones by assessing the detection of Y-chromosome probes among males and females in 2755 samples from 17 studies on the 450K microarray and masking of large outliers between technical replicates and their impact downstream via an EWAS reanalysis. RESULTS: In contrast to conventional approaches, ours marks most Y-chromosome probes in females as undetected while removing a median of only 0.14% of the data per sample, catches more (30% vs. 6%) of large outliers (more than 20 percentage point difference between technical replicates), and helps to identify strong associations previously obfuscated by outliers between whole blood DNA methylation and chronological age in a well-powered EWAS (n = 729). CONCLUSIONS: We provide guidance for filtering both 450K and EPIC microarrays as an essential preprocessing step to reduce spurious values. An implementation (including a function compatible with objects from the popular minfi package) was added to ewastools, an R package for comprehensive quality control of DNA methylation microarrays. Scripts to reproduce all analyses are available at doi.org/10.5281/zenodo.1443561 .
Asunto(s)
Cromosomas Humanos Y/genética , Metilación de ADN , Epigenómica/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Islas de CpG , Epigénesis Genética , Femenino , Humanos , Masculino , Análisis de Secuencia de ADNRESUMEN
Background: Mislabeled, contaminated or poorly performing samples can threaten power in methylation microarray analyses or even result in spurious associations. We describe a set of quality checks for the popular Illumina 450K and EPIC microarrays to identify problematic samples and demonstrate their application in publicly available datasets. Methods: Quality checks implemented here include 17 control metrics defined by the manufacturer, a sex check to detect mislabeled sex-discordant samples, and both an identity check for fingerprinting sample donors and a measure of sample contamination based on probes querying high-frequency SNPs. These checks were tested on 80 datasets comprising 8327 samples run on the 450K microarray from the GEO repository. Results: Nine hundred forty samples were flagged by at least one control metric and 133 samples from 20 datasets were assigned the wrong sex. In a dataset in which a subset of samples appear contaminated with a single source of DNA, we demonstrate that our measure based on outliers among SNP probes was strongly correlated (> 0.95) with another independent measure of contamination. Conclusions: A more complete examination of samples that may be mislabeled, contaminated, or have poor performance due to technical problems will improve downstream analyses and replication of findings. We demonstrate that quality control problems are prevalent in a public repository of DNA methylation data. We advocate for a more thorough quality control workflow in epigenome-wide association studies and provide a software package to perform the checks described in this work. Reproducible code and supplementary material are available at 10.5281/zenodo.1172730.
Asunto(s)
Metilación de ADN , Bases de Datos Genéticas/normas , Análisis de Secuencia por Matrices de Oligonucleótidos/normas , Islas de CpG , Epigenómica/normas , Genoma Humano , Humanos , Polimorfismo de Nucleótido Simple , Control de Calidad , Programas InformáticosRESUMEN
AIM: One concern in epigenome-wide studies investigating leukocyte DNA methylation is that observed associations may at least partly reflect differences in leukocyte composition (LC) rather than changes in methylation. We estimated the magnitude of confounding by LC for common risk factors and diseases. MATERIALS & METHODS: Variation of LC according to sex, race, age, smoking, alcohol consumption, BMI, cardiovascular fitness, hypertension, coronary heart disease and diabetes was analyzed using blood differentials from 4117 participants of NHANES. Furthermore, leukocyte DNA methylation levels of biomarkers of smoking, BMI, diabetes, age and sex were regressed on these outcomes in a sample of 989 participants of ESTHER, and regression coefficients with and without adjustment for estimated LC were compared. RESULTS: Aside from race and ages below 25 years, none of the investigated factors had substantial impact on LC. Adjusted and unadjusted coefficients were virtually identical. CONCLUSION: Confounding by LC might often be a minor issue.
Asunto(s)
Enfermedad Coronaria/genética , Metilación de ADN , Diabetes Mellitus/genética , Hipertensión/genética , Leucocitos/citología , Adulto , Anciano , Anciano de 80 o más Años , Índice de Masa Corporal , Enfermedad Coronaria/epidemiología , Diabetes Mellitus/epidemiología , Femenino , Predisposición Genética a la Enfermedad , Humanos , Hipertensión/epidemiología , Leucocitos/metabolismo , Masculino , Persona de Mediana EdadRESUMEN
BACKGROUND: Colorectal cancer (CRC) is the third most common cancer worldwide. If detected at an early stage, prognosis is good. Despite increasing evidence for the benefits of implemented screening programs, such as screening colonoscopy, compliance is rather low. Hence there is demand for non-invasive tests for the early detection of CRC with high acceptance in population-wide screening. The objective of this study was to identify and evaluate leukocyte DNA methylation patterns as a potential biomarker for early detection of CRC. METHODS: Blood samples of patients scheduled for a screening colonoscopy were collected before the procedure. Additionally, blood samples from CRC cases recruited in a clinical setting were collected. DNA was extracted from leukocytes, and DNA methylation was measured with the Infinium 450K BeadChip. In total, 46 CRC cases and 140 controls from the screening setting and 93 CRC cases from the clinical setting were measured. RESULTS: An epigenome-wide discovery revealed two CpG sites in the promoter region of KIAA1549L that were significantly differentially methylated between cases and controls. A third marker in the body region of BCL2 was discovered in a candidate approach testing biomarkers reported in the literature. Logistic regression models built on these three markers yielded an optimism-corrected c-statistic of 0.69 in the screening setting and 0.73 in the clinical setting. CONCLUSIONS: Although diagnostic performance of the DNA methylation signature identified in this first epigenome-wide association study of leukocyte DNA methylation with CRC in a screening setting is not competitive with established screening tests, the identified markers may contribute to multimarker panels for early detection of CRC.
Asunto(s)
Biomarcadores de Tumor/genética , Neoplasias Colorrectales/genética , Metilación de ADN , ADN de Neoplasias/sangre , Epigenómica/métodos , Leucocitos/fisiología , Anciano , Biomarcadores de Tumor/sangre , Neoplasias Colorrectales/sangre , Islas de CpG , ADN de Neoplasias/genética , Detección Precoz del Cáncer , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Leucocitos/metabolismo , Masculino , Persona de Mediana EdadRESUMEN
AIM: Whole-blood DNA methylation depends on the underlying leukocyte composition and confounding hereby is a major concern in epigenome-wide association studies. Cell counts are often missing or may not be feasible. Computational approaches estimate leukocyte composition from DNA methylation based on reference datasets of purified leukocytes. We explored the possibility to train such a model on whole-blood DNA methylation and cell counts without the need for purification. MATERIALS & METHODS: Using whole-blood DNA methylation and corresponding five-part cell counts from 2445 participants from the London Life Sciences Prospective Population Study, a model was trained on a subset of 175 subjects and evaluated on the remaining. RESULTS: Correlations between cell counts and estimated cell proportions were high (neutrophils 0.85, eosinophils 0.88, basophils 0.02, lymphocytes 0.84, monocytes 0.55) and estimated proportions explained more variance in whole-blood DNA methylation levels than counts. CONCLUSION: Our model provided precise estimates for the common cell types.
Asunto(s)
Metilación de ADN , Leucocitos/clasificación , Adulto , Anciano , Biomarcadores/sangre , Enfermedad Coronaria/sangre , Femenino , Humanos , Recuento de Leucocitos/métodos , Recuento de Leucocitos/normas , Leucocitos/metabolismo , Masculino , Persona de Mediana Edad , Estándares de ReferenciaRESUMEN
DNA methylation (DNAm) has been revealed to play a role in various diseases. Here we performed epigenome-wide screening and validation to identify mortality-related DNAm signatures in a general population-based cohort with up to 14 years follow-up. In the discovery panel in a case-cohort approach, 11,063 CpGs reach genome-wide significance (FDR<0.05). 58 CpGs, mapping to 38 well-known disease-related genes and 14 intergenic regions, are confirmed in a validation panel. A mortality risk score based on ten selected CpGs exhibits strong association with all-cause mortality, showing hazard ratios (95% CI) of 2.16 (1.10-4.24), 3.42 (1.81-6.46) and 7.36 (3.69-14.68), respectively, for participants with scores of 1, 2-5 and 5+ compared with a score of 0. These associations are confirmed in an independent cohort and are independent from the 'epigenetic clock'. In conclusion, DNAm of multiple disease-related genes are strongly linked to mortality outcomes. The DNAm-based risk score might be informative for risk assessment and stratification.