ABSTRACT
Coronary heart disease (CHD) is one of the leading causes of mortality and morbidity in the United States. Accurate time-to-event CHD prediction models with high-dimensional DNA methylation and clinical features may assist with early prediction and intervention strategies. We developed a state-of-the-art deep learning autoencoder survival analysis model (AESurv) to effectively analyze high-dimensional blood DNA methylation features and traditional clinical risk factors by learning low-dimensional representation of participants for time-to-event CHD prediction. We demonstrated the utility of our model in two cohort studies: the Strong Heart Study cohort (SHS), a prospective cohort studying cardiovascular disease and its risk factors among American Indians adults; the Women's Health Initiative (WHI), a prospective cohort study including randomized clinical trials and observational study to improve postmenopausal women's health with one of the main focuses on cardiovascular disease. Our AESurv model effectively learned participant representations in low-dimensional latent space and achieved better model performance (concordance index-C index of 0.864 ± 0.009 and time-to-event mean area under the receiver operating characteristic curve-AUROC of 0.905 ± 0.009) than other survival analysis models (Cox proportional hazard, Cox proportional hazard deep neural network survival analysis, random survival forest, and gradient boosting survival analysis models) in the SHS. We further validated the AESurv model in WHI and also achieved the best model performance. The AESurv model can be used for accurate CHD prediction and assist health care professionals and patients to perform early intervention strategies. We suggest using AESurv model for future time-to-event CHD prediction based on DNA methylation features.
Subject(s)
Coronary Disease , DNA Methylation , Humans , Coronary Disease/mortality , Female , Survival Analysis , Deep Learning , Risk Factors , Male , Middle Aged , Prospective StudiesABSTRACT
The statistical analysis of omics data poses a great computational challenge given their ultra-high-dimensional nature and frequent between-features correlation. In this work, we extended the iterative sure independence screening (ISIS) algorithm by pairing ISIS with elastic-net (Enet) and 2 versions of adaptive elastic-net (adaptive elastic-net (AEnet) and multistep adaptive elastic-net (MSAEnet)) to efficiently improve feature selection and effect estimation in omics research. We subsequently used genome-wide human blood DNA methylation data from American Indian participants in the Strong Heart Study (n = 2235 participants; measured in 1989-1991) to compare the performance (predictive accuracy, coefficient estimation, and computational efficiency) of ISIS-paired regularization methods with that of a bayesian shrinkage and traditional linear regression to identify an epigenomic multimarker of body mass index (BMI). ISIS-AEnet outperformed the other methods in prediction. In biological pathway enrichment analysis of genes annotated to BMI-related differentially methylated positions, ISIS-AEnet captured most of the enriched pathways in common for at least 2 of all the evaluated methods. ISIS-AEnet can favor biological discovery because it identifies the most robust biological pathways while achieving an optimal balance between bias and efficient feature selection. In the extended SIS R package, we also implemented ISIS paired with Cox and logistic regression for time-to-event and binary endpoints, respectively, and a bootstrap approach for the estimation of regression coefficients.
Subject(s)
Algorithms , Body Mass Index , DNA Methylation , Epigenomics , Humans , Epigenomics/methods , Female , Male , Bayes Theorem , Middle Aged , Epigenesis, Genetic , Aged , Biomarkers/bloodABSTRACT
PURPOSE: Liver cancer incidence among American Indians/Alaska Natives has risen over the past 20 years. Peripheral blood DNA methylation may be associated with liver cancer and could be used as a biomarker for cancer risk. We evaluated the association of blood DNA methylation with risk of liver cancer. METHODS: We conducted a prospective cohort study in 2324 American Indians, between age 45 and 75 years, from Arizona, Oklahoma, North Dakota and South Dakota who participated in the Strong Heart Study between 1989 and 1991. Liver cancer deaths (n = 21) were ascertained using death certificates obtained through 2017. The mean follow-up duration (SD) for non-cases was 25.1 (5.6) years and for cases, 11.0 (8.8) years. DNA methylation was assessed from blood samples collected at baseline using MethylationEPIC BeadChip 850 K arrays. We used Cox regression models adjusted for age, sex, center, body mass index, low-density lipoprotein cholesterol, smoking, alcohol consumption, and immune cell proportions to examine the associations. RESULTS: We identified 9 CpG sites associated with liver cancer. cg16057201 annotated to MRFAP1) was hypermethylated among cases vs. non-cases (hazard ratio (HR) for one standard deviation increase in methylation was 1.25 (95% CI 1.14, 1.37). The other eight CpGs were hypomethylated and the corresponding HRs (95% CI) ranged from 0.58 (0.44, 0.75) for cg04967787 (annotated to PPRC1) to 0.77 (0.67, 0.88) for cg08550308. We also assessed 7 differentially methylated CpG sites associated with liver cancer in previous studies. The adjusted HR for cg15079934 (annotated to LPS1) was 1.93 (95% CI 1.10, 3.39). CONCLUSIONS: Blood DNA methylation may be associated with liver cancer mortality and may be altered during the development of liver cancer.
Subject(s)
Indians, North American , Liver Neoplasms , Humans , Middle Aged , Aged , American Indian or Alaska Native , DNA Methylation , Prospective Studies , Indians, North American/genetics , Liver Neoplasms/epidemiology , Liver Neoplasms/geneticsABSTRACT
Dyslipidemia has been associated with depression, but individual lipid species associated with depression remain largely unknown. The temporal relationship between lipid metabolism and the development of depression also remains to be determined. We studied 3721 fasting plasma samples from 1978 American Indians attending two exams (2001-2003, 2006-2009, mean ~5.5 years apart) in the Strong Heart Family Study. Plasma lipids were repeatedly measured by untargeted liquid chromatography-mass spectrometry (LC-MS). Depressive symptoms were assessed using the 20-item Center for Epidemiologic Studies for Depression (CES-D). Participants at risk for depression were defined as total CES-D score ≥16. Generalized estimating equation (GEE) was used to examine the associations of lipid species with incident or prevalent depression, adjusting for covariates. The associations between changes in lipids and changes in depressive symptoms were additionally adjusted for baseline lipids. We found that lower levels of sphingomyelins and glycerophospholipids and higher level of lysophospholipids were significantly associated with incident and/or prevalent depression. Changes in sphingomyelins, glycerophospholipids, acylcarnitines, fatty acids and triacylglycerols were associated with changes in depressive symptoms and other psychosomatic traits. We also identified differential lipid networks associated with risk of depression. The observed alterations in lipid metabolism may affect depression through increasing the activities of acid sphingomyelinase and phospholipase A2, disturbing neurotransmitters and membrane signaling, enhancing inflammation, oxidative stress, and lipid peroxidation, and/or affecting energy storage in lipid droplets or membrane formation. These findings illuminate the mechanisms through which dyslipidemia may contribute to depression and provide initial evidence for targeting lipid metabolism in developing preventive and therapeutic interventions for depression.
Subject(s)
Depression , Dyslipidemias , Humans , Longitudinal Studies , Depression/diagnosis , American Indian or Alaska Native , Independent Living , Lipidomics , Sphingomyelins , GlycerophospholipidsABSTRACT
BACKGROUND: Epigenetic dysregulation has been proposed as a key mechanism for arsenic-related cardiovascular disease (CVD). We evaluated differentially methylated positions (DMPs) as potential mediators on the association between arsenic and CVD. METHODS: Blood DNA methylation was measured in 2321 participants (mean age 56.2, 58.6% women) of the Strong Heart Study, a prospective cohort of American Indians. Urinary arsenic species were measured using high-performance liquid chromatography coupled to inductively coupled plasma mass spectrometry. We identified DMPs that are potential mediators between arsenic and CVD. In a cross-species analysis, we compared those DMPs with differential liver DNA methylation following early-life arsenic exposure in the apoE knockout (apoE-/-) mouse model of atherosclerosis. RESULTS: A total of 20 and 13 DMPs were potential mediators for CVD incidence and mortality, respectively, several of them annotated to genes related to diabetes. Eleven of these DMPs were similarly associated with incident CVD in 3 diverse prospective cohorts (Framingham Heart Study, Women's Health Initiative, and Multi-Ethnic Study of Atherosclerosis). In the mouse model, differentially methylated regions in 20 of those genes and DMPs in 10 genes were associated with arsenic. CONCLUSIONS: Differential DNA methylation might be part of the biological link between arsenic and CVD. The gene functions suggest that diabetes might represent a relevant mechanism for arsenic-related cardiovascular risk in populations with a high burden of diabetes.
Subject(s)
Arsenic , Atherosclerosis , Cardiovascular Diseases , Animals , Apolipoproteins E , Arsenic/toxicity , Atherosclerosis/chemically induced , Atherosclerosis/genetics , Cardiovascular Diseases/chemically induced , Cardiovascular Diseases/genetics , DNA Methylation , Female , Humans , Male , Mice , Middle Aged , Prospective StudiesABSTRACT
The generalized estimating equations method (GEE) is commonly applied to analyze data obtained from family studies. GEE is well known for its robustness on misspecification of correlation structure. However, the unbalanced distribution of family sizes and complicated genetic relatedness structure within each family may challenge GEE performance. We focused our research on binary outcomes. To evaluate the performance of GEE, we conducted a series of simulations, on data generated adopting the kinship matrix (correlation structure within each family) from the Strong Heart Family Study (SHFS). We performed a fivefold cross-validation to further evaluate the GEE predictive power on data from the SHFS. A Bayesian modeling approach, with direct integration of the kinship matrix, was also included to contrast with GEE. Our simulation studies revealed that GEE performs well on a binary outcome from families having a relatively simple kinship structure. However, data with a binary outcome generated from families with complex kinship structures, especially with a large genetic variance, can challenge the performance of GEE.
ABSTRACT
The marmoset is a fundamental nonhuman primate model for the study of aging, neurobiology, and many other topics. Genetic management of captive marmoset colonies is complicated by frequent chimerism in the blood and other tissues, a lack of tools to enable cost-effective, genome-wide interrogation of variation, and historic mergers and migrations of animals between colonies. We implemented genotype-by-sequencing (GBS) of hair follicle derived DNA (a minimally chimeric DNA source) of 82 marmosets housed at the Southwest National Primate Research Center (SNPRC). Our primary goals were the genetic characterization of our marmoset population for pedigree verification and colony management and to inform the scientific community of the functional genetic makeup of this valuable resource. We used the GBS data to reconstruct the genetic legacy of recent mergers between colonies, to identify genetically related animals whose relationships were previously unknown due to incomplete pedigree information, and to show that animals in the SNPRC colony appear to exhibit low levels of inbreeding. Of the >99,000 single-nucleotide variants (SNVs) that we characterized, >9800 are located within gene regions known to harbor pathogenic variants of clinical significance in humans. Overall, we show the combination of low-resolution (sparse) genotyping using hair follicle DNA is a powerful strategy for the genetic management of captive marmoset colonies and for identifying potential SNVs for the development of biomedical research models.
Subject(s)
Callithrix , Genotype , Pedigree , Animals , Callithrix/genetics , Male , Female , Polymorphism, Single Nucleotide , Sequence Analysis, DNA , Inbreeding , Hair Follicle , Genotyping Techniques/methods , Genotyping Techniques/veterinaryABSTRACT
BACKGROUND. The confounder-corrected chemical shift-encoded MRI (CSE-MRI) sequence used to determine proton density fat fraction (PDFF) for hepatic fat quantification is not widely available. As an alternative, hepatic fat can be assessed by a two-point Dixon method to calculate signal fat fraction (FF) from conventional T1-weighted in- and opposed-phase (IOP) images, although signal FF is prone to biases, leading to inaccurate quantification. OBJECTIVE. The purpose of this study was to compare hepatic fat quantification by use of PDFF inferred from conventional T1-weighted IOP images and deep-learning convolutional neural networks (CNNs) with quantification by use of two-point Dixon signal FF with CSE-MRI PDFF as the reference standard. METHODS. This study entailed retrospective analysis of data from 292 participants (203 women, 89 men; mean age, 53.7 ± 12.0 [SD] years) enrolled at two sites from September 1, 2017, to December 18, 2019, in the Strong Heart Family Study (a prospective population-based study of American Indian communities). Participants underwent liver MRI (site A, 3 T; site B, 1.5 T) including T1-weighted IOP MRI and CSE-MRI (used to reconstruct CSE PDFF and CSE R2* maps). With CSE PDFF as reference, a CNN was trained in a random sample of 218 (75%) participants to infer voxel-by-voxel PDFF maps from T1-weighted IOP images; testing was performed in the other 74 (25%) participants. Parametric values from the entire liver were automatically extracted. Per-participant median CNN-inferred PDFF and median two-point Dixon signal FF were compared with reference median CSE-MRI PDFF by means of linear regression analysis, intraclass correlation coefficient (ICC), and Bland-Altman analysis. The code is publicly available at github.com/kang927/CNN-inference-of-PDFF-from-T1w-IOP-MR. RESULTS. In the 74 test-set participants, reference CSE PDFF ranged from 1% to 32% (mean, 11.3% ± 8.3% [SD]); reference CSE R2* ranged from 31 to 457 seconds-1 (mean, 62.4 ± 67.3 seconds-1 [SD]). Agreement metrics with reference to CSE PDFF for CNN-inferred PDFF were ICC = 0.99, bias = -0.19%, 95% limits of agreement (LoA) = (-2.80%, 2.71%) and for two-point Dixon signal FF were ICC = 0.93, bias = -1.11%, LoA = (-7.54%, 5.33%). CONCLUSION. Agreement with reference CSE PDFF was better for CNN-inferred PDFF from conventional T1-weighted IOP images than for two-point Dixon signal FF. Further investigation is needed in individuals with moderate-to-severe iron overload. CLINICAL IMPACT. Measurement of CNN-inferred PDFF from widely available T1-weighted IOP images may facilitate adoption of hepatic PDFF as a quantitative bio-marker for liver fat assessment, expanding opportunities to screen for hepatic steatosis and nonalcoholic fatty liver disease.
Subject(s)
Deep Learning , Non-alcoholic Fatty Liver Disease , Male , Humans , Female , Adult , Middle Aged , Aged , Protons , Retrospective Studies , Prospective Studies , Liver/diagnostic imaging , Non-alcoholic Fatty Liver Disease/diagnostic imaging , Magnetic Resonance Imaging/methodsABSTRACT
The Collaborative Cohort of Cohorts for COVID-19 Research (C4R) is a national prospective study of adults comprising 14 established US prospective cohort studies. Starting as early as 1971, investigators in the C4R cohort studies have collected data on clinical and subclinical diseases and their risk factors, including behavior, cognition, biomarkers, and social determinants of health. C4R links this pre-coronavirus disease 2019 (COVID-19) phenotyping to information on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection and acute and postacute COVID-related illness. C4R is largely population-based, has an age range of 18-108 years, and reflects the racial, ethnic, socioeconomic, and geographic diversity of the United States. C4R ascertains SARS-CoV-2 infection and COVID-19 illness using standardized questionnaires, ascertainment of COVID-related hospitalizations and deaths, and a SARS-CoV-2 serosurvey conducted via dried blood spots. Master protocols leverage existing robust retention rates for telephone and in-person examinations and high-quality event surveillance. Extensive prepandemic data minimize referral, survival, and recall bias. Data are harmonized with research-quality phenotyping unmatched by clinical and survey-based studies; these data will be pooled and shared widely to expedite collaboration and scientific findings. This resource will allow evaluation of risk and resilience factors for COVID-19 severity and outcomes, including postacute sequelae, and assessment of the social and behavioral impact of the pandemic on long-term health trajectories.
Subject(s)
COVID-19 , Adolescent , Adult , Aged , Aged, 80 and over , COVID-19/epidemiology , Cohort Studies , Humans , Middle Aged , Pandemics , Prospective Studies , SARS-CoV-2 , United States/epidemiology , Young AdultABSTRACT
Dyslipidemia associates with and usually precedes the onset of chronic kidney disease (CKD), but a comprehensive assessment of molecular lipid species associated with risk of CKD is lacking. Here, we sought to identify fasting plasma lipids associated with risk of CKD among American Indians in the Strong Heart Family Study, a large-scale community-dwelling of individuals, followed by replication in Mexican Americans from the San Antonio Family Heart Study and Caucasians from the Australian Diabetes, Obesity and Lifestyle Study. We also performed repeated measurement analysis to examine the temporal relationship between the change in the lipidome and change in kidney function between baseline and follow-up of about five years apart. Network analysis was conducted to identify differential lipid classes associated with risk of CKD. In the discovery cohort, we found that higher baseline level of multiple lipid species, including glycerophospholipids, glycerolipids and sphingolipids, was significantly associated with increased risk of CKD, independent of age, sex, body mass index, diabetes and hypertension. Many lipid species were replicated in at least one external cohort at the individual lipid species and/or the class level. Longitudinal change in the plasma lipidome was significantly associated with change in the estimated glomerular filtration rate after adjusting for covariates, baseline lipids and the baseline rate. Network analysis identified distinct lipidomic signatures differentiating high from low-risk groups. Thus, our results demonstrated that disturbed lipid metabolism precedes the onset of CKD. These findings shed light on the mechanisms linking dyslipidemia to CKD and provide potential novel biomarkers for identifying individuals with early impaired kidney function at preclinical stages.
Subject(s)
Diabetes Mellitus , Dyslipidemias , Renal Insufficiency, Chronic , Humans , Lipidomics , Australia , Renal Insufficiency, Chronic/diagnosis , Renal Insufficiency, Chronic/epidemiology , Dyslipidemias/epidemiology , Glomerular Filtration Rate , Glycerophospholipids , Biomarkers , Sphingolipids , American Indian or Alaska NativeABSTRACT
BACKGROUND AND AIMS: Rates of cardiovascular disease (CVD) among American Indians (AI) have been increasing. Although we have observed an association between atherosclerosis and CVD in older adults, the potential association among young AI is unclear. Therefore, we aim to describe the prevalence of atherosclerosis among young AI and determine its association with CVD and all-cause mortality. METHODS AND RESULTS: We evaluated AI participants from the Strong Heart Family Study (SHFS), who were <40 years old and CVD free at the baseline examination, 2001-2003 (n = 1376). We used carotid ultrasound to detect baseline atherosclerotic plaque. We identified CVD events and all-cause mortality through 2019, with a median follow-up of 17.8 years. We used shared frailty Cox Proportional Hazards models to assess the association between atherosclerosis and time to CVD event or all-cause mortality, while controlling for covariates. Among 1376 participants, 71 (5.2%) had atherosclerosis at baseline. During follow-up, 120 (8.7%) had CVD events and 104 (7.6%) died from any cause. CVD incidence was higher in participants who had baseline atherosclerosis (13.51/1000 person-years) than in those who did not (4.95/1000 person-years, p = 0.0003). CVD risk and all-cause mortality were higher in participants with atherosclerosis, while controlling for covariates (CVD HR = 1.85, 95%CI = 1.02-3.37, p = 0.0420; all-cause mortality HR = 2.04, 95%CI = 1.07-3.89, p = 0.0291). CONCLUSIONS: Among young AI, atherosclerosis was independently associated with incident CVD and all-cause mortality later in life. Thus, atherosclerosis begins early in life and interventions in adolescents and young adults to slow the progression of disease could prevent or delay CVD events later in life.
Subject(s)
Atherosclerosis , Cardiovascular Diseases , Adolescent , Adult , Aged , Atherosclerosis/diagnostic imaging , Atherosclerosis/epidemiology , Cardiovascular Diseases/diagnostic imaging , Cardiovascular Diseases/epidemiology , Humans , Incidence , Proportional Hazards Models , Risk Factors , Young AdultABSTRACT
PURPOSE: Our study examined psychosocial risk and protective features affecting cardiovascular and mortality disparities in American Indians, including stress, anger, cynicism, trauma, depression, quality of life, and social support. METHODS: The Strong Heart Family Study cohort recruited American Indian adults from 12 communities over 3 regions in 2001-2003 (N = 2786). Psychosocial measures included Cohen Perceived Stress, Spielberger Anger Expression, Cook-Medley cynicism subscale, symptoms of post-traumatic stress disorder, Centers for Epidemiologic Studies Depression scale, Short Form 12-a quality of life scale, and the Social Support and Social Undermining scale. Cardiovascular events and all-cause mortality were evaluated by surveillance and physician adjudication through 2017. RESULTS: Participants were middle-aged, 40% male, with mean 12 years formal education. Depression symptoms were correlated with anger, cynicism, poor quality of life, isolation, criticism; better social support was correlated with lower cynicism, anger, and trauma. Adjusted time-to-event regressions found that depression, (poor) quality of life, and social isolation scores formed higher risk for mortality and cardiovascular events, and social support was associated with lower risk. Social support partially explained risk associations in causal mediation analyses. CONCLUSION: Altogether, our findings suggest that social support is associated with better mood and quality of life; and lower cynicism, stress, and disease risk-even when said risk may be increased by comorbidities. Future research should examine whether enhancing social support can prospectively reduce risk, as an efficient, cost-effective intervention opportunity that may be enacted at the community level.
Subject(s)
Cardiovascular Diseases , Adult , Cardiovascular Diseases/psychology , Depression/epidemiology , Female , Humans , Male , Middle Aged , Quality of Life , Social Support , Stress, Psychological/epidemiology , American Indian or Alaska NativeABSTRACT
Although hundreds of genome-wide association studies-implicated loci have been reported for adult obesity-related traits, less is known about the genetics specific for early-onset obesity and with only a few studies conducted in non-European populations to date. Searching for additional genetic variants associated with childhood obesity, we performed a trans-ancestral meta-analysis of 30 studies consisting of up to 13 005 cases (≥95th percentile of body mass index (BMI) achieved 2-18 years old) and 15 599 controls (consistently <50th percentile of BMI) of European, African, North/South American and East Asian ancestry. Suggestive loci were taken forward for replication in a sample of 1888 cases and 4689 controls from seven cohorts of European and North/South American ancestry. In addition to observing 18 previously implicated BMI or obesity loci, for both early and late onset, we uncovered one completely novel locus in this trans-ancestral analysis (nearest gene, METTL15). The variant was nominally associated with only the European subgroup analysis but had a consistent direction of effect in other ethnicities. We then utilized trans-ancestral Bayesian analysis to narrow down the location of the probable causal variant at each genome-wide significant signal. Of all the fine-mapped loci, we were able to narrow down the causative variant at four known loci to fewer than 10 single nucleotide polymorphisms (SNPs) (FAIM2, GNPDA2, MC4R and SEC16B loci). In conclusion, an ethnically diverse setting has enabled us to both identify an additional pediatric obesity locus and further fine-map existing loci.
Subject(s)
Chromosome Mapping/methods , Genome-Wide Association Study/methods , Pediatric Obesity/genetics , Polymorphism, Single Nucleotide , Wilms Tumor/genetics , Bayes Theorem , Case-Control Studies , Child , Female , Genetic Loci , Genetic Predisposition to Disease , Humans , MaleABSTRACT
BACKGROUND: Epidemiologic studies often use self-report as proxy for clinical history. However, whether self-report correctly identifies prevalence in minority populations with health disparities and poor health-care access is unknown. Furthermore, overlap of clinical vascular events with covert vascular brain injury (VBI), detected by imaging, is largely unexamined. METHODS: The Strong Heart Study recruited American Indians from 3 regions, with surveillance and adjudication of stroke events from 1989 to 2013. In 2010-2013, all 817 survivors, aged 65-95 years, underwent brain imaging, neurological history interview, and cognitive testing. VBI was defined as imaged infarct or hemorrhage. RESULTS: Adjudicated stroke was prevalent in 4% of participants and separately collected, self-reported stroke in 8%. Imaging-defined VBI was detected in 51% and not associated with any stroke event in 47%. Compared with adjudication, self-report had 76% sensitivity and 95% specificity. Participants with adjudicated or self-reported stroke had the poorest performance on cognitive testing; those with imaging-only (covert) VBI had intermediate performance. CONCLUSION: In this community-based cohort, self-report for prior stroke had good performance metrics. A majority of participants with VBI did not have overt, clinically recognized events but did have neurological or cognitive symptoms. Data collection methodology for studies in a resource-limited setting must balance practical limitations in costs, accuracy, feasibility, and research goals.
Subject(s)
Cerebrovascular Trauma , Physicians , Stroke , Cerebrovascular Trauma/diagnostic imaging , Cerebrovascular Trauma/epidemiology , Humans , Magnetic Resonance Imaging , Self Report , Stroke/diagnostic imaging , Stroke/epidemiologyABSTRACT
BACKGROUND: Whole-exome sequencing (WES) can expedite research on genetic variation in non-human primate (NHP) models of human diseases. However, NHP-specific reagents for exome capture are not available. This study reports the use of human-specific capture reagents in WES for olive baboons, marmosets, and vervet monkeys. METHODS: Exome capture was carried out using the SureSelect Human All Exon V6 panel from Agilent Technologies, followed by high-throughput sequencing. Capture of protein-coding genes and detection of single nucleotide variants were evaluated. RESULTS: Exome capture and sequencing results showed that more than 97% of old world and 93% of new world monkey protein coding genes were detected. Single nucleotide variants were detected across the genomes and missense variants were found in genes associated with human diseases. CONCLUSIONS: A cost-effective approach based on commercial, human-specific reagents can be used to perform WES for the discovery of genetic variants in these NHP species.
Subject(s)
Exome , High-Throughput Nucleotide Sequencing , Animals , Chlorocebus aethiops , Exome/genetics , Humans , Indicators and Reagents , Primates , Exome SequencingABSTRACT
BACKGROUND: Arsenic has been associated with hypertension, though it is unclear whether associations persist at the exposure concentrations (e.g. <100 µg/L) in drinking water occurring in parts of the Western United States. METHODS: We assessed associations between arsenic biomarkers and systolic blood pressure (SBP), diastolic blood pressure (DBP), and hypertension in the Strong Heart Family Study, a family-based cohort of American Indians from the Northern plains, Southern plains, and Southwest. We included 1910 participants from three study centers with complete baseline visit data (2001-2003) in the cross-sectional analysis of all three outcomes, and 1453 participants in the prospective analysis of incident hypertension (follow-up 2006-2009). We used generalized estimating equations with exchangeable correlation structure conditional on family membership to estimate the association of arsenic exposure biomarker levels with SBP or DBP (linear regressions) or hypertension prevalence and incidence (Poisson regressions), adjusting for urine creatinine, urine arsenobetaine, and measured confounders. RESULTS: We observed cross-sectional associations for a two-fold increase in inorganic and methylated urine arsenic species of 0.64 (95% CI: 0.07, 1.35) mm Hg for SBP, 0.49 (95% CI: 0.03, 1.02) mm Hg for DBP, and a prevalence ratio of 1.10 (95% CI: 1.01, 1.21) for hypertension in fully adjusted models. During follow-up, 14% of subjects developed hypertension. We observed non-monotonic relationships between quartiles of arsenic and incident hypertension. Effect estimates were null for incident hypertension with continuous exposure metrics. Stratification by study site revealed elevated associations in Arizona, the site with the highest arsenic levels, while results for Oklahoma and North and South Dakota were largely null. Blood pressure changes with increasing arsenic concentrations were larger for those with diabetes at baseline. CONCLUSIONS: Our results suggest a modest cross-sectional association of arsenic exposure biomarkers with blood pressure, and possible non-linear effects on incident hypertension.
Subject(s)
Arsenic , Hypertension , Indians, North American , Arizona , Arsenic/toxicity , Blood Pressure , Cross-Sectional Studies , Environmental Exposure/adverse effects , Humans , Hypertension/chemically induced , Hypertension/epidemiology , Oklahoma , Prospective Studies , South Dakota , United StatesABSTRACT
BACKGROUND: Our aim was to investigate if moderate to vigorous physical activity (MVPA), calcium intake interacts with bone mineral density (BMD)-related single nucleotide polymorphisms (SNPs) to influence BMD in 750 Hispanic children (4-19y) of the cross-sectional Viva La Familia Study. METHODS: Physical activity and dietary intake were measured by accelerometers and multiple-pass 24 h dietary recalls, respectively. Total body and lumbar spine BMD were measured by dual energy X-ray absorptiometry. A polygenic risk score (PRS) was computed based on SNPs identified in published literature. Regression analysis was conducted with PRSs, MVPA and calcium intake with total body and lumbar spine BMD. RESULTS: We found evidence of statistically significant interaction effects between the PRS and MVPA on total body BMD and lumbar spine BMD (p < 0.05). Higher PRS was associated with a lower total body BMD (ß = - 0.040 ± 0.009, p = 1.1 × 10- 5) and lumbar spine BMD (ß = - 0.042 ± 0.013, p = 0.0016) in low MVPA group, as compared to high MVPA group (ß = - 0.015 ± 0.006, p = 0.02; ß = 0.008 ± 0.01, p = 0.4, respectively). DISCUSSION: The study indicated that calcium intake does not modify the relationship between genetic variants and BMD, while it implied physical activity interacts with genetic variants to affect BMD in Hispanic children. Due to limited sample size of our study, future research on gene by environment interaction on bone health and functional studies to provide biological insights are needed. CONCLUSIONS: Bone health in Hispanic children with high genetic risk for low BMD is benefitted more by MVPA than children with low genetic risk. Our results may be useful to predict disease risk and tailor dietary and physical activity advice delivery to people, especially children.
Subject(s)
Bone Density , Exercise , Absorptiometry, Photon , Bone Density/genetics , Child , Cross-Sectional Studies , Hispanic or Latino/genetics , HumansABSTRACT
BACKGROUND: Elevated adiposity is often posited by medical and public health researchers to be a risk factor associated with cardiovascular disease, diabetes, and other diseases. These health challenges are now thought to be reflected in epigenetic modifications to DNA molecules, such as DNA methylation, which can alter gene expression. METHODS: Here we report the results of three Epigenome Wide Association Studies (EWAS) in which we assessed the differential methylation of DNA (obtained from peripheral blood) associated with three adiposity phenotypes (BMI, waist circumference, and impedance-measured percent body fat) among American Indian adult participants in the Strong Heart Study. RESULTS: We found differential methylation at 8264 CpG sites associated with at least one of our three response variables. Of the three adiposity proxies we measured, waist circumference had the highest number of associated differentially methylated CpGs, while percent body fat was associated with the lowest. Because both waist circumference and percent body fat relate to physiology, we focused interpretations on these variables. We found a low degree of overlap between these two variables in our gene ontology enrichment and Differentially Methylated Region analyses, supporting that waist circumference and percent body fat measurements represent biologically distinct concepts. CONCLUSIONS: We interpret these general findings to indicate that highly significant regions of the genome (DMR) and synthesis pathways (GO) in waist circumference analyses are more likely to be associated with the presence of visceral/abdominal fat than more general measures of adiposity. Our findings confirmed numerous CpG sites previously found to be differentially methylated in association with adiposity phenotypes, while we also found new differentially methylated CpG sites and regions not previously identified.
Subject(s)
Adiposity/genetics , CpG Islands , DNA Methylation , Epigenome , Aged , Body Mass Index , Female , Gene Ontology , Genome-Wide Association Study , Heart Disease Risk Factors , Humans , Male , Middle Aged , Phenotype , Prospective Studies , Waist Circumference , American Indian or Alaska NativeABSTRACT
BACKGROUND: Diet plays a key role in development of diabetes, and there has been recent interest in better understanding the association of dairy food intake with diabetes. OBJECTIVE: This study examined the associations of full-fat and low-fat dairy food intake with incident diabetes among American Indians-a population with a high burden of diabetes. METHODS: The study included participants from the Strong Heart Family Study (SHFS), a family-based study of cardiovascular disease in American Indians, free of diabetes at baseline (2001-2003) (n = 1623). Participants were 14-86-y-old at baseline and 60.8% were female. Dairy food intake was assessed using a Block food frequency questionnaire. Incident diabetes was defined using American Diabetes Association criteria. Parametric survival models with a Weibull distribution were used to evaluate the associations of full-fat and low-fat dairy food intake with incident diabetes. Serving sizes were defined as 250 mL for milk and 42.5 g for cheese. RESULTS: We identified 277 cases of diabetes during a mean follow-up of 11 y. Reported intake of dairy foods was low [median full-fat dairy food intake: 0.11 serving/1000 kcal; median low-fat dairy food intake: 0.03 serving/1000 kcal]. Participants who reported the highest full-fat dairy food intake had a lower risk of diabetes compared to those who reported the lowest full-fat food dairy intake [HR (95% CI): 0.79 (0.59, 1.06); P-trend = 0.03, comparing extreme tertiles, after adjustment for age, sex, site, physical activity, education, smoking, diet quality, and low-fat dairy food intake]. Low-fat dairy food intake was not associated with diabetes. CONCLUSIONS: American Indians who participated in the SHFS reported low dairy food intake. Participants who reported higher full-fat dairy food intake had a lower risk of diabetes than participants who reported lower intake. These findings may be of interest to populations with low dairy food intake.
Subject(s)
Dairy Products , Diabetes Mellitus/epidemiology , Dietary Fats/administration & dosage , Indians, North American , Adolescent , Adult , Aged , Aged, 80 and over , Cohort Studies , Female , Humans , Male , Middle Aged , Risk Factors , United States/epidemiology , Young AdultABSTRACT
BACKGROUND: Inorganic arsenic exposure is ubiquitous and both exposure and inter-individual differences in its metabolism have been associated with cardiometabolic risk. A more efficient arsenic metabolism profile (lower MMA%, higher DMA%) has been associated with reduced risk for arsenic-related health outcomes. This profile, however, has also been associated with increased risk for diabetes-related outcomes. OBJECTIVES: The mechanism behind these conflicting associations is unclear; we hypothesized the one-carbon metabolism (OCM) pathway may play a role. METHODS: We evaluated the influence of OCM on the relationship between arsenic metabolism and diabetes-related outcomes (HOMA2-IR, waist circumference, fasting plasma glucose) using metabolomic data from an OCM-specific and P180 metabolite panel measured in plasma, arsenic metabolism measured in urine, and HOMA2-IR and FPG measured in fasting plasma. Samples were drawn from baseline visits (2001-2003) in 59 participants from the Strong Heart Family Study, a family-based cohort study of American Indians aged ≥14 years from Arizona, Oklahoma, and North/South Dakota. RESULTS: In unadjusted analyses, a 5% increase in DMA% was associated with higher HOMA2-IR (geometric mean ratio (GMR)= 1.13 (95% CI: 1.03, 1.25)) and waist circumference (mean difference=3.66 (0.95, 6.38). MMA% was significantly associated with lower HOMA2-IR and waist circumference. After adjustment for OCM-related metabolites (SAM, SAH, cysteine, glutamate, lysophosphatidylcholine 18.2, and three phosphatidlycholines), associations were attenuated and no longer significant. CONCLUSIONS: These preliminary results indicate that the association of lower MMA% and higher DMA% with diabetes-related outcomes may be influenced by OCM status, either through confounding, reverse causality, or mediation.