RESUMEN
The use of omic modalities to dissect the molecular underpinnings of common diseases and traits is becoming increasingly common. But multi-omic traits can be genetically predicted, which enables highly cost-effective and powerful analyses for studies that do not have multi-omics1. Here we examine a large cohort (the INTERVAL study2; n = 50,000 participants) with extensive multi-omic data for plasma proteomics (SomaScan, n = 3,175; Olink, n = 4,822), plasma metabolomics (Metabolon HD4, n = 8,153), serum metabolomics (Nightingale, n = 37,359) and whole-blood Illumina RNA sequencing (n = 4,136), and use machine learning to train genetic scores for 17,227 molecular traits, including 10,521 that reach Bonferroni-adjusted significance. We evaluate the performance of genetic scores through external validation across cohorts of individuals of European, Asian and African American ancestries. In addition, we show the utility of these multi-omic genetic scores by quantifying the genetic control of biological pathways and by generating a synthetic multi-omic dataset of the UK Biobank3 to identify disease associations using a phenome-wide scan. We highlight a series of biological insights with regard to genetic mechanisms in metabolism and canonical pathway associations with disease; for example, JAK-STAT signalling and coronary atherosclerosis. Finally, we develop a portal ( https://www.omicspred.org/ ) to facilitate public access to all genetic scores and validation results, as well as to serve as a platform for future extensions and enhancements of multi-omic genetic scores.
Asunto(s)
Enfermedad de la Arteria Coronaria , Multiómica , Humanos , Enfermedad de la Arteria Coronaria/genética , Enfermedad de la Arteria Coronaria/metabolismo , Metabolómica/métodos , Fenotipo , Proteómica/métodos , Aprendizaje Automático , Negro o Afroamericano/genética , Asiático/genética , Pueblo Europeo/genética , Reino Unido , Conjuntos de Datos como Asunto , Internet , Reproducibilidad de los Resultados , Estudios de Cohortes , Proteoma/análisis , Proteoma/metabolismo , Metaboloma , Plasma/metabolismo , Bases de Datos FactualesRESUMEN
Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
Asunto(s)
Estatura , Mapeo Cromosómico , Polimorfismo de Nucleótido Simple , Humanos , Estatura/genética , Frecuencia de los Genes/genética , Genoma Humano/genética , Estudio de Asociación del Genoma Completo , Haplotipos/genética , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Europa (Continente)/etnología , Tamaño de la Muestra , FenotipoRESUMEN
BACKGROUND: Metabolic syndrome (MetS) is a precursor to cardiovascular diseases and type 2 diabetes. Existing MetS prediction models relied heavily on biochemical measures and those based on non-invasive predictors such as lifestyle behaviours were limited. We aim to (1) develop a weighted lifestyle risk index for MetS and (2) externally validate this index using two Asian-based cohorts in Singapore. METHODS: Using data from the Multi-Ethnic Cohort (MEC) 1 (n = 2873, 41% male), multiple logistic regression was used to identify predictors associated with MetS. A weighted lifestyle risk index was generated using coefficients of the selected predictors in the development cohort (MEC1). Subsequently, the performance of the lifestyle risk index in predicting the occurrence of MetS within 10 years was assessed by discrimination and calibration in an external validation cohort (MEC2) (n = 6070, 43% male). RESULTS: A lifestyle risk index for MetS with nine predictors was developed (age, sex, ethnicity, having a family history of diabetes, BMI, diet, physical activity, smoking status, and screen time). This index demonstrated acceptable discrimination in the development cohort [AUC (95% CI) = 0.74 (0.71, 0.76)] and the validation cohort [AUC (95% CI) = 0.79 (0.77, 0.81)]. CONCLUSION: This lifestyle risk index exhibits potential for risk stratification in population-based screening programmes. Future research could apply a similar methodology to develop disease-specific lifestyle risk indices using nationwide registry-based data.
Asunto(s)
Diabetes Mellitus Tipo 2 , Síndrome Metabólico , Humanos , Masculino , Femenino , Síndrome Metabólico/diagnóstico , Síndrome Metabólico/epidemiología , Factores de Riesgo , Diabetes Mellitus Tipo 2/diagnóstico , Estilo de Vida , DietaRESUMEN
Whole-exome sequencing (WES) has been widely used to study the role of protein-coding variants in genetic diseases. Non-coding regions, typically covered by sparse off-target data, are often discarded by conventional WES analyses. Here, we develop a genotype calling pipeline named WEScall to analyse both target and off-target data. We leverage linkage disequilibrium shared within study samples and from an external reference panel to improve genotyping accuracy. In an application to WES of 2527 Chinese and Malays, WEScall can reduce the genotype discordance rate from 0.26% (SE= 6.4 × 10-6) to 0.08% (SE = 3.6 × 10-6) across 1.1 million single nucleotide polymorphisms (SNPs) in the deeply sequenced target regions. Furthermore, we obtain genotypes at 0.70% (SE = 3.0 × 10-6) discordance rate across 5.2 million off-target SNPs, which had ~1.2× mean sequencing depth. Using this dataset, we perform genome-wide association studies of 10 metabolic traits. Despite of our small sample size, we identify 10 loci at genome-wide significance (P < 5 × 10-8), including eight well-established loci. The two novel loci, both associated with glycated haemoglobin levels, are GPATCH8-SLC4A1 (rs369762319, P = 2.56 × 10-12) and ROR2 (rs1201042, P = 3.24 × 10-8). Finally, using summary statistics from UK Biobank and Biobank Japan, we show that polygenic risk prediction can be significantly improved for six out of nine traits by incorporating off-target data (P < 0.01). These results demonstrate WEScall as a useful tool to facilitate WES studies with decent amounts of off-target data.
Asunto(s)
Secuenciación del Exoma/métodos , Predisposición Genética a la Enfermedad , Genotipo , Proteína 1 de Intercambio de Anión de Eritrocito/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Desequilibrio de Ligamiento , Proteínas Musculares/genética , Polimorfismo de Nucleótido SimpleRESUMEN
BACKGROUND: Evidence is accumulating that intake of animal-based and plant-based proteins has different effects on cardiometabolic health, but less is known about the health effect of isocaloric substitution of animal-based and plant-based proteins. Data from Asian populations are limited. OBJECTIVES: This study aimed to evaluate the effects of isocaloric substitution of total plant-based proteins for total and various animal-based protein food groups and to evaluate the effects of substituting protein from legumes and pulses for various animal-based protein food groups on cardiovascular disease (CVD) risk factors and predicted 10-y CVD risk. METHODS: We conducted a cross-sectional analysis using data collected from 9211 Singapore residents (aged 21-75 y) from the Singapore Multi-Ethnic Cohort. Data on sociodemographic and lifestyle factors were collected using questionnaires. Dietary intakes were assessed using a validated FFQ. BMI, waist circumference, and blood pressure were measured during a physical examination, and blood samples were collected to measure lipid profiles. Associations were assessed by substitution models using a multiple linear regression analysis. RESULTS: Isocaloric substitution of total plant-based proteins for total and all specific animal-based protein food groups were associated with lower BMI (ß: -0.30; 95% CI: -0.38, -0.22), waist circumference (ß: -0.85; 95% CI: -1.04, -0.66), and LDL cholesterol concentrations (ß: -0.06; 95% CI: -0.08, -0.05) (P < 0.0056). Replacement of processed meat and processed seafood proteins with total plant-based proteins was associated with improvement in most CVD risk factors and predicted 10-y CVD risk. Replacement of oily fish with legume proteins was associated with lower HDL cholesterol and higher TG concentrations. CONCLUSIONS: The substitution of plant-based proteins for animal-based proteins, especially from processed meat and processed seafood, was inversely associated with the established CVD risk factors such as BMI, waist circumference, and lipid concentrations and predicted 10-y CVD risk. These findings warrant further investigation in independent studies in other Asian populations.
Asunto(s)
Enfermedades Cardiovasculares , Proteínas de Plantas , Animales , Factores de Riesgo , Factores de Riesgo Cardiometabólico , Estudios Transversales , Verduras , Lípidos , DietaRESUMEN
Metabolites are small intermediate products of cellular metabolism perturbed in a variety of complex disorders. Identifying genetic markers associated with metabolite concentrations could delineate disease-related metabolic pathways in humans. We tested genetic variants for associations with 136 metabolites in 1954 Chinese from Singapore. At a conservative genome-wide threshold (3.7 × 10-10), we detected 1899 variant-metabolite associations at 16 genetic loci. Three loci (ABCA7, A4GALT, GSTM2) represented novel associations with metabolites, with the strongest association observed between ABCA7 and d18:1/24:1 dihexosylceramide. Among 13 replicated loci, we identified six new variants independent of previously reported metabolite or lipid signals. We observed variant-metabolite associations at two loci (ABCA7, CHCHD2) that have been linked to neurodegenerative diseases. At SGPP1 and SPTLC3 loci, genetic variants showed preferential selectivity for sphingolipids with d16 (rather than d18) sphingosine backbone, including sphingosine-1-phosphate (S1P). Our results provide new genetic associations for metabolites and highlight the role of metabolites as intermediate modulators in disease metabolic pathways.
Asunto(s)
Enfermedad de Alzheimer/genética , Pueblo Asiatico/genética , Glicoesfingolípidos/metabolismo , Enfermedad de Parkinson/genética , Esfingolípidos/metabolismo , Transportadoras de Casetes de Unión a ATP/genética , Transportadoras de Casetes de Unión a ATP/metabolismo , Enfermedad de Alzheimer/metabolismo , Carnitina/análogos & derivados , Carnitina/metabolismo , China , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismo , Femenino , Galactosiltransferasas/genética , Galactosiltransferasas/metabolismo , Sitios Genéticos , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Glutatión Transferasa/genética , Glutatión Transferasa/metabolismo , Glicoesfingolípidos/genética , Humanos , Inhibidores de Hidroximetilglutaril-CoA Reductasas/metabolismo , Transportador 1 de Anión Orgánico Específico del Hígado/genética , Transportador 1 de Anión Orgánico Específico del Hígado/metabolismo , Lisofosfolípidos/metabolismo , Masculino , Proteínas de la Membrana/genética , Proteínas de la Membrana/metabolismo , Persona de Mediana Edad , Enfermedad de Parkinson/metabolismo , Monoéster Fosfórico Hidrolasas/genética , Monoéster Fosfórico Hidrolasas/metabolismo , Serina/metabolismo , Serina C-Palmitoiltransferasa/genética , Serina C-Palmitoiltransferasa/metabolismo , Esfingolípidos/química , Esfingosina/análogos & derivados , Esfingosina/metabolismo , Espectrometría de Masas en Tándem , Factores de Transcripción/genética , Factores de Transcripción/metabolismoRESUMEN
BACKGROUND: Family history, and genetic and non-genetic risk factors can stratify women according to their individual risk of developing breast cancer. The extent of overlap between these risk predictors is not clear. METHODS: In this case-only analysis involving 7600 Asian breast cancer patients diagnosed between age 30 and 75 years, we examined identification of high-risk patients based on positive family history, the Gail model 5-year absolute risk [5yAR] above 1.3%, breast cancer predisposition genes (protein-truncating variants [PTV] in ATM, BRCA1, BRCA2, CHEK2, PALB2, BARD1, RAD51C, RAD51D, or TP53), and polygenic risk score (PRS) 5yAR above 1.3%. RESULTS: Correlation between 5yAR (at age of diagnosis) predicted by PRS and the Gail model was low (r=0.27). Fifty-three percent of breast cancer patients (n=4041) were considered high risk by one or more classification criteria. Positive family history, PTV carriership, PRS, or the Gail model identified 1247 (16%), 385 (5%), 2774 (36%), and 1592 (21%) patients who were considered at high risk, respectively. In a subset of 3227 women aged below 50 years, the four models studied identified 470 (15%), 213 (7%), 769 (24%), and 325 (10%) unique patients who were considered at high risk, respectively. For younger women, PRS and PTVs together identified 745 (59% of 1276) high-risk individuals who were not identified by the Gail model or family history. CONCLUSIONS: Family history and genetic and non-genetic risk stratification tools have the potential to complement one another to identify women at high risk.
Asunto(s)
Neoplasias de la Mama , Pueblo Asiatico , Neoplasias de la Mama/diagnóstico , Neoplasias de la Mama/epidemiología , Neoplasias de la Mama/genética , Femenino , Predisposición Genética a la Enfermedad/genética , Humanos , Masculino , Medición de RiesgoRESUMEN
Despite the dramatic underrepresentation of non-European populations in human genetics studies, researchers continue to exclude participants of non-European ancestry, as well as variants rare in European populations, even when these data are available. This practice perpetuates existing research disparities and can lead to important and large effect size associations being missed. Here, we conducted genome-wide association studies (GWAS) of 31 serum and urine biomarker quantitative traits in African (n = 9354), East Asian (n = 2559), and South Asian (n = 9823) ancestry UK Biobank (UKBB) participants. We adjusted for all known GWAS catalog variants for each trait, as well as novel signals identified in a recent European ancestry-focused analysis of UKBB participants. We identify 7 novel signals in African ancestry and 2 novel signals in South Asian ancestry participants (p < 1.61E-10). Many of these signals are highly plausible, including a cis pQTL for the gene encoding gamma-glutamyl transferase and PIEZO1 and G6PD variants with impacts on HbA1c through likely erythrocytic mechanisms. This work illustrates the importance of using the genetic data we already have in diverse populations, with novel discoveries possible in even modest sample sizes.
Asunto(s)
Bancos de Muestras Biológicas/estadística & datos numéricos , Biomarcadores/metabolismo , Estudio de Asociación del Genoma Completo/métodos , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo/genética , Alelos , Pueblo Asiatico/genética , Biomarcadores/sangre , Biomarcadores/orina , Población Negra/genética , Femenino , Frecuencia de los Genes , Predisposición Genética a la Enfermedad/etnología , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Genotipo , Humanos , Masculino , Fenotipo , Reino Unido , Población Blanca/genéticaRESUMEN
Educational attainment is widely used as a surrogate for socioeconomic status (SES). Low SES is a risk factor for hypertension and high blood pressure (BP). To identify novel BP loci, we performed multi-ancestry meta-analyses accounting for gene-educational attainment interactions using two variables, "Some College" (yes/no) and "Graduated College" (yes/no). Interactions were evaluated using both a 1 degree of freedom (DF) interaction term and a 2DF joint test of genetic and interaction effects. Analyses were performed for systolic BP, diastolic BP, mean arterial pressure, and pulse pressure. We pursued genome-wide interrogation in Stage 1 studies (N = 117 438) and follow-up on promising variants in Stage 2 studies (N = 293 787) in five ancestry groups. Through combined meta-analyses of Stages 1 and 2, we identified 84 known and 18 novel BP loci at genome-wide significance level (P < 5 × 10-8). Two novel loci were identified based on the 1DF test of interaction with educational attainment, while the remaining 16 loci were identified through the 2DF joint test of genetic and interaction effects. Ten novel loci were identified in individuals of African ancestry. Several novel loci show strong biological plausibility since they involve physiologic systems implicated in BP regulation. They include genes involved in the central nervous system-adrenal signaling axis (ZDHHC17, CADPS, PIK3C2G), vascular structure and function (GNB3, CDON), and renal function (HAS2 and HAS2-AS1, SLIT3). Collectively, these findings suggest a role of educational attainment or SES in further dissection of the genetic architecture of BP.
Asunto(s)
Estudio de Asociación del Genoma Completo , Hipertensión , Presión Sanguínea/genética , Epistasis Genética , Sitios Genéticos , Humanos , Hipertensión/genética , Polimorfismo de Nucleótido SimpleRESUMEN
The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of the heritability of this disease. Here, to test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing in 12,940 individuals from five ancestry groups. To increase statistical power, we expanded the sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support the idea that lower-frequency variants have a major role in predisposition to type 2 diabetes.
Asunto(s)
Diabetes Mellitus Tipo 2/genética , Predisposición Genética a la Enfermedad/genética , Variación Genética/genética , Alelos , Análisis Mutacional de ADN , Europa (Continente)/etnología , Exoma , Estudio de Asociación del Genoma Completo , Técnicas de Genotipaje , Humanos , Tamaño de la MuestraRESUMEN
Elevated blood pressure (BP), a leading cause of global morbidity and mortality, is influenced by both genetic and lifestyle factors. Cigarette smoking is one such lifestyle factor. Across five ancestries, we performed a genome-wide gene-smoking interaction study of mean arterial pressure (MAP) and pulse pressure (PP) in 129 913 individuals in stage 1 and follow-up analysis in 480 178 additional individuals in stage 2. We report here 136 loci significantly associated with MAP and/or PP. Of these, 61 were previously published through main-effect analysis of BP traits, 37 were recently reported by us for systolic BP and/or diastolic BP through gene-smoking interaction analysis and 38 were newly identified (P < 5 × 10-8, false discovery rate < 0.05). We also identified nine new signals near known loci. Of the 136 loci, 8 showed significant interaction with smoking status. They include CSMD1 previously reported for insulin resistance and BP in the spontaneously hypertensive rats. Many of the 38 new loci show biologic plausibility for a role in BP regulation. SLC26A7 encodes a chloride/bicarbonate exchanger expressed in the renal outer medullary collecting duct. AVPR1A is widely expressed, including in vascular smooth muscle cells, kidney, myocardium and brain. FHAD1 is a long non-coding RNA overexpressed in heart failure. TMEM51 was associated with contractile function in cardiomyocytes. CASP9 plays a central role in cardiomyocyte apoptosis. Identified only in African ancestry were 30 novel loci. Our findings highlight the value of multi-ancestry investigations, particularly in studies of interaction with lifestyle factors, where genomic and lifestyle differences may contribute to novel findings.
Asunto(s)
Presión Arterial/genética , Interacción Gen-Ambiente , Hipertensión/genética , Polimorfismo Genético , Grupos Raciales/genética , Fumar/efectos adversos , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Antiportadores/genética , Presión Sanguínea/genética , Caspasa 9/genética , Etnicidad/genética , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Hipertensión/etiología , Masculino , Proteínas de la Membrana/genética , Persona de Mediana Edad , Receptores de Vasopresinas/genética , Transportadores de Sulfato/genética , Proteínas Supresoras de Tumor/genética , Adulto JovenRESUMEN
Genome-wide association analysis advanced understanding of blood pressure (BP), a major risk factor for vascular conditions such as coronary heart disease and stroke. Accounting for smoking behavior may help identify BP loci and extend our knowledge of its genetic architecture. We performed genome-wide association meta-analyses of systolic and diastolic BP incorporating gene-smoking interactions in 610,091 individuals. Stage 1 analysis examined â¼18.8 million SNPs and small insertion/deletion variants in 129,913 individuals from four ancestries (European, African, Asian, and Hispanic) with follow-up analysis of promising variants in 480,178 additional individuals from five ancestries. We identified 15 loci that were genome-wide significant (p < 5 × 10-8) in stage 1 and formally replicated in stage 2. A combined stage 1 and 2 meta-analysis identified 66 additional genome-wide significant loci (13, 35, and 18 loci in European, African, and trans-ancestry, respectively). A total of 56 known BP loci were also identified by our results (p < 5 × 10-8). Of the newly identified loci, ten showed significant interaction with smoking status, but none of them were replicated in stage 2. Several loci were identified in African ancestry, highlighting the importance of genetic studies in diverse populations. The identified loci show strong evidence for regulatory features and support shared pathophysiology with cardiometabolic and addiction traits. They also highlight a role in BP regulation for biological candidates such as modulators of vascular structure and function (CDKN1B, BCAR1-CFDP1, PXDN, EEA1), ciliopathies (SDCCAG8, RPGRIP1L), telomere maintenance (TNKS, PINX1, AKTIP), and central dopaminergic signaling (MSRA, EBF2).
Asunto(s)
Presión Sanguínea/genética , Sitios Genéticos , Estudio de Asociación del Genoma Completo , Grupos Raciales/genética , Fumar/genética , Estudios de Cohortes , Diástole/genética , Epistasis Genética , Femenino , Humanos , Masculino , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo/genética , Reproducibilidad de los Resultados , Sístole/genéticaRESUMEN
PURPOSE OF REVIEW: Prevalence of type 2 diabetes (T2D) and progression of complications differ between worldwide populations. While obesity is a major contributing risk factor, variations in physiological manifestations, e.g., developing T2D at lower body mass index in some populations, suggest other contributing factors. Early T2D genetic associations were mostly discovered in European ancestry populations. This review describes the progression of genetic discoveries associated with T2D in individuals of East Asian ancestry in the last 10 years and highlights the shared genetic susceptibility between the population groups and additional insights into genetic contributions to T2D. RECENT FINDINGS: Through increased sample size and power, new genetic associations with T2D were discovered in East Asian ancestry populations, often with higher allele frequencies than European ancestry populations. As we continue to generate maps of T2D-associated variants across diverse populations, there will be a critical need to expand and diversify other omics resources to enable integration for clinical translation.
Asunto(s)
Diabetes Mellitus Tipo 2 , Pueblo Asiatico/genética , Diabetes Mellitus Tipo 2/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
BACKGROUND: Hypertriglyceridemia has emerged as a critical coronary artery disease (CAD) risk factor. Rare loss-of-function (LoF) variants in apolipoprotein C-III have been reported to reduce triglycerides (TG) and are cardioprotective in American Indians and Europeans. However, there is a lack of data in other Europeans and non-Europeans. Also, whether genetically increased plasma TG due to ApoC-III is causally associated with increased CAD risk is still unclear and inconsistent. The objectives of this study were to verify the cardioprotective role of earlier reported six LoF variants of APOC3 in South Asians and other multi-ethnic cohorts and to evaluate the causal association of TG raising common variants for increasing CAD risk. METHODS: We performed gene-centric and Mendelian randomization analyses and evaluated the role of genetic variation encompassing APOC3 for affecting circulating TG and the risk for developing CAD. RESULTS: One rare LoF variant (rs138326449) with a 37% reduction in TG was associated with lowered risk for CAD in Europeans (p = 0.007), but we could not confirm this association in Asian Indians (p = 0.641). Our data could not validate the cardioprotective role of other five LoF variants analysed. A common variant rs5128 in the APOC3 was strongly associated with elevated TG levels showing a p-value 2.8 × 10- 424. Measures of plasma ApoC-III in a small subset of Sikhs revealed a 37% increase in ApoC-III concentrations among homozygous mutant carriers than the wild-type carriers of rs5128. A genetically instrumented per 1SD increment of plasma TG level of 15 mg/dL would cause a mild increase (3%) in the risk for CAD (p = 0.042). CONCLUSIONS: Our results highlight the challenges of inclusion of rare variant information in clinical risk assessment and the generalizability of implementation of ApoC-III inhibition for treating atherosclerotic disease. More studies would be needed to confirm whether genetically raised TG and ApoC-III concentrations would increase CAD risk.
Asunto(s)
Apolipoproteína C-III/genética , Enfermedad de la Arteria Coronaria/genética , Variación Genética , Anciano , Alelos , Enfermedad de la Arteria Coronaria/etnología , Europa (Continente)/epidemiología , Femenino , Estudios de Asociación Genética , Genotipo , Heterocigoto , Humanos , India/epidemiología , Masculino , Análisis de la Aleatorización Mendeliana , Persona de Mediana Edad , Mutación , Riesgo , Análisis de Secuencia de ADN , Triglicéridos/sangreRESUMEN
A major challenge in evaluating the contribution of rare variants to complex disease is identifying enough copies of the rare alleles to permit informative statistical analysis. To investigate the contribution of rare variants to the risk of type 2 diabetes (T2D) and related traits, we performed deep whole-genome analysis of 1,034 members of 20 large Mexican-American families with high prevalence of T2D. If rare variants of large effect accounted for much of the diabetes risk in these families, our experiment was powered to detect association. Using gene expression data on 21,677 transcripts for 643 pedigree members, we identified evidence for large-effect rare-variant cis-expression quantitative trait loci that could not be detected in population studies, validating our approach. However, we did not identify any rare variants of large effect associated with T2D, or the related traits of fasting glucose and insulin, suggesting that large-effect rare variants account for only a modest fraction of the genetic risk of these traits in this sample of families. Reliable identification of large-effect rare variants will require larger samples of extended pedigrees or different study designs that further enrich for such variants.
Asunto(s)
Diabetes Mellitus Tipo 2/genética , Predisposición Genética a la Enfermedad/genética , Variación Genética , Americanos Mexicanos/genética , Diabetes Mellitus Tipo 2/etnología , Diabetes Mellitus Tipo 2/patología , Salud de la Familia , Femenino , Frecuencia de los Genes , Predisposición Genética a la Enfermedad/etnología , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Humanos , Masculino , Linaje , Fenotipo , Sitios de Carácter Cuantitativo/genética , Secuenciación Completa del Genoma/métodosRESUMEN
Comprehensive metabolite profiling captures many highly heritable traits, including amino acid levels, which are potentially sensitive biomarkers for disease pathogenesis. To better understand the contribution of genetic variation to amino acid levels, we performed single variant and gene-based tests of association between nine serum amino acids (alanine, glutamine, glycine, histidine, isoleucine, leucine, phenylalanine, tyrosine, and valine) and 16.6 million genotyped and imputed variants in 8545 non-diabetic Finnish men from the METabolic Syndrome In Men (METSIM) study with replication in Northern Finland Birth Cohort (NFBC1966). We identified five novel loci associated with amino acid levels (P = < 5×10-8): LOC157273/PPP1R3B with glycine (rs9987289, P = 2.3×10-26); ZFHX3 (chr16:73326579, minor allele frequency (MAF) = 0.42%, P = 3.6×10-9), LIPC (rs10468017, P = 1.5×10-8), and WWOX (rs9937914, P = 3.8×10-8) with alanine; and TRIB1 with tyrosine (rs28601761, P = 8×10-9). Gene-based tests identified two novel genes harboring missense variants of MAF <1% that show aggregate association with amino acid levels: PYCR1 with glycine (Pgene = 1.5×10-6) and BCAT2 with valine (Pgene = 7.4×10-7); neither gene was implicated by single variant association tests. These findings are among the first applications of gene-based tests to identify new loci for amino acid levels. In addition to the seven novel gene associations, we identified five independent signals at established amino acid loci, including two rare variant signals at GLDC (rs138640017, MAF=0.95%, Pconditional = 5.8×10-40) with glycine levels and HAL (rs141635447, MAF = 0.46%, Pconditional = 9.4×10-11) with histidine levels. Examination of all single variant association results in our data revealed a strong inverse relationship between effect size and MAF (Ptrend<0.001). These novel signals provide further insight into the molecular mechanisms of amino acid metabolism and potentially, their perturbations in disease.
Asunto(s)
Aminoácidos/metabolismo , Estudio de Asociación del Genoma Completo/métodos , Finlandia , Frecuencia de los Genes/genética , Genotipo , Humanos , Masculino , Persona de Mediana EdadRESUMEN
Genome-wide association studies (GWASs) have identified many genetic variations associated with type 2 diabetes mellitus (T2DM) in Asians, but understanding the functional genetic variants that influence traits is often a complex process. In this study, fine mapping and other analytical strategies were performed to investigate the effects of G protein signaling modulator 1 (GPSM1) on insulin resistance in skeletal muscle. A total of 128 single-nucleotide polymorphisms (SNPs) within GPSM1 were analysed in 21,897 T2DM cases and 32,710 healthy controls from seven GWASs. The SNP rs28539249 in intron 9 of GPSM1 showed a nominally significant association with T2DM in Asians (OR = 1.07, 95% CI = 1.04-1.10, P < 10-4). The GPSM1 mRNA was increased in skeletal muscle and correlated with T2DM traits across obese mice model. An eQTL for the cis-acting regulation of GPSM1 expression in human skeletal muscle was identified for rs28539249, and the increased GPSM1 expression related with T2DM traits within GEO datasets. Another independent Asian cohort showed that rs28539249 is associated with the skeletal muscle expression of CACFD1, GTF3C5, SARDH, and FAM163B genes, which are functionally enriched for endoplasmic reticulum stress (ERS) and unfolded protein response (UPR) pathways. Moreover, rs28539249 locus was predicted to disrupt regulatory regions in human skeletal muscle with enriched epigenetic marks and binding affinity for CTCF. Supershift EMSA assays followed luciferase assays demonstrated the CTCF specifically binding to rs28539249-C allele leading to decreased transcriptional activity. Thus, the post-GWAS annotation confirmed the Asian-specific association of genetic variant in GPSM1 with T2DM, suggesting a role for the variant in the regulation in skeletal muscle.
Asunto(s)
Diabetes Mellitus Experimental , Diabetes Mellitus Tipo 2 , Predisposición Genética a la Enfermedad , Inhibidores de Disociación de Guanina Nucleótido , Músculo Esquelético/metabolismo , Polimorfismo de Nucleótido Simple , Animales , Pueblo Asiatico , Diabetes Mellitus Experimental/genética , Diabetes Mellitus Experimental/metabolismo , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Estudio de Asociación del Genoma Completo , Inhibidores de Disociación de Guanina Nucleótido/genética , Inhibidores de Disociación de Guanina Nucleótido/metabolismo , Humanos , RatonesRESUMEN
Epidemiology studies suggested that low birthweight was associated with a higher risk of hypertension in later life. However, little is known about the causality of such associations. In our study, we evaluated the causal association of low birthweight with adulthood hypertension following a standard analytic protocol using the study-level data of 183,433 participants from 60 studies (CHARGE-BIG consortium), as well as that with blood pressure using publicly available summary-level genome-wide association data from EGG consortium of 153,781 participants, ICBP consortium and UK Biobank cohort together of 757,601 participants. We used seven SNPs as the instrumental variable in the study-level analysis and 47 SNPs in the summary-level analysis. In the study-level analyses, decreased birthweight was associated with a higher risk of hypertension in adults (the odds ratio per 1 standard deviation (SD) lower birthweight, 1.22; 95% CI 1.16 to 1.28), while no association was found between genetically instrumented birthweight and hypertension risk (instrumental odds ratio for causal effect per 1 SD lower birthweight, 0.97; 95% CI 0.68 to 1.41). Such results were consistent with that from the summary-level analyses, where the genetically determined low birthweight was not associated with blood pressure measurements either. One SD lower genetically determined birthweight was not associated with systolic blood pressure (ß = - 0.76, 95% CI - 2.45 to 1.08 mmHg), 0.06 mmHg lower diastolic blood pressure (ß = - 0.06, 95% CI - 0.93 to 0.87 mmHg), or pulse pressure (ß = - 0.65, 95% CI - 1.38 to 0.69 mmHg, all p > 0.05). Our findings suggest that the inverse association of birthweight with hypertension risk from observational studies was not supported by large Mendelian randomization analyses.
Asunto(s)
Peso al Nacer , Presión Sanguínea/genética , Hipertensión/epidemiología , Hipertensión/genética , Análisis de la Aleatorización Mendeliana/métodos , Adulto , Peso al Nacer/genética , Peso al Nacer/fisiología , Presión Sanguínea/fisiología , Índice de Masa Corporal , Femenino , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Recién Nacido de Bajo Peso , Recién Nacido , Masculino , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
Knowledge of biological relatedness between samples is important for many genetic studies. In large-scale human genetic association studies, the estimated kinship is used to remove cryptic relatedness, control for family structure, and estimate trait heritability. However, estimation of kinship is challenging for sparse sequencing data, such as those from off-target regions in target sequencing studies, where genotypes are largely uncertain or missing. Existing methods often assume accurate genotypes at a large number of markers across the genome. We show that these methods, without accounting for the genotype uncertainty in sparse sequencing data, can yield a strong downward bias in kinship estimation. We develop a computationally efficient method called SEEKIN to estimate kinship for both homogeneous samples and heterogeneous samples with population structure and admixture. Our method models genotype uncertainty and leverages linkage disequilibrium through imputation. We test SEEKIN on a whole exome sequencing dataset (WES) of Singapore Chinese and Malays, which involves substantial population structure and admixture. We show that SEEKIN can accurately estimate kinship coefficient and classify genetic relatedness using off-target sequencing data down sampled to ~0.15X depth. In application to the full WES dataset without down sampling, SEEKIN also outperforms existing methods by properly analyzing shallow off-target data (~0.75X). Using both simulated and real phenotypes, we further illustrate how our method improves estimation of trait heritability for WES studies.
Asunto(s)
Bases de Datos Genéticas , Genética de Población/métodos , Genoma Humano , Análisis de Secuencia de ADN , Pueblo Asiatico/genética , Biología Computacional , Exoma , Estudios de Asociación Genética , Genotipo , Técnicas de Genotipaje , Humanos , Desequilibrio de Ligamiento , Modelos Genéticos , Programas InformáticosRESUMEN
Lipid and lipoprotein subclasses are associated with metabolic and cardiovascular diseases, yet the genetic contributions to variability in subclass traits are not fully understood. We conducted single-variant and gene-based association tests between 15.1M variants from genome-wide and exome array and imputed genotypes and 72 lipid and lipoprotein traits in 8,372 Finns. After accounting for 885 variants at 157 previously identified lipid loci, we identified five novel signals near established loci at HIF3A, ADAMTS3, PLTP, LCAT, and LIPG. Four of the signals were identified with a low-frequency (0.005Asunto(s)
Frecuencia de los Genes/genética
, Metabolismo de los Lípidos/genética
, Lípidos/genética
, Lipoproteínas/genética
, Polimorfismo de Nucleótido Simple/genética
, Triglicéridos/genética
, Población Blanca/genética
, HDL-Colesterol/genética
, Exoma/genética
, Finlandia
, Estudio de Asociación del Genoma Completo/métodos
, Genotipo
, Humanos
, Masculino
, Persona de Mediana Edad
, Análisis de Componente Principal/métodos