Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 238
1.
medRxiv ; 2024 May 10.
Article En | MEDLINE | ID: mdl-38766261

The etiology of prostate cancer, the second most common cancer in men globally, has a strong heritable component. While rare coding germline variants in several genes have been identified as risk factors from candidate gene and linkage studies, the exome-wide spectrum of causal rare variants remains to be fully explored. To more comprehensively address their contribution, we analysed data from 37,184 prostate cancer cases and 331,329 male controls from five cohorts with germline exome/genome sequencing and one cohort with imputed array data from a population enriched in low-frequency deleterious variants. Our gene-level collapsing analysis revealed that rare damaging variants in SAMHD1 as well as genes in the DNA damage response pathway (BRCA2, ATM and CHEK2) are associated with the risk of overall prostate cancer. We also found that rare damaging variants in AOX1 and BRCA2 were associated with increased severity of prostate cancer in a case-only analysis of aggressive versus non-aggressive prostate cancer. At the single-variant level, we found rare non-synonymous variants in three genes (HOXB13, CHEK2, BIK) significantly associated with increased risk of overall prostate cancer and in four genes (ANO7, SPDL1, AR, TERT) with decreased risk. Altogether, this study provides deeper insights into the genetic architecture and biological basis of prostate cancer risk and severity.

2.
Nature ; 2024 May 20.
Article En | MEDLINE | ID: mdl-38768635

Rare coding variants that significantly impact function provide insights into the biology of a gene1-3. However, ascertaining their frequency requires large sample sizes4-8. Here, we present a catalogue of human protein-coding variation, derived from exome sequencing of 983,578 individuals across diverse populations. 23% of the Regeneron Genetics Center Million Exome data (RGC-ME) comes from non-European individuals of African, East Asian, Indigenous American, Middle Eastern, and South Asian ancestry. This catalogue includes over 10.4 million missense and 1.1 million predicted loss-of-function (pLOF) variants. We identify individuals with rare biallelic pLOF variants in 4,848 genes, 1,751 of which have not been previously reported. From precise quantitative estimates of selection against heterozygous loss-of-function, we identify 3,988 loss-of-function intolerant genes, including 86 that were previously assessed as tolerant and 1,153 lacking established disease annotation. We also define regions of missense depletion at high resolution. Notably, 1,482 genes have regions depleted of missense variants despite being tolerant to pLOF variants. Finally, we estimate that 3% of individuals have a clinically actionable genetic variant, and that 11,773 variants reported in ClinVar with unknown significance are likely to be deleterious cryptic splice sites. To facilitate variant interpretation and genetics-informed precision medicine, we make this important resource of coding variation from the RGC-ME accessible via a public variant allele frequency browser.

3.
Nat Genet ; 56(4): 579-584, 2024 Apr.
Article En | MEDLINE | ID: mdl-38575728

Obesity is a major risk factor for many common diseases and has a substantial heritable component. To identify new genetic determinants, we performed exome-sequence analyses for adult body mass index (BMI) in up to 587,027 individuals. We identified rare loss-of-function variants in two genes (BSN and APBA1) with effects substantially larger than those of well-established obesity genes such as MC4R. In contrast to most other obesity-related genes, rare variants in BSN and APBA1 were not associated with normal variation in childhood adiposity. Furthermore, BSN protein-truncating variants (PTVs) magnified the influence of common genetic variants associated with BMI, with a common variant polygenic score exhibiting an effect twice as large in BSN PTV carriers than in noncarriers. Finally, we explored the plasma proteomic signatures of BSN PTV carriers as well as the functional consequences of BSN deletion in human induced pluripotent stem cell-derived hypothalamic neurons. Collectively, our findings implicate degenerative processes in synaptic function in the etiology of adult-onset obesity.


Diabetes Mellitus, Type 2 , Induced Pluripotent Stem Cells , Liver Diseases , Nerve Tissue Proteins , Adult , Humans , Adaptor Proteins, Signal Transducing/genetics , Diabetes Mellitus, Type 2/genetics , Genetic Predisposition to Disease , Nerve Tissue Proteins/genetics , Obesity/complications , Obesity/genetics , Proteomics
5.
Br J Gen Pract ; 2024 Feb 19.
Article En | MEDLINE | ID: mdl-38373851

BACKGROUND: UK cardiovascular disease (CVD) incidence and mortality have declined in recent decades but socioeconomic inequalities persist. AIM: To present a new CVD model, and project health outcomes and the impact of guideline-recommended statin treatment across quintiles of socioeconomic deprivation in the UK. DESIGN AND SETTING: A lifetime microsimulation model was developed using 117 896 participants in 16 statin trials, 501 854 UK Biobank (UKB) participants, and quality-of-life data from national health surveys. METHOD: A CVD microsimulation model was developed using risk equations for myocardial infarction, stroke, coronary revascularisation, cancer, and vascular and non-vascular death, estimated using trial data. The authors calibrated and further developed this model in the UKB cohort, including further characteristics and a diabetes risk equation, and validated the model in UKB and Whitehall II cohorts. The model was used to predict CVD incidence, life expectancy, quality-adjusted life years (QALYs), and the impact of UK guideline-recommended statin treatment across socioeconomic deprivation quintiles. RESULTS: Age, sex, socioeconomic deprivation, smoking, hypertension, diabetes, and cardiovascular events were key CVD risk determinants. Model-predicted event rates corresponded well to observed rates across participant categories. The model projected strong gradients in remaining life expectancy, with 4-5-year (5-8 QALYs) gaps between the least and most socioeconomically deprived quintiles. Guideline-recommended statin treatment was projected to increase QALYs, with larger gains in quintiles of higher deprivation. CONCLUSION: The study demonstrated the potential of guideline-recommended statin treatment to reduce socioeconomic inequalities. This CVD model is a novel resource for individualised long-term projections of health outcomes of CVD treatments.

6.
Eur J Prev Cardiol ; 2024 Jan 10.
Article En | MEDLINE | ID: mdl-38198221

AIM: Lowering low-density lipoprotein cholesterol (LDL-C) through PCSK9 inhibition represents a new therapeutic approach to preventing and treating cardiovascular disease (CVD). Phenome-wide analyses of PCSK9 genetic variants in large biobanks can help to identify unexpected effects of PCSK9 inhibition. METHODS: In the prospective China Kadoorie Biobank, we constructed a genetic score using three variants at the PCSK9 locus associated with directly-measured LDL-C (PCSK9-GS). Logistic regression gave estimated odds ratios (ORs) for PCSK9-GS associations with CVD and non-CVD outcomes, scaled to 1SD lower LDL-C. RESULTS: PCSK9-GS was associated with lower risks of carotid plaque (n=8340 cases; OR=0.61 [95%CI: 0.45-0.83]; P=0.0015), major occlusive vascular events (n=15,752; 0.80 [0.67-0.95]; P=0.011), and ischaemic stroke (n=11,467; 0.80 [0.66-0.98]; P=0.029). However, PCSK9-GS was also associated with higher risk of hospitalisation with chronic obstructive pulmonary disease (COPD: n=6836; 1.38 [1.08-1.76]; P=0.0089), and with even higher risk of fatal exacerbations among individuals with pre-existing COPD (n=730; 3.61 [1.71-7.60]; P=7.3x10-4). We also replicated associations for a PCSK9 variant, reported in UK Biobank, with increased risks of acute upper respiratory tract infection (URTI) (pooled OR after meta-analysis of 1.87 ([1.38-2.54]; P=5.4x10-5) and self-reported asthma (pooled OR 1.17 ([1.04-1.30]; P=0.0071). There was no association of a polygenic LDL-C score with COPD hospitalisation, COPD exacerbation, or URTI. CONCLUSIONS: LDL-C-lowering PCSK9 genetic variants are associated with lower risk of subclinical and clinical atherosclerotic vascular disease, but higher risks of respiratory diseases. Pharmacovigilance studies may be required to monitor patients treated with therapeutic PCSK9 inhibitors for exacerbations of respiratory diseases or respiratory tract infections.


Genetic analyses of over 100,000 participants of the China Kadoorie Biobank, mimicking the effect of new drugs intended to reduce cholesterol by targeting the PCSK9 protein, have identified potential severe effects of lower PCSK9 activity in patients with existing respiratory disease. PCSK9 genetic variants that are associated with lower cholesterol and reduced rates of cardiovascular disease are also associated with increased risk of a range of respiratory diseases, including asthma, upper respiratory tract infections, and hospitalisation with chronic obstructive respiratory disease (COPD). These genetic variants are not associated with whether or not individuals have COPD; instead they are specifically associated with an increase in the chance of those who already have COPD being hospitalised and even dying, suggesting that careful monitoring of such patients should be considered during development of and treatment with anti-PCSK9 medication.

7.
Sci Transl Med ; 16(729): eadf4428, 2024 Jan 10.
Article En | MEDLINE | ID: mdl-38198570

Population-based prospective studies, such as UK Biobank, are valuable for generating and testing hypotheses about the potential causes of human disease. We describe how UK Biobank's study design, data access policies, and approaches to statistical analysis can help to minimize error and improve the interpretability of research findings, with implications for other population-based prospective studies being established worldwide.


Biological Specimen Banks , UK Biobank , Humans , Prospective Studies , Research Design , Data Analysis
9.
J Am Coll Cardiol ; 82(20): 1906-1920, 2023 11 14.
Article En | MEDLINE | ID: mdl-37940228

BACKGROUND: Integrated analyses of plasma proteomic and genetic markers in prospective studies can clarify the causal relevance of proteins and discover novel targets for ischemic heart disease (IHD) and other diseases. OBJECTIVES: The purpose of this study was to examine associations of proteomics and genetics data with IHD in population studies to discover novel preventive treatments. METHODS: We conducted a nested case-cohort study in the China Kadoorie Biobank (CKB) involving 1,971 incident IHD cases and 2,001 subcohort participants who were genotyped and free of prior cardiovascular disease. We measured 1,463 proteins in the stored baseline samples using the OLINK EXPLORE panel. Cox regression yielded adjusted HRs for IHD associated with individual proteins after accounting for multiple testing. Moreover, cis-protein quantitative loci (pQTLs) identified for proteins in genome-wide association studies of CKB and of UK Biobank were used as instrumental variables in separate 2-sample Mendelian randomization (MR) studies involving global CARDIOGRAM+C4D consortium (210,842 IHD cases and 1,378,170 controls). RESULTS: Overall 361 proteins were significantly associated at false discovery rate <0.05 with risk of IHD (349 positively, 12 inversely) in CKB, including N-terminal prohormone of brain natriuretic peptide and proprotein convertase subtilisin/kexin type 9. Of these 361 proteins, 212 had cis-pQTLs in CKB, and MR analyses of 198 variants in CARDIOGRAM+C4D identified 13 proteins that showed potentially causal associations with IHD. Independent MR analyses of 307 cis-pQTLs identified in Europeans replicated associations for 4 proteins (FURIN, proteinase-activated receptor-1, Asialoglycoprotein receptor-1, and matrix metalloproteinase-3). Further downstream analyses showed that FURIN, which is highly expressed in endothelial cells, is a potential novel target and matrix metalloproteinase-3 a potential repurposing target for IHD. CONCLUSIONS: Integrated analyses of proteomic and genetic data in Chinese and European adults provided causal support for FURIN and multiple other proteins as potential novel drug targets for treatment of IHD.


Furin , Myocardial Ischemia , Adult , Humans , Cohort Studies , Endothelial Cells , Genome-Wide Association Study , Matrix Metalloproteinases , Myocardial Ischemia/drug therapy , Myocardial Ischemia/genetics , Myocardial Ischemia/epidemiology , Prospective Studies , Proteomics , Risk Factors , Case-Control Studies
10.
Article En | MEDLINE | ID: mdl-37923370

BACKGROUND: Little is known about the persistence of antibodies after the first year following SARS-CoV-2 infection. We aimed to determine the proportion of individuals that maintain detectable levels of SARS-CoV-2 antibodies over an 18-month period following infection. METHODS: Population-based prospective study of 20 000 UK Biobank participants and their adult relatives recruited in May 2020. The proportion of SARS-CoV-2 cases testing positive for immunoglobulin G (IgG) antibodies against the spike protein (IgG-S), and the nucleocapsid protein (IgG-N), was calculated at varying intervals following infection. RESULTS: Overall, 20 195 participants were recruited. Their median age was 56 years (IQR 39-68), 56% were female and 88% were of white ethnicity. The proportion of SARS-CoV-2 cases with IgG-S antibodies following infection remained high (92%, 95% CI 90%-93%) at 6 months after infection. Levels of IgG-N antibodies following infection gradually decreased from 92% (95% CI 88%-95%) at 3 months to 72% (95% CI 70%-75%) at 18 months. There was no strong evidence of heterogeneity in antibody persistence by age, sex, ethnicity or socioeconomic deprivation. CONCLUSION: This study adds to the limited evidence on the long-term persistence of antibodies following SARS-CoV-2 infection, with likely implications for waning immunity following infection and the use of IgG-N in population surveys.

11.
Nature ; 622(7984): 784-793, 2023 Oct.
Article En | MEDLINE | ID: mdl-37821707

The Mexico City Prospective Study is a prospective cohort of more than 150,000 adults recruited two decades ago from the urban districts of Coyoacán and Iztapalapa in Mexico City1. Here we generated genotype and exome-sequencing data for all individuals and whole-genome sequencing data for 9,950 selected individuals. We describe high levels of relatedness and substantial heterogeneity in ancestry composition across individuals. Most sequenced individuals had admixed Indigenous American, European and African ancestry, with extensive admixture from Indigenous populations in central, southern and southeastern Mexico. Indigenous Mexican segments of the genome had lower levels of coding variation but an excess of homozygous loss-of-function variants compared with segments of African and European origin. We estimated ancestry-specific allele frequencies at 142 million genomic variants, with an effective sample size of 91,856 for Indigenous Mexican ancestry at exome variants, all available through a public browser. Using whole-genome sequencing, we developed an imputation reference panel that outperforms existing panels at common variants in individuals with high proportions of central, southern and southeastern Indigenous Mexican ancestry. Our work illustrates the value of genetic studies in diverse populations and provides foundational imputation and allele frequency resources for future genetic studies in Mexico and in the United States, where the Hispanic/Latino population is predominantly of Mexican descent.


Exome Sequencing , Genome, Human , Genotype , Hispanic or Latino , Adult , Humans , Africa/ethnology , Americas/ethnology , Europe/ethnology , Gene Frequency/genetics , Genetics, Population , Genome, Human/genetics , Genotyping Techniques , Hispanic or Latino/genetics , Homozygote , Loss of Function Mutation/genetics , Mexico , Prospective Studies
12.
Int J Epidemiol ; 52(6): 1862-1869, 2023 Dec 25.
Article En | MEDLINE | ID: mdl-37898918

BACKGROUND: The relevance of folic acid for stroke prevention in low-folate populations such as in China is uncertain. Genetic studies of the methylenetetrahydrofolate reductase (MTHFR) C677T polymorphism, which increases plasma homocysteine (tHcy) levels, could clarify the causal relevance of elevated tHcy levels for stroke, ischaemic heart disease (IHD) and other diseases in populations without folic acid fortification. METHODS: In the prospective China Kadoorie Biobank, 156 253 participants were genotyped for MTHFR and 12 240 developed a stroke during the 12-year follow-up. Logistic regression was used to estimate region-specific odds ratios (ORs) for total stroke and stroke types, IHD and other diseases comparing TT genotype for MTHFR C677T (two thymine alleles at position 677 of MTHFR C677T polymorphism) vs CC (two cytosine alleles) after adjustment for age and sex, and these were combined using inverse-variance weighting. RESULTS: Overall, 21% of participants had TT genotypes, but this varied from 5% to 41% across the 10 study regions. Individuals with TT genotypes had 13% (adjusted OR 1.13, 95% CI 1.09-1.17) higher risks of any stroke [with a 2-fold stronger association with intracerebral haemorrhage (1.24, 1.17-1.32) than for ischaemic stroke (1.11, 1.07-1.15)] than the reference CC genotype. In contrast, MTHFR C677T was unrelated to risk of IHD or any other non-vascular diseases, including cancer, diabetes and chronic obstructive lung disease. CONCLUSIONS: In Chinese adults, the MTHFR C677T polymorphism was associated with higher risks of stroke. The findings warrant corroboration by further trials of folic acid and implementation of mandatory folic acid fortification programmes for stroke prevention in low-folate populations.


Brain Ischemia , Coronary Artery Disease , Stroke , Adult , Humans , Methylenetetrahydrofolate Reductase (NADPH2)/genetics , Prospective Studies , Stroke/epidemiology , Stroke/genetics , Folic Acid , Genotype , Homocysteine/genetics
13.
Nat Commun ; 14(1): 5419, 2023 09 05.
Article En | MEDLINE | ID: mdl-37669985

Recently, large scale genomic projects such as All of Us and the UK Biobank have introduced a new research paradigm where data are stored centrally in cloud-based Trusted Research Environments (TREs). To characterize the advantages and drawbacks of different TRE attributes in facilitating cross-cohort analysis, we conduct a Genome-Wide Association Study of standard lipid measures using two approaches: meta-analysis and pooled analysis. Comparison of full summary data from both approaches with an external study shows strong correlation of known loci with lipid levels (R2 ~ 83-97%). Importantly, 90 variants meet the significance threshold only in the meta-analysis and 64 variants are significant only in pooled analysis, with approximately 20% of variants in each of those groups being most prevalent in non-European, non-Asian ancestry individuals. These findings have important implications, as technical and policy choices lead to cross-cohort analyses generating similar, but not identical results, particularly for non-European ancestral populations.


Genome-Wide Association Study , Population Health , Humans , Genomics , Policy , Lipids
14.
Eur J Epidemiol ; 38(10): 1089-1103, 2023 Oct.
Article En | MEDLINE | ID: mdl-37676424

Adiposity is associated with multiple diseases and traits, but little is known about the causal relevance and mechanisms underlying these associations. Large-scale proteomic profiling, especially when integrated with genetic data, can clarify mechanisms linking adiposity with disease outcomes. We examined the associations of adiposity with plasma levels of 1463 proteins in 3977 Chinese adults, using measured and genetically-instrumented BMI. We further used two-sample bi-directional MR analyses to assess if certain proteins influenced adiposity, along with other (e.g. enrichment) analyses to clarify possible mechanisms underlying the observed associations. Overall, the mean (SD) baseline BMI was 23.9 (3.3) kg/m2, with only 6% being obese (i.e. BMI ≥ 30 kg/m2). Measured and genetically-instrumented BMI was significantly associated at FDR < 0.05 with levels of 1096 (positive/inverse: 826/270) and 307 (positive/inverse: 270/37) proteins, respectively, with FABP4, LEP, IL1RN, LSP1, GOLM2, TNFRSF6B, and ADAMTS15 showing the strongest positive and PON3, NCAN, LEPR, IGFBP2 and MOG showing the strongest inverse genetic associations. These associations were largely linear, in adiposity-to-protein direction, and replicated (> 90%) in Europeans of UKB (mean BMI 27.4 kg/m2). Enrichment analyses of the top > 50 BMI-associated proteins demonstrated their involvement in atherosclerosis, lipid metabolism, tumour progression and inflammation. Two-sample bi-directional MR analyses using cis-pQTLs identified in CKB GWAS found eight proteins (ITIH3, LRP11, SCAMP3, NUDT5, OGN, EFEMP1, TXNDC15, PRDX6) significantly affect levels of BMI, with NUDT5 also showing bi-directional association. The findings among relatively lean Chinese adults identified novel pathways by which adiposity may increase disease risks and novel potential targets for treatment of obesity and obesity-related diseases.


Adiposity , East Asian People , Humans , Adult , Adiposity/genetics , Proteomics , Body Mass Index , Obesity/genetics , Obesity/complications , Mendelian Randomization Analysis , Polymorphism, Single Nucleotide , Extracellular Matrix Proteins/genetics , Carrier Proteins/genetics , Membrane Proteins/genetics
15.
J Epidemiol Community Health ; 78(1): 3-10, 2023 12 08.
Article En | MEDLINE | ID: mdl-37699665

BACKGROUND: The social determinants of ethnic disparities in risk of SARS-CoV-2 infection during the first wave of the pandemic in the UK remain unclear. METHODS: In May 2020, a total of 20 195 adults were recruited from the general population into the UK Biobank SARS-CoV-2 Serology Study. Between mid-May and mid-November 2020, participants provided monthly blood samples. At the end of the study, participants completed a questionnaire on social factors during different periods of the pandemic. Logistic regression yielded ORs for the association between ethnicity and SARS-CoV-2 immunoglobulin G antibodies (indicating prior infection) using blood samples collected in July 2020, immediately after the first wave. RESULTS: After exclusions, 14 571 participants (mean age 56; 58% women) returned a blood sample in July, of whom 997 (7%) had SARS-CoV-2 antibodies. Seropositivity was strongly related to ethnicity: compared with those of White ethnicity, ORs (adjusted for age and sex) for Black, South Asian, Chinese, Mixed and Other ethnic groups were 2.66 (95% CI 1.94-3.60), 1.66 (1.15-2.34), 0.99 (0.42-1.99), 1.42 (1.03-1.91) and 1.79 (1.27-2.47), respectively. Additional adjustment for social factors reduced the overall likelihood ratio statistics for ethnicity by two-thirds (67%; mostly from occupational factors and UK region of residence); more precise measurement of social factors may have further reduced the association. CONCLUSIONS: This study identifies social factors that are likely to account for much of the ethnic disparities in SARS-CoV-2 infection during the first wave in the UK, and highlights the particular relevance of occupation and residential region in the pathway between ethnicity and SARS-CoV-2 infection.


COVID-19 , Adult , Humans , Female , Middle Aged , Male , SARS-CoV-2 , Social Factors , Biological Specimen Banks , Social Determinants of Health , Surveys and Questionnaires
16.
Cell Genom ; 3(8): 100361, 2023 Aug 09.
Article En | MEDLINE | ID: mdl-37601966

The China Kadoorie Biobank (CKB) is a population-based prospective cohort of >512,000 adults recruited from 2004 to 2008 from 10 geographically diverse regions across China. Detailed data from questionnaires and physical measurements were collected at baseline, with additional measurements at three resurveys involving ∼5% of surviving participants. Analyses of genome-wide genotyping, for >100,000 participants using custom-designed Axiom arrays, reveal extensive relatedness, recent consanguinity, and signatures reflecting large-scale population movements from recent Chinese history. Systematic genome-wide association studies of incident disease, captured through electronic linkage to death and disease registries and to the national health insurance system, replicate established disease loci and identify 14 novel disease associations. Together with studies of candidate drug targets and disease risk factors and contributions to international genetics consortia, these demonstrate the breadth, depth, and quality of the CKB data. Ongoing high-throughput omics assays of collected biosamples and planned whole-genome sequencing will further enhance the scientific value of this biobank.

17.
Lancet Public Health ; 8(9): e670-e679, 2023 09.
Article En | MEDLINE | ID: mdl-37633676

BACKGROUND: Social inequalities in adult mortality have been reported across diverse populations, but there is no large-scale prospective evidence from Mexico. We aimed to quantify social, including educational, inequalities in mortality among adults in Mexico City. METHODS: The Mexico City Prospective Study recruited 150 000 adults aged 35 years and older from two districts of Mexico City between 1998 and 2004. Participants were followed up until Jan 1, 2021 for cause-specific mortality. Cox regression analysis yielded rate ratios (RRs) for death at ages 35-74 years associated with education and examined, in exploratory analyses, the mediating effects of lifestyle and related risk factors. FINDINGS: Among 143 478 participants aged 35-74 years, there was a strong inverse association of education with premature death. Compared with participants with tertiary education, after adjustment for age and sex, those with no education had about twice the mortality rate (RR 1·84; 95% CI 1·71-1·98), equivalent to approximately 6 years lower life expectancy, with an RR of 1·78 (1·67-1·90) among participants with incomplete primary, 1·62 (1·53-1·72) with complete primary, and 1·34 (1·25-1·42) with secondary education. Education was most strongly associated with death from renal disease and acute diabetic crises (RR 3·65; 95% CI 3·05-4·38 for no education vs tertiary education) and from infectious diseases (2·67; 2·00-3·56), but there was an apparent higher rate of death from all specific causes studied with lower education, with the exception of cancer for which there was little association. Lifestyle factors (ie, smoking, alcohol drinking, and leisure time physical activity) and related physiological correlates (ie, adiposity, diabetes, and blood pressure) accounted for about four-fifths of the association of education with premature mortality. INTERPRETATION: In this Mexican population there were marked educational inequalities in premature adult mortality, which appeared to largely be accounted for by lifestyle and related risk factors. Effective interventions to reduce these risk factors could reduce inequalities and have a major impact on premature mortality. FUNDING: Wellcome Trust, the Mexican Health Ministry, the National Council of Science and Technology for Mexico, Cancer Research UK, British Heart Foundation, and the UK Medical Research Council Population Health Research Unit.


Mortality, Premature , Adult , Humans , Prospective Studies , Cause of Death , Mexico/epidemiology , Educational Status
18.
Nat Genet ; 55(7): 1138-1148, 2023 07.
Article En | MEDLINE | ID: mdl-37308787

Human genetic studies of smoking behavior have been thus far largely limited to common variants. Studying rare coding variants has the potential to identify drug targets. We performed an exome-wide association study of smoking phenotypes in up to 749,459 individuals and discovered a protective association in CHRNB2, encoding the ß2 subunit of the α4ß2 nicotine acetylcholine receptor. Rare predicted loss-of-function and likely deleterious missense variants in CHRNB2 in aggregate were associated with a 35% decreased odds for smoking heavily (odds ratio (OR) = 0.65, confidence interval (CI) = 0.56-0.76, P = 1.9 × 10-8). An independent common variant association in the protective direction ( rs2072659 ; OR = 0.96; CI = 0.94-0.98; P = 5.3 × 10-6) was also evident, suggesting an allelic series. Our findings in humans align with decades-old experimental observations in mice that ß2 loss abolishes nicotine-mediated neuronal responses and attenuates nicotine self-administration. Our genetic discovery will inspire future drug designs targeting CHRNB2 in the brain for the treatment of nicotine addiction.


Nicotine , Tobacco Use Disorder , Humans , Animals , Mice , Smoking/genetics , Tobacco Use Disorder/genetics , Phenotype , Odds Ratio
19.
Nat Med ; 29(6): 1476-1486, 2023 Jun.
Article En | MEDLINE | ID: mdl-37291211

Alcohol consumption accounts for ~3 million annual deaths worldwide, but uncertainty persists about its relationships with many diseases. We investigated the associations of alcohol consumption with 207 diseases in the 12-year China Kadoorie Biobank of >512,000 adults (41% men), including 168,050 genotyped for ALDH2- rs671 and ADH1B- rs1229984 , with >1.1 million ICD-10 coded hospitalized events. At baseline, 33% of men drank alcohol regularly. Among men, alcohol intake was positively associated with 61 diseases, including 33 not defined by the World Health Organization as alcohol-related, such as cataract (n = 2,028; hazard ratio 1.21; 95% confidence interval 1.09-1.33, per 280 g per week) and gout (n = 402; 1.57, 1.33-1.86). Genotype-predicted mean alcohol intake was positively associated with established (n = 28,564; 1.14, 1.09-1.20) and new alcohol-associated (n = 16,138; 1.06, 1.01-1.12) diseases, and with specific diseases such as liver cirrhosis (n = 499; 2.30, 1.58-3.35), stroke (n = 12,176; 1.38, 1.27-1.49) and gout (n = 338; 2.33, 1.49-3.62), but not ischemic heart disease (n = 8,408; 1.04, 0.94-1.14). Among women, 2% drank alcohol resulting in low power to assess associations of self-reported alcohol intake with disease risks, but genetic findings in women suggested the excess male risks were not due to pleiotropic genotypic effects. Among Chinese men, alcohol consumption increased multiple disease risks, highlighting the need to strengthen preventive measures to reduce alcohol intake.


Alcohol Drinking , East Asian People , Gout , Adult , Female , Humans , Male , Alcohol Drinking/adverse effects , Alcohol Drinking/epidemiology , Alcohol Drinking/ethnology , Alcohol Drinking/genetics , Aldehyde Dehydrogenase, Mitochondrial/genetics , East Asian People/statistics & numerical data , Ethanol , Genotype , Risk Factors , Disease/ethnology , Disease/etiology , Disease/genetics , China/epidemiology
20.
bioRxiv ; 2023 Nov 02.
Article En | MEDLINE | ID: mdl-37214792

Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.

...