RESUMO
Rare coding variants that substantially affect function provide insights into the biology of a gene1-3. However, ascertaining the frequency of such variants requires large sample sizes4-8. Here we present a catalogue of human protein-coding variation, derived from exome sequencing of 983,578 individuals across diverse populations. In total, 23% of the Regeneron Genetics Center Million Exome (RGC-ME) data come from individuals of African, East Asian, Indigenous American, Middle Eastern and South Asian ancestry. The catalogue includes more than 10.4 million missense and 1.1 million predicted loss-of-function (pLOF) variants. We identify individuals with rare biallelic pLOF variants in 4,848 genes, 1,751 of which have not been previously reported. From precise quantitative estimates of selection against heterozygous loss of function (LOF), we identify 3,988 LOF-intolerant genes, including 86 that were previously assessed as tolerant and 1,153 that lack established disease annotation. We also define regions of missense depletion at high resolution. Notably, 1,482 genes have regions that are depleted of missense variants despite being tolerant of pLOF variants. Finally, we estimate that 3% of individuals have a clinically actionable genetic variant, and that 11,773 variants reported in ClinVar with unknown significance are likely to be deleterious cryptic splice sites. To facilitate variant interpretation and genetics-informed precision medicine, we make this resource of coding variation from the RGC-ME dataset publicly accessible through a variant allele frequency browser.
Assuntos
Exoma , Variação Genética , Proteínas , Humanos , Alelos , Exoma/genética , Sequenciamento do Exoma , Frequência do Gene , Variação Genética/genética , Heterozigoto , Mutação com Perda de Função/genética , Mutação de Sentido Incorreto/genética , Fases de Leitura Aberta/genética , Proteínas/genética , Sítios de Splice de RNA/genética , Medicina de PrecisãoRESUMO
Clonal haematopoiesis involves the expansion of certain blood cell lineages and has been associated with ageing and adverse health outcomes1-5. Here we use exome sequence data on 628,388 individuals to identify 40,208 carriers of clonal haematopoiesis of indeterminate potential (CHIP). Using genome-wide and exome-wide association analyses, we identify 24 loci (21 of which are novel) where germline genetic variation influences predisposition to CHIP, including missense variants in the lymphocytic antigen coding gene LY75, which are associated with reduced incidence of CHIP. We also identify novel rare variant associations with clonal haematopoiesis and telomere length. Analysis of 5,041 health traits from the UK Biobank (UKB) found relationships between CHIP and severe COVID-19 outcomes, cardiovascular disease, haematologic traits, malignancy, smoking, obesity, infection and all-cause mortality. Longitudinal and Mendelian randomization analyses revealed that CHIP is associated with solid cancers, including non-melanoma skin cancer and lung cancer, and that CHIP linked to DNMT3A is associated with the subsequent development of myeloid but not lymphoid leukaemias. Additionally, contrary to previous findings from the initial 50,000 UKB exomes6, our results in the full sample do not support a role for IL-6 inhibition in reducing the risk of cardiovascular disease among CHIP carriers. Our findings demonstrate that CHIP represents a complex set of heterogeneous phenotypes with shared and unique germline genetic causes and varied clinical implications.
Assuntos
COVID-19 , Doenças Cardiovasculares , Humanos , Hematopoiese Clonal/genética , Doenças Cardiovasculares/epidemiologia , Doenças Cardiovasculares/genéticaRESUMO
BACKGROUND: Exome sequencing in hundreds of thousands of persons may enable the identification of rare protein-coding genetic variants associated with protection from human diseases like liver cirrhosis, providing a strategy for the discovery of new therapeutic targets. METHODS: We performed a multistage exome sequencing and genetic association analysis to identify genes in which rare protein-coding variants were associated with liver phenotypes. We conducted in vitro experiments to further characterize associations. RESULTS: The multistage analysis involved 542,904 persons with available data on liver aminotransferase levels, 24,944 patients with various types of liver disease, and 490,636 controls without liver disease. We found that rare coding variants in APOB, ABCB4, SLC30A10, and TM6SF2 were associated with increased aminotransferase levels and an increased risk of liver disease. We also found that variants in CIDEB, which encodes a structural protein found in hepatic lipid droplets, had a protective effect. The burden of rare predicted loss-of-function variants plus missense variants in CIDEB (combined carrier frequency, 0.7%) was associated with decreased alanine aminotransferase levels (beta per allele, -1.24 U per liter; 95% confidence interval [CI], -1.66 to -0.83; P = 4.8×10-9) and with 33% lower odds of liver disease of any cause (odds ratio per allele, 0.67; 95% CI, 0.57 to 0.79; P = 9.9×10-7). Rare coding variants in CIDEB were associated with a decreased risk of liver disease across different underlying causes and different degrees of severity, including cirrhosis of any cause (odds ratio per allele, 0.50; 95% CI, 0.36 to 0.70). Among 3599 patients who had undergone bariatric surgery, rare coding variants in CIDEB were associated with a decreased nonalcoholic fatty liver disease activity score (beta per allele in score units, -0.98; 95% CI, -1.54 to -0.41 [scores range from 0 to 8, with higher scores indicating more severe disease]). In human hepatoma cell lines challenged with oleate, CIDEB small interfering RNA knockdown prevented the buildup of large lipid droplets. CONCLUSIONS: Rare germline mutations in CIDEB conferred substantial protection from liver disease. (Funded by Regeneron Pharmaceuticals.).
Assuntos
Proteínas Reguladoras de Apoptose , Mutação em Linhagem Germinativa , Hepatopatias , Proteínas Reguladoras de Apoptose/genética , Proteínas Reguladoras de Apoptose/metabolismo , Predisposição Genética para Doença/genética , Predisposição Genética para Doença/prevenção & controle , Humanos , Fígado/metabolismo , Hepatopatias/genética , Hepatopatias/metabolismo , Hepatopatias/prevenção & controle , Transaminases/genética , Sequenciamento do ExomaRESUMO
Hypersensitivity reactions to drugs are often unpredictable and can be life threatening, underscoring a need for understanding their underlying mechanisms and risk factors. The extent to which germline genetic variation influences the risk of commonly reported drug allergies such as penicillin allergy remains largely unknown. We extracted data from the electronic health records of more than 600,000 participants from the UK, Estonian, and Vanderbilt University Medical Center's BioVU biobanks to study the role of genetic variation in the occurrence of self-reported penicillin hypersensitivity reactions. We used imputed SNP to HLA typing data from these cohorts to further fine map the human leukocyte antigen (HLA) association and replicated our results in 23andMe's research cohort involving a total of 1.12 million individuals. Genome-wide meta-analysis of penicillin allergy revealed two loci, including one located in the HLA region on chromosome 6. This signal was further fine-mapped to the HLA-B∗55:01 allele (OR 1.41 95% CI 1.33-1.49, p value 2.04 × 10-31) and confirmed by independent replication in 23andMe's research cohort (OR 1.30 95% CI 1.25-1.34, p value 1.00 × 10-47). The lead SNP was also associated with lower lymphocyte counts and in silico follow-up suggests a potential effect on T-lymphocytes at HLA-B∗55:01. We also observed a significant hit in PTPN22 and the GWAS results correlated with the genetics of rheumatoid arthritis and psoriasis. We present robust evidence for the role of an allele of the major histocompatibility complex (MHC) I gene HLA-B in the occurrence of penicillin allergy.
Assuntos
Artrite Reumatoide/genética , Hipersensibilidade a Drogas/genética , Antígenos HLA-B/genética , Polimorfismo de Nucleotídeo Único , Proteína Tirosina Fosfatase não Receptora Tipo 22/genética , Psoríase/genética , Adulto , Alelos , Artrite Reumatoide/complicações , Artrite Reumatoide/imunologia , Cromossomos Humanos Par 6/química , Hipersensibilidade a Drogas/complicações , Hipersensibilidade a Drogas/etiologia , Hipersensibilidade a Drogas/imunologia , Registros Eletrônicos de Saúde , Europa (Continente) , Feminino , Expressão Gênica , Loci Gênicos , Predisposição Genética para Doença , Genoma Humano , Estudo de Associação Genômica Ampla , Antígenos HLA-B/imunologia , Teste de Histocompatibilidade , Humanos , Masculino , Penicilinas/efeitos adversos , Proteína Tirosina Fosfatase não Receptora Tipo 22/imunologia , Psoríase/complicações , Psoríase/imunologia , Autorrelato , Linfócitos T/imunologia , Linfócitos T/patologia , Estados UnidosRESUMO
Erectile dysfunction (ED) is a common condition affecting more than 20% of men over 60 years, yet little is known about its genetic architecture. We performed a genome-wide association study of ED in 6,175 case subjects among 223,805 European men and identified one locus at 6q16.3 (lead variant rs57989773, OR 1.20 per C-allele; p = 5.71 × 10-14), located between MCHR2 and SIM1. In silico analysis suggests SIM1 to confer ED risk through hypothalamic dysregulation. Mendelian randomization provides evidence that genetic risk of type 2 diabetes mellitus is a cause of ED (OR 1.11 per 1-log unit higher risk of type 2 diabetes). These findings provide insights into the biological underpinnings and the causes of ED and may help prioritize the development of future therapies for this common disorder.
Assuntos
Diabetes Mellitus Tipo 2/complicações , Diabetes Mellitus Tipo 2/genética , Disfunção Erétil/etiologia , Disfunção Erétil/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Hipotálamo/patologia , Alelos , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Cromossomos Humanos Par 6/genética , Simulação por Computador , Europa (Continente) , Humanos , Masculino , Proteínas Repressoras/genéticaRESUMO
Obesity traits are causally implicated with risk of cardiometabolic diseases. It remains unclear whether there are similar causal effects of obesity traits on other non-communicable diseases. Also, it is largely unexplored whether there are any sex-specific differences in the causal effects of obesity traits on cardiometabolic diseases and other leading causes of death. We constructed sex-specific genetic risk scores (GRS) for three obesity traits; body mass index (BMI), waist-hip ratio (WHR), and WHR adjusted for BMI, including 565, 324, and 337 genetic variants, respectively. These GRSs were then used as instrumental variables to assess associations between the obesity traits and leading causes of mortality in the UK Biobank using Mendelian randomization. We also investigated associations with potential mediators, including smoking, glycemic and blood pressure traits. Sex-differences were subsequently assessed by Cochran's Q-test (Phet). A Mendelian randomization analysis of 228,466 women and 195,041 men showed that obesity causes coronary artery disease, stroke (particularly ischemic), chronic obstructive pulmonary disease, lung cancer, type 2 and 1 diabetes mellitus, non-alcoholic fatty liver disease, chronic liver disease, and acute and chronic renal failure. Higher BMI led to higher risk of type 2 diabetes in women than in men (Phet = 1.4×10-5). Waist-hip-ratio led to a higher risk of chronic obstructive pulmonary disease (Phet = 3.7×10-6) and higher risk of chronic renal failure (Phet = 1.0×10-4) in men than women. Obesity traits have an etiological role in the majority of the leading global causes of death. Sex differences exist in the effects of obesity traits on risk of type 2 diabetes, chronic obstructive pulmonary disease, and renal failure, which may have downstream implications for public health.
Assuntos
Causas de Morte , Doença da Artéria Coronariana/genética , Diabetes Mellitus Tipo 2/genética , Obesidade/genética , Doença Pulmonar Obstrutiva Crônica/genética , Adiposidade/genética , Idoso , Pressão Sanguínea/genética , Índice de Massa Corporal , Doença da Artéria Coronariana/complicações , Doença da Artéria Coronariana/mortalidade , Doença da Artéria Coronariana/patologia , Diabetes Mellitus Tipo 2/complicações , Diabetes Mellitus Tipo 2/mortalidade , Diabetes Mellitus Tipo 2/patologia , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Análise da Randomização Mendeliana , Pessoa de Meia-Idade , Neoplasias , Obesidade/complicações , Obesidade/mortalidade , Obesidade/patologia , Polimorfismo de Nucleotídeo Único , Doença Pulmonar Obstrutiva Crônica/complicações , Doença Pulmonar Obstrutiva Crônica/mortalidade , Doença Pulmonar Obstrutiva Crônica/patologia , Fatores de Risco , Acidente Vascular Cerebral , Relação Cintura-QuadrilRESUMO
In this study, we leveraged the combined evidence of rare coding variants and common alleles to identify therapeutic targets for osteoporosis. We undertook a large-scale multiancestry exome-wide association study for estimated bone mineral density, which showed that the burden of rare coding alleles in 19 genes was associated with estimated bone mineral density (P < 3.6 × 10-7). These genes were highly enriched for a set of known causal genes for osteoporosis (65-fold; P = 2.5 × 10-5). Exome-wide significant genes had 96-fold increased odds of being the top ranked effector gene at a given GWAS locus (P = 1.8 × 10-10). By integrating proteomics Mendelian randomization evidence, we prioritized CD109 (cluster of differentiation 109) as a gene for which heterozygous loss of function is associated with higher bone density. CRISPR-Cas9 editing of CD109 in SaOS-2 osteoblast-like cell lines showed that partial CD109 knockdown led to increased mineralization. This study demonstrates that the convergence of common and rare variants, proteomics and CRISPR can highlight new bone biology to guide therapeutic development.
Assuntos
Predisposição Genética para Doença , Osteoporose , Humanos , Sequenciamento do Exoma , Osteoporose/genética , Densidade Óssea/genética , Alelos , Fatores de Transcrição/genética , Estudo de Associação Genômica AmplaRESUMO
Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.
RESUMO
OBJECTIVE: Osteoarthritis (OA) is a progressive disease for which there is no disease modifying therapy. Vitamin K levels and vitamin K antagonism have been associated with risk and progression of OA which may have direct implications for clinical management, but these observational findings are susceptible to confounding. We aimed to estimate the causal association between vitamin K and OA risk using Mendelian randomisation (MR). METHODS: We used data from the largest genome-wide association study (GWAS) of OA to date (up to 826,690 individuals) to estimate the effect of genetically predicted vitamin K level (instrumented using four variants derived from a GWAS of 2,138 individuals) on risk of all OA types, knee, hip, spine, hand OA, and total joint replacement. We employed the inverse-variance weighted method for the primary analysis and, in a series of sensitivity analyses, adjusted for sub-genome wide significant instruments and tested for potential bias from pleiotropy. RESULTS: We showed that genetically predicted vitamin K levels were not causally associated with risk of OA overall (OR 0.98 per unit increase in log-transformed vitamin K1; 95%CI 0.96-1.01), knee (OR 0.98; 0.92-1.03), hip (OR 0.97; 0.88-1.07), spine (OR 0.97; 0.90-1.04), hand OA (OR 0.97; 0.91-1.04) or joint replacement (OR 0.96; 0.89-1.04). Results were similar across all sensitivity analyses. CONCLUSION: We found little evidence of a causal association between genetically predicted vitamin K and OA risk. Larger genetic and interventional studies of vitamin K are required to confirm our findings.
Assuntos
Análise da Randomização Mendeliana , Osteoartrite , Estudo de Associação Genômica Ampla , Humanos , Osteoartrite/genética , Polimorfismo de Nucleotídeo Único , Vitamina KRESUMO
Body fat distribution is a major, heritable risk factor for cardiometabolic disease, independent of overall adiposity. Using exome-sequencing in 618,375 individuals (including 160,058 non-Europeans) from the UK, Sweden and Mexico, we identify 16 genes associated with fat distribution at exome-wide significance. We show 6-fold larger effect for fat-distribution associated rare coding variants compared with fine-mapped common alleles, enrichment for genes expressed in adipose tissue and causal genes for partial lipodystrophies, and evidence of sex-dimorphism. We describe an association with favorable fat distribution (p = 1.8 × 10-09), favorable metabolic profile and protection from type 2 diabetes (~28% lower odds; p = 0.004) for heterozygous protein-truncating mutations in INHBE, which encodes a circulating growth factor of the activin family, highly and specifically expressed in hepatocytes. Our results suggest that inhibin ßE is a liver-expressed negative regulator of adipose storage whose blockade may be beneficial in fat distribution-associated metabolic disease.
Assuntos
Diabetes Mellitus Tipo 2 , Subunidades beta de Inibinas/genética , Tecido Adiposo , Adiposidade/genética , Diabetes Mellitus Tipo 2/genética , Exoma/genética , Humanos , MutaçãoRESUMO
Polycystic ovary syndrome (PCOS) is a common complex disease in women with a strong genetic component and downstream consequences for reproductive, metabolic and psychological health. There are currently 19 known PCOS risk loci, primarily identified in women of Han Chinese or European ancestry, and 14 of these risk loci were identified or replicated in a genome-wide association study of PCOS performed in up to 10,074 cases and 103,164 controls of European descent. However, for most of these loci the gene responsible for the association is unknown. We therefore use a Bayesian colocalization approach (Coloc) to highlight genes in PCOS-associated regions that may have a role in mediating the disease risk. We evaluated the posterior probabilities of evidence consistent with shared causal variants between 14 PCOS genetic risk loci and intermediate cellular phenotypes in one protein (N = 3301) and two expression quantitative trait locus datasets (N = 31,684 and N = 80-491). Through these analyses, we identified seven proteins or genes with evidence of a possibly shared causal variant for almost 30% of known PCOS signals, including follicle stimulating hormone and ERBB3, IKZF4, RPS26, SUOX, ZFP36L2, and C8orf49. Several of these potential effector proteins and genes have been implicated in the hypothalamic-pituitary-gonadal signalling pathway and provide an avenue for functional follow-up in order to demonstrate a causal role in PCOS pathophysiology.
Assuntos
Síndrome do Ovário Policístico/genética , Locos de Características Quantitativas , Feminino , Humanos , Fator de Transcrição Ikaros/genética , Oxirredutases atuantes sobre Doadores de Grupo Enxofre/genética , Síndrome do Ovário Policístico/metabolismo , Proteoma/genética , Proteoma/metabolismo , Receptor ErbB-3/genética , Proteínas Ribossômicas/genética , Fatores de Transcrição/genéticaRESUMO
Triangulation of evidence from clinical trials and human genetics suggests a potential excess risk of cardiovascular disease events arising from therapeutic inhibition of sclerostin.
Assuntos
Doenças Cardiovasculares , Doenças Cardiovasculares/genética , Genética Humana , HumanosRESUMO
Inhibition of sclerostin is a therapeutic approach to lowering fracture risk in patients with osteoporosis. However, data from phase 3 randomized controlled trials (RCTs) of romosozumab, a first-in-class monoclonal antibody that inhibits sclerostin, suggest an imbalance of serious cardiovascular events, and regulatory agencies have issued marketing authorizations with warnings of cardiovascular disease. Here, we meta-analyze published and unpublished cardiovascular outcome trial data of romosozumab and investigate whether genetic variants that mimic therapeutic inhibition of sclerostin are associated with higher risk of cardiovascular disease. Meta-analysis of up to three RCTs indicated a probable higher risk of cardiovascular events with romosozumab. Scaled to the equivalent dose of romosozumab (210 milligrams per month; 0.09 grams per square centimeter of higher bone mineral density), the SOST genetic variants were associated with lower risk of fracture and osteoporosis (commensurate with the therapeutic effect of romosozumab) and with a higher risk of myocardial infarction and/or coronary revascularization and major adverse cardiovascular events. The same variants were also associated with increased risk of type 2 diabetes mellitus and higher systolic blood pressure and central adiposity. Together, our findings indicate that inhibition of sclerostin may elevate cardiovascular risk, warranting a rigorous evaluation of the cardiovascular safety of romosozumab and other sclerostin inhibitors.