Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 64
Filtrar
1.
Nature ; 631(8021): 583-592, 2024 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-38768635

RESUMEN

Rare coding variants that substantially affect function provide insights into the biology of a gene1-3. However, ascertaining the frequency of such variants requires large sample sizes4-8. Here we present a catalogue of human protein-coding variation, derived from exome sequencing of 983,578 individuals across diverse populations. In total, 23% of the Regeneron Genetics Center Million Exome (RGC-ME) data come from individuals of African, East Asian, Indigenous American, Middle Eastern and South Asian ancestry. The catalogue includes more than 10.4 million missense and 1.1 million predicted loss-of-function (pLOF) variants. We identify individuals with rare biallelic pLOF variants in 4,848 genes, 1,751 of which have not been previously reported. From precise quantitative estimates of selection against heterozygous loss of function (LOF), we identify 3,988 LOF-intolerant genes, including 86 that were previously assessed as tolerant and 1,153 that lack established disease annotation. We also define regions of missense depletion at high resolution. Notably, 1,482 genes have regions that are depleted of missense variants despite being tolerant of pLOF variants. Finally, we estimate that 3% of individuals have a clinically actionable genetic variant, and that 11,773 variants reported in ClinVar with unknown significance are likely to be deleterious cryptic splice sites. To facilitate variant interpretation and genetics-informed precision medicine, we make this resource of coding variation from the RGC-ME dataset publicly accessible through a variant allele frequency browser.


Asunto(s)
Variación Genética , Humanos , Variación Genética/genética , Secuenciación del Exoma , Mutación con Pérdida de Función/genética , Exoma/genética , Heterocigoto , Mutación Missense/genética , Frecuencia de los Genes , Alelos , Sistemas de Lectura Abierta/genética
2.
Nat Genet ; 55(8): 1277-1287, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37558884

RESUMEN

In this study, we leveraged the combined evidence of rare coding variants and common alleles to identify therapeutic targets for osteoporosis. We undertook a large-scale multiancestry exome-wide association study for estimated bone mineral density, which showed that the burden of rare coding alleles in 19 genes was associated with estimated bone mineral density (P < 3.6 × 10-7). These genes were highly enriched for a set of known causal genes for osteoporosis (65-fold; P = 2.5 × 10-5). Exome-wide significant genes had 96-fold increased odds of being the top ranked effector gene at a given GWAS locus (P = 1.8 × 10-10). By integrating proteomics Mendelian randomization evidence, we prioritized CD109 (cluster of differentiation 109) as a gene for which heterozygous loss of function is associated with higher bone density. CRISPR-Cas9 editing of CD109 in SaOS-2 osteoblast-like cell lines showed that partial CD109 knockdown led to increased mineralization. This study demonstrates that the convergence of common and rare variants, proteomics and CRISPR can highlight new bone biology to guide therapeutic development.


Asunto(s)
Predisposición Genética a la Enfermedad , Osteoporosis , Humanos , Secuenciación del Exoma , Osteoporosis/genética , Densidad Ósea/genética , Alelos , Factores de Transcripción/genética , Estudio de Asociación del Genoma Completo
3.
bioRxiv ; 2023 Nov 02.
Artículo en Inglés | MEDLINE | ID: mdl-37214792

RESUMEN

Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.

5.
Nature ; 612(7939): 301-309, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36450978

RESUMEN

Clonal haematopoiesis involves the expansion of certain blood cell lineages and has been associated with ageing and adverse health outcomes1-5. Here we use exome sequence data on 628,388 individuals to identify 40,208 carriers of clonal haematopoiesis of indeterminate potential (CHIP). Using genome-wide and exome-wide association analyses, we identify 24 loci (21 of which are novel) where germline genetic variation influences predisposition to CHIP, including missense variants in the lymphocytic antigen coding gene LY75, which are associated with reduced incidence of CHIP. We also identify novel rare variant associations with clonal haematopoiesis and telomere length. Analysis of 5,041 health traits from the UK Biobank (UKB) found relationships between CHIP and severe COVID-19 outcomes, cardiovascular disease, haematologic traits, malignancy, smoking, obesity, infection and all-cause mortality. Longitudinal and Mendelian randomization analyses revealed that CHIP is associated with solid cancers, including non-melanoma skin cancer and lung cancer, and that CHIP linked to DNMT3A is associated with the subsequent development of myeloid but not lymphoid leukaemias. Additionally, contrary to previous findings from the initial 50,000 UKB exomes6, our results in the full sample do not support a role for IL-6 inhibition in reducing the risk of cardiovascular disease among CHIP carriers. Our findings demonstrate that CHIP represents a complex set of heterogeneous phenotypes with shared and unique germline genetic causes and varied clinical implications.


Asunto(s)
COVID-19 , Enfermedades Cardiovasculares , Humanos , Hematopoyesis Clonal/genética , Enfermedades Cardiovasculares/epidemiología , Enfermedades Cardiovasculares/genética
6.
Nat Commun ; 13(1): 4844, 2022 08 23.
Artículo en Inglés | MEDLINE | ID: mdl-35999217

RESUMEN

Body fat distribution is a major, heritable risk factor for cardiometabolic disease, independent of overall adiposity. Using exome-sequencing in 618,375 individuals (including 160,058 non-Europeans) from the UK, Sweden and Mexico, we identify 16 genes associated with fat distribution at exome-wide significance. We show 6-fold larger effect for fat-distribution associated rare coding variants compared with fine-mapped common alleles, enrichment for genes expressed in adipose tissue and causal genes for partial lipodystrophies, and evidence of sex-dimorphism. We describe an association with favorable fat distribution (p = 1.8 × 10-09), favorable metabolic profile and protection from type 2 diabetes (~28% lower odds; p = 0.004) for heterozygous protein-truncating mutations in INHBE, which encodes a circulating growth factor of the activin family, highly and specifically expressed in hepatocytes. Our results suggest that inhibin ßE is a liver-expressed negative regulator of adipose storage whose blockade may be beneficial in fat distribution-associated metabolic disease.


Asunto(s)
Diabetes Mellitus Tipo 2 , Subunidades beta de Inhibinas/genética , Tejido Adiposo , Adiposidad/genética , Diabetes Mellitus Tipo 2/genética , Exoma/genética , Humanos , Mutación
7.
N Engl J Med ; 387(4): 332-344, 2022 07 28.
Artículo en Inglés | MEDLINE | ID: mdl-35939579

RESUMEN

BACKGROUND: Exome sequencing in hundreds of thousands of persons may enable the identification of rare protein-coding genetic variants associated with protection from human diseases like liver cirrhosis, providing a strategy for the discovery of new therapeutic targets. METHODS: We performed a multistage exome sequencing and genetic association analysis to identify genes in which rare protein-coding variants were associated with liver phenotypes. We conducted in vitro experiments to further characterize associations. RESULTS: The multistage analysis involved 542,904 persons with available data on liver aminotransferase levels, 24,944 patients with various types of liver disease, and 490,636 controls without liver disease. We found that rare coding variants in APOB, ABCB4, SLC30A10, and TM6SF2 were associated with increased aminotransferase levels and an increased risk of liver disease. We also found that variants in CIDEB, which encodes a structural protein found in hepatic lipid droplets, had a protective effect. The burden of rare predicted loss-of-function variants plus missense variants in CIDEB (combined carrier frequency, 0.7%) was associated with decreased alanine aminotransferase levels (beta per allele, -1.24 U per liter; 95% confidence interval [CI], -1.66 to -0.83; P = 4.8×10-9) and with 33% lower odds of liver disease of any cause (odds ratio per allele, 0.67; 95% CI, 0.57 to 0.79; P = 9.9×10-7). Rare coding variants in CIDEB were associated with a decreased risk of liver disease across different underlying causes and different degrees of severity, including cirrhosis of any cause (odds ratio per allele, 0.50; 95% CI, 0.36 to 0.70). Among 3599 patients who had undergone bariatric surgery, rare coding variants in CIDEB were associated with a decreased nonalcoholic fatty liver disease activity score (beta per allele in score units, -0.98; 95% CI, -1.54 to -0.41 [scores range from 0 to 8, with higher scores indicating more severe disease]). In human hepatoma cell lines challenged with oleate, CIDEB small interfering RNA knockdown prevented the buildup of large lipid droplets. CONCLUSIONS: Rare germline mutations in CIDEB conferred substantial protection from liver disease. (Funded by Regeneron Pharmaceuticals.).


Asunto(s)
Proteínas Reguladoras de la Apoptosis , Mutación de Línea Germinal , Hepatopatías , Proteínas Reguladoras de la Apoptosis/genética , Proteínas Reguladoras de la Apoptosis/metabolismo , Predisposición Genética a la Enfermedad/genética , Predisposición Genética a la Enfermedad/prevención & control , Humanos , Hígado/metabolismo , Hepatopatías/genética , Hepatopatías/metabolismo , Hepatopatías/prevención & control , Transaminasas/genética , Secuenciación del Exoma
8.
Pharmacogenomics J ; 22(3): 160-165, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-35149777

RESUMEN

Sarilumab is a human monoclonal antibody against interleukin (IL)-6Rα that has been approved for the treatment of adult patients with moderately to severely active rheumatoid arthritis (RA) and an inadequate response or intolerance to one or more disease-modifying antirheumatic drugs (DMARDs). Mild liver function test abnormalities have been observed in patients treated with sarilumab. We describe a genome-wide association study of bilirubin elevations in RA patients treated with sarilumab. Array genotyping and exome sequencing were performed on DNA samples from 1075 patients. Variants in the UGT1A1 gene were strongly associated with maximum bilirubin elevations in sarilumab-treated patients (rs4148325; p = 2.88 × 10-41) but were not associated with aminotransferase elevations. No other independent loci showed evidence of association with bilirubin elevations after sarilumab treatment. These findings suggest that most bilirubin increases during sarilumab treatment are related to genetic variation in UGT1A1 rather than underlying liver injury.


Asunto(s)
Antirreumáticos , Artritis Reumatoide , Adulto , Anticuerpos Monoclonales Humanizados , Antirreumáticos/uso terapéutico , Artritis Reumatoide/tratamiento farmacológico , Artritis Reumatoide/genética , Bilirrubina/uso terapéutico , Estudio de Asociación del Genoma Completo , Glucuronosiltransferasa/genética , Humanos , Resultado del Tratamiento
9.
Genet Med ; 24(3): 703-711, 2022 03.
Artículo en Inglés | MEDLINE | ID: mdl-34906480

RESUMEN

PURPOSE: Recurrent pathogenic copy number variants (pCNVs) have large-effect impacts on brain function and represent important etiologies of neurodevelopmental psychiatric disorders (NPDs), including autism and schizophrenia. Patterns of health care utilization in adults with pCNVs have gone largely unstudied and are likely to differ in significant ways from those of children. METHODS: We compared the prevalence of NPDs and electronic health record-based medical conditions in 928 adults with 26 pCNVs to a demographically-matched cohort of pCNV-negative controls from >135,000 patient-participants in Geisinger's MyCode Community Health Initiative. We also evaluated 3 quantitative health care utilization measures (outpatient, inpatient, and emergency department visits) in both groups. RESULTS: Adults with pCNVs (24.9%) were more likely than controls (16.0%) to have a documented NPD. They had significantly higher rates of several chronic diseases, including diabetes (29.3% in participants with pCNVs vs 20.4% in participants without pCNVs) and dementia (2.2% in participants with pCNVs vs 1.0% participants without pCNVs), and twice as many annual emergency department visits. CONCLUSION: These findings highlight the potential for genetic information-specifically, pCNVs-to inform the study of health care outcomes and utilization in adults. If, as our findings suggest, adults with pCNVs have poorer health and require disproportionate health care resources, early genetic diagnosis paired with patient-centered interventions may help to anticipate problems, improve outcomes, and reduce the associated economic burden.


Asunto(s)
Variaciones en el Número de Copia de ADN , Atención a la Salud , Adulto , Niño , Estudios de Cohortes , Variaciones en el Número de Copia de ADN/genética , Humanos , Aceptación de la Atención de Salud , Prevalencia
10.
Nature ; 599(7886): 628-634, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34662886

RESUMEN

A major goal in human genetics is to use natural variation to understand the phenotypic consequences of altering each protein-coding gene in the genome. Here we used exome sequencing1 to explore protein-altering variants and their consequences in 454,787 participants in the UK Biobank study2. We identified 12 million coding variants, including around 1 million loss-of-function and around 1.8 million deleterious missense variants. When these were tested for association with 3,994 health-related traits, we found 564 genes with trait associations at P ≤ 2.18 × 10-11. Rare variant associations were enriched in loci from genome-wide association studies (GWAS), but most (91%) were independent of common variant signals. We discovered several risk-increasing associations with traits related to liver disease, eye disease and cancer, among others, as well as risk-lowering associations for hypertension (SLC9A3R2), diabetes (MAP3K15, FAM234A) and asthma (SLC27A3). Six genes were associated with brain imaging phenotypes, including two involved in neural development (GBE1, PLD1). Of the signals available and powered for replication in an independent cohort, 81% were confirmed; furthermore, association signals were generally consistent across individuals of European, Asian and African ancestry. We illustrate the ability of exome sequencing to identify gene-trait associations, elucidate gene function and pinpoint effector genes that underlie GWAS signals at scale.


Asunto(s)
Bancos de Muestras Biológicas , Bases de Datos Genéticas , Secuenciación del Exoma , Exoma/genética , África/etnología , Asia/etnología , Asma/genética , Diabetes Mellitus/genética , Europa (Continente)/etnología , Oftalmopatías/genética , Femenino , Predisposición Genética a la Enfermedad/genética , Variación Genética , Estudio de Asociación del Genoma Completo , Humanos , Hipertensión/genética , Hepatopatías/genética , Masculino , Mutación , Neoplasias/genética , Carácter Cuantitativo Heredable , Reino Unido
11.
Science ; 373(6550)2021 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-34210852

RESUMEN

Large-scale human exome sequencing can identify rare protein-coding variants with a large impact on complex traits such as body adiposity. We sequenced the exomes of 645,626 individuals from the United Kingdom, the United States, and Mexico and estimated associations of rare coding variants with body mass index (BMI). We identified 16 genes with an exome-wide significant association with BMI, including those encoding five brain-expressed G protein-coupled receptors (CALCR, MC4R, GIPR, GPR151, and GPR75). Protein-truncating variants in GPR75 were observed in ~4/10,000 sequenced individuals and were associated with 1.8 kilograms per square meter lower BMI and 54% lower odds of obesity in the heterozygous state. Knock out of Gpr75 in mice resulted in resistance to weight gain and improved glycemic control in a high-fat diet model. Inhibition of GPR75 may provide a therapeutic strategy for obesity.


Asunto(s)
Índice de Masa Corporal , Exoma/genética , Obesidad/genética , Receptores Acoplados a Proteínas G/genética , Animales , Variación Genética , Humanos , Ratones , Ratones Noqueados , Análisis de Secuencia de ADN , Aumento de Peso/genética
12.
Nat Genet ; 53(7): 942-948, 2021 07.
Artículo en Inglés | MEDLINE | ID: mdl-34183854

RESUMEN

The UK Biobank Exome Sequencing Consortium (UKB-ESC) is a private-public partnership between the UK Biobank (UKB) and eight biopharmaceutical companies that will complete the sequencing of exomes for all ~500,000 UKB participants. Here, we describe the early results from ~200,000 UKB participants and the features of this project that enabled its success. The biopharmaceutical industry has increasingly used human genetics to improve success in drug discovery. Recognizing the need for large-scale human genetics data, as well as the unique value of the data access and contribution terms of the UKB, the UKB-ESC was formed. As a result, exome data from 200,643 UKB enrollees are now available. These data include ~10 million exonic variants-a rich resource of rare coding variation that is particularly valuable for drug discovery. The UKB-ESC precompetitive collaboration has further strengthened academic and industry ties and has provided teams with an opportunity to interact with and learn from the wider research community.


Asunto(s)
Bancos de Muestras Biológicas , Descubrimiento de Drogas , Secuenciación del Exoma , Genética Humana , Investigación , Descubrimiento de Drogas/métodos , Genómica/métodos , Humanos , Reino Unido
13.
Genet Epidemiol ; 45(6): 664-681, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-34184762

RESUMEN

Serum alanine aminotransferase (ALT) and aspartate aminotransferase (AST) are biomarkers for liver health. Here we report the largest genome-wide association analysis to date of serum ALT and AST levels in over 388k people of European ancestry from UK biobank and DiscovEHR. Eleven million imputed markers with a minor allele frequency (MAF) ≥ 0.5% were analyzed. Overall, 300 ALT and 336 AST independent genome-wide significant associations were identified. Among them, 81 ALT and 61 AST associations are reported for the first time. Genome-wide interaction study identified 9 ALT and 12 AST independent associations significantly modified by body mass index (BMI), including several previously reported potential liver disease therapeutic targets, for example, PNPLA3, HSD17B13, and MARC1. While further work is necessary to understand the effect of ALT and AST-associated variants on liver disease, the weighted burden of significant BMI-modified signals is significantly associated with liver disease outcomes. In summary, this study identifies genetic associations which offer an important step forward in understanding the genetic architecture of serum ALT and AST levels. Significant interactions between BMI and genetic loci not only highlight the important role of adiposity in liver damage but also shed light on the genetic etiology of liver disease in obese individuals.


Asunto(s)
Alanina Transaminasa/sangre , Aspartato Aminotransferasas/sangre , Índice de Masa Corporal , Estudio de Asociación del Genoma Completo , Humanos
14.
Sci Rep ; 11(1): 5595, 2021 03 10.
Artículo en Inglés | MEDLINE | ID: mdl-33692434

RESUMEN

Inflammatory bowel disease (IBD), clinically defined as Crohn's disease (CD), ulcerative colitis (UC), or IBD-unclassified, results in chronic inflammation of the gastrointestinal tract in genetically susceptible hosts. Pediatric onset IBD represents ≥ 25% of all IBD diagnoses and often presents with intestinal stricturing, perianal disease, and failed response to conventional treatments. NOD2 was the first and is the most replicated locus associated with adult IBD, to date. However, its role in pediatric onset IBD is not well understood. We performed whole-exome sequencing on a cohort of 1,183 patients with pediatric onset IBD (ages 0-18.5 years). We identified 92 probands with biallelic rare and low frequency NOD2 variants accounting for approximately 8% of our cohort, suggesting a Mendelian inheritance pattern of disease. Additionally, we investigated the contribution of recessive inheritance of NOD2 alleles in adult IBD patients from a large clinical population cohort. We found that recessive inheritance of NOD2 variants explains ~ 7% of cases in this adult IBD cohort, including ~ 10% of CD cases, confirming the observations from our pediatric IBD cohort. Exploration of EHR data showed that several of these adult IBD patients obtained their initial IBD diagnosis before 18 years of age, consistent with early onset disease. While it has been previously reported that carriers of more than one NOD2 risk alleles have increased susceptibility to Crohn's Disease (CD), our data formally demonstrate that recessive inheritance of NOD2 alleles is a mechanistic driver of early onset IBD, specifically CD, likely due to loss of NOD2 protein function. Collectively, our findings show that recessive inheritance of rare and low frequency deleterious NOD2 variants account for 7-10% of CD cases and implicate NOD2 as a Mendelian disease gene for early onset Crohn's Disease.


Asunto(s)
Colitis Ulcerosa/genética , Enfermedad de Crohn/genética , Mutación , Proteína Adaptadora de Señalización NOD2/genética , Adolescente , Adulto , Edad de Inicio , Niño , Preescolar , Colitis Ulcerosa/metabolismo , Enfermedad de Crohn/metabolismo , Femenino , Humanos , Lactante , Recién Nacido , Masculino
15.
Nat Med ; 27(1): 66-72, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33432171

RESUMEN

The clinical impact of rare loss-of-function variants has yet to be determined for most genes. Integration of DNA sequencing data with electronic health records (EHRs) could enhance our understanding of the contribution of rare genetic variation to human disease1. By leveraging 10,900 whole-exome sequences linked to EHR data in the Penn Medicine Biobank, we addressed the association of the cumulative effects of rare predicted loss-of-function variants for each individual gene on human disease on an exome-wide scale, as assessed using a set of diverse EHR phenotypes. After discovering 97 genes with exome-by-phenome-wide significant phenotype associations (P < 10-6), we replicated 26 of these in the Penn Medicine Biobank, as well as in three other medical biobanks and the population-based UK Biobank. Of these 26 genes, five had associations that have been previously reported and represented positive controls, whereas 21 had phenotype associations not previously reported, among which were genes implicated in glaucoma, aortic ectasia, diabetes mellitus, muscular dystrophy and hearing loss. These findings show the value of aggregating rare predicted loss-of-function variants into 'gene burdens' for identifying new gene-disease associations using EHR phenotypes in a medical biobank. We suggest that application of this approach to even larger numbers of individuals will provide the statistical power required to uncover unexplored relationships between rare genetic variation and disease phenotypes.


Asunto(s)
Registros Electrónicos de Salud , Exoma , Genotipo , Fenotipo , Anciano , Biología Computacional , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Polimorfismo de Nucleótido Simple , Secuenciación del Exoma
16.
Bioinformatics ; 36(22-23): 5537-5538, 2021 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-33300997

RESUMEN

SUMMARY: Variant Call Format (VCF), the prevailing representation for germline genotypes in population sequencing, suffers rapid size growth as larger cohorts are sequenced and more rare variants are discovered. We present Sparse Project VCF (spVCF), an evolution of VCF with judicious entropy reduction and run-length encoding, delivering >10× size reduction for modern studies with practically minimal information loss. spVCF interoperates with VCF efficiently, including tabix-based random access. We demonstrate its effectiveness with the DiscovEHR and UK Biobank whole-exome sequencing cohorts. AVAILABILITY AND IMPLEMENTATION: Apache-licensed reference implementation: github.com/mlin/spVCF. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genómica , Programas Informáticos , Secuencia de Bases , Genotipo , Células Germinativas
17.
Nature ; 586(7831): 749-756, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-33087929

RESUMEN

The UK Biobank is a prospective study of 502,543 individuals, combining extensive phenotypic and genotypic data with streamlined access for researchers around the world1. Here we describe the release of exome-sequence data for the first 49,960 study participants, revealing approximately 4 million coding variants (of which around 98.6% have a frequency of less than 1%). The data include 198,269 autosomal predicted loss-of-function (LOF) variants, a more than 14-fold increase compared to the imputed sequence. Nearly all genes (more than 97%) had at least one carrier with a LOF variant, and most genes (more than 69%) had at least ten carriers with a LOF variant. We illustrate the power of characterizing LOF variants in this population through association analyses across 1,730 phenotypes. In addition to replicating established associations, we found novel LOF variants with large effects on disease traits, including PIEZO1 on varicose veins, COL6A1 on corneal resistance, MEPE on bone density, and IQGAP2 and GMPR on blood cell traits. We further demonstrate the value of exome sequencing by surveying the prevalence of pathogenic variants of clinical importance, and show that 2% of this population has a medically actionable variant. Furthermore, we characterize the penetrance of cancer in carriers of pathogenic BRCA1 and BRCA2 variants. Exome sequences from the first 49,960 participants highlight the promise of genome sequencing in large population-based studies and are now accessible to the scientific community.


Asunto(s)
Bases de Datos Genéticas , Secuenciación del Exoma , Exoma/genética , Mutación con Pérdida de Función/genética , Fenotipo , Anciano , Densidad Ósea/genética , Colágeno Tipo VI/genética , Demografía , Femenino , Genes BRCA1 , Genes BRCA2 , Genotipo , Humanos , Canales Iónicos/genética , Masculino , Persona de Mediana Edad , Neoplasias/genética , Penetrancia , Fragmentos de Péptidos/genética , Reino Unido , Várices/genética , Proteínas Activadoras de ras GTPasa/genética
18.
JAMA Psychiatry ; 77(12): 1276-1285, 2020 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-32697297

RESUMEN

Importance: Population screening for medically relevant genomic variants that cause diseases such as hereditary cancer and cardiovascular disorders is increasing to facilitate early disease detection or prevention. Neuropsychiatric disorders (NPDs) are common, complex disorders with clear genetic causes; yet, access to genetic diagnosis is limited. We explored whether inclusion of NPD in population-based genomic screening programs is warranted by assessing 3 key factors: prevalence, penetrance, and personal utility. Objective: To evaluate the suitability of including pathogenic copy number variants (CNVs) associated with NPD in population screening by determining their prevalence and penetrance and exploring the personal utility of disclosing results. Design, Setting, and Participants: In this cohort study, the frequency of 31 NPD CNVs was determined in patient-participants via exome data. Associated clinical phenotypes were assessed using linked electronic health records. Nine CNVs were selected for disclosure by licensed genetic counselors, and participants' psychosocial reactions were evaluated using a mixed-methods approach. A primarily adult population receiving medical care at Geisinger, a large integrated health care system in the United States with the only population-based genomic screening program approved for medically relevant results disclosure, was included. The cohort was identified from the Geisinger MyCode Community Health Initiative. Exome and linked electronic health record data were available for this cohort, which was recruited from February 2007 to April 2017. Data were collected for the qualitative analysis April 2017 through February 2018. Analysis began February 2018 and ended December 2019. Main Outcomes and Measures: The planned outcomes of this study include (1) prevalence estimate of NPD-associated CNVs in an unselected health care system population; (2) penetrance estimate of NPD diagnoses in CNV-positive individuals; and (3) qualitative themes that describe participants' responses to receiving NPD-associated genomic results. Results: Of 90 595 participants with CNV data, a pathogenic CNV was identified in 708 (0.8%; 436 women [61.6%]; mean [SD] age, 50.04 [18.74] years). Seventy percent (n = 494) had at least 1 associated clinical symptom. Of these, 28.8% (204) of CNV-positive individuals had an NPD code in their electronic health record, compared with 13.3% (11 835 of 89 887) of CNV-negative individuals (odds ratio, 2.21; 95% CI, 1.86-2.61; P < .001); 66.4% (470) of CNV-positive individuals had a history of depression and anxiety compared with 54.6% (49 118 of 89 887) of CNV-negative individuals (odds ratio, 1.53; 95% CI, 1.31-1.80; P < .001). 16p13.11 (71 [0.078%]) and 22q11.2 (108 [0.119%]) were the most prevalent deletions and duplications, respectively. Only 5.8% of individuals (41 of 708) had a previously known genetic diagnosis. Results disclosure was completed for 141 individuals. Positive participant responses included poignant reactions to learning a medical reason for lifelong cognitive and psychiatric disabilities. Conclusions and Relevance: This study informs critical factors central to the development of population-based genomic screening programs and supports the inclusion of NPD in future designs to promote equitable access to clinically useful genomic information.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Prestación Integrada de Atención de Salud , Pruebas Genéticas , Tamizaje Masivo , Trastornos Mentales/genética , Trastornos Neurocognitivos/genética , Satisfacción del Paciente , Penetrancia , Adulto , Estudios de Cohortes , Registros Electrónicos de Salud , Femenino , Humanos , Masculino , Tamizaje Masivo/normas , Trastornos Mentales/epidemiología , Persona de Mediana Edad , Trastornos Neurocognitivos/epidemiología , Pennsylvania/epidemiología , Prevalencia , Secuenciación del Exoma
19.
Diabetes ; 69(2): 249-258, 2020 02.
Artículo en Inglés | MEDLINE | ID: mdl-31836692

RESUMEN

Lipodystrophies are a group of disorders characterized by absence or loss of adipose tissue and abnormal fat distribution, commonly accompanied by metabolic dysregulation. Although considered rare disorders, their prevalence in the general population is not well understood. We aimed to evaluate the clinical and genetic prevalence of lipodystrophy disorders in a large clinical care cohort. We interrogated the electronic health record (EHR) information of >1.3 million adults from the Geisinger Health System for lipodystrophy diagnostic codes. We estimate a clinical prevalence of disease of 1 in 20,000 individuals. We performed genetic analyses in individuals with available genomic data to identify variants associated with inherited lipodystrophies and examined their EHR for comorbidities associated with lipodystrophy. We identified 16 individuals carrying the p.R482Q pathogenic variant in LMNA associated with Dunnigan familial partial lipodystrophy. Four had a clinical diagnosis of lipodystrophy, whereas the remaining had no documented clinical diagnosis despite having accompanying metabolic abnormalities. We observed a lipodystrophy-associated variant carrier frequency of 1 in 3,082 individuals in our cohort with substantial burden of metabolic dysregulation. We estimate a genetic prevalence of disease of ∼1 in 7,000 in the general population. Partial lipodystrophy is an underdiagnosed condition. and its prevalence, as defined molecularly, is higher than previously reported. Genetically guided stratification of patients with common metabolic disorders, like diabetes and dyslipidemia, is an important step toward precision medicine.


Asunto(s)
Registros Electrónicos de Salud , Lipodistrofia/epidemiología , Lipodistrofia/genética , Vigilancia de la Población , Adulto , Femenino , Predisposición Genética a la Enfermedad , Genómica , Humanos , Masculino , Estados Unidos/epidemiología
20.
Neurol Genet ; 5(5): e358, 2019 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-31583275

RESUMEN

OBJECTIVE: To estimate the prevalence of TOR1A sequence variants associated with DYT1 dystonia. METHODS: We determined the frequency of the common trinucleotide deletion that causes DYT1 in the Genome Aggregation Database and the Penn Medicine Biobank, totaling exomes from over 135,000 individuals. We also evaluated the prevalence of other possible pathogenic variants in this gene and asked whether the D216H polymorphism is linked to a higher diagnostic rate for dystonia independent of the DYT1-causing mutation. RESULTS: The estimated range of prevalence of the most common pathogenic variant that causes DYT1 is ∼17.6-26.1 carriers per 100,000 individuals. Based on the different data sets used, we predict that there are between 54,366 and 80,891 mutation carriers in the United States, which, due to the reduced penetrance of this variant, would translate into 16,475-24,513 DYT1 patients. CONCLUSIONS: Our data provide a prevalence estimate of the most common DYT1 mutation in the general population. This information is specifically important for those with interest in the development of precision therapeutics for dystonia.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA