Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
1.
Nature ; 2024 May 20.
Artículo en Inglés | MEDLINE | ID: mdl-38768635

RESUMEN

Rare coding variants that significantly impact function provide insights into the biology of a gene1-3. However, ascertaining their frequency requires large sample sizes4-8. Here, we present a catalogue of human protein-coding variation, derived from exome sequencing of 983,578 individuals across diverse populations. 23% of the Regeneron Genetics Center Million Exome data (RGC-ME) comes from non-European individuals of African, East Asian, Indigenous American, Middle Eastern, and South Asian ancestry. This catalogue includes over 10.4 million missense and 1.1 million predicted loss-of-function (pLOF) variants. We identify individuals with rare biallelic pLOF variants in 4,848 genes, 1,751 of which have not been previously reported. From precise quantitative estimates of selection against heterozygous loss-of-function, we identify 3,988 loss-of-function intolerant genes, including 86 that were previously assessed as tolerant and 1,153 lacking established disease annotation. We also define regions of missense depletion at high resolution. Notably, 1,482 genes have regions depleted of missense variants despite being tolerant to pLOF variants. Finally, we estimate that 3% of individuals have a clinically actionable genetic variant, and that 11,773 variants reported in ClinVar with unknown significance are likely to be deleterious cryptic splice sites. To facilitate variant interpretation and genetics-informed precision medicine, we make this important resource of coding variation from the RGC-ME accessible via a public variant allele frequency browser.

3.
Nature ; 622(7984): 784-793, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37821707

RESUMEN

The Mexico City Prospective Study is a prospective cohort of more than 150,000 adults recruited two decades ago from the urban districts of Coyoacán and Iztapalapa in Mexico City1. Here we generated genotype and exome-sequencing data for all individuals and whole-genome sequencing data for 9,950 selected individuals. We describe high levels of relatedness and substantial heterogeneity in ancestry composition across individuals. Most sequenced individuals had admixed Indigenous American, European and African ancestry, with extensive admixture from Indigenous populations in central, southern and southeastern Mexico. Indigenous Mexican segments of the genome had lower levels of coding variation but an excess of homozygous loss-of-function variants compared with segments of African and European origin. We estimated ancestry-specific allele frequencies at 142 million genomic variants, with an effective sample size of 91,856 for Indigenous Mexican ancestry at exome variants, all available through a public browser. Using whole-genome sequencing, we developed an imputation reference panel that outperforms existing panels at common variants in individuals with high proportions of central, southern and southeastern Indigenous Mexican ancestry. Our work illustrates the value of genetic studies in diverse populations and provides foundational imputation and allele frequency resources for future genetic studies in Mexico and in the United States, where the Hispanic/Latino population is predominantly of Mexican descent.


Asunto(s)
Secuenciación del Exoma , Genoma Humano , Genotipo , Hispánicos o Latinos , Adulto , Humanos , África/etnología , Américas/etnología , Europa (Continente)/etnología , Frecuencia de los Genes/genética , Genética de Población , Genoma Humano/genética , Técnicas de Genotipaje , Hispánicos o Latinos/genética , Homocigoto , Mutación con Pérdida de Función/genética , México , Estudios Prospectivos
4.
Nat Genet ; 55(7): 1138-1148, 2023 07.
Artículo en Inglés | MEDLINE | ID: mdl-37308787

RESUMEN

Human genetic studies of smoking behavior have been thus far largely limited to common variants. Studying rare coding variants has the potential to identify drug targets. We performed an exome-wide association study of smoking phenotypes in up to 749,459 individuals and discovered a protective association in CHRNB2, encoding the ß2 subunit of the α4ß2 nicotine acetylcholine receptor. Rare predicted loss-of-function and likely deleterious missense variants in CHRNB2 in aggregate were associated with a 35% decreased odds for smoking heavily (odds ratio (OR) = 0.65, confidence interval (CI) = 0.56-0.76, P = 1.9 × 10-8). An independent common variant association in the protective direction ( rs2072659 ; OR = 0.96; CI = 0.94-0.98; P = 5.3 × 10-6) was also evident, suggesting an allelic series. Our findings in humans align with decades-old experimental observations in mice that ß2 loss abolishes nicotine-mediated neuronal responses and attenuates nicotine self-administration. Our genetic discovery will inspire future drug designs targeting CHRNB2 in the brain for the treatment of nicotine addiction.


Asunto(s)
Nicotina , Tabaquismo , Humanos , Animales , Ratones , Fumar/genética , Tabaquismo/genética , Fenotipo , Oportunidad Relativa
5.
bioRxiv ; 2023 Nov 02.
Artículo en Inglés | MEDLINE | ID: mdl-37214792

RESUMEN

Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.

6.
Front Public Health ; 10: 1011388, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36276400

RESUMEN

The prevalence of proximate risk factors for active tuberculosis (TB) in areas of high prevalence of latent tuberculosis infection (LTBI) is not clearly understood. We aimed at assessing the prevalence of non-communicable multi-morbidity focusing on diabetes mellitus (DM), malnutrition, and hypertension (HTN) as common risk factors of LTBI progressing to active TB. In a cross-sectional study, 2,351 adults (45% male and 55% female) from villages in the Kancheepuram district of South India were enrolled between 2013 and 2020. DM was defined as HbA1c >6.4%, undernutrition was defined as low body mass index (LBMI) <18.5 kg/m2, obesity was classified as BMI ≥25 kg/m2, HTN was reported as systolic pressure >130 mmHg, and LTBI was defined as positive (≥ 0.35 international units/ml) by QuantiFERON Gold In-Tube assay. A total of 1,226 individuals (52%) were positive for LTBI out of 2351 tested individuals. The prevalence of DM and pre-diabetes mellitus (PDM) was 21 and 35%, respectively, HTN was 15% in latent tuberculosis (LTB)-infected individuals. The association of DM [odds ratio (OR)]; adjusted odds ratio (aOR) (OR = 1.26, 95% CI: 1.13-1.65; aOR = 1.19, 95% CI: 1.10-1.58), PDM (OR = 1.11, 95% CI: 1.0-1.35), and HTN (OR = 1.28, 95% CI: 1.11-1.62; aOR = 1.18, 95% CI: 1.0-1.56) poses as risk factors of LTBI progression to active TB. The prevalence of LBMI 9% (OR = 1.07, 95% CI: 0.78-1.48) and obesity 42% (OR = 0.85, 95% CI: 0.70-1.03) did not show any statistically significant association with LTB-infected individuals. The present evidence of a high burden of multi-morbidity suggests that proximate risk factors of active TB in LTBI can be managed by nutrition and lifestyle modification.


Asunto(s)
Diabetes Mellitus , Hipertensión , Tuberculosis Latente , Desnutrición , Estado Prediabético , Tuberculosis , Adulto , Masculino , Femenino , Humanos , Tuberculosis Latente/epidemiología , Estudios Transversales , Prevalencia , Hemoglobina Glucada/análisis , Tuberculosis/epidemiología , Estado Prediabético/epidemiología , Diabetes Mellitus/epidemiología , Factores de Riesgo , India/epidemiología , Obesidad/epidemiología
7.
Nat Commun ; 13(1): 4844, 2022 08 23.
Artículo en Inglés | MEDLINE | ID: mdl-35999217

RESUMEN

Body fat distribution is a major, heritable risk factor for cardiometabolic disease, independent of overall adiposity. Using exome-sequencing in 618,375 individuals (including 160,058 non-Europeans) from the UK, Sweden and Mexico, we identify 16 genes associated with fat distribution at exome-wide significance. We show 6-fold larger effect for fat-distribution associated rare coding variants compared with fine-mapped common alleles, enrichment for genes expressed in adipose tissue and causal genes for partial lipodystrophies, and evidence of sex-dimorphism. We describe an association with favorable fat distribution (p = 1.8 × 10-09), favorable metabolic profile and protection from type 2 diabetes (~28% lower odds; p = 0.004) for heterozygous protein-truncating mutations in INHBE, which encodes a circulating growth factor of the activin family, highly and specifically expressed in hepatocytes. Our results suggest that inhibin ßE is a liver-expressed negative regulator of adipose storage whose blockade may be beneficial in fat distribution-associated metabolic disease.


Asunto(s)
Diabetes Mellitus Tipo 2 , Subunidades beta de Inhibinas/genética , Tejido Adiposo , Adiposidad/genética , Diabetes Mellitus Tipo 2/genética , Exoma/genética , Humanos , Mutación
8.
N Engl J Med ; 387(4): 332-344, 2022 07 28.
Artículo en Inglés | MEDLINE | ID: mdl-35939579

RESUMEN

BACKGROUND: Exome sequencing in hundreds of thousands of persons may enable the identification of rare protein-coding genetic variants associated with protection from human diseases like liver cirrhosis, providing a strategy for the discovery of new therapeutic targets. METHODS: We performed a multistage exome sequencing and genetic association analysis to identify genes in which rare protein-coding variants were associated with liver phenotypes. We conducted in vitro experiments to further characterize associations. RESULTS: The multistage analysis involved 542,904 persons with available data on liver aminotransferase levels, 24,944 patients with various types of liver disease, and 490,636 controls without liver disease. We found that rare coding variants in APOB, ABCB4, SLC30A10, and TM6SF2 were associated with increased aminotransferase levels and an increased risk of liver disease. We also found that variants in CIDEB, which encodes a structural protein found in hepatic lipid droplets, had a protective effect. The burden of rare predicted loss-of-function variants plus missense variants in CIDEB (combined carrier frequency, 0.7%) was associated with decreased alanine aminotransferase levels (beta per allele, -1.24 U per liter; 95% confidence interval [CI], -1.66 to -0.83; P = 4.8×10-9) and with 33% lower odds of liver disease of any cause (odds ratio per allele, 0.67; 95% CI, 0.57 to 0.79; P = 9.9×10-7). Rare coding variants in CIDEB were associated with a decreased risk of liver disease across different underlying causes and different degrees of severity, including cirrhosis of any cause (odds ratio per allele, 0.50; 95% CI, 0.36 to 0.70). Among 3599 patients who had undergone bariatric surgery, rare coding variants in CIDEB were associated with a decreased nonalcoholic fatty liver disease activity score (beta per allele in score units, -0.98; 95% CI, -1.54 to -0.41 [scores range from 0 to 8, with higher scores indicating more severe disease]). In human hepatoma cell lines challenged with oleate, CIDEB small interfering RNA knockdown prevented the buildup of large lipid droplets. CONCLUSIONS: Rare germline mutations in CIDEB conferred substantial protection from liver disease. (Funded by Regeneron Pharmaceuticals.).


Asunto(s)
Proteínas Reguladoras de la Apoptosis , Mutación de Línea Germinal , Hepatopatías , Proteínas Reguladoras de la Apoptosis/genética , Proteínas Reguladoras de la Apoptosis/metabolismo , Predisposición Genética a la Enfermedad/genética , Predisposición Genética a la Enfermedad/prevención & control , Humanos , Hígado/metabolismo , Hepatopatías/genética , Hepatopatías/metabolismo , Hepatopatías/prevención & control , Transaminasas/genética , Secuenciación del Exoma
9.
PLoS Negl Trop Dis ; 16(7): e0010561, 2022 07.
Artículo en Inglés | MEDLINE | ID: mdl-35857754

RESUMEN

BACKGROUND: The prevalence of Strongyloides stercoralis infection is estimated to be 30-100 million worldwide, although this an underestimate. Most cases remain undiagnosed due to the asymptomatic nature of the infection. We wanted to estimate the seroprevalence of S. stercoralis infection in a South Indian adult population. METHODS: To this end, we performed community-based screening of 2351 individuals (aged 18-65) in Kanchipuram District of Tamil Nadu between 2013 and 2020. Serological testing for S. stercoralis was performed using the NIE ELISA. RESULTS: Our data shows a seroprevalence of 33% (768/2351) for S. stercoralis infection which had a higher prevalence among males 36% (386/1069) than among females 29.8% (382/1282). Adults aged ≥55 (aOR = 1.65, 95% CI: 1.25-2.18) showed higher adjusted odds of association compared with other age groups. Eosinophil levels (39%) (aOR = 1.43, 95% CI: 1.19-1.74) and hemoglobin levels (24%) (aOR = 1.25, 95% CI: 1.11-1.53) were significantly associated with S. stercoralis infection. In contrast, low BMI (aOR = 1.15, 95% CI: 0.82-1.61) or the presence of diabetes mellitus (OR = 1.18, 95% CI: 0.83-1.69) was not associated with S. stercoralis seropositivity. CONCLUSIONS: Our study provides evidence for a very high baseline prevalence of S. stercoralis infection in South Indian communities and this information could provide realistic and concrete planning of control measures.


Asunto(s)
Strongyloides stercoralis , Estrongiloidiasis , Adulto , Animales , Heces , Femenino , Humanos , India/epidemiología , Masculino , Prevalencia , Estudios Seroepidemiológicos , Estrongiloidiasis/diagnóstico , Estrongiloidiasis/epidemiología
10.
Commun Biol ; 5(1): 540, 2022 06 03.
Artículo en Inglés | MEDLINE | ID: mdl-35661827

RESUMEN

To better understand the genetics of hearing loss, we performed a genome-wide association meta-analysis with 125,749 cases and 469,497 controls across five cohorts. We identified 53/c loci affecting hearing loss risk, including common coding variants in COL9A3 and TMPRSS3. Through exome sequencing of 108,415 cases and 329,581 controls, we observed rare coding associations with 11 Mendelian hearing loss genes, including additive effects in known hearing loss genes GJB2 (Gly12fs; odds ratio [OR] = 1.21, P = 4.2 × 10-11) and SLC26A5 (gene burden; OR = 1.96, P = 2.8 × 10-17). We also identified hearing loss associations with rare coding variants in FSCN2 (OR = 1.14, P = 1.9 × 10-15) and KLHDC7B (OR = 2.14, P = 5.2 × 10-30). Our results suggest a shared etiology between Mendelian and common hearing loss in adults. This work illustrates the potential of large-scale exome sequencing to elucidate the genetic architecture of common disorders where both common and rare variation contribute to risk.


Asunto(s)
Estudio de Asociación del Genoma Completo , Pérdida Auditiva , Exoma/genética , Variación Genética , Estudio de Asociación del Genoma Completo/métodos , Pérdida Auditiva/genética , Humanos , Proteínas de la Membrana/genética , Proteínas de Neoplasias/genética , Serina Endopeptidasas/genética , Secuenciación del Exoma
11.
Nat Genet ; 54(4): 382-392, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-35241825

RESUMEN

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) enters human host cells via angiotensin-converting enzyme 2 (ACE2) and causes coronavirus disease 2019 (COVID-19). Here, through a genome-wide association study, we identify a variant (rs190509934, minor allele frequency 0.2-2%) that downregulates ACE2 expression by 37% (P = 2.7 × 10-8) and reduces the risk of SARS-CoV-2 infection by 40% (odds ratio = 0.60, P = 4.5 × 10-13), providing human genetic evidence that ACE2 expression levels influence COVID-19 risk. We also replicate the associations of six previously reported risk variants, of which four were further associated with worse outcomes in individuals infected with the virus (in/near LZTFL1, MHC, DPP9 and IFNAR2). Lastly, we show that common variants define a risk score that is strongly associated with severe disease among cases and modestly improves the prediction of disease severity relative to demographic and clinical factors alone.


Asunto(s)
COVID-19 , Enzima Convertidora de Angiotensina 2/genética , COVID-19/genética , Estudio de Asociación del Genoma Completo , Humanos , Factores de Riesgo , SARS-CoV-2/genética
12.
Nature ; 599(7886): 628-634, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34662886

RESUMEN

A major goal in human genetics is to use natural variation to understand the phenotypic consequences of altering each protein-coding gene in the genome. Here we used exome sequencing1 to explore protein-altering variants and their consequences in 454,787 participants in the UK Biobank study2. We identified 12 million coding variants, including around 1 million loss-of-function and around 1.8 million deleterious missense variants. When these were tested for association with 3,994 health-related traits, we found 564 genes with trait associations at P ≤ 2.18 × 10-11. Rare variant associations were enriched in loci from genome-wide association studies (GWAS), but most (91%) were independent of common variant signals. We discovered several risk-increasing associations with traits related to liver disease, eye disease and cancer, among others, as well as risk-lowering associations for hypertension (SLC9A3R2), diabetes (MAP3K15, FAM234A) and asthma (SLC27A3). Six genes were associated with brain imaging phenotypes, including two involved in neural development (GBE1, PLD1). Of the signals available and powered for replication in an independent cohort, 81% were confirmed; furthermore, association signals were generally consistent across individuals of European, Asian and African ancestry. We illustrate the ability of exome sequencing to identify gene-trait associations, elucidate gene function and pinpoint effector genes that underlie GWAS signals at scale.


Asunto(s)
Bancos de Muestras Biológicas , Bases de Datos Genéticas , Secuenciación del Exoma , Exoma/genética , África/etnología , Asia/etnología , Asma/genética , Diabetes Mellitus/genética , Europa (Continente)/etnología , Oftalmopatías/genética , Femenino , Predisposición Genética a la Enfermedad/genética , Variación Genética , Estudio de Asociación del Genoma Completo , Humanos , Hipertensión/genética , Hepatopatías/genética , Masculino , Mutación , Neoplasias/genética , Carácter Cuantitativo Heredable , Reino Unido
13.
Science ; 373(6550)2021 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-34210852

RESUMEN

Large-scale human exome sequencing can identify rare protein-coding variants with a large impact on complex traits such as body adiposity. We sequenced the exomes of 645,626 individuals from the United Kingdom, the United States, and Mexico and estimated associations of rare coding variants with body mass index (BMI). We identified 16 genes with an exome-wide significant association with BMI, including those encoding five brain-expressed G protein-coupled receptors (CALCR, MC4R, GIPR, GPR151, and GPR75). Protein-truncating variants in GPR75 were observed in ~4/10,000 sequenced individuals and were associated with 1.8 kilograms per square meter lower BMI and 54% lower odds of obesity in the heterozygous state. Knock out of Gpr75 in mice resulted in resistance to weight gain and improved glycemic control in a high-fat diet model. Inhibition of GPR75 may provide a therapeutic strategy for obesity.


Asunto(s)
Índice de Masa Corporal , Exoma/genética , Obesidad/genética , Receptores Acoplados a Proteínas G/genética , Animales , Variación Genética , Humanos , Ratones , Ratones Noqueados , Análisis de Secuencia de ADN , Aumento de Peso/genética
14.
Nat Genet ; 53(7): 942-948, 2021 07.
Artículo en Inglés | MEDLINE | ID: mdl-34183854

RESUMEN

The UK Biobank Exome Sequencing Consortium (UKB-ESC) is a private-public partnership between the UK Biobank (UKB) and eight biopharmaceutical companies that will complete the sequencing of exomes for all ~500,000 UKB participants. Here, we describe the early results from ~200,000 UKB participants and the features of this project that enabled its success. The biopharmaceutical industry has increasingly used human genetics to improve success in drug discovery. Recognizing the need for large-scale human genetics data, as well as the unique value of the data access and contribution terms of the UKB, the UKB-ESC was formed. As a result, exome data from 200,643 UKB enrollees are now available. These data include ~10 million exonic variants-a rich resource of rare coding variation that is particularly valuable for drug discovery. The UKB-ESC precompetitive collaboration has further strengthened academic and industry ties and has provided teams with an opportunity to interact with and learn from the wider research community.


Asunto(s)
Bancos de Muestras Biológicas , Descubrimiento de Drogas , Secuenciación del Exoma , Genética Humana , Investigación , Descubrimiento de Drogas/métodos , Genómica/métodos , Humanos , Reino Unido
15.
Nature ; 586(7831): 749-756, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-33087929

RESUMEN

The UK Biobank is a prospective study of 502,543 individuals, combining extensive phenotypic and genotypic data with streamlined access for researchers around the world1. Here we describe the release of exome-sequence data for the first 49,960 study participants, revealing approximately 4 million coding variants (of which around 98.6% have a frequency of less than 1%). The data include 198,269 autosomal predicted loss-of-function (LOF) variants, a more than 14-fold increase compared to the imputed sequence. Nearly all genes (more than 97%) had at least one carrier with a LOF variant, and most genes (more than 69%) had at least ten carriers with a LOF variant. We illustrate the power of characterizing LOF variants in this population through association analyses across 1,730 phenotypes. In addition to replicating established associations, we found novel LOF variants with large effects on disease traits, including PIEZO1 on varicose veins, COL6A1 on corneal resistance, MEPE on bone density, and IQGAP2 and GMPR on blood cell traits. We further demonstrate the value of exome sequencing by surveying the prevalence of pathogenic variants of clinical importance, and show that 2% of this population has a medically actionable variant. Furthermore, we characterize the penetrance of cancer in carriers of pathogenic BRCA1 and BRCA2 variants. Exome sequences from the first 49,960 participants highlight the promise of genome sequencing in large population-based studies and are now accessible to the scientific community.


Asunto(s)
Bases de Datos Genéticas , Secuenciación del Exoma , Exoma/genética , Mutación con Pérdida de Función/genética , Fenotipo , Anciano , Densidad Ósea/genética , Colágeno Tipo VI/genética , Demografía , Femenino , Genes BRCA1 , Genes BRCA2 , Genotipo , Humanos , Canales Iónicos/genética , Masculino , Persona de Mediana Edad , Neoplasias/genética , Penetrancia , Fragmentos de Péptidos/genética , Reino Unido , Várices/genética , Proteínas Activadoras de ras GTPasa/genética
16.
Am J Hum Genet ; 102(5): 874-889, 2018 05 03.
Artículo en Inglés | MEDLINE | ID: mdl-29727688

RESUMEN

Large-scale human genetics studies are ascertaining increasing proportions of populations as they continue growing in both number and scale. As a result, the amount of cryptic relatedness within these study cohorts is growing rapidly and has significant implications on downstream analyses. We demonstrate this growth empirically among the first 92,455 exomes from the DiscovEHR cohort and, via a custom simulation framework we developed called SimProgeny, show that these measures are in line with expectations given the underlying population and ascertainment approach. For example, within DiscovEHR we identified ∼66,000 close (first- and second-degree) relationships, involving 55.6% of study participants. Our simulation results project that >70% of the cohort will be involved in these close relationships, given that DiscovEHR scales to 250,000 recruited individuals. We reconstructed 12,574 pedigrees by using these relationships (including 2,192 nuclear families) and leveraged them for multiple applications. The pedigrees substantially improved the phasing accuracy of 20,947 rare, deleterious compound heterozygous mutations. Reconstructed nuclear families were critical for identifying 3,415 de novo mutations in ∼1,783 genes. Finally, we demonstrate the segregation of known and suspected disease-causing mutations, including a tandem duplication that occurs in LDLR and causes familial hypercholesterolemia, through reconstructed pedigrees. In summary, this work highlights the prevalence of cryptic relatedness expected among large healthcare population-genomic studies and demonstrates several analyses that are uniquely enabled by large amounts of cryptic relatedness.


Asunto(s)
Exoma/genética , Medicina de Precisión , Estudios de Cohortes , Simulación por Computador , Registros Electrónicos de Salud , Exones/genética , Familia , Femenino , Genética de Población , Geografía , Heterocigoto , Humanos , Masculino , Mutación/genética , Linaje , Fenotipo , Reproducibilidad de los Resultados
17.
Nucleic Acids Res ; 46(7): 3326-3338, 2018 04 20.
Artículo en Inglés | MEDLINE | ID: mdl-29562350

RESUMEN

Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumscribed the universe of all possible uORFs based on coding gene sequence motifs and identified 1.3 million unique uORFs. To determine which of these are likely to be biologically relevant, we built a simple Bayesian classifier using 89 attributes of uORFs labeled as active in ribosome profiling experiments. This allowed us to extrapolate to a comprehensive catalog of likely functional uORFs. We validated our predictions using in vivo protein levels and ribosome occupancy from 46 individuals. This is a substantially larger catalog of functional uORFs than has previously been reported. Our ranked list of likely active uORFs allows researchers to test their hypotheses regarding the role of uORFs in health and disease. We demonstrate several examples of biological interest through the application of our catalog to somatic mutations in cancer and disease-associated germline variants in humans.


Asunto(s)
Sistemas de Lectura Abierta/genética , Biosíntesis de Proteínas/genética , ARN Mensajero/genética , Ribosomas/genética , Teorema de Bayes , Biología Computacional , Humanos , Mutación/genética
18.
N Engl J Med ; 378(12): 1096-1106, 2018 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-29562163

RESUMEN

BACKGROUND: Elucidation of the genetic factors underlying chronic liver disease may reveal new therapeutic targets. METHODS: We used exome sequence data and electronic health records from 46,544 participants in the DiscovEHR human genetics study to identify genetic variants associated with serum levels of alanine aminotransferase (ALT) and aspartate aminotransferase (AST). Variants that were replicated in three additional cohorts (12,527 persons) were evaluated for association with clinical diagnoses of chronic liver disease in DiscovEHR study participants and two independent cohorts (total of 37,173 persons) and with histopathological severity of liver disease in 2391 human liver samples. RESULTS: A splice variant (rs72613567:TA) in HSD17B13, encoding the hepatic lipid droplet protein hydroxysteroid 17-beta dehydrogenase 13, was associated with reduced levels of ALT (P=4.2×10-12) and AST (P=6.2×10-10). Among DiscovEHR study participants, this variant was associated with a reduced risk of alcoholic liver disease (by 42% [95% confidence interval {CI}, 20 to 58] among heterozygotes and by 53% [95% CI, 3 to 77] among homozygotes), nonalcoholic liver disease (by 17% [95% CI, 8 to 25] among heterozygotes and by 30% [95% CI, 13 to 43] among homozygotes), alcoholic cirrhosis (by 42% [95% CI, 14 to 61] among heterozygotes and by 73% [95% CI, 15 to 91] among homozygotes), and nonalcoholic cirrhosis (by 26% [95% CI, 7 to 40] among heterozygotes and by 49% [95% CI, 15 to 69] among homozygotes). Associations were confirmed in two independent cohorts. The rs72613567:TA variant was associated with a reduced risk of nonalcoholic steatohepatitis, but not steatosis, in human liver samples. The rs72613567:TA variant mitigated liver injury associated with the risk-increasing PNPLA3 p.I148M allele and resulted in an unstable and truncated protein with reduced enzymatic activity. CONCLUSIONS: A loss-of-function variant in HSD17B13 was associated with a reduced risk of chronic liver disease and of progression from steatosis to steatohepatitis. (Funded by Regeneron Pharmaceuticals and others.).


Asunto(s)
17-Hidroxiesteroide Deshidrogenasas/genética , Hígado Graso/genética , Predisposición Genética a la Enfermedad , Hepatopatías/genética , Mutación con Pérdida de Función , 17-Hidroxiesteroide Deshidrogenasas/metabolismo , Alanina Transaminasa/sangre , Aspartato Aminotransferasas/sangre , Biomarcadores/sangre , Enfermedad Crónica , Progresión de la Enfermedad , Femenino , Variación Genética , Genotipo , Humanos , Modelos Lineales , Hígado/patología , Hepatopatías/patología , Masculino , Análisis de Secuencia de ARN , Secuenciación del Exoma
19.
Nucleic Acids Res ; 45(18): 10393-10402, 2017 Oct 13.
Artículo en Inglés | MEDLINE | ID: mdl-28977528

RESUMEN

Nonsynonymous single nucleotide variants (nsSNVs) constitute about 50% of known disease-causing mutations and understanding their functional impact is an area of active research. Existing algorithms predict pathogenicity of nsSNVs; however, they are unable to differentiate heterozygous, dominant disease-causing variants from heterozygous carrier variants that lead to disease only in the homozygous state. Here, we present MAPPIN (Method for Annotating, Predicting Pathogenicity, and mode of Inheritance for Nonsynonymous variants), a prediction method which utilizes a random forest algorithm to distinguish between nsSNVs with dominant, recessive, and benign effects. We apply MAPPIN to a set of Mendelian disease-causing mutations and accurately predict pathogenicity for all mutations. Furthermore, MAPPIN predicts mode of inheritance correctly for 70.3% of nsSNVs. MAPPIN also correctly predicts pathogenicity for 87.3% of mutations from the Deciphering Developmental Disorders Study with a 78.5% accuracy for mode of inheritance. When tested on a larger collection of mutations from the Human Gene Mutation Database, MAPPIN is able to significantly discriminate between mutations in known dominant and recessive genes. Finally, we demonstrate that MAPPIN outperforms CADD and Eigen in predicting disease inheritance modes for all validation datasets. To our knowledge, MAPPIN is the first nsSNV pathogenicity prediction algorithm that provides mode of inheritance predictions, adding another layer of information for variant prioritization.


Asunto(s)
Mapeo Cromosómico/métodos , Biología Computacional/métodos , Predisposición Genética a la Enfermedad , Patrón de Herencia , Anotación de Secuencia Molecular , Mutación , Algoritmos , Animales , Bases de Datos Genéticas , Enfermedad/genética , Predicción , Variación Genética , Humanos , Ratones , Polimorfismo de Nucleótido Simple
20.
Nat Commun ; 8(1): 382, 2017 08 29.
Artículo en Inglés | MEDLINE | ID: mdl-28851873

RESUMEN

Variants predicted to result in the loss of function of human genes have attracted interest because of their clinical impact and surprising prevalence in healthy individuals. Here, we present ALoFT (annotation of loss-of-function transcripts), a method to annotate and predict the disease-causing potential of loss-of-function variants. Using data from Mendelian disease-gene discovery projects, we show that ALoFT can distinguish between loss-of-function variants that are deleterious as heterozygotes and those causing disease only in the homozygous state. Investigation of variants discovered in healthy populations suggests that each individual carries at least two heterozygous premature stop alleles that could potentially lead to disease if present as homozygotes. When applied to de novo putative loss-of-function variants in autism-affected families, ALoFT distinguishes between deleterious variants in patients and benign variants in unaffected siblings. Finally, analysis of somatic variants in >6500 cancer exomes shows that putative loss-of-function variants predicted to be deleterious by ALoFT are enriched in known driver genes.Variants causing loss of function (LoF) of human genes have clinical implications. Here, the authors present a method to predict disease-causing potential of LoF variants, ALoFT (annotation of Loss-of-Function Transcripts) and show its application to interpreting LoF variants in different contexts.


Asunto(s)
Mutación con Pérdida de Función , Anotación de Secuencia Molecular , Trastorno Autístico/genética , Bases de Datos Genéticas , Exoma , Predisposición Genética a la Enfermedad , Humanos , Neoplasias/genética , Polimorfismo de Nucleótido Simple
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...