Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 588
Filtrar
Más filtros

Tipo del documento
Intervalo de año de publicación
1.
Cell ; 185(23): 4409-4427.e18, 2022 11 10.
Artículo en Inglés | MEDLINE | ID: mdl-36368308

RESUMEN

Fully understanding autism spectrum disorder (ASD) genetics requires whole-genome sequencing (WGS). We present the latest release of the Autism Speaks MSSNG resource, which includes WGS data from 5,100 individuals with ASD and 6,212 non-ASD parents and siblings (total n = 11,312). Examining a wide variety of genetic variants in MSSNG and the Simons Simplex Collection (SSC; n = 9,205), we identified ASD-associated rare variants in 718/5,100 individuals with ASD from MSSNG (14.1%) and 350/2,419 from SSC (14.5%). Considering genomic architecture, 52% were nuclear sequence-level variants, 46% were nuclear structural variants (including copy-number variants, inversions, large insertions, uniparental isodisomies, and tandem repeat expansions), and 2% were mitochondrial variants. Our study provides a guidebook for exploring genotype-phenotype correlations in families who carry ASD-associated rare variants and serves as an entry point to the expanded studies required to dissect the etiology in the ∼85% of the ASD population that remain idiopathic.


Asunto(s)
Trastorno del Espectro Autista , Trastorno Autístico , Humanos , Trastorno del Espectro Autista/genética , Predisposición Genética a la Enfermedad , Variaciones en el Número de Copia de ADN/genética , Genómica
2.
Annu Rev Pharmacol Toxicol ; 64: 33-51, 2024 Jan 23.
Artículo en Inglés | MEDLINE | ID: mdl-37506333

RESUMEN

Interindividual variability in genes encoding drug-metabolizing enzymes, transporters, receptors, and human leukocyte antigens has a major impact on a patient's response to drugs with regard to efficacy and safety. Enabled by both technological and conceptual advances, the field of pharmacogenomics is developing rapidly. Major progress in omics profiling methods has enabled novel genotypic and phenotypic characterization of patients and biobanks. These developments are paralleled by advances in machine learning, which have allowed us to parse the immense wealth of data and establish novel genetic markers and polygenic models for drug selection and dosing. Pharmacogenomics has recently become more widespread in clinical practice to personalize treatment and to develop new drugs tailored to specific patient populations. In this review, we provide an overview of the latest developments in the field and discuss the way forward, including how to address the missing heritability, develop novel polygenic models, and further improve the clinical implementation of pharmacogenomics.


Asunto(s)
Proteínas de Transporte de Membrana , Farmacogenética , Humanos , Tecnología
3.
Am J Hum Genet ; 111(5): 990-995, 2024 05 02.
Artículo en Inglés | MEDLINE | ID: mdl-38636510

RESUMEN

Since genotype imputation was introduced, researchers have been relying on the estimated imputation quality from imputation software to perform post-imputation quality control (QC). However, this quality estimate (denoted as Rsq) performs less well for lower-frequency variants. We recently published MagicalRsq, a machine-learning-based imputation quality calibration, which leverages additional typed markers from the same cohort and outperforms Rsq as a QC metric. In this work, we extended the original MagicalRsq to allow cross-cohort model training and named the new model MagicalRsq-X. We removed the cohort-specific estimated minor allele frequency and included linkage disequilibrium scores and recombination rates as additional features. Leveraging whole-genome sequencing data from TOPMed, specifically participants in the BioMe, JHS, WHI, and MESA studies, we performed comprehensive cross-cohort evaluations for predominantly European and African ancestral individuals based on their inferred global ancestry with the 1000 Genomes and Human Genome Diversity Project data as reference. Our results suggest MagicalRsq-X outperforms Rsq in almost every setting, with 7.3%-14.4% improvement in squared Pearson correlation with true R2, corresponding to 85-218 K variant gains. We further developed a metric to quantify the genetic distances of a target cohort relative to a reference cohort and showed that such metric largely explained the performance of MagicalRsq-X models. Finally, we found MagicalRsq-X saved up to 53 known genome-wide significant variants in one of the largest blood cell trait GWASs that would be missed using the original Rsq for QC. In conclusion, MagicalRsq-X shows superiority for post-imputation QC and benefits genetic studies by distinguishing well and poorly imputed lower-frequency variants.


Asunto(s)
Frecuencia de los Genes , Genotipo , Polimorfismo de Nucleótido Simple , Programas Informáticos , Humanos , Estudios de Cohortes , Desequilibrio de Ligamiento , Estudio de Asociación del Genoma Completo/métodos , Genoma Humano , Control de Calidad , Aprendizaje Automático , Secuenciación Completa del Genoma/normas , Secuenciación Completa del Genoma/métodos
4.
Hum Mol Genet ; 2024 Jul 07.
Artículo en Inglés | MEDLINE | ID: mdl-38970828

RESUMEN

Systemic sclerosis (SSc) is a heterogeneous rare autoimmune fibrosing disorder affecting connective tissue. The etiology of systemic sclerosis is largely unknown and many genes have been suggested as susceptibility loci of modest impact by genome-wide association study (GWAS). Multiple factors can contribute to the pathological process of the disease, which makes it more difficult to identify possible disease-causing genetic alterations. In this study, we have applied whole genome sequencing (WGS) in 101 indexed family trios, supplemented with transcriptome sequencing on cultured fibroblast cells of four patients and five family controls where available. Single nucleotide variants (SNVs) and copy number variants (CNVs) were examined, with emphasis on de novo variants. We also performed enrichment test for rare variants in candidate genes previously proposed in association with systemic sclerosis. We identified 42 exonic and 34 ncRNA de novo SNV changes in 101 trios, from a total of over 6000 de novo variants genome wide. We observed higher than expected de novo variants in PRKXP1 gene. We also observed such phenomenon along with increased expression in patient group in NEK7 gene. Additionally, we also observed significant enrichment of rare variants in candidate genes in the patient cohort, further supporting the complexity/multi-factorial etiology of systemic sclerosis. Our findings identify new candidate genes including PRKXP1 and NEK7 for future studies in SSc. We observed rare variant enrichment in candidate genes previously proposed in association with SSc, which suggest more efforts should be pursued to further investigate possible pathogenetic mechanisms associated with those candidate genes.

5.
Am J Hum Genet ; 110(10): 1704-1717, 2023 10 05.
Artículo en Inglés | MEDLINE | ID: mdl-37802043

RESUMEN

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions in lipid metabolism. Large-scale whole-genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess more associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with measurement of blood lipids and lipoproteins (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare-variant aggregate association tests using the STAAR (variant-set test for association using annotation information) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare-coding variants in nearby protein-coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500-kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variation and rare protein-coding variation at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNAs.


Asunto(s)
ARN Largo no Codificante , Humanos , ARN Largo no Codificante/genética , Estudio de Asociación del Genoma Completo , Medicina de Precisión , Secuenciación Completa del Genoma/métodos , Lípidos/genética , Polimorfismo de Nucleótido Simple/genética
6.
Am J Hum Genet ; 110(5): 762-773, 2023 05 04.
Artículo en Inglés | MEDLINE | ID: mdl-37019109

RESUMEN

The ongoing release of large-scale sequencing data in the UK Biobank allows for the identification of associations between rare variants and complex traits. SAIGE-GENE+ is a valid approach to conducting set-based association tests for quantitative and binary traits. However, for ordinal categorical phenotypes, applying SAIGE-GENE+ with treating the trait as quantitative or binarizing the trait can cause inflated type I error rates or power loss. In this study, we propose a scalable and accurate method for rare-variant association tests, POLMM-GENE, in which we used a proportional odds logistic mixed model to characterize ordinal categorical phenotypes while adjusting for sample relatedness. POLMM-GENE fully utilizes the categorical nature of phenotypes and thus can well control type I error rates while remaining powerful. In the analyses of UK Biobank 450k whole-exome-sequencing data for five ordinal categorical traits, POLMM-GENE identified 54 gene-phenotype associations.


Asunto(s)
Exoma , Estudio de Asociación del Genoma Completo , Estudio de Asociación del Genoma Completo/métodos , Exoma/genética , Bancos de Muestras Biológicas , Fenotipo , Análisis de Datos , Reino Unido
7.
Am J Hum Genet ; 110(5): 863-879, 2023 05 04.
Artículo en Inglés | MEDLINE | ID: mdl-37146589

RESUMEN

Deleterious mutations in the X-linked gene encoding ornithine transcarbamylase (OTC) cause the most common urea cycle disorder, OTC deficiency. This rare but highly actionable disease can present with severe neonatal onset in males or with later onset in either sex. Individuals with neonatal onset appear normal at birth but rapidly develop hyperammonemia, which can progress to cerebral edema, coma, and death, outcomes ameliorated by rapid diagnosis and treatment. Here, we develop a high-throughput functional assay for human OTC and individually measure the impact of 1,570 variants, 84% of all SNV-accessible missense mutations. Comparison to existing clinical significance calls, demonstrated that our assay distinguishes known benign from pathogenic variants and variants with neonatal onset from late-onset disease presentation. This functional stratification allowed us to identify score ranges corresponding to clinically relevant levels of impairment of OTC activity. Examining the results of our assay in the context of protein structure further allowed us to identify a 13 amino acid domain, the SMG loop, whose function appears to be required in human cells but not in yeast. Finally, inclusion of our data as PS3 evidence under the current ACMG guidelines, in a pilot reclassification of 34 variants with complete loss of activity, would change the classification of 22 from variants of unknown significance to clinically actionable likely pathogenic variants. These results illustrate how large-scale functional assays are especially powerful when applied to rare genetic diseases.


Asunto(s)
Hiperamonemia , Enfermedad por Deficiencia de Ornitina Carbamoiltransferasa , Ornitina Carbamoiltransferasa , Humanos , Sustitución de Aminoácidos , Hiperamonemia/etiología , Hiperamonemia/genética , Mutación Missense/genética , Ornitina Carbamoiltransferasa/genética , Enfermedad por Deficiencia de Ornitina Carbamoiltransferasa/genética , Enfermedad por Deficiencia de Ornitina Carbamoiltransferasa/diagnóstico , Enfermedad por Deficiencia de Ornitina Carbamoiltransferasa/terapia
8.
Genet Epidemiol ; 2024 Mar 12.
Artículo en Inglés | MEDLINE | ID: mdl-38472164

RESUMEN

Genome-wide association studies (GWAS) have provided an abundance of information about the genetic variants and their loci that are associated to complex traits and diseases. However, due to linkage disequilibrium (LD) and noncoding regions of loci, it remains a challenge to pinpoint the causal genes. Gene network-based approaches, paired with network diffusion methods, have been proposed to prioritize causal genes and to boost statistical power in GWAS based on the assumption that trait-associated genes are clustered in a gene network. Due to the difficulty in mapping trait-associated variants to genes in GWAS, this assumption has never been directly or rigorously tested empirically. On the other hand, whole exome sequencing (WES) data focuses on the protein-coding regions, directly identifying trait-associated genes. In this study, we tested the assumption by leveraging the recently available exome-based association statistics from the UK Biobank WES data along with two types of networks. We found that almost all trait-associated genes were significantly more proximal to each other than randomly selected genes within both networks. These results support the assumption that trait-associated genes are clustered in gene networks, which can be further leveraged to boost the power of GWAS such as by introducing less stringent p value thresholds.

9.
Am J Hum Genet ; 109(4): 680-691, 2022 04 07.
Artículo en Inglés | MEDLINE | ID: mdl-35298919

RESUMEN

Identification of rare-variant associations is crucial to full characterization of the genetic architecture of complex traits and diseases. Essential in this process is the evaluation of novel methods in simulated data that mirror the distribution of rare variants and haplotype structure in real data. Additionally, importing real-variant annotation enables in silico comparison of methods, such as rare-variant association tests and polygenic scoring methods, that focus on putative causal variants. Existing simulation methods are either unable to employ real-variant annotation or severely under- or overestimate the number of singletons and doubletons, thereby reducing the ability to generalize simulation results to real studies. We present RAREsim, a flexible and accurate rare-variant simulation algorithm. Using parameters and haplotypes derived from real sequencing data, RAREsim efficiently simulates the expected variant distribution and enables real-variant annotations. We highlight RAREsim's utility across various genetic regions, sample sizes, ancestries, and variant classes.


Asunto(s)
Variación Genética , Proyectos de Investigación , Simulación por Computador , Variación Genética/genética , Haplotipos/genética , Humanos , Modelos Genéticos , Herencia Multifactorial
10.
Am J Hum Genet ; 109(6): 1055-1064, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35588732

RESUMEN

Polygenic risk scores (PRSs) quantify the contribution of multiple genetic loci to an individual's likelihood of a complex trait or disease. However, existing PRSs estimate this likelihood with common genetic variants, excluding the impact of rare variants. Here, we report on a method to identify rare variants associated with outlier gene expression and integrate their impact into PRS predictions for body mass index (BMI), obesity, and bariatric surgery. Between the top and bottom 10%, we observed a 20.8% increase in risk for obesity (p = 3 × 10-14), 62.3% increase in risk for severe obesity (p = 1 × 10-6), and median 5.29 years earlier onset for bariatric surgery (p = 0.008), as a function of expression outlier-associated rare variant burden when controlling for common variant PRS. We show that these predictions were more significant than integrating the effects of rare protein-truncating variants (PTVs), observing a mean 19% increase in phenotypic variance explained with expression outlier-associated rare variants when compared with PTVs (p = 2 × 10-15). We replicated these findings by using data from the Million Veteran Program and demonstrated that PRSs across multiple traits and diseases can benefit from the inclusion of expression outlier-associated rare variants identified through population-scale transcriptome sequencing.


Asunto(s)
Herencia Multifactorial , Obesidad , Índice de Masa Corporal , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Herencia Multifactorial/genética , Obesidad/genética , Fenotipo , Factores de Riesgo
11.
Eur Heart J ; 2024 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-39028637

RESUMEN

Atrial fibrillation (AF) is a globally prevalent cardiac arrhythmia with significant genetic underpinnings, as highlighted by recent large-scale genetic studies. A prominent clinical and genetic overlap exists between AF, heritable ventricular cardiomyopathies, and arrhythmia syndromes, underlining the potential of AF as an early indicator of severe ventricular disease in younger individuals. Indeed, several recent studies have demonstrated meaningful yields of rare pathogenic variants among early-onset AF patients (∼4%-11%), most notably for cardiomyopathy genes in which rare variants are considered clinically actionable. Genetic testing thus presents a promising opportunity to identify monogenetic defects linked to AF and inherited cardiac conditions, such as cardiomyopathy, and may contribute to prognosis and management in early-onset AF patients. A first step towards recognizing this monogenic contribution was taken with the Class IIb recommendation for genetic testing in AF patients aged 45 years or younger by the 2023 American College of Cardiology/American Heart Association guidelines for AF. By identifying pathogenic genetic variants known to underlie inherited cardiomyopathies and arrhythmia syndromes, a personalized care pathway can be developed, encompassing more tailored screening, cascade testing, and potentially genotype-informed prognosis and preventive measures. However, this can only be ensured by frameworks that are developed and supported by all stakeholders. Ambiguity in test results such as variants of uncertain significance remain a major challenge and as many as ∼60% of people with early-onset AF might carry such variants. Patient education (including pretest counselling), training of genetic teams, selection of high-confidence genes, and careful reporting are strategies to mitigate this. Further challenges to implementation include financial barriers, insurability issues, workforce limitations, and the need for standardized definitions in a fast-moving field. Moreover, the prevailing genetic evidence largely rests on European descent populations, underscoring the need for diverse research cohorts and international collaboration. Embracing these challenges and the potential of genetic testing may improve AF care. However, further research-mechanistic, translational, and clinical-is urgently needed.

12.
Diabetologia ; 67(2): 327-332, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-38051360

RESUMEN

AIMS/HYPOTHESIS: GLIS3 encodes a transcription factor involved in pancreatic beta cell development and function. Rare pathogenic, bi-allelic mutations in GLIS3 cause syndromic neonatal diabetes whereas frequent SNPs at this locus associate with common type 2 diabetes risk. Because rare, functional variants located in other susceptibility genes for type 2 diabetes have already been shown to strongly increase individual risk for common type 2 diabetes, we aimed to investigate the contribution of rare pathogenic GLIS3 variants to type 2 diabetes. METHODS: GLIS3 was sequenced in 5471 individuals from the Rare Variants Involved in Diabetes and Obesity (RaDiO) study. Variant pathogenicity was assessed following the criteria established by the American College of Medical Genetics and Genomics (ACMG). To address the pathogenic strong criterion number 3 (PS3), we conducted functional investigations of these variants using luciferase assays, focusing on capacity of GLIS family zinc finger 3 (GLIS3) to bind to and activate the INS promoter. The association between rare pathogenic or likely pathogenic (P/LP) variants and type 2 diabetes risk (and other metabolic traits) was then evaluated. A meta-analysis combining association results from RaDiO, the 52K study (43,125 individuals) and the TOPMed study (44,083 individuals) was finally performed. RESULTS: Through targeted resequencing of GLIS3, we identified 105 rare variants that were carried by 395 participants from RaDiO. Among them, 49 variants decreased the activation of the INS promoter. Following ACMG criteria, 18 rare variants were classified as P/LP, showing an enrichment in the last two exons compared with the remaining exons (p<5×10-6; OR>3.5). The burden of these P/LP variants was strongly higher in individuals with type 2 diabetes (p=3.0×10-3; OR 3.9 [95% CI 1.4, 12]), whereas adiposity, age at type 2 diabetes diagnosis and cholesterol levels were similar between variant carriers and non-carriers with type 2 diabetes. Interestingly, all carriers with type 2 diabetes were sensitive to oral sulfonylureas. A total of 7 P/LP variants were identified in both 52K and TOPMed studies. The meta-analysis of association studies obtained from RaDiO, 52K and TOPMed showed an enrichment of P/LP GLIS3 variants in individuals with type 2 diabetes (p=5.6×10-5; OR 2.1 [95% CI 1.4, 2.9]). CONCLUSIONS/INTERPRETATION: Rare P/LP GLIS3 variants do contribute to type 2 diabetes risk. The variants located in the distal part of the protein could have a direct effect on its functional activity by impacting its transactivation domain, by homology with the mouse GLIS3 protein. Furthermore, rare P/LP GLIS3 variants seem to have a direct clinical effect on beta cell function, which could be improved by increasing insulin secretion via the use of sulfonylureas.


Asunto(s)
Diabetes Mellitus Tipo 2 , Células Secretoras de Insulina , Ratones , Animales , Recién Nacido , Humanos , Diabetes Mellitus Tipo 2/tratamiento farmacológico , Diabetes Mellitus Tipo 2/genética , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Regulación de la Expresión Génica , Células Secretoras de Insulina/metabolismo , Mutación , Proteínas de Unión al ADN/metabolismo , Proteínas Represoras/metabolismo , Transactivadores/metabolismo
13.
Genet Epidemiol ; 47(6): 450-460, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37158367

RESUMEN

Current software packages for the analysis and the simulations of rare variants are only available for binary and continuous traits. Ravages provides solutions in a single R package to perform rare variant association tests for multicategory, binary and continuous phenotypes, to simulate datasets under different scenarios and to compute statistical power. Association tests can be run in the whole genome thanks to C++ implementation of most of the functions, using either RAVA-FIRST, a recently developed strategy to filter and analyse genome-wide rare variants, or user-defined candidate regions. Ravages also includes a simulation module that generates genetic data for cases who can be stratified into several subgroups and for controls. Through comparisons with existing programmes, we show that Ravages complements existing tools and will be useful to study the genetic architecture of complex diseases. Ravages is available on the CRAN at https://cran.r-project.org/web/packages/Ravages/ and maintained on Github at https://github.com/genostats/Ravages.


Asunto(s)
Variación Genética , Modelos Genéticos , Humanos , Simulación por Computador , Fenotipo , Programas Informáticos
14.
Genet Epidemiol ; 47(1): 78-94, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36047334

RESUMEN

Linkage analysis maps genetic loci for a heritable trait by identifying genomic regions with excess relatedness among individuals with similar trait values. Analysis may be conducted on related individuals from families, or on samples of unrelated individuals from a population. For allelically heterogeneous traits, population-based linkage analysis can be more powerful than genotypic-association analysis. Here, we focus on linkage analysis in a population sample, but use sequences rather than individuals as our unit of observation. Earlier investigations of sequence-based linkage mapping relied on known sequence relatedness, whereas we infer relatedness from the sequence data. We propose two ways to associate similarity in relatedness of sequences with similarity in their trait values and compare the resulting linkage methods to two genotypic-association methods. We also introduce a procedure to label case sequences as potential carriers or noncarriers of causal variants after an association has been found. This post hoc labeling of case sequences is based on inferred relatedness to other case sequences. Our simulation results indicate that methods based on sequence relatedness improve localization and perform as well as genotypic-association methods for detecting rare causal variants. Sequence-based linkage analysis therefore has potential to fine-map allelically heterogeneous disease traits.


Asunto(s)
Modelos Genéticos , Sitios de Carácter Cuantitativo , Humanos , Mapeo Cromosómico/métodos , Fenotipo , Genotipo , Ligamiento Genético , Desequilibrio de Ligamiento
15.
Genet Epidemiol ; 47(7): 475-495, 2023 10.
Artículo en Inglés | MEDLINE | ID: mdl-37341229

RESUMEN

The risk of congenital heart defects (CHDs) may be influenced by maternal genes, fetal genes, and their interactions. Existing methods commonly test the effects of maternal and fetal variants one-at-a-time and may have reduced statistical power to detect genetic variants with low minor allele frequencies. In this article, we propose a gene-based association test of interactions for maternal-fetal genotypes (GATI-MFG) using a case-mother and control-mother design. GATI-MFG can integrate the effects of multiple variants within a gene or genomic region and evaluate the joint effect of maternal and fetal genotypes while allowing for their interactions. In simulation studies, GATI-MFG had improved statistical power over alternative methods, such as the single-variant test and functional data analysis (FDA) under various disease scenarios. We further applied GATI-MFG to a two-phase genome-wide association study of CHDs for the testing of both common variants and rare variants using 947 CHD case mother-infant pairs and 1306 control mother-infant pairs from the National Birth Defects Prevention Study (NBDPS). After Bonferroni adjustment for 23,035 genes, two genes on chromosome 17, TMEM107 (p = 1.64e-06) and CTC1 (p = 2.0e-06), were identified for significant association with CHD in common variants analysis. Gene TMEM107 regulates ciliogenesis and ciliary protein composition and was found to be associated with heterotaxy. Gene CTC1 plays an essential role in protecting telomeres from degradation, which was suggested to be associated with cardiogenesis. Overall, GATI-MFG outperformed the single-variant test and FDA in the simulations, and the results of application to NBDPS samples are consistent with existing literature supporting the association of TMEM107 and CTC1 with CHDs.


Asunto(s)
Estudio de Asociación del Genoma Completo , Cardiopatías Congénitas , Femenino , Humanos , Modelos Genéticos , Genotipo , Cardiopatías Congénitas/genética , Madres , Estudios de Casos y Controles
16.
Genet Epidemiol ; 47(6): 409-431, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37101379

RESUMEN

In genetic studies, many phenotypes have multiple naturally ordered discrete values. The phenotypes can be correlated with each other. If multiple correlated ordinal traits are analyzed simultaneously, the power of analysis may increase significantly while the false positives can be controlled well. In this study, we propose bivariate functional ordinal linear regression (BFOLR) models using latent regressions with cumulative logit link or probit link to perform a gene-based analysis for bivariate ordinal traits and sequencing data. In the proposed BFOLR models, genetic variant data are viewed as stochastic functions of physical positions, and the genetic effects are treated as a function of physical positions. The BFOLR models take the correlation of the two ordinal traits into account via latent variables. The BFOLR models are built upon functional data analysis which can be revised to analyze the bivariate ordinal traits and high-dimension genetic data. The methods are flexible and can analyze three types of genetic data: (1) rare variants only, (2) common variants only, and (3) a combination of rare and common variants. Extensive simulation studies show that the likelihood ratio tests of the BFOLR models control type I errors well and have good power performance. The BFOLR models are applied to analyze Age-Related Eye Disease Study data, in which two genes, CFH and ARMS2, are found to strongly associate with eye drusen size, drusen area, age-related macular degeneration (AMD) categories, and AMD severity scale.


Asunto(s)
Degeneración Macular , Modelos Genéticos , Humanos , Fenotipo , Degeneración Macular/genética , Simulación por Computador , Modelos Lineales
17.
Am J Hum Genet ; 108(8): 1401-1408, 2021 08 05.
Artículo en Inglés | MEDLINE | ID: mdl-34216550

RESUMEN

Precise interpretation of the effects of rare protein-truncating variants (PTVs) is important for accurate determination of variant impact. Current methods for assessing the ability of PTVs to induce nonsense-mediated decay (NMD) focus primarily on the position of the variant in the transcript. We used RNA sequencing of the Genotype Tissue Expression v.8 cohort to compute the efficiency of NMD using allelic imbalance for 2,320 rare (genome aggregation database minor allele frequency ≤ 1%) PTVs across 809 individuals in 49 tissues. We created an interpretable predictive model using penalized logistic regression in order to evaluate the comprehensive influence of variant annotation, tissue, and inter-individual variation on NMD. We found that variant position, allele frequency, the inclusion of ultra-rare and singleton variants, and conservation were predictive of allelic imbalance. Furthermore, we found that NMD effects were highly concordant across tissues and individuals. Due to this high consistency, we demonstrate in silico that utilizing peripheral tissues or cell lines provides accurate prediction of NMD for PTVs.


Asunto(s)
Codón sin Sentido/genética , Regulación de la Expresión Génica , Enfermedades Genéticas Congénitas/patología , Variación Genética , Mutación , Degradación de ARNm Mediada por Codón sin Sentido , ARN Mensajero/genética , Frecuencia de los Genes , Enfermedades Genéticas Congénitas/genética , Humanos
18.
Am J Hum Genet ; 108(12): 2354-2367, 2021 12 02.
Artículo en Inglés | MEDLINE | ID: mdl-34822764

RESUMEN

Whole-genome sequencing studies applied to large populations or biobanks with extensive phenotyping raise new analytic challenges. The need to consider many variants at a locus or group of genes simultaneously and the potential to study many correlated phenotypes with shared genetic architecture provide opportunities for discovery not addressed by the traditional one variant, one phenotype association study. Here, we introduce a Bayesian model comparison approach called MRP (multiple rare variants and phenotypes) for rare-variant association studies that considers correlation, scale, and direction of genetic effects across a group of genetic variants, phenotypes, and studies, requiring only summary statistic data. We apply our method to exome sequencing data (n = 184,698) across 2,019 traits from the UK Biobank, aggregating signals in genes. MRP demonstrates an ability to recover signals such as associations between PCSK9 and LDL cholesterol levels. We additionally find MRP effective in conducting meta-analyses in exome data. Non-biomarker findings include associations between MC1R and red hair color and skin color, IL17RA and monocyte count, and IQGAP2 and mean platelet volume. Finally, we apply MRP in a multi-phenotype setting; after clustering the 35 biomarker phenotypes based on genetic correlation estimates, we find that joint analysis of these phenotypes results in substantial power gains for gene-trait associations, such as in TNFRSF13B in one of the clusters containing diabetes- and lipid-related traits. Overall, we show that the MRP model comparison approach improves upon useful features from widely used meta-analysis approaches for rare-variant association analyses and prioritizes protective modifiers of disease risk.


Asunto(s)
Variación Genética , Estudio de Asociación del Genoma Completo , Modelos Genéticos , Teorema de Bayes , Femenino , Humanos , Masculino , Fenotipo
19.
Am J Hum Genet ; 108(8): 1478-1487, 2021 08 05.
Artículo en Inglés | MEDLINE | ID: mdl-34197731

RESUMEN

Idiopathic achalasia (IA) is a severe motility disorder characterized by neuronal degeneration in the myenteric plexus, but the etiology remains largely unknown. We performed whole-exome sequencing (WES) in 100 IA-affected individuals and 313 non-IA control subjects and validated the results in 230 IA-affected individuals and 1,760 non-IA control subjects. Common missense variants rs1705003 (CUTA, GenBank: NC_000006.11:g.33385953A>G) and rs1126511 (HLA-DPB1, GenBank: NC_000006.11:g.33048466G>T) at 6p21.32 were reproducibly associated with increased risk of IA (rs1126511: OR = 1.83, p = 2.34 × 10-9; rs1705003: OR = 2.37, p = 3.21 × 10-7), meeting exome-wide significance. Both variants can affect the expression of their target genes at the transcript level. An array-based association analysis in 280 affected individuals and 1,121 control subjects determined the same signal at 6p21.32. Further conditional analyses supported that the two missense variants identified in WES-based association study were potential causal variants of IA. For rare variants, the top genes identified by gene-based analysis were significantly enriched in nerve and muscle phenotypic genes in the mouse. Moreover, the functional rare variants in these genes tended to cooccur in IA-affected individuals. In an independent cohort, we successfully validated three rare variants (CREB5, GenBank: NC_000007.13:g.28848865G>T; ESYT3, GenBank: NC_000003.11:g.138183253C>T; and LPIN1, GenBank: NC_000002.11:g.11925128A>G) which heightens the risk of developing IA. Our study identified and validated two common variants and three rare variants associated with IA in immunologic and neurological genes, providing new insight into the etiology of IA.


Asunto(s)
Proteína de Unión al Elemento de Respuesta al AMP Cíclico/genética , Acalasia del Esófago/patología , Secuenciación del Exoma/métodos , Exoma , Predisposición Genética a la Enfermedad , Variación Genética , Fosfatidato Fosfatasa/genética , Sinaptotagminas/genética , Estudios de Casos y Controles , Acalasia del Esófago/genética , Pruebas Genéticas , Humanos , Fenotipo
20.
Am J Hum Genet ; 108(1): 100-114, 2021 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-33352116

RESUMEN

Chiari I malformation (CM1), the displacement of the cerebellum through the foramen magnum into the spinal canal, is one of the most common pediatric neurological conditions. Individuals with CM1 can present with neurological symptoms, including severe headaches and sensory or motor deficits, often as a consequence of brainstem compression or syringomyelia (SM). We conducted whole-exome sequencing (WES) on 668 CM1 probands and 232 family members and performed gene-burden and de novo enrichment analyses. A significant enrichment of rare and de novo non-synonymous variants in chromodomain (CHD) genes was observed among individuals with CM1 (combined p = 2.4 × 10-10), including 3 de novo loss-of-function variants in CHD8 (LOF enrichment p = 1.9 × 10-10) and a significant burden of rare transmitted variants in CHD3 (p = 1.8 × 10-6). Overall, individuals with CM1 were found to have significantly increased head circumference (p = 2.6 × 10-9), with many harboring CHD rare variants having macrocephaly. Finally, haploinsufficiency for chd8 in zebrafish led to macrocephaly and posterior hindbrain displacement reminiscent of CM1. These results implicate chromodomain genes and excessive brain growth in CM1 pathogenesis.


Asunto(s)
Malformación de Arnold-Chiari/genética , Proteínas de Unión al ADN/genética , Polimorfismo de Nucleótido Simple/genética , Adulto , Animales , Malformación de Arnold-Chiari/patología , Encéfalo/patología , Estudios de Casos y Controles , Femenino , Haploinsuficiencia/genética , Humanos , Imagen por Resonancia Magnética/métodos , Masculino , Siringomielia/genética , Secuenciación del Exoma/métodos , Pez Cebra/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA