Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Am J Hum Genet ; 111(8): 1656-1672, 2024 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-39043182

RESUMO

Pathogenic variants in the JAG1 gene are a primary cause of the multi-system disorder Alagille syndrome. Although variant detection rates are high for this disease, there is uncertainty associated with the classification of missense variants that leads to reduced diagnostic yield. Consequently, up to 85% of reported JAG1 missense variants have uncertain or conflicting classifications. We generated a library of 2,832 JAG1 nucleotide variants within exons 1-7, a region with a high number of reported missense variants, and designed a high-throughput assay to measure JAG1 membrane expression, a requirement for normal function. After calibration using a set of 175 known or predicted pathogenic and benign variants included within the variant library, 486 variants were characterized as functionally abnormal (n = 277 abnormal and n = 209 likely abnormal), of which 439 (90.3%) were missense. We identified divergent membrane expression occurring at specific residues, indicating that loss of the wild-type residue itself does not drive pathogenicity, a finding supported by structural modeling data and with broad implications for clinical variant classification both for Alagille syndrome and globally across other disease genes. Of 144 uncertain variants reported in patients undergoing clinical or research testing, 27 had functionally abnormal membrane expression, and inclusion of our data resulted in the reclassification of 26 to likely pathogenic. Functional evidence augments the classification of genomic variants, reducing uncertainty and improving diagnostics. Inclusion of this repository of functional evidence during JAG1 variant reclassification will significantly affect resolution of variant pathogenicity, making a critical impact on the molecular diagnosis of Alagille syndrome.


Assuntos
Síndrome de Alagille , Proteína Jagged-1 , Mutação de Sentido Incorreto , Síndrome de Alagille/genética , Proteína Jagged-1/genética , Humanos , Éxons/genética
2.
Genome Res ; 34(10): 1500-1513, 2024 Oct 29.
Artigo em Inglês | MEDLINE | ID: mdl-39327030

RESUMO

The human major histocompatibility complex (MHC) is a ∼4 Mb genomic segment on Chromosome 6 that plays a pivotal role in the immune response. Despite its importance in various traits and diseases, its complex nature makes it challenging to accurately characterize on a routine basis. We present a novel approach allowing targeted sequencing and de novo haplotypic assembly of the MHC region in heterozygous samples, using long-read sequencing technologies. Our approach is validated using two reference samples, two family trios, and an African-American sample. We achieved excellent coverage (96.6%-99.9% with at least 30× depth) and high accuracy (99.89%-99.99%) for the different haplotypes. This methodology offers a reliable and cost-effective method for sequencing and fully characterizing the MHC without the need for whole-genome sequencing, facilitating broader studies on this important genomic segment and having significant implications in immunology, genetics, and medicine.


Assuntos
Haplótipos , Heterozigoto , Complexo Principal de Histocompatibilidade , Humanos , Complexo Principal de Histocompatibilidade/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento Completo do Genoma/métodos , Análise de Sequência de DNA/métodos , Genoma Humano
3.
Am J Hum Genet ; 104(2): 299-309, 2019 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-30686509

RESUMO

Different parts of a gene can be of differential importance to development and health. This regional heterogeneity is also apparent in the distribution of disease-associated mutations, which often cluster in particular regions of disease-associated genes. The ability to precisely estimate functionally important sub-regions of genes will be key in correctly deciphering relationships between genetic variation and disease. Previous methods have had some success using standing human variation to characterize this variability in importance by measuring sub-regional intolerance, i.e., the depletion in functional variation from expectation within a given region of a gene. However, the ability to precisely estimate local intolerance was restricted by the fact that only information within a given sub-region is used, leading to instability in local estimates, especially for small regions. We show that borrowing information across regions using a Bayesian hierarchical model stabilizes estimates, leading to lower variability and improved predictive utility. Specifically, our approach more effectively identifies regions enriched for ClinVar pathogenic variants. We also identify significant correlations between sub-region intolerance and the distribution of pathogenic variation in disease-associated genes, with AUCs for classifying de novo missense variants in Online Mendelian Inheritance in Man (OMIM) genes of up to 0.86 using exonic sub-regions and 0.91 using sub-regions defined by protein domains. This result immediately suggests that considering the intolerance of regions in which variants are found may improve diagnostic interpretation. We also illustrate the utility of integrating regional intolerance into gene-level disease association tests with a study of known disease-associated genes for epileptic encephalopathy.


Assuntos
Componentes do Gene/genética , Modelos Genéticos , Mutação/genética , Espasmos Infantis/genética , Espasmos Infantis/patologia , Teorema de Bayes , Éxons/genética , Humanos
4.
Hum Genomics ; 15(1): 44, 2021 07 13.
Artigo em Inglês | MEDLINE | ID: mdl-34256850

RESUMO

BACKGROUND: Previous research in autism and other neurodevelopmental disorders (NDDs) has indicated an important contribution of protein-coding (coding) de novo variants (DNVs) within specific genes. The role of de novo noncoding variation has been observable as a general increase in genetic burden but has yet to be resolved to individual functional elements. In this study, we assessed whole-genome sequencing data in 2671 families with autism (discovery cohort of 516 families, replication cohort of 2155 families). We focused on DNVs in enhancers with characterized in vivo activity in the brain and identified an excess of DNVs in an enhancer named hs737. RESULTS: We adapted the fitDNM statistical model to work in noncoding regions and tested enhancers for excess of DNVs in families with autism. We found only one enhancer (hs737) with nominal significance in the discovery (p = 0.0172), replication (p = 2.5 × 10-3), and combined dataset (p = 1.1 × 10-4). Each individual with a DNV in hs737 had shared phenotypes including being male, intact cognitive function, and hypotonia or motor delay. Our in vitro assessment of the DNVs showed they all reduce enhancer activity in a neuronal cell line. By epigenomic analyses, we found that hs737 is brain-specific and targets the transcription factor gene EBF3 in human fetal brain. EBF3 is genome-wide significant for coding DNVs in NDDs (missense p = 8.12 × 10-35, loss-of-function p = 2.26 × 10-13) and is widely expressed in the body. Through characterization of promoters bound by EBF3 in neuronal cells, we saw enrichment for binding to NDD genes (p = 7.43 × 10-6, OR = 1.87) involved in gene regulation. Individuals with coding DNVs have greater phenotypic severity (hypotonia, ataxia, and delayed development syndrome [HADDS]) in comparison to individuals with noncoding DNVs that have autism and hypotonia. CONCLUSIONS: In this study, we identify DNVs in the hs737 enhancer in individuals with autism. Through multiple approaches, we find hs737 targets the gene EBF3 that is genome-wide significant in NDDs. By assessment of noncoding variation and the genes they affect, we are beginning to understand their impact on gene regulatory networks in NDDs.


Assuntos
Transtorno Autístico/genética , Predisposição Genética para Doença , Hipotonia Muscular/genética , Transtornos do Neurodesenvolvimento/genética , Fatores de Transcrição/genética , Transtorno Autístico/epidemiologia , Transtorno Autístico/patologia , Elementos Facilitadores Genéticos/genética , Exoma/genética , Feminino , Redes Reguladoras de Genes/genética , Humanos , Masculino , Hipotonia Muscular/epidemiologia , Hipotonia Muscular/patologia , Mutação/genética , Transtornos do Neurodesenvolvimento/epidemiologia , Transtornos do Neurodesenvolvimento/patologia , Neurônios/metabolismo , Neurônios/patologia
5.
Am J Hum Genet ; 100(1): 31-39, 2017 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-28017371

RESUMO

Mixed models have become the tool of choice for genetic association studies; however, standard mixed model methods may be poorly calibrated or underpowered under family sampling bias and/or case-control ascertainment. Previously, we introduced a liability threshold-based mixed model association statistic (LTMLM) to address case-control ascertainment in unrelated samples. Here, we consider family-biased case-control ascertainment, where case and control subjects are ascertained non-randomly with respect to family relatedness. Previous work has shown that this type of ascertainment can severely bias heritability estimates; we show here that it also impacts mixed model association statistics. We introduce a family-based association statistic (LT-Fam) that is robust to this problem. Similar to LTMLM, LT-Fam is computed from posterior mean liabilities (PML) under a liability threshold model; however, LT-Fam uses published narrow-sense heritability estimates to avoid the problem of biased heritability estimation, enabling correct calibration. In simulations with family-biased case-control ascertainment, LT-Fam was correctly calibrated (average χ2 = 1.00-1.02 for null SNPs), whereas the Armitage trend test (ATT), standard mixed model association (MLM), and case-control retrospective association test (CARAT) were mis-calibrated (e.g., average χ2 = 0.50-1.22 for MLM, 0.89-2.65 for CARAT). LT-Fam also attained higher power than other methods in some settings. In 1,259 type 2 diabetes-affected case subjects and 5,765 control subjects from the CARe cohort, downsampled to induce family-biased ascertainment, LT-Fam was correctly calibrated whereas ATT, MLM, and CARAT were again mis-calibrated. Our results highlight the importance of modeling family sampling bias in case-control datasets with related samples.


Assuntos
Família , Estudos de Associação Genética/métodos , Modelos Genéticos , Viés , Calibragem , Diabetes Mellitus Tipo 2/genética , Genótipo , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Estudos Retrospectivos
6.
Am J Hum Genet ; 96(5): 720-30, 2015 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-25892111

RESUMO

We introduce a liability-threshold mixed linear model (LTMLM) association statistic for case-control studies and show that it has a well-controlled false-positive rate and more power than existing mixed-model methods for diseases with low prevalence. Existing mixed-model methods suffer a loss in power under case-control ascertainment, but no solution has been proposed. Here, we solve this problem by using a χ(2) score statistic computed from posterior mean liabilities (PMLs) under the liability-threshold model. Each individual's PML is conditional not only on that individual's case-control status but also on every individual's case-control status and the genetic relationship matrix (GRM) obtained from the data. The PMLs are estimated with a multivariate Gibbs sampler; the liability-scale phenotypic covariance matrix is based on the GRM, and a heritability parameter is estimated via Haseman-Elston regression on case-control phenotypes and then transformed to the liability scale. In simulations of unrelated individuals, the LTMLM statistic was correctly calibrated and achieved higher power than existing mixed-model methods for diseases with low prevalence, and the magnitude of the improvement depended on sample size and severity of case-control ascertainment. In a Wellcome Trust Case Control Consortium 2 multiple sclerosis dataset with >10,000 samples, LTMLM was correctly calibrated and attained a 4.3% improvement (p = 0.005) in χ(2) statistics over existing mixed-model methods at 75 known associated SNPs, consistent with simulations. Larger increases in power are expected at larger sample sizes. In conclusion, case-control studies of diseases with low prevalence can achieve power higher than that in existing mixed-model methods.


Assuntos
Estudos de Associação Genética , Modelos Genéticos , Modelos Teóricos , Estudos de Casos e Controles , Mapeamento Cromossômico , Simulação por Computador , Humanos , Esclerose Múltipla/genética , Esclerose Múltipla/patologia , Fenótipo , Polimorfismo de Nucleotídeo Único , Tamanho da Amostra
7.
Am J Hum Genet ; 97(4): 576-92, 2015 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-26430803

RESUMO

Polygenic risk scores have shown great promise in predicting complex disease risk and will become more accurate as training sample sizes increase. The standard approach for calculating risk scores involves linkage disequilibrium (LD)-based marker pruning and applying a p value threshold to association statistics, but this discards information and can reduce predictive accuracy. We introduce LDpred, a method that infers the posterior mean effect size of each marker by using a prior on effect sizes and LD information from an external reference panel. Theory and simulations show that LDpred outperforms the approach of pruning followed by thresholding, particularly at large sample sizes. Accordingly, predicted R(2) increased from 20.1% to 25.3% in a large schizophrenia dataset and from 9.8% to 12.0% in a large multiple sclerosis dataset. A similar relative improvement in accuracy was observed for three additional large disease datasets and for non-European schizophrenia samples. The advantage of LDpred over existing methods will grow as sample sizes increase.


Assuntos
Desequilíbrio de Ligação/genética , Modelos Teóricos , Herança Multifatorial/genética , Esclerose Múltipla/genética , Polimorfismo de Nucleotídeo Único/genética , Esquizofrenia/genética , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Fenótipo , Prognóstico , Locos de Características Quantitativas
9.
Genome Biol Evol ; 16(2)2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38302106

RESUMO

Regions under balancing selection are characterized by dense polymorphisms and multiple persistent haplotypes, along with other sequence complexities. Successful identification of these patterns depends on both the statistical approach and the quality of sequencing. To address this challenge, at first, a new statistical method called LD-ABF was developed, employing efficient Bayesian techniques to effectively test for balancing selection. LD-ABF demonstrated the most robust detection of selection in a variety of simulation scenarios, compared against a range of existing tests/tools (Tajima's D, HKA, Dng, BetaScan, and BalLerMix). Furthermore, the impact of the quality of sequencing on detection of balancing selection was explored, as well, using: (i) SNP genotyping and exome data, (ii) targeted high-resolution HLA genotyping (IHIW), and (iii) whole-genome long-read sequencing data (Pangenome). In the analysis of SNP genotyping and exome data, we identified known targets and 38 new selection signatures in genes not previously linked to balancing selection. To further investigate the impact of sequencing quality on detection of balancing selection, a detailed investigation of the MHC was performed with high-resolution HLA typing data. Higher quality sequencing revealed the HLA-DQ genes consistently demonstrated strong selection signatures otherwise not observed from the sparser SNP array and exome data. The HLA-DQ selection signature was also replicated in the Pangenome samples using considerably less samples but, with high-quality long-read sequence data. The improved statistical method, coupled with higher quality sequencing, leads to more consistent identification of selection and enhanced localization of variants under selection, particularly in complex regions.


Assuntos
Antígenos HLA-DQ , Polimorfismo de Nucleotídeo Único , Frequência do Gene , Desequilíbrio de Ligação , Teorema de Bayes , Haplótipos , Antígenos HLA-DQ/genética
10.
Front Genet ; 14: 1004138, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36911412

RESUMO

Introduction: Components of the immune response have previously been associated with the pathophysiology of atopic dermatitis (AD), specifically the Human Leukocyte Antigen (HLA) Class II region via genome-wide association studies, however the exact elements have not been identified. Methods: This study examines the genetic variation of HLA Class II genes using next generation sequencing (NGS) and evaluates the resultant amino acids, with particular attention on binding site residues, for associations with AD. The Genetics of AD cohort was used to evaluate HLA Class II allelic variation on 464 subjects with AD and 384 controls. Results: Statistically significant associations with HLA-DP α and ß alleles and specific amino acids were found, some conferring susceptibility to AD and others with a protective effect. Evaluation of polymorphic residues in DP binding pockets revealed the critical role of P1 and P6 (P1: α31M + (ß84G or ß84V) [protection]; α31Q + ß84D [susceptibility] and P6: α11A + ß11G [protection]) and were replicated with a national cohort of children consisting of 424 AD subjects. Independently, AD susceptibility-associated residues were associated with the G polymorphism of SNP rs9277534 in the 3' UTR of the HLA-DPB1 gene, denoting higher expression of these HLA-DP alleles, while protection-associated residues were associated with the A polymorphism, denoting lower expression. Discussion: These findings lay the foundation for evaluating non-self-antigens suspected to be associated with AD as they potentially interact with particular HLA Class II subcomponents, forming a complex involved in the pathophysiology of AD. It is possible that a combination of structural HLA-DP components and levels of expression of these components contribute to AD pathophysiology.

11.
Genetics ; 221(2)2022 05 31.
Artigo em Inglês | MEDLINE | ID: mdl-35385101

RESUMO

Genomic regions subject to purifying selection are more likely to carry disease-causing mutations than regions not under selection. Cross species conservation is often used to identify such regions but with limited resolution to detect selection on short evolutionary timescales such as that occurring in only one species. In contrast, genetic intolerance looks for depletion of variation relative to expectation within a species, allowing species-specific features to be identified. When estimating the intolerance of noncoding sequence, methods strongly leverage variant frequency distributions. As the expected distributions depend on ancestry, if not properly controlled for, ancestral population source may obfuscate signals of selection. We demonstrate that properly incorporating ancestry in intolerance estimation greatly improved variant classification. We provide a genome-wide intolerance map that is conditional on ancestry and likely to be particularly valuable for variant prioritization.


Assuntos
Genoma Humano , Genômica , Evolução Biológica , Genética Populacional , Humanos , Seleção Genética
12.
Transplantation ; 105(3): 637-647, 2021 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-32301906

RESUMO

BACKGROUND: HLA molecular mismatch (MM) is a risk factor for de novo donor-specific antibody (dnDSA) development in solid organ transplantation. HLA expression differences have also been associated with adverse outcomes in hematopoietic cell transplantation. We sought to study both MM and expression in assessing dnDSA risk. METHODS: One hundred three HLA-DP-mismatched solid organ transplantation pairs were retrospectively analyzed. MM was computed using amino acids (aa), eplets, and, supplementarily, Grantham/Epstein scores. DPB1 alleles were classified as rs9277534-A (low-expression) or rs9277534-G (high-expression) linked. To determine the associations between risk factors and dnDSA, logistic regression, linkage disequilibrium (LD), and population-based analyses were performed. RESULTS: A high-risk AA:GX (recipient:donor) expression combination (X = A or G) demonstrated strong association with HLA-DP dnDSA (P = 0.001). MM was also associated with HLA-DP dnDSA when evaluated by itself (eplet P = 0.007, aa P = 0.003, Grantham P = 0.005, Epstein P = 0.004). When attempting to determine the relative individual effects of the risk factors in multivariable analysis, only AA:GX expression status retained a strong association (relative risk = 18.6, P = 0.007 with eplet; relative risk = 15.8, P = 0.02 with aa), while MM was no longer significant (eplet P = 0.56, aa P = 0.51). Importantly, these risk factors are correlated, due to LD between the expression-tagging single-nucleotide polymorphism and polymorphisms along HLA-DPB1. CONCLUSIONS: The MM and expression risk factors each appear to be strong predictors of HLA-DP dnDSA and to possess clinical utility; however, these two risk factors are closely correlated. These metrics may represent distinct ways of characterizing a common overlapping dnDSA risk profile, but they are not independent. Further, we demonstrate the importance and detailed implications of LD effects in dnDSA risk assessment and possibly transplantation overall.


Assuntos
Rejeição de Enxerto/imunologia , Cadeias beta de HLA-DP/biossíntese , Isoanticorpos/imunologia , Transplante de Rim/efeitos adversos , Doadores de Tecidos , Seguimentos , Cadeias beta de HLA-DP/imunologia , Transplante de Células-Tronco Hematopoéticas/métodos , Teste de Histocompatibilidade , Humanos , Desequilíbrio de Ligação , Estudos Retrospectivos
13.
Cancer Epidemiol Biomarkers Prev ; 20(11): 2450-6, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-21930957

RESUMO

BACKGROUND: The United States has experienced an alarming and unexplained increase in the incidence of esophageal adenocarcinoma (EAC) since the 1970s. A concurrent increase in obesity has led some to suggest a relationship between the two trends. We explore the extent of this relationship. METHODS: Using a previously validated disease simulation model of white males in the United States, we estimated EAC incidence 1973 to 2005 given constant obesity prevalence and low population progression rates consistent with the early 1970s. Introducing only the observed, rising obesity prevalence, we calculated the incremental incidence caused by obesity. We compared these with EAC incidence data from the National Cancer Institute's Surveillance, Epidemiology, and End Results (SEER) registry to determine obesity's contribution to the rise therein. Incidences were converted to absolute numbers of cases using U.S. population data. RESULTS: Using constant obesity prevalence, we projected a total of 30,555 EAC cases cumulatively over 1973 to 2005 and 1,151 in 2005 alone. Incorporating the observed obesity trend resulted in 35,767 cumulative EACs and 1,608 in 2005. Estimates derived from SEER data showed 111,223 cumulative and 7,173 cases in 2005. We conclude that the rise in obesity accounted for 6.5% of the increase in EAC cases that occurred from 1973 to 2005 and 7.6% in the year 2005. CONCLUSION: Using published OR for EAC among obese individuals, we found that only a small percentage of the rise in EAC incidence is attributable to secular trends in obesity. IMPACT: Other factors, alone and in combination, should be explored as causes of the EAC epidemic.


Assuntos
Adenocarcinoma/epidemiologia , Neoplasias Esofágicas/epidemiologia , Obesidade/epidemiologia , Adenocarcinoma/etnologia , Adenocarcinoma/etiologia , Simulação por Computador , Neoplasias Esofágicas/etnologia , Neoplasias Esofágicas/etiologia , Feminino , Humanos , Incidência , Masculino , Modelos Estatísticos , Obesidade/complicações , Obesidade/etnologia , Sistema de Registros , Fatores de Risco , Programa de SEER , Estados Unidos/epidemiologia , População Branca
14.
PLoS One ; 5(3): e9483, 2010 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-20208996

RESUMO

BACKGROUND: The incidence of esophageal adenocarcinoma (EAC) has risen rapidly in the U.S. and western world. The aim of the study was to begin the investigation of this rapid rise by developing, calibrating, and validating a mathematical disease simulation model of EAC using available epidemiologic data. METHODS: The model represents the natural history of EAC, including the essential biologic health states from normal mucosa to detected cancer. Progression rates between health states were estimated via calibration, which identified distinct parameter sets producing model outputs that fit epidemiologic data; specifically, the prevalence of pre-cancerous lesions and EAC cancer incidence from the published literature and Surveillance, Epidemiology, and End Results (SEER) data. As an illustrative example of a clinical and policy application, the calibrated and validated model retrospectively analyzed the potential benefit of an aspirin chemoprevention program. RESULTS: Model outcomes approximated calibration targets; results of the model's fit and validation are presented. Approximately 7,000 cases of EAC could have been prevented over a 30-year period if all white males started aspirin chemoprevention at age 40 in 1965. CONCLUSIONS: The model serves as the foundation for future analyses to determine a cost-effective screening and management strategy to prevent EAC morbidity and mortality.


Assuntos
Adenocarcinoma/epidemiologia , Neoplasias Esofágicas/epidemiologia , Adenocarcinoma/etnologia , Adenocarcinoma/prevenção & controle , Adulto , Algoritmos , Anticarcinógenos/uso terapêutico , Aspirina/uso terapêutico , Calibragem , Análise Custo-Benefício , Progressão da Doença , Neoplasias Esofágicas/etnologia , Neoplasias Esofágicas/prevenção & controle , Humanos , Masculino , Cadeias de Markov , Modelos Teóricos , Estados Unidos , População Branca
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA