Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 107
Filter
1.
Nature ; 631(8019): 134-141, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38867047

ABSTRACT

Mosaic loss of the X chromosome (mLOX) is the most common clonal somatic alteration in leukocytes of female individuals1,2, but little is known about its genetic determinants or phenotypic consequences. Here, to address this, we used data from 883,574 female participants across 8 biobanks; 12% of participants exhibited detectable mLOX in approximately 2% of leukocytes. Female participants with mLOX had an increased risk of myeloid and lymphoid leukaemias. Genetic analyses identified 56 common variants associated with mLOX, implicating genes with roles in chromosomal missegregation, cancer predisposition and autoimmune diseases. Exome-sequence analyses identified rare missense variants in FBXO10 that confer a twofold increased risk of mLOX. Only a small fraction of associations was shared with mosaic Y chromosome loss, suggesting that distinct biological processes drive formation and clonal expansion of sex chromosome missegregation. Allelic shift analyses identified X chromosome alleles that are preferentially retained in mLOX, demonstrating variation at many loci under cellular selection. A polygenic score including 44 allelic shift loci correctly inferred the retained X chromosomes in 80.7% of mLOX cases in the top decile. Our results support a model in which germline variants predispose female individuals to acquiring mLOX, with the allelic content of the X chromosome possibly shaping the magnitude of clonal expansion.


Subject(s)
Aneuploidy , Chromosomes, Human, X , Clone Cells , Leukocytes , Mosaicism , Adult , Female , Humans , Male , Middle Aged , Alleles , Autoimmune Diseases/genetics , Biological Specimen Banks , Chromosome Segregation/genetics , Chromosomes, Human, X/genetics , Chromosomes, Human, Y/genetics , Clone Cells/metabolism , Clone Cells/pathology , Exome/genetics , F-Box Proteins/genetics , Genetic Predisposition to Disease/genetics , Germ-Line Mutation , Leukemia/genetics , Leukocytes/metabolism , Models, Genetic , Multifactorial Inheritance/genetics , Mutation, Missense/genetics
2.
Nat Rev Genet ; 23(9): 533-546, 2022 09.
Article in English | MEDLINE | ID: mdl-35501396

ABSTRACT

Human genetics can inform the biology and epidemiology of coronavirus disease 2019 (COVID-19) by pinpointing causal mechanisms that explain why some individuals become more severely affected by the disease upon infection by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus. Large-scale genetic association studies, encompassing both rare and common genetic variants, have used different study designs and multiple disease phenotype definitions to identify several genomic regions associated with COVID-19. Along with a multitude of follow-up studies, these findings have increased our understanding of disease aetiology and provided routes for management of COVID-19. Important emergent opportunities include the clinical translatability of genetic risk prediction, the repurposing of existing drugs, exploration of variable host effects of different viral strains, study of inter-individual variability in vaccination response and understanding the long-term consequences of SARS-CoV-2 infection. Beyond the current pandemic, these transferrable opportunities are likely to affect the study of many infectious diseases.


Subject(s)
COVID-19 , COVID-19/epidemiology , COVID-19/genetics , Humans , Molecular Epidemiology , Pandemics , SARS-CoV-2/genetics
3.
Nature ; 604(7906): 509-516, 2022 04.
Article in English | MEDLINE | ID: mdl-35396579

ABSTRACT

Rare coding variation has historically provided the most direct connections between gene function and disease pathogenesis. By meta-analysing the whole exomes of 24,248 schizophrenia cases and 97,322 controls, we implicate ultra-rare coding variants (URVs) in 10 genes as conferring substantial risk for schizophrenia (odds ratios of 3-50, P < 2.14 × 10-6) and 32 genes at a false discovery rate of <5%. These genes have the greatest expression in central nervous system neurons and have diverse molecular functions that include the formation, structure and function of the synapse. The associations of the NMDA (N-methyl-D-aspartate) receptor subunit GRIN2A and AMPA (α-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) receptor subunit GRIA3 provide support for dysfunction of the glutamatergic system as a mechanistic hypothesis in the pathogenesis of schizophrenia. We observe an overlap of rare variant risk among schizophrenia, autism spectrum disorders1, epilepsy and severe neurodevelopmental disorders2, although different mutation types are implicated in some shared genes. Most genes described here, however, are not implicated in neurodevelopment. We demonstrate that genes prioritized from common variant analyses of schizophrenia are enriched in rare variant risk3, suggesting that common and rare genetic risk factors converge at least partially on the same underlying pathogenic biological processes. Even after excluding significantly associated genes, schizophrenia cases still carry a substantial excess of URVs, which indicates that more risk genes await discovery using this approach.


Subject(s)
Mutation , Neurodevelopmental Disorders , Schizophrenia , Case-Control Studies , Exome , Genetic Predisposition to Disease/genetics , Humans , Neurodevelopmental Disorders/genetics , Receptors, N-Methyl-D-Aspartate/genetics , Schizophrenia/genetics
4.
Am J Hum Genet ; 111(7): 1431-1447, 2024 Jul 11.
Article in English | MEDLINE | ID: mdl-38908374

ABSTRACT

Methods of estimating polygenic scores (PGSs) from genome-wide association studies are increasingly utilized. However, independent method evaluation is lacking, and method comparisons are often limited. Here, we evaluate polygenic scores derived via seven methods in five biobank studies (totaling about 1.2 million participants) across 16 diseases and quantitative traits, building on a reference-standardized framework. We conducted meta-analyses to quantify the effects of method choice, hyperparameter tuning, method ensembling, and the target biobank on PGS performance. We found that no single method consistently outperformed all others. PGS effect sizes were more variable between biobanks than between methods within biobanks when methods were well tuned. Differences between methods were largest for the two investigated autoimmune diseases, seropositive rheumatoid arthritis and type 1 diabetes. For most methods, cross-validation was more reliable for tuning hyperparameters than automatic tuning (without the use of target data). For a given target phenotype, elastic net models combining PGS across methods (ensemble PGS) tuned in the UK Biobank provided consistent, high, and cross-biobank transferable performance, increasing PGS effect sizes (ß coefficients) by a median of 5.0% relative to LDpred2 and MegaPRS (the two best-performing single methods when tuned with cross-validation). Our interactively browsable online-results and open-source workflow prspipe provide a rich resource and reference for the analysis of polygenic scoring methods across biobanks.


Subject(s)
Biological Specimen Banks , Genome-Wide Association Study , Multifactorial Inheritance , Humans , Multifactorial Inheritance/genetics , Phenotype , Diabetes Mellitus, Type 1/genetics , Polymorphism, Single Nucleotide , Machine Learning
5.
Nature ; 595(7865): 107-113, 2021 07.
Article in English | MEDLINE | ID: mdl-33915569

ABSTRACT

COVID-19, which is caused by SARS-CoV-2, can result in acute respiratory distress syndrome and multiple organ failure1-4, but little is known about its pathophysiology. Here we generated single-cell atlases of 24 lung, 16 kidney, 16 liver and 19 heart autopsy tissue samples and spatial atlases of 14 lung samples from donors who died of COVID-19. Integrated computational analysis uncovered substantial remodelling in the lung epithelial, immune and stromal compartments, with evidence of multiple paths of failed tissue regeneration, including defective alveolar type 2 differentiation and expansion of fibroblasts and putative TP63+ intrapulmonary basal-like progenitor cells. Viral RNAs were enriched in mononuclear phagocytic and endothelial lung cells, which induced specific host programs. Spatial analysis in lung distinguished inflammatory host responses in lung regions with and without viral RNA. Analysis of the other tissue atlases showed transcriptional alterations in multiple cell types in heart tissue from donors with COVID-19, and mapped cell types and genes implicated with disease severity based on COVID-19 genome-wide association studies. Our foundational dataset elucidates the biological effect of severe SARS-CoV-2 infection across the body, a key step towards new treatments.


Subject(s)
COVID-19/pathology , COVID-19/virology , Kidney/pathology , Liver/pathology , Lung/pathology , Myocardium/pathology , SARS-CoV-2/pathogenicity , Adult , Aged , Aged, 80 and over , Atlases as Topic , Autopsy , Biological Specimen Banks , COVID-19/genetics , COVID-19/immunology , Endothelial Cells , Epithelial Cells/pathology , Epithelial Cells/virology , Female , Fibroblasts , Genome-Wide Association Study , Heart/virology , Humans , Inflammation/pathology , Inflammation/virology , Kidney/virology , Liver/virology , Lung/virology , Male , Middle Aged , Organ Specificity , Phagocytes , Pulmonary Alveoli/pathology , Pulmonary Alveoli/virology , RNA, Viral/analysis , Regeneration , SARS-CoV-2/immunology , Single-Cell Analysis , Viral Load
6.
Nature ; 581(7809): 434-443, 2020 05.
Article in English | MEDLINE | ID: mdl-32461654

ABSTRACT

Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.


Subject(s)
Exome/genetics , Genes, Essential/genetics , Genetic Variation/genetics , Genome, Human/genetics , Adult , Brain/metabolism , Cardiovascular Diseases/genetics , Cohort Studies , Databases, Genetic , Female , Genetic Predisposition to Disease/genetics , Genome-Wide Association Study , Humans , Loss of Function Mutation/genetics , Male , Mutation Rate , Proprotein Convertase 9/genetics , RNA, Messenger/genetics , Reproducibility of Results , Exome Sequencing , Whole Genome Sequencing
7.
Am J Hum Genet ; 109(1): 24-32, 2022 01 06.
Article in English | MEDLINE | ID: mdl-34861179

ABSTRACT

Genetic correlation is an important parameter in efforts to understand the relationships among complex traits. Current methods that analyze individual genotype data for estimating genetic correlation are challenging to scale to large datasets. Methods that analyze summary data, while being computationally efficient, tend to yield estimates of genetic correlation with reduced precision. We propose SCORE (scalable genetic correlation estimator), a randomized method of moments estimator of genetic correlation that is both scalable and accurate. SCORE obtains more precise estimates of genetic correlations relative to summary-statistic methods that can be applied at scale; it achieves a 44% reduction in standard error relative to LD-score regression (LDSC) and a 20% reduction relative to high-definition likelihood (HDL) (averaged over all simulations). The efficiency of SCORE enables computation of genetic correlations on the UK Biobank dataset, consisting of ≈300 K individuals and ≈500 K SNPs, in a few h (orders of magnitude faster than methods that analyze individual data, such as GCTA). Across 780 pairs of traits in 291,273 unrelated white British individuals in the UK Biobank, SCORE identifies significant genetic correlation between 200 additional pairs of traits over LDSC (beyond the 245 pairs identified by both).


Subject(s)
Biological Specimen Banks , Genetic Association Studies , Genetic Background , Models, Genetic , Phenotype , Algorithms , Genetic Variation , Humans , Multifactorial Inheritance , Reproducibility of Results , United Kingdom
8.
PLoS Genet ; 18(11): e1010367, 2022 11.
Article in English | MEDLINE | ID: mdl-36327219

ABSTRACT

Host genetics is a key determinant of COVID-19 outcomes. Previously, the COVID-19 Host Genetics Initiative genome-wide association study used common variants to identify multiple loci associated with COVID-19 outcomes. However, variants with the largest impact on COVID-19 outcomes are expected to be rare in the population. Hence, studying rare variants may provide additional insights into disease susceptibility and pathogenesis, thereby informing therapeutics development. Here, we combined whole-exome and whole-genome sequencing from 21 cohorts across 12 countries and performed rare variant exome-wide burden analyses for COVID-19 outcomes. In an analysis of 5,085 severe disease cases and 571,737 controls, we observed that carrying a rare deleterious variant in the SARS-CoV-2 sensor toll-like receptor TLR7 (on chromosome X) was associated with a 5.3-fold increase in severe disease (95% CI: 2.75-10.05, p = 5.41x10-7). This association was consistent across sexes. These results further support TLR7 as a genetic determinant of severe disease and suggest that larger studies on rare variants influencing COVID-19 outcomes could provide additional insights.


Subject(s)
COVID-19 , Exome , Humans , Exome/genetics , Genome-Wide Association Study , COVID-19/genetics , Genetic Predisposition to Disease , Toll-Like Receptor 7/genetics , SARS-CoV-2/genetics
9.
Hum Mol Genet ; 31(23): 3945-3966, 2022 11 28.
Article in English | MEDLINE | ID: mdl-35848942

ABSTRACT

Given the highly variable clinical phenotype of Coronavirus disease 2019 (COVID-19), a deeper analysis of the host genetic contribution to severe COVID-19 is important to improve our understanding of underlying disease mechanisms. Here, we describe an extended genome-wide association meta-analysis of a well-characterized cohort of 3255 COVID-19 patients with respiratory failure and 12 488 population controls from Italy, Spain, Norway and Germany/Austria, including stratified analyses based on age, sex and disease severity, as well as targeted analyses of chromosome Y haplotypes, the human leukocyte antigen region and the SARS-CoV-2 peptidome. By inversion imputation, we traced a reported association at 17q21.31 to a ~0.9-Mb inversion polymorphism that creates two highly differentiated haplotypes and characterized the potential effects of the inversion in detail. Our data, together with the 5th release of summary statistics from the COVID-19 Host Genetics Initiative including non-Caucasian individuals, also identified a new locus at 19q13.33, including NAPSA, a gene which is expressed primarily in alveolar cells responsible for gas exchange in the lung.


Subject(s)
COVID-19 , Humans , COVID-19/genetics , SARS-CoV-2/genetics , Genome-Wide Association Study , Haplotypes , Polymorphism, Genetic
10.
Bioinformatics ; 39(9)2023 09 02.
Article in English | MEDLINE | ID: mdl-37647640

ABSTRACT

MOTIVATION: Existing methods for simulating synthetic genotype and phenotype datasets have limited scalability, constraining their usability for large-scale analyses. Moreover, a systematic approach for evaluating synthetic data quality and a benchmark synthetic dataset for developing and evaluating methods for polygenic risk scores are lacking. RESULTS: We present HAPNEST, a novel approach for efficiently generating diverse individual-level genotypic and phenotypic data. In comparison to alternative methods, HAPNEST shows faster computational speed and a lower degree of relatedness with reference panels, while generating datasets that preserve key statistical properties of real data. These desirable synthetic data properties enabled us to generate 6.8 million common variants and nine phenotypes with varying degrees of heritability and polygenicity across 1 million individuals. We demonstrate how HAPNEST can facilitate biobank-scale analyses through the comparison of seven methods to generate polygenic risk scoring across multiple ancestry groups and different genetic architectures. AVAILABILITY AND IMPLEMENTATION: A synthetic dataset of 1 008 000 individuals and nine traits for 6.8 million common variants is available at https://www.ebi.ac.uk/biostudies/studies/S-BSST936. The HAPNEST software for generating synthetic datasets is available as Docker/Singularity containers and open source Julia and C code at https://github.com/intervene-EU-H2020/synthetic_data.


Subject(s)
Benchmarking , Data Accuracy , Humans , Genotype , Phenotype , Multifactorial Inheritance
11.
Article in English | MEDLINE | ID: mdl-38503536

ABSTRACT

OBJECTIVES: Rheumatic diseases may impair reproductive success and pregnancy outcomes, but systematic evaluations across diseases are lacking. We conducted a nationwide cohort study to examine the impact of rheumatic diseases on reproductive health measures, comparing the impacts with those of other immune-mediated diseases (IMDs). METHODS: Out of all of the 5 339 804 Finnish citizens, individuals born 1964-1984 and diagnosed with any of the 19 IMDs before age 30 (women) or 35 (men) were matched with 20 controls by birth year, sex, and education. We used data from nationwide health registers to study the impact of IMDs on reproductive health measures, such as reproductive success and, for women, ever having experienced adverse maternal and perinatal outcomes. RESULTS: Several of the rheumatic diseases, particularly SLE, JIA, and seropositive RA, were associated with higher rates of childlessness and fewer children. The risks for pre-eclampsia, newborns being small for gestational age, preterm delivery, non-elective Caesarean sections, and need of neonatal intensive care were increased in many IMDs. Particularly, SLE, SS, type 1 diabetes, and Addison's disease showed >2-fold risks for some of these outcomes. In most rheumatic diseases, moderate (1.1-1.5-fold) risk increases were observed for diverse adverse pregnancy outcomes, with similar effects in IBD, celiac disease, asthma, ITP, and psoriasis. CONCLUSION: Rheumatic diseases have a broad impact on reproductive health, with effects comparable with that of several other IMDs. Of the rheumatic diseases, SLE and SS conferred the largest risk increases on perinatal adverse event outcomes.

12.
Nature ; 550(7675): 239-243, 2017 10 11.
Article in English | MEDLINE | ID: mdl-29022581

ABSTRACT

Rare genetic variants are abundant in humans and are expected to contribute to individual disease risk. While genetic association studies have successfully identified common genetic variants associated with susceptibility, these studies are not practical for identifying rare variants. Efforts to distinguish pathogenic variants from benign rare variants have leveraged the genetic code to identify deleterious protein-coding alleles, but no analogous code exists for non-coding variants. Therefore, ascertaining which rare variants have phenotypic effects remains a major challenge. Rare non-coding variants have been associated with extreme gene expression in studies using single tissues, but their effects across tissues are unknown. Here we identify gene expression outliers, or individuals showing extreme expression levels for a particular gene, across 44 human tissues by using combined analyses of whole genomes and multi-tissue RNA-sequencing data from the Genotype-Tissue Expression (GTEx) project v6p release. We find that 58% of underexpression and 28% of overexpression outliers have nearby conserved rare variants compared to 8% of non-outliers. Additionally, we developed RIVER (RNA-informed variant effect on regulation), a Bayesian statistical model that incorporates expression data to predict a regulatory effect for rare variants with higher accuracy than models using genomic annotations alone. Overall, we demonstrate that rare variants contribute to large gene expression changes across tissues and provide an integrative method for interpretation of rare variants in individual genomes.


Subject(s)
Gene Expression Profiling , Genetic Variation/genetics , Organ Specificity/genetics , Bayes Theorem , Female , Genome, Human/genetics , Genomics , Genotype , Humans , Male , Models, Genetic , Sequence Analysis, RNA
16.
Am J Hum Genet ; 102(6): 1204-1211, 2018 06 07.
Article in English | MEDLINE | ID: mdl-29861106

ABSTRACT

There is a limited understanding about the impact of rare protein-truncating variants across multiple phenotypes. We explore the impact of this class of variants on 13 quantitative traits and 10 diseases using whole-exome sequencing data from 100,296 individuals. Protein-truncating variants in genes intolerant to this class of mutations increased risk of autism, schizophrenia, bipolar disorder, intellectual disability, and ADHD. In individuals without these disorders, there was an association with shorter height, lower education, increased hospitalization, and reduced age at enrollment. Gene sets implicated from GWASs did not show a significant protein-truncating variants burden beyond what was captured by established Mendelian genes. In conclusion, we provide a thorough investigation of the impact of rare deleterious coding variants on complex traits, suggesting widespread pleiotropic risk.


Subject(s)
Mutation/genetics , Open Reading Frames/genetics , Databases, Genetic , Ethnicity/genetics , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Phenotype , Proteins/genetics
17.
Psychol Med ; 51(9): 1451-1458, 2021 07.
Article in English | MEDLINE | ID: mdl-32063240

ABSTRACT

BACKGROUND: Although accurate differentiation between bipolar disorder (BD) and unipolar major depressive disorder (MDD) has important prognostic and therapeutic implications, the distinction is often challenging based on clinical grounds alone. In this study, we tested whether psychiatric polygenic risk scores (PRSs) improve clinically based classification models of BD v. MDD diagnosis. METHODS: Our sample included 843 BD and 930 MDD subjects similarly genotyped and phenotyped using the same standardized interview. We performed multivariate modeling and receiver operating characteristic analysis, testing the incremental effect of PRSs on a baseline model with clinical symptoms and features known to associate with BD compared with MDD status. RESULTS: We found a strong association between a BD diagnosis and PRSs drawn from BD (R2 = 3.5%, p = 4.94 × 10-12) and schizophrenia (R2 = 3.2%, p = 5.71 × 10-11) genome-wide association meta-analyses. Individuals with top decile BD PRS had a significantly increased risk for BD v. MDD compared with those in the lowest decile (odds ratio 3.39, confidence interval 2.19-5.25). PRSs discriminated BD v. MDD to a degree comparable with many individual symptoms and clinical features previously shown to associate with BD. When compared with the full composite model with all symptoms and clinical features PRSs provided modestly improved discriminatory ability (ΔC = 0.011, p = 6.48 × 10-4). CONCLUSIONS: Our study demonstrates that psychiatric PRSs provide modest independent discrimination between BD and MDD cases, suggesting that PRSs could ultimately have utility in subjects at the extremes of the distribution and/or subjects for whom clinical symptoms are poorly measured or yet to manifest.


Subject(s)
Bipolar Disorder/diagnosis , Depressive Disorder, Major/diagnosis , Multifactorial Inheritance , Adult , Bipolar Disorder/genetics , Depressive Disorder, Major/genetics , Female , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Male , Middle Aged , Risk Factors , Schizophrenia/diagnosis
18.
PLoS Genet ; 14(5): e1007329, 2018 05.
Article in English | MEDLINE | ID: mdl-29795570

ABSTRACT

As part of a broader collaborative network of exome sequencing studies, we developed a jointly called data set of 5,685 Ashkenazi Jewish exomes. We make publicly available a resource of site and allele frequencies, which should serve as a reference for medical genetics in the Ashkenazim (hosted in part at https://ibd.broadinstitute.org, also available in gnomAD at http://gnomad.broadinstitute.org). We estimate that 34% of protein-coding alleles present in the Ashkenazi Jewish population at frequencies greater than 0.2% are significantly more frequent (mean 15-fold) than their maximum frequency observed in other reference populations. Arising via a well-described founder effect approximately 30 generations ago, this catalog of enriched alleles can contribute to differences in genetic risk and overall prevalence of diseases between populations. As validation we document 148 AJ enriched protein-altering alleles that overlap with "pathogenic" ClinVar alleles (table available at https://github.com/macarthur-lab/clinvar/blob/master/output/clinvar.tsv), including those that account for 10-100 fold differences in prevalence between AJ and non-AJ populations of some rare diseases, especially recessive conditions, including Gaucher disease (GBA, p.Asn409Ser, 8-fold enrichment); Canavan disease (ASPA, p.Glu285Ala, 12-fold enrichment); and Tay-Sachs disease (HEXA, c.1421+1G>C, 27-fold enrichment; p.Tyr427IlefsTer5, 12-fold enrichment). We next sought to use this catalog, of well-established relevance to Mendelian disease, to explore Crohn's disease, a common disease with an estimated two to four-fold excess prevalence in AJ. We specifically attempt to evaluate whether strong acting rare alleles, particularly protein-truncating or otherwise large effect-size alleles, enriched by the same founder-effect, contribute excess genetic risk to Crohn's disease in AJ, and find that ten rare genetic risk factors in NOD2 and LRRK2 are enriched in AJ (p < 0.005), including several novel contributing alleles, show evidence of association to CD. Independently, we find that genomewide common variant risk defined by GWAS shows a strong difference between AJ and non-AJ European control population samples (0.97 s.d. higher, p<10-16). Taken together, the results suggest coordinated selection in AJ population for higher CD risk alleles in general. The results and approach illustrate the value of exome sequencing data in case-control studies along with reference data sets like ExAC (sites VCF available via FTP at ftp.broadinstitute.org/pub/ExAC_release/release0.3/) to pinpoint genetic variation that contributes to variable disease predisposition across populations.


Subject(s)
Crohn Disease/genetics , Genetic Predisposition to Disease/genetics , Jews/genetics , Rare Diseases/genetics , Algorithms , Crohn Disease/epidemiology , Genetics, Population , Genome-Wide Association Study , Haplotypes , Humans , Models, Genetic , Molecular Epidemiology , Polymorphism, Single Nucleotide , Rare Diseases/epidemiology
19.
Bioinformatics ; 35(21): 4478-4479, 2019 11 01.
Article in English | MEDLINE | ID: mdl-31086968

ABSTRACT

MOTIVATION: The correct classification of missense variants as benign or pathogenic remains challenging. Pathogenic variants are expected to have higher deleterious prediction scores than benign variants in the same gene. However, most of the existing variant annotation tools do not reference the score range of benign population variants on gene level. RESULTS: We present a web-application, Variant Score Ranker, which enables users to rapidly annotate variants and perform gene-specific variant score ranking on the population level. We also provide an intuitive example of how gene- and population-calibrated variant ranking scores can improve epilepsy variant prioritization. AVAILABILITY AND IMPLEMENTATION: http://vsranker.broadinstitute.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Mutation, Missense , Software
20.
Nature ; 514(7520): 92-97, 2014 Oct 02.
Article in English | MEDLINE | ID: mdl-25231870

ABSTRACT

Age at menarche is a marker of timing of puberty in females. It varies widely between individuals, is a heritable trait and is associated with risks for obesity, type 2 diabetes, cardiovascular disease, breast cancer and all-cause mortality. Studies of rare human disorders of puberty and animal models point to a complex hypothalamic-pituitary-hormonal regulation, but the mechanisms that determine pubertal timing and underlie its links to disease risk remain unclear. Here, using genome-wide and custom-genotyping arrays in up to 182,416 women of European descent from 57 studies, we found robust evidence (P < 5 × 10(-8)) for 123 signals at 106 genomic loci associated with age at menarche. Many loci were associated with other pubertal traits in both sexes, and there was substantial overlap with genes implicated in body mass index and various diseases, including rare disorders of puberty. Menarche signals were enriched in imprinted regions, with three loci (DLK1-WDR25, MKRN3-MAGEL2 and KCNK9) demonstrating parent-of-origin-specific associations concordant with known parental expression patterns. Pathway analyses implicated nuclear hormone receptors, particularly retinoic acid and γ-aminobutyric acid-B2 receptor signalling, among novel mechanisms that regulate pubertal timing in humans. Our findings suggest a genetic architecture involving at least hundreds of common variants in the coordinated timing of the pubertal transition.


Subject(s)
Alleles , Genetic Loci/genetics , Menarche/genetics , Parents , Adolescent , Age Factors , Body Mass Index , Breast Neoplasms/genetics , Calcium-Binding Proteins , Cardiovascular Diseases/genetics , Child , Diabetes Mellitus, Type 2/genetics , Europe/ethnology , Female , Genome-Wide Association Study , Genomic Imprinting/genetics , Humans , Hypothalamo-Hypophyseal System/physiology , Intercellular Signaling Peptides and Proteins/genetics , Male , Membrane Proteins/genetics , Obesity/genetics , Ovary/physiology , Polymorphism, Single Nucleotide/genetics , Potassium Channels, Tandem Pore Domain/genetics , Proteins/genetics , Quantitative Trait Loci/genetics , Receptors, GABA-B/metabolism , Receptors, Retinoic Acid/metabolism , Ribonucleoproteins/genetics , Ubiquitin-Protein Ligases
SELECTION OF CITATIONS
SEARCH DETAIL