Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 351
Filter
Add more filters

Publication year range
1.
Nature ; 631(8019): 134-141, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38867047

ABSTRACT

Mosaic loss of the X chromosome (mLOX) is the most common clonal somatic alteration in leukocytes of female individuals1,2, but little is known about its genetic determinants or phenotypic consequences. Here, to address this, we used data from 883,574 female participants across 8 biobanks; 12% of participants exhibited detectable mLOX in approximately 2% of leukocytes. Female participants with mLOX had an increased risk of myeloid and lymphoid leukaemias. Genetic analyses identified 56 common variants associated with mLOX, implicating genes with roles in chromosomal missegregation, cancer predisposition and autoimmune diseases. Exome-sequence analyses identified rare missense variants in FBXO10 that confer a twofold increased risk of mLOX. Only a small fraction of associations was shared with mosaic Y chromosome loss, suggesting that distinct biological processes drive formation and clonal expansion of sex chromosome missegregation. Allelic shift analyses identified X chromosome alleles that are preferentially retained in mLOX, demonstrating variation at many loci under cellular selection. A polygenic score including 44 allelic shift loci correctly inferred the retained X chromosomes in 80.7% of mLOX cases in the top decile. Our results support a model in which germline variants predispose female individuals to acquiring mLOX, with the allelic content of the X chromosome possibly shaping the magnitude of clonal expansion.


Subject(s)
Aneuploidy , Chromosomes, Human, X , Clone Cells , Leukocytes , Mosaicism , Adult , Female , Humans , Male , Middle Aged , Alleles , Autoimmune Diseases/genetics , Biological Specimen Banks , Chromosome Segregation/genetics , Chromosomes, Human, X/genetics , Chromosomes, Human, Y/genetics , Clone Cells/metabolism , Clone Cells/pathology , Exome/genetics , F-Box Proteins/genetics , Genetic Predisposition to Disease/genetics , Germ-Line Mutation , Leukemia/genetics , Leukocytes/metabolism , Models, Genetic , Multifactorial Inheritance/genetics , Mutation, Missense/genetics
2.
Nature ; 603(7899): 95-102, 2022 03.
Article in English | MEDLINE | ID: mdl-35197637

ABSTRACT

Genome-wide association studies (GWAS) have identified thousands of genetic variants linked to the risk of human disease. However, GWAS have so far remained largely underpowered in relation to identifying associations in the rare and low-frequency allelic spectrum and have lacked the resolution to trace causal mechanisms to underlying genes1. Here we combined whole-exome sequencing in 392,814 UK Biobank participants with imputed genotypes from 260,405 FinnGen participants (653,219 total individuals) to conduct association meta-analyses for 744 disease endpoints across the protein-coding allelic frequency spectrum, bridging the gap between common and rare variant studies. We identified 975 associations, with more than one-third being previously unreported. We demonstrate population-level relevance for mutations previously ascribed to causing single-gene disorders, map GWAS associations to likely causal genes, explain disease mechanisms, and systematically relate disease associations to levels of 117 biomarkers and clinical-stage drug targets. Combining sequencing and genotyping in two population biobanks enabled us to benefit from increased power to detect and explain disease associations, validate findings through replication and propose medical actionability for rare genetic variants. Our study provides a compendium of protein-coding variant associations for future insights into disease biology and drug discovery.


Subject(s)
Genome-Wide Association Study , Proteins , Gene Frequency/genetics , Genetic Predisposition to Disease/genetics , Genotype , Humans , Polymorphism, Single Nucleotide/genetics , Proteins/genetics , Exome Sequencing
3.
Blood ; 143(23): 2425-2432, 2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38498041

ABSTRACT

ABSTRACT: The factor V Leiden (FVL; rs6025) and prothrombin G20210A (PTGM; rs1799963) polymorphisms are 2 of the most well-studied genetic risk factors for venous thromboembolism (VTE). However, double heterozygosity (DH) for FVL and PTGM remains poorly understood, with previous studies showing marked disagreement regarding thrombosis risk conferred by the DH genotype. Using multidimensional data from the UK Biobank (UKB) and FinnGen biorepositories, we evaluated the clinical impact of DH carrier status across 937 939 individuals. We found that 662 participants (0.07%) were DH carriers. After adjustment for age, sex, and ancestry, DH individuals experienced a markedly elevated risk of VTE compared with wild-type individuals (odds ratio [OR] = 5.24; 95% confidence interval [CI], 4.01-6.84; P = 4.8 × 10-34), which approximated the risk conferred by FVL homozygosity. A secondary analysis restricted to UKB participants (N = 445 144) found that effect size estimates for the DH genotype remained largely unchanged (OR = 4.53; 95% CI, 3.42-5.90; P < 1 × 10-16) after adjustment for commonly cited VTE risk factors, such as body mass index, blood type, and markers of inflammation. In contrast, the DH genotype was not associated with a significantly higher risk of any arterial thrombosis phenotype, including stroke, myocardial infarction, and peripheral artery disease. In summary, we leveraged population-scale genomic data sets to conduct, to our knowledge, the largest study to date on the DH genotype and were able to establish far more precise effect size estimates than previously possible. Our findings indicate that the DH genotype may occur as frequently as FVL homozygosity and may confer a similarly increased risk of VTE.


Subject(s)
Biological Specimen Banks , Factor V , Heterozygote , Prothrombin , Humans , Prothrombin/genetics , Factor V/genetics , Female , Male , Middle Aged , United Kingdom/epidemiology , Aged , Risk Factors , Venous Thromboembolism/genetics , Venous Thromboembolism/epidemiology , Adult , Thrombosis/genetics , Thrombosis/epidemiology , Thrombosis/etiology , Genetic Predisposition to Disease , Genotype , Polymorphism, Single Nucleotide , UK Biobank
4.
Nature ; 586(7831): 769-775, 2020 10.
Article in English | MEDLINE | ID: mdl-33057200

ABSTRACT

Myeloproliferative neoplasms (MPNs) are blood cancers that are characterized by the excessive production of mature myeloid cells and arise from the acquisition of somatic driver mutations in haematopoietic stem cells (HSCs). Epidemiological studies indicate a substantial heritable component of MPNs that is among the highest known for cancers1. However, only a limited number of genetic risk loci have been identified, and the underlying biological mechanisms that lead to the acquisition of MPNs remain unclear. Here, by conducting a large-scale genome-wide association study (3,797 cases and 1,152,977 controls), we identify 17 MPN risk loci (P < 5.0 × 10-8), 7 of which have not been previously reported. We find that there is a shared genetic architecture between MPN risk and several haematopoietic traits from distinct lineages; that there is an enrichment for MPN risk variants within accessible chromatin of HSCs; and that increased MPN risk is associated with longer telomere length in leukocytes and other clonal haematopoietic states-collectively suggesting that MPN risk is associated with the function and self-renewal of HSCs. We use gene mapping to identify modulators of HSC biology linked to MPN risk, and show through targeted variant-to-function assays that CHEK2 and GFI1B have roles in altering the function of HSCs to confer disease risk. Overall, our results reveal a previously unappreciated mechanism for inherited MPN risk through the modulation of HSC function.


Subject(s)
Genetic Predisposition to Disease/genetics , Hematopoietic Stem Cells/pathology , Myeloproliferative Disorders/genetics , Myeloproliferative Disorders/pathology , Neoplasms/genetics , Neoplasms/pathology , Cell Lineage/genetics , Cell Self Renewal , Checkpoint Kinase 2/genetics , Female , Humans , Leukocytes/pathology , Male , Proto-Oncogene Proteins/genetics , Repressor Proteins/genetics , Risk , Telomere Homeostasis
5.
PLoS Genet ; 19(10): e1010982, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37871108

ABSTRACT

BACKGROUND: Preterm birth (<37 weeks of gestation) is a major cause of neonatal death and morbidity. Up to 40% of the variation in timing of birth results from genetic factors, mostly due to the maternal genome. METHODS: We conducted a genome-wide meta-analysis of gestational duration and spontaneous preterm birth in 68,732 and 98,370 European mothers, respectively. RESULTS: The meta-analysis detected 15 loci associated with gestational duration, and four loci associated with preterm birth. Seven of the associated loci were novel. The loci mapped to several biologically plausible genes, for example HAND2 whose expression was previously shown to decrease during gestation, associated with gestational duration, and GC (Vitamin D-binding protein), associated with preterm birth. Downstream in silico-analysis suggested regulatory roles as underlying mechanisms for the associated loci. LD score regression found birth weight measures as the most strongly correlated traits, highlighting the unique nature of spontaneous preterm birth phenotype. Tissue expression and colocalization analysis revealed reproductive tissues and immune cell types as the most relevant sites of action. CONCLUSION: We report novel genetic risk loci that associate with preterm birth or gestational duration, and reproduce findings from previous genome-wide association studies. Altogether, our findings provide new insight into the genetic background of preterm birth. Better characterization of the causal genetic mechanisms will be important to public health as it could suggest new strategies to treat and prevent preterm birth.


Subject(s)
Premature Birth , Female , Infant, Newborn , Humans , Premature Birth/genetics , Genome-Wide Association Study/methods , Mothers , Phenotype , Birth Weight
6.
Am J Hum Genet ; 109(12): 2152-2162, 2022 12 01.
Article in English | MEDLINE | ID: mdl-36347255

ABSTRACT

Family history is the standard indirect measure of inherited susceptibility in clinical care, whereas polygenic risk scores (PRSs) have more recently demonstrated potential for more directly capturing genetic risk in many diseases. Few studies have systematically compared how these overlap and complement each other across common diseases. Within FinnGen (N = 306,418), we leverage family relationships, up to 50 years of nationwide registries, and genome-wide genotyping to examine the interplay of family history and genome-wide PRSs. We explore the dynamic for three types of family history across 24 common diseases: first- and second-degree family history and parental causes of death. Covering a large proportion of the burden of non-communicable diseases in adults, we show that family history and PRS are independent and not interchangeable measures, but instead provide complementary information on inherited disease susceptibility. The PRSs explained on average 10% of the effect of first-degree family history, and first-degree family history 3% of PRSs, and PRS effects were independent of both early- and late-onset family history. The PRS stratified the risk similarly in individuals with and without family history. In most diseases, including coronary artery disease, glaucoma, and type 2 diabetes, a positive family history with a high PRS was associated with a considerably elevated risk, whereas a low PRS compensated completely for the risk implied by positive family history. This study provides a catalogue of risk estimates for both family history of disease and PRSs and highlights opportunities for a more comprehensive way of assessing inherited disease risk across common diseases.


Subject(s)
Diabetes Mellitus, Type 2 , Genome-Wide Association Study , Adult , Humans , Diabetes Mellitus, Type 2/genetics , Multifactorial Inheritance/genetics , Genetic Predisposition to Disease , Medical History Taking , Risk Factors
7.
Am J Hum Genet ; 109(10): 1727-1741, 2022 10 06.
Article in English | MEDLINE | ID: mdl-36055244

ABSTRACT

Transcriptomics data have been integrated with genome-wide association studies (GWASs) to help understand disease/trait molecular mechanisms. The utility of metabolomics, integrated with transcriptomics and disease GWASs, to understand molecular mechanisms for metabolite levels or diseases has not been thoroughly evaluated. We performed probabilistic transcriptome-wide association and locus-level colocalization analyses to integrate transcriptomics results for 49 tissues in 706 individuals from the GTEx project, metabolomics results for 1,391 plasma metabolites in 6,136 Finnish men from the METSIM study, and GWAS results for 2,861 disease traits in 260,405 Finnish individuals from the FinnGen study. We found that genetic variants that regulate metabolite levels were more likely to influence gene expression and disease risk compared to the ones that do not. Integrating transcriptomics with metabolomics results prioritized 397 genes for 521 metabolites, including 496 previously identified gene-metabolite pairs with strong functional connections and suggested 33.3% of such gene-metabolite pairs shared the same causal variants with genetic associations of gene expression. Integrating transcriptomics and metabolomics individually with FinnGen GWAS results identified 1,597 genes for 790 disease traits. Integrating transcriptomics and metabolomics jointly with FinnGen GWAS results helped pinpoint metabolic pathways from genes to diseases. We identified putative causal effects of UGT1A1/UGT1A4 expression on gallbladder disorders through regulating plasma (E,E)-bilirubin levels, of SLC22A5 expression on nasal polyps and plasma carnitine levels through distinct pathways, and of LIPC expression on age-related macular degeneration through glycerophospholipid metabolic pathways. Our study highlights the power of integrating multiple sets of molecular traits and GWAS results to deepen understanding of disease pathophysiology.


Subject(s)
Genome-Wide Association Study , Transcriptome , Bilirubin , Carnitine , Glycerophospholipids , Humans , Male , Metabolomics , Quantitative Trait Loci/genetics , Solute Carrier Family 22 Member 5/genetics , Transcriptome/genetics
8.
Mol Psychiatry ; 29(9): 2733-2741, 2024 Sep.
Article in English | MEDLINE | ID: mdl-38556557

ABSTRACT

Genetic factors contribute to the susceptibility of psychotic disorders, but less is known how they affect psychotic disease-course development. Utilizing polygenic scores (PGSs) in combination with longitudinal healthcare data with decades of follow-up we investigated the contributing genetics to psychotic disease-course severity and diagnostic shifts in the SUPER-Finland study, encompassing 10 403 genotyped individuals with a psychotic disorder. To longitudinally track the study participants' past disease-course severity, we created a psychiatric hospitalization burden metric using the full-coverage and nation-wide Finnish in-hospital registry (data from 1969 and onwards). Using a hierarchical model, ranking the psychotic diagnoses according to clinical severity, we show that high schizophrenia PGS (SZ-PGS) was associated with progression from lower ranked psychotic disorders to schizophrenia (OR = 1.32 [1.23-1.43], p = 1.26e-12). This development manifested already at psychotic illness onset as a higher psychiatric hospitalization burden, the proxy for disease-course severity. In schizophrenia (n = 5 479), both a high SZ-PGS and a low educational attainment PGS (EA-PGS) were associated with increased psychiatric hospitalization burden (p = 1.00e-04 and p = 4.53e-10). The SZ-PGS and the EA-PGS associated with distinct patterns of hospital usage. In individuals with high SZ-PGS, the increased hospitalization burden was composed of longer individual hospital stays, while low EA-PGS associated with shorter but more frequent hospital visits. The negative effect of a low EA-PGS was found to be partly mediated via substance use disorder, a major risk factor for hospitalizations. In conclusion, we show that high SZ-PGS and low EA-PGS both impacted psychotic disease-course development negatively but resulted in different disease-course trajectories.


Subject(s)
Disease Progression , Genetic Predisposition to Disease , Hospitalization , Multifactorial Inheritance , Psychotic Disorders , Schizophrenia , Severity of Illness Index , Humans , Psychotic Disorders/genetics , Psychotic Disorders/epidemiology , Schizophrenia/genetics , Schizophrenia/epidemiology , Male , Female , Finland/epidemiology , Adult , Multifactorial Inheritance/genetics , Middle Aged , Genetic Predisposition to Disease/genetics , Cohort Studies , Longitudinal Studies , Genotype , Registries
9.
Nat Rev Genet ; 20(11): 693-701, 2019 11.
Article in English | MEDLINE | ID: mdl-31455890

ABSTRACT

Human genomics is undergoing a step change from being a predominantly research-driven activity to one driven through health care as many countries in Europe now have nascent precision medicine programmes. To maximize the value of the genomic data generated, these data will need to be shared between institutions and across countries. In recognition of this challenge, 21 European countries recently signed a declaration to transnationally share data on at least 1 million human genomes by 2022. In this Roadmap, we identify the challenges of data sharing across borders and demonstrate that European research infrastructures are well-positioned to support the rapid implementation of widespread genomic data access.


Subject(s)
Biomedical Research , Genome, Human , Human Genome Project , Europe , Humans
11.
Nature ; 572(7769): 323-328, 2019 08.
Article in English | MEDLINE | ID: mdl-31367044

ABSTRACT

Exome-sequencing studies have generally been underpowered to identify deleterious alleles with a large effect on complex traits as such alleles are mostly rare. Because the population of northern and eastern Finland has expanded considerably and in isolation following a series of bottlenecks, individuals of these populations have numerous deleterious alleles at a relatively high frequency. Here, using exome sequencing of nearly 20,000 individuals from these regions, we investigate the role of rare coding variants in clinically relevant quantitative cardiometabolic traits. Exome-wide association studies for 64 quantitative traits identified 26 newly associated deleterious alleles. Of these 26 alleles, 19 are either unique to or more than 20 times more frequent in Finnish individuals than in other Europeans and show geographical clustering comparable to Mendelian disease mutations that are characteristic of the Finnish population. We estimate that sequencing studies of populations without this unique history would require hundreds of thousands to millions of participants to achieve comparable association power.


Subject(s)
Exome Sequencing , Genetic Association Studies/methods , Genetic Predisposition to Disease/genetics , Genetic Variation/genetics , Quantitative Trait Loci/genetics , Alleles , Cholesterol, HDL/genetics , Cluster Analysis , Endpoint Determination , Finland , Geographic Mapping , Humans , Multifactorial Inheritance/genetics , Reproducibility of Results
13.
BMC Genomics ; 25(1): 256, 2024 Mar 07.
Article in English | MEDLINE | ID: mdl-38454350

ABSTRACT

BACKGROUND: Congenital heart defects (CHD) are structural defects of the heart affecting approximately 1% of newborns. They exhibit low penetrance and non-Mendelian patterns of inheritance as varied and complex traits. While genetic factors are known to play an important role in the development of CHD, the specific genetics remain unknown for the majority of patients. To elucidate the underlying genetic risk, we performed a genome wide association study (GWAS) of CHDs in general and specific CHD subgroups using the FinnGen Release 10 (R10) (N > 393,000), followed by functional fine-mapping through eQTL and co-localization analyses using the GTEx database. RESULTS: We discovered three genome-wide significant loci associated with general CHD. Two of them were located in chromosome 17: 17q21.32 (rs2316327, intronic: LRRC37A2, Odds ratio (OR) [95% Confidence Interval (CI)] = 1.17[1.12-1.23], p = 1.5 × 10-9) and 17q25.3 (rs1293973611, nearest: BAHCC1, OR[95%CI] = 4.48[2.80-7.17], p = 7.0 × 10-10), respectively, and in addition to general CHD, the rs1293973611 locus was associated with the septal defect subtype. The third locus was in band 1p21.2 (rs35046143, nearest: PALMD, OR[95%CI] = 1.15[1.09-1.21], p = 7.1 × 10-9), and it was associated with general CHD and left-sided lesions. In the subgroup analysis, two additional loci were associated with septal defects (rs75230966 and rs6824295), and one with left-sided lesions (rs1305393195). In the eQTL analysis the variants rs2316327 (general CHD), and rs75230966 (septal defects) both located in 17q21.32 (with a LD r2 of 0.41) were both predicted to significantly associate with the expression of WNT9B in the atrial appendage tissue category. This effect was further confirmed by co-localization analysis, which also implicated WNT3 expression in the atrial appendage. A meta-analysis of general CHD together with the UK Biobank (combined N = 881,678) provided a different genome-wide significant locus in LRRC37A2; rs16941382 (OR[95%CI] = 1.15[1.11-1.20], p = 1.5 × 10-9) which is in significant LD with rs2316327. CONCLUSIONS: Our results of general CHD and different CHD subcategories identified a complex risk locus on chromosome 17 near BAHCC1 and LRRC37A2, interacting with the genes WNT9B, WNT3 and MYL4, may constitute potential novel CHD risk associated loci, warranting future experimental tests to determine their role.


Subject(s)
Genome-Wide Association Study , Heart Defects, Congenital , Humans , Infant, Newborn , Genetic Predisposition to Disease , Heart Defects, Congenital/genetics , Risk Factors , Databases, Genetic
14.
Lab Invest ; 104(4): 100325, 2024 04.
Article in English | MEDLINE | ID: mdl-38220043

ABSTRACT

Formalin-fixed paraffin-embedded (FFPE) tissues stored in biobanks and pathology archives are a vast but underutilized source for molecular studies on different diseases. Beyond being the "gold standard" for preservation of diagnostic human tissues, FFPE samples retain similar genetic information as matching blood samples, which could make FFPE samples an ideal resource for genomic analysis. However, research on this resource has been hindered by the perception that DNA extracted from FFPE samples is of poor quality. Here, we show that germline disease-predisposing variants and polygenic risk scores (PRS) can be identified from FFPE normal tissue (FFPE-NT) DNA with high accuracy. We optimized the performance of FFPE-NT DNA on a genome-wide array containing 657,675 variants. Via a series of testing and validation phases, we established a protocol for FFPE-NT genotyping with results comparable with blood genotyping. The median call rate of FFPE-NT samples in the validation phase was 99.85% (range 98.26%-99.94%) and median concordance with matching blood samples was 99.79% (range 98.85%-99.9%). We also demonstrated that a rare pathogenic PALB2 genetic variant predisposing to cancer can be correctly identified in FFPE-NT samples. We further imputed the FFPE-NT genotype data and calculated the FFPE-NT genome-wide PRS in 3 diseases and 4 disease risk variables. In all cases, FFPE-NT and matching blood PRS were highly concordant (all Pearson's r > 0.95). The ability to precisely genotype FFPE-NT on a genome-wide array enables translational genomics applications of archived FFPE-NT samples with the possibility to link to corresponding phenotypes and longitudinal health data.


Subject(s)
Formaldehyde , Genetic Risk Score , Humans , Genotype , Tissue Fixation/methods , DNA/genetics , Paraffin Embedding/methods
15.
Am J Hum Genet ; 108(4): 583-596, 2021 04 01.
Article in English | MEDLINE | ID: mdl-33798444

ABSTRACT

The contribution of genome structural variation (SV) to quantitative traits associated with cardiometabolic diseases remains largely unknown. Here, we present the results of a study examining genetic association between SVs and cardiometabolic traits in the Finnish population. We used sensitive methods to identify and genotype 129,166 high-confidence SVs from deep whole-genome sequencing (WGS) data of 4,848 individuals. We tested the 64,572 common and low-frequency SVs for association with 116 quantitative traits and tested candidate associations using exome sequencing and array genotype data from an additional 15,205 individuals. We discovered 31 genome-wide significant associations at 15 loci, including 2 loci at which SVs have strong phenotypic effects: (1) a deletion of the ALB promoter that is greatly enriched in the Finnish population and causes decreased serum albumin level in carriers (p = 1.47 × 10-54) and is also associated with increased levels of total cholesterol (p = 1.22 × 10-28) and 14 additional cholesterol-related traits, and (2) a multi-allelic copy number variant (CNV) at PDPR that is strongly associated with pyruvate (p = 4.81 × 10-21) and alanine (p = 6.14 × 10-12) levels and resides within a structurally complex genomic region that has accumulated many rearrangements over evolutionary time. We also confirmed six previously reported associations, including five led by stronger signals in single nucleotide variants (SNVs) and one linking recurrent HP gene deletion and cholesterol levels (p = 6.24 × 10-10), which was also found to be strongly associated with increased glycoprotein level (p = 3.53 × 10-35). Our study confirms that integrating SVs in trait-mapping studies will expand our knowledge of genetic factors underlying disease risk.


Subject(s)
Cardiovascular Diseases/genetics , Genomic Structural Variation/genetics , Alleles , Cholesterol/blood , DNA Copy Number Variations/genetics , Female , Finland , Genome, Human/genetics , Genotype , High-Throughput Nucleotide Sequencing , Humans , Male , Mitochondrial Proteins/genetics , Promoter Regions, Genetic/genetics , Pyruvate Dehydrogenase (Lipoamide)-Phosphatase/genetics , Pyruvic Acid/metabolism , Serum Albumin, Human/genetics
16.
Am J Hum Genet ; 108(4): 722-738, 2021 04 01.
Article in English | MEDLINE | ID: mdl-33798445

ABSTRACT

Progressive myoclonus epilepsies (PMEs) comprise a group of clinically and genetically heterogeneous rare diseases. Over 70% of PME cases can now be molecularly solved. Known PME genes encode a variety of proteins, many involved in lysosomal and endosomal function. We performed whole-exome sequencing (WES) in 84 (78 unrelated) unsolved PME-affected individuals, with or without additional family members, to discover novel causes. We identified likely disease-causing variants in 24 out of 78 (31%) unrelated individuals, despite previous genetic analyses. The diagnostic yield was significantly higher for individuals studied as trios or families (14/28) versus singletons (10/50) (OR = 3.9, p value = 0.01, Fisher's exact test). The 24 likely solved cases of PME involved 18 genes. First, we found and functionally validated five heterozygous variants in NUS1 and DHDDS and a homozygous variant in ALG10, with no previous disease associations. All three genes are involved in dolichol-dependent protein glycosylation, a pathway not previously implicated in PME. Second, we independently validate SEMA6B as a dominant PME gene in two unrelated individuals. Third, in five families, we identified variants in established PME genes; three with intronic or copy-number changes (CLN6, GBA, NEU1) and two very rare causes (ASAH1, CERS1). Fourth, we found a group of genes usually associated with developmental and epileptic encephalopathies, but here, remarkably, presenting as PME, with or without prior developmental delay. Our systematic analysis of these cases suggests that the small residuum of unsolved cases will most likely be a collection of very rare, genetically heterogeneous etiologies.


Subject(s)
Dolichols/metabolism , Mutation/genetics , Myoclonic Epilepsies, Progressive/genetics , Adolescent , Adult , Age of Onset , Child , Child, Preschool , Cohort Studies , DNA Copy Number Variations/genetics , Female , Glycosylation , Humans , Introns/genetics , Male , Middle Aged , Myoclonic Epilepsies, Progressive/classification , Exome Sequencing , Young Adult
17.
Gastroenterology ; 165(4): 861-873, 2023 10.
Article in English | MEDLINE | ID: mdl-37453564

ABSTRACT

BACKGROUND & AIMS: Small intestinal neuroendocrine tumor (SI-NET) is a rare disease, but its incidence has increased over the past 4 decades. Understanding the genetic risk factors underlying SI-NETs can help in disease prevention and may provide clinically beneficial markers for diagnosis. Here the results of the largest genome-wide association study of SI-NETs performed to date with 405 cases and 614,666 controls are reported. METHODS: Samples from 307 patients with SI-NETs and 287,137 controls in the FinnGen study were used for the identification of SI-NET risk-associated genetic variants. The results were also meta-analyzed with summary statistics from the UK Biobank (n = 98 patients with SI-NET and n = 327,529 controls). RESULTS: We identified 6 genome-wide significant (P < 5 × 10-8) loci associated with SI-NET risk, of which 4 (near SEMA6A, LGR5, CDKAL1, and FERMT2) are novel and 2 (near LTA4H-ELK and in KIF16B) have been reported previously. Interestingly, the top hit (rs200138614; P = 1.80 × 10-19) was a missense variant (p.Cys712Phe) in the LGR5 gene, a bona-fide marker of adult intestinal stem cells and a potentiator of canonical WNT signaling. The association was validated in an independent Finnish collection of 70 patients with SI-NETs, as well as in the UK Biobank exome sequence data (n = 92 cases and n = 392,814 controls). Overexpression of LGR5 p.Cys712Phe in intestinal organoids abolished the ability of R-Spondin1 to support organoid growth, indicating that the mutation perturbed R-Spondin-LGR5 signaling. CONCLUSIONS: Our study is the largest genome-wide association study to date on SI-NETs and reported 4 new associated genome-wide association study loci, including a novel missense mutation (rs200138614, p.Cys712Phe) in LGR5, a canonical marker of adult intestinal stem cells.


Subject(s)
Intestinal Neoplasms , Neuroendocrine Tumors , Adult , Humans , Neuroendocrine Tumors/genetics , Neuroendocrine Tumors/pathology , Mutation, Missense , Genome-Wide Association Study , Intestinal Neoplasms/genetics , Intestinal Neoplasms/pathology , Receptors, G-Protein-Coupled/genetics , Kinesins/genetics
18.
Mol Psychiatry ; 28(2): 891-900, 2023 02.
Article in English | MEDLINE | ID: mdl-36253440

ABSTRACT

Suicide is a worldwide health crisis. We aimed to identify genetic risk variants associated with suicide death and suicidal behavior. Meta-analysis for suicide death was performed using 3765 cases from Utah and matching 6572 controls of European ancestry. Meta-analysis for suicidal behavior using data across five cohorts (n = 8315 cases and 256,478 psychiatric or populational controls of European ancestry) was also performed. One locus in neuroligin 1 (NLGN1) passing the genome-wide significance threshold for suicide death was identified (top SNP rs73182688, with p = 5.48 × 10-8 before and p = 4.55 × 10-8 after mtCOJO analysis conditioning on MDD to remove genetic effects on suicide mediated by MDD). Conditioning on suicidal attempts did not significantly change the association strength (p = 6.02 × 10-8), suggesting suicide death specificity. NLGN1 encodes a member of a family of neuronal cell surface proteins. Members of this family act as splice site-specific ligands for beta-neurexins and may be involved in synaptogenesis. The NRXN-NLGN pathway was previously implicated in suicide, autism, and schizophrenia. We additionally identified ROBO2 and ZNF28 associations with suicidal behavior in the meta-analysis across five cohorts in gene-based association analysis using MAGMA. Lastly, we replicated two loci including variants near SOX5 and LOC101928519 associated with suicidal attempts identified in the ISGC and MVP meta-analysis using the independent FinnGen samples. Suicide death and suicidal behavior showed positive genetic correlations with depression, schizophrenia, pain, and suicidal attempt, and negative genetic correlation with educational attainment. These correlations remained significant after conditioning on depression, suggesting pleiotropic effects among these traits. Bidirectional generalized summary-data-based Mendelian randomization analysis suggests that genetic risk for the suicidal attempt and suicide death are both bi-directionally causal for MDD.


Subject(s)
Suicidal Ideation , Suicide , Humans , Genome-Wide Association Study , Suicide/psychology , Suicide, Attempted/psychology , Risk Factors
19.
PLoS Genet ; 17(4): e1009501, 2021 04.
Article in English | MEDLINE | ID: mdl-33909604

ABSTRACT

Protein-truncating variants (PTVs) affecting dyslipidemia risk may point to therapeutic targets for cardiometabolic disease. Our objective was to identify PTVs that were associated with both lipid levels and the risk of coronary artery disease (CAD) or type 2 diabetes (T2D) and assess their possible associations with risks of other diseases. To achieve this aim, we leveraged the enrichment of PTVs in the Finnish population and tested the association of low-frequency PTVs in 1,209 genes with serum lipid levels in the Finrisk Study (n = 23,435). We then tested which of the lipid-associated PTVs were also associated with the risks of T2D or CAD, as well as 2,683 disease endpoints curated in the FinnGen Study (n = 218,792). Two PTVs were associated with both lipid levels and the risk of CAD or T2D: triglyceride-lowering variants in ANGPTL8 (-24.0[-30.4 to -16.9] mg/dL per rs760351239-T allele, P = 3.4 × 10-9) and ANGPTL4 (-14.4[-18.6 to -9.8] mg/dL per rs746226153-G allele, P = 4.3 × 10-9). The risk of T2D was lower in carriers of the ANGPTL4 PTV (OR = 0.70[0.60-0.81], P = 2.2 × 10-6) than noncarriers. The odds of CAD were 47% lower in carriers of a PTV in ANGPTL8 (OR = 0.53[0.37-0.76], P = 4.5 × 10-4) than noncarriers. Finally, the phenome-wide scan of the ANGPTL8 PTV showed that the ANGPTL8 PTV carriers were less likely to use statin therapy (68,782 cases, OR = 0.52[0.40-0.68], P = 1.7 × 10-6) compared to noncarriers. Our findings provide genetic evidence of potential long-term efficacy and safety of therapeutic targeting of dyslipidemias.


Subject(s)
Angiopoietin-like Proteins/genetics , Coronary Artery Disease/drug therapy , Diabetes Mellitus, Type 2/drug therapy , Dyslipidemias/drug therapy , Peptide Hormones/genetics , Aged , Angiopoietin-Like Protein 8 , Coronary Artery Disease/blood , Coronary Artery Disease/genetics , Coronary Artery Disease/pathology , Diabetes Mellitus, Type 2/blood , Diabetes Mellitus, Type 2/genetics , Diabetes Mellitus, Type 2/pathology , Dyslipidemias/blood , Dyslipidemias/genetics , Dyslipidemias/pathology , Female , Genetic Predisposition to Disease , Humans , Hydroxymethylglutaryl-CoA Reductase Inhibitors/administration & dosage , Hydroxymethylglutaryl-CoA Reductase Inhibitors/adverse effects , Male , Middle Aged , Polymorphism, Single Nucleotide/genetics , Risk Factors , Triglycerides/blood
20.
Genome Res ; 30(1): 62-71, 2020 01.
Article in English | MEDLINE | ID: mdl-31871067

ABSTRACT

Missense variant interpretation is challenging. Essential regions for protein function are conserved among gene-family members, and genetic variants within these regions are potentially more likely to confer risk to disease. Here, we generated 2871 gene-family protein sequence alignments involving 9990 genes and performed missense variant burden analyses to identify novel essential protein regions. We mapped 2,219,811 variants from the general population into these alignments and compared their distribution with 76,153 missense variants from patients. With this gene-family approach, we identified 465 regions enriched for patient variants spanning 41,463 amino acids in 1252 genes. As a comparison, by testing the same genes individually, we identified fewer patient variant enriched regions, involving only 2639 amino acids and 215 genes. Next, we selected de novo variants from 6753 patients with neurodevelopmental disorders and 1911 unaffected siblings and observed an 8.33-fold enrichment of patient variants in our identified regions (95% C.I. = 3.90-Inf, P-value = 2.72 × 10-11). By using the complete ClinVar variant set, we found that missense variants inside the identified regions are 106-fold more likely to be classified as pathogenic in comparison to benign classification (OR = 106.15, 95% C.I = 70.66-Inf, P-value < 2.2 × 10-16). All pathogenic variant enriched regions (PERs) identified are available online through "PER viewer," a user-friendly online platform for interactive data mining, visualization, and download. In summary, our gene-family burden analysis approach identified novel PERs in protein sequences. This annotation can empower variant interpretation.


Subject(s)
Chromosome Mapping , Genetic Predisposition to Disease , Genetic Variation , Multigene Family , Alleles , Amino Acid Sequence , Amino Acid Substitution , Computational Biology/methods , Female , Genome-Wide Association Study , Humans , Male , Mutation, Missense , Software , User-Computer Interface
SELECTION OF CITATIONS
SEARCH DETAIL