RESUMO
We present the largest exome sequencing study of autism spectrum disorder (ASD) to date (n = 35,584 total samples, 11,986 with ASD). Using an enhanced analytical framework to integrate de novo and case-control rare variation, we identify 102 risk genes at a false discovery rate of 0.1 or less. Of these genes, 49 show higher frequencies of disruptive de novo variants in individuals ascertained to have severe neurodevelopmental delay, whereas 53 show higher frequencies in individuals ascertained to have ASD; comparing ASD cases with mutations in these groups reveals phenotypic differences. Expressed early in brain development, most risk genes have roles in regulation of gene expression or neuronal communication (i.e., mutations effect neurodevelopmental and neurophysiological changes), and 13 fall within loci recurrently hit by copy number variants. In cells from the human cortex, expression of risk genes is enriched in excitatory and inhibitory neuronal lineages, consistent with multiple paths to an excitatory-inhibitory imbalance underlying ASD.
Assuntos
Transtorno Autístico/genética , Córtex Cerebral/crescimento & desenvolvimento , Sequenciamento do Exoma/métodos , Regulação da Expressão Gênica no Desenvolvimento , Neurobiologia/métodos , Estudos de Casos e Controles , Linhagem da Célula , Estudos de Coortes , Exoma , Feminino , Frequência do Gene , Predisposição Genética para Doença , Humanos , Masculino , Mutação de Sentido Incorreto , Neurônios/metabolismo , Fenótipo , Fatores Sexuais , Análise de Célula Única/métodosRESUMO
Circular RNAs (circRNAs) are an intriguing class of RNA due to their covalently closed structure, high stability, and implicated roles in gene regulation. Here, we used an exome capture RNA sequencing protocol to detect and characterize circRNAs across >2,000 cancer samples. When compared against Ribo-Zero and RNase R, capture sequencing significantly enhanced the enrichment of circRNAs and preserved accurate circular-to-linear ratios. Using capture sequencing, we built the most comprehensive catalog of circRNA species to date: MiOncoCirc, the first database to be composed primarily of circRNAs directly detected in tumor tissues. Using MiOncoCirc, we identified candidate circRNAs to serve as biomarkers for prostate cancer and were able to detect circRNAs in urine. We further detected a novel class of circular transcripts, termed read-through circRNAs, that involved exons originating from different genes. MiOncoCirc will serve as a valuable resource for the development of circRNAs as diagnostic or therapeutic targets across cancer types.
Assuntos
Perfilação da Expressão Gênica/métodos , Neoplasias/genética , RNA/genética , Biomarcadores Tumorais/genética , Bases de Dados Genéticas , Regulação Neoplásica da Expressão Gênica/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , MicroRNAs/genética , RNA/metabolismo , RNA Circular , Análise de Sequência de RNA/métodos , Sequenciamento do Exoma/métodosRESUMO
Genetic alterations in signaling pathways that control cell-cycle progression, apoptosis, and cell growth are common hallmarks of cancer, but the extent, mechanisms, and co-occurrence of alterations in these pathways differ between individual tumors and tumor types. Using mutations, copy-number changes, mRNA expression, gene fusions and DNA methylation in 9,125 tumors profiled by The Cancer Genome Atlas (TCGA), we analyzed the mechanisms and patterns of somatic alterations in ten canonical pathways: cell cycle, Hippo, Myc, Notch, Nrf2, PI-3-Kinase/Akt, RTK-RAS, TGFß signaling, p53 and ß-catenin/Wnt. We charted the detailed landscape of pathway alterations in 33 cancer types, stratified into 64 subtypes, and identified patterns of co-occurrence and mutual exclusivity. Eighty-nine percent of tumors had at least one driver alteration in these pathways, and 57% percent of tumors had at least one alteration potentially targetable by currently available drugs. Thirty percent of tumors had multiple targetable alterations, indicating opportunities for combination therapy.
Assuntos
Bases de Dados Genéticas , Neoplasias/patologia , Transdução de Sinais/genética , Genes Neoplásicos , Humanos , Neoplasias/genética , Fosfatidilinositol 3-Quinases/genética , Fosfatidilinositol 3-Quinases/metabolismo , Fator de Crescimento Transformador beta/genética , Fator de Crescimento Transformador beta/metabolismo , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo , Proteínas Wnt/genética , Proteínas Wnt/metabolismoRESUMO
Diffuse large B cell lymphoma (DLBCL) is the most common form of blood cancer and is characterized by a striking degree of genetic and clinical heterogeneity. This heterogeneity poses a major barrier to understanding the genetic basis of the disease and its response to therapy. Here, we performed an integrative analysis of whole-exome sequencing and transcriptome sequencing in a cohort of 1,001 DLBCL patients to comprehensively define the landscape of 150 genetic drivers of the disease. We characterized the functional impact of these genes using an unbiased CRISPR screen of DLBCL cell lines to define oncogenes that promote cell growth. A prognostic model comprising these genetic alterations outperformed current established methods: cell of origin, the International Prognostic Index comprising clinical variables, and dual MYC and BCL2 expression. These results comprehensively define the genetic drivers and their functional roles in DLBCL to identify new therapeutic opportunities in the disease.
Assuntos
Sistemas CRISPR-Cas , Perfilação da Expressão Gênica , Linfoma Difuso de Grandes Células B/genética , Antineoplásicos/administração & dosagem , Linhagem Celular Tumoral , Células Cultivadas , Exoma , Feminino , Humanos , Linfoma Difuso de Grandes Células B/tratamento farmacológico , Masculino , Rituximab/administração & dosagemRESUMO
Twin studies suggest that additive genetic effects account for about a quarter of the variance in handedness. Recently, Schijven et al. used exome-wide sequencing to provide evidence for a role of rare protein-coding variants in handedness. These included the gene encoding beta-tubulin, TUBB4B, suggesting that microtubules are relevant for handedness ontogenesis.
Assuntos
Lateralidade Funcional , Tubulina (Proteína) , Tubulina (Proteína)/genética , Humanos , Lateralidade Funcional/genética , Variação Genética , Microtúbulos/genéticaRESUMO
Gene-based burden tests are a popular and powerful approach for analysis of exome-wide association studies. These approaches combine sets of variants within a gene into a single burden score that is then tested for association. Typically, a range of burden scores are calculated and tested across a range of annotation classes and frequency bins. Correlation between these tests can complicate the multiple testing correction and hamper interpretation of the results. We introduce a method called the sparse burden association test (SBAT) that tests the joint set of burden scores under the assumption that causal burden scores act in the same effect direction. The method simultaneously assesses the significance of the model fit and selects the set of burden scores that best explain the association at the same time. Using simulated data, we show that the method is well calibrated and highlight scenarios where the test outperforms existing gene-based tests. We apply the method to 73 quantitative traits from the UK Biobank, showing that SBAT is a valuable additional gene-based test when combined with other existing approaches. This test is implemented in the REGENIE software.
Assuntos
Estudo de Associação Genômica Ampla , Humanos , Estudo de Associação Genômica Ampla/métodos , Análise dos Mínimos Quadrados , Software , Modelos Genéticos , Exoma/genética , Variação Genética , Simulação por ComputadorRESUMO
Copy number variants (CNVs) are significant contributors to the pathogenicity of rare genetic diseases and, with new innovative methods, can now reliably be identified from exome sequencing. Challenges still remain in accurate classification of CNV pathogenicity. CNV calling using GATK-gCNV was performed on exomes from a cohort of 6,633 families (15,759 individuals) with heterogeneous phenotypes and variable prior genetic testing collected at the Broad Institute Center for Mendelian Genomics of the Genomics Research to Elucidate the Genetics of Rare Diseases consortium and analyzed using the seqr platform. The addition of CNV detection to exome analysis identified causal CNVs for 171 families (2.6%). The estimated sizes of CNVs ranged from 293 bp to 80 Mb. The causal CNVs consisted of 140 deletions, 15 duplications, 3 suspected complex structural variants (SVs), 3 insertions, and 10 complex SVs, the latter two groups being identified by orthogonal confirmation methods. To classify CNV variant pathogenicity, we used the 2020 American College of Medical Genetics and Genomics/ClinGen CNV interpretation standards and developed additional criteria to evaluate allelic and functional data as well as variants on the X chromosome to further advance the framework. We interpreted 151 CNVs as likely pathogenic/pathogenic and 20 CNVs as high-interest variants of uncertain significance. Calling CNVs from existing exome data increases the diagnostic yield for individuals undiagnosed after standard testing approaches, providing a higher-resolution alternative to arrays at a fraction of the cost of genome sequencing. Our improvements to the classification approach advances the systematic framework to assess the pathogenicity of CNVs.
Assuntos
Variações do Número de Cópias de DNA , Sequenciamento do Exoma , Exoma , Doenças Raras , Humanos , Variações do Número de Cópias de DNA/genética , Doenças Raras/genética , Doenças Raras/diagnóstico , Exoma/genética , Masculino , Feminino , Estudos de Coortes , Testes Genéticos/métodosRESUMO
Bicuspid aortic valve (BAV) is the most common congenital heart lesion with an estimated population prevalence of 1%. We hypothesize that specific gene variants predispose to early-onset complications of BAV (EBAV). We analyzed whole-exome sequences (WESs) to identify rare coding variants that contribute to BAV disease in 215 EBAV-affected families. Predicted damaging variants in candidate genes with moderate or strong supportive evidence to cause developmental cardiac phenotypes were present in 107 EBAV-affected families (50% of total), including genes that cause BAV (9%) or heritable thoracic aortic disease (HTAD, 19%). After appropriate filtration, we also identified 129 variants in 54 candidate genes that are associated with autosomal-dominant congenital heart phenotypes, including recurrent deleterious variation of FBN2, MYH6, channelopathy genes, and type 1 and 5 collagen genes. These findings confirm our hypothesis that unique rare genetic variants drive early-onset presentations of BAV disease.
Assuntos
Valva Aórtica , Doença da Válvula Aórtica Bicúspide , Sequenciamento do Exoma , Doenças das Valvas Cardíacas , Linhagem , Humanos , Doença da Válvula Aórtica Bicúspide/genética , Doença da Válvula Aórtica Bicúspide/patologia , Valva Aórtica/anormalidades , Valva Aórtica/patologia , Doenças das Valvas Cardíacas/genética , Masculino , Feminino , Predisposição Genética para Doença , Idade de Início , Fenótipo , Exoma/genética , Adulto , Cadeias Pesadas de Miosina/genética , Fibrilina-2/genética , Miosinas Cardíacas/genéticaRESUMO
Histone deacetylase 3 (HDAC3) is a crucial epigenetic modulator essential for various developmental and physiological functions. Although its dysfunction is increasingly recognized in abnormal phenotypes, to our knowledge, there have been no established reports of human diseases directly linked to HDAC3 dysfunction. Using trio exome sequencing and extensive phenotypic analysis, we correlated heterozygous de novo variants in HDAC3 with a neurodevelopmental disorder having variable clinical presentations, frequently associated with intellectual disability, developmental delay, epilepsy, and musculoskeletal abnormalities. In a cohort of six individuals, we identified missense variants in HDAC3 (c.277G>A [p.Asp93Asn], c.328G>A [p.Ala110Thr], c.601C>T [p.Pro201Ser], c. 797T>C [p.Leu266Ser], c.799G>A [p.Gly267Ser], and c.1075C>T [p.Arg359Cys]), all located in evolutionarily conserved sites and confirmed as de novo. Experimental studies identified defective deacetylation activity in the p.Asp93Asn, p.Pro201Ser, p.Leu266Ser, and p.Gly267Ser variants, positioned near the enzymatic pocket. In addition, proteomic analysis employing co-immunoprecipitation revealed that the disrupted interactions with molecules involved in the CoREST and NCoR complexes, particularly in the p.Ala110Thr variant, consist of a central pathogenic mechanism. Moreover, immunofluorescence analysis showed diminished nuclear to cytoplasmic fluorescence ratio in the p.Ala110Thr, p.Gly267Ser, and p.Arg359Cys variants, indicating impaired nuclear localization. Taken together, our study highlights that de novo missense variants in HDAC3 are associated with a broad spectrum of neurodevelopmental disorders, which emphasizes the complex role of HDAC3 in histone deacetylase activity, multi-protein complex interactions, and nuclear localization for proper physiological functions. These insights open new avenues for understanding the molecular mechanisms of HDAC3-related disorders and may inform future therapeutic strategies.
Assuntos
Epigênese Genética , Histona Desacetilases , Mutação de Sentido Incorreto , Transtornos do Neurodesenvolvimento , Humanos , Histona Desacetilases/genética , Histona Desacetilases/metabolismo , Mutação de Sentido Incorreto/genética , Transtornos do Neurodesenvolvimento/genética , Masculino , Feminino , Pré-Escolar , Criança , Deficiência Intelectual/genética , Sequenciamento do Exoma , Adolescente , Deficiências do Desenvolvimento/genética , Fenótipo , Lactente , Correpressor 1 de Receptor Nuclear/genética , Correpressor 1 de Receptor Nuclear/metabolismoRESUMO
Selenophosphate synthetase (SEPHS) plays an essential role in selenium metabolism. Two mammalian SEPHS paralogues, SEPHS1 and SEPHS2, share high sequence identity and structural homology with SEPHS. Here, we report nine individuals from eight families with developmental delay, growth and feeding problems, hypotonia, and dysmorphic features, all with heterozygous missense variants in SEPHS1. Eight of these individuals had a recurrent variant at amino acid position 371 of SEPHS1 (p.Arg371Trp, p.Arg371Gln, and p.Arg371Gly); seven of these variants were known to be de novo. Structural modeling and biochemical assays were used to understand the effect of these variants on SEPHS1 function. We found that a variant at residue Trp352 results in local structural changes of the C-terminal region of SEPHS1 that decrease the overall thermal stability of the enzyme. In contrast, variants of a solvent-exposed residue Arg371 do not impact enzyme stability and folding but could modulate direct protein-protein interactions of SEPSH1 with cellular factors in promoting cell proliferation and development. In neuronal SH-SY5Y cells, we assessed the impact of SEPHS1 variants on cell proliferation and ROS production and investigated the mRNA expression levels of genes encoding stress-related selenoproteins. Our findings provided evidence that the identified SEPHS1 variants enhance cell proliferation by modulating ROS homeostasis. Our study supports the hypothesis that SEPHS1 plays a critical role during human development and provides a basis for further investigation into the molecular mechanisms employed by SEPHS1. Furthermore, our data suggest that variants in SEPHS1 are associated with a neurodevelopmental disorder.
Assuntos
Deficiência Intelectual , Anormalidades Musculoesqueléticas , Transtornos do Neurodesenvolvimento , Animais , Criança , Humanos , Deficiências do Desenvolvimento/genética , Éxons , Deficiência Intelectual/genética , Mamíferos/genética , Hipotonia Muscular/genética , Anormalidades Musculoesqueléticas/genética , Neuroblastoma/genética , Transtornos do Neurodesenvolvimento/genética , Espécies Reativas de OxigênioRESUMO
Infertility, affecting â¼10% of men, is predominantly caused by primary spermatogenic failure (SPGF). We screened likely pathogenic and pathogenic (LP/P) variants in 638 candidate genes for male infertility in 521 individuals presenting idiopathic SPGF and 323 normozoospermic men in the ESTAND cohort. Molecular diagnosis was reached for 64 men with SPGF (12%), with findings in 39 genes (6%). The yield did not differ significantly between the subgroups with azoospermia (20/185, 11%), oligozoospermia (18/181, 10%), and primary cryptorchidism with SPGF (26/155, 17%). Notably, 19 of 64 LP/P variants (30%) identified in 28 subjects represented recurrent findings in this study and/or with other male infertility cohorts. NR5A1 was the most frequently affected gene, with seven LP/P variants in six SPGF-affected men and two normozoospermic men. The link to SPGF was validated for recently proposed candidate genes ACTRT1, ASZ1, GLUD2, GREB1L, LEO1, RBM5, ROS1, and TGIF2LY. Heterozygous truncating variants in BNC1, reported in female infertility, emerged as plausible causes of severe oligozoospermia. Data suggested that several infertile men may present congenital conditions with less pronounced or pleiotropic phenotypes affecting the development and function of the reproductive system. Genes regulating the hypothalamic-pituitary-gonadal axis were affected in >30% of subjects with LP/P variants. Six individuals had more than one LP/P variant, including five with two findings from the gene panel. A 4-fold increased prevalence of cancer was observed in men with genetic infertility compared to the general male population (8% vs. 2%; p = 4.4 × 10-3). Expanding genetic testing in andrology will contribute to the multidisciplinary management of SPGF.
Assuntos
Infertilidade Masculina , Humanos , Masculino , Infertilidade Masculina/genética , Adulto , Sequenciamento do Exoma , Fator Esteroidogênico 1/genética , Azoospermia/genética , Oligospermia/genética , Mutação , Espermatogênese/genética , Estudos de CoortesRESUMO
Developmental and functional defects in the lymphatic system are responsible for primary lymphoedema (PL). PL is a chronic debilitating disease caused by increased accumulation of interstitial fluid, predisposing to inflammation, infections and fibrosis. There is no cure, only symptomatic treatment is available. Thirty-two genes or loci have been linked to PL, and another 22 are suggested, including Hepatocyte Growth Factor (HGF). We searched for HGF variants in 770 index patients from the Brussels PL cohort. We identified ten variants predicted to cause HGF loss-of-function (six nonsense, two frameshifts, and two splice-site changes; 1.3% of our cohort), and 14 missense variants predicted to be pathogenic in 17 families (2.21%). We studied co-segregation within families, mRNA stability for non-sense variants, and in vitro functional effects of the missense variants. Analyses of the mRNA of patient cells revealed degradation of the nonsense mutant allele. Reduced protein secretion was detected for nine of the 14 missense variants expressed in COS-7 cells. Stimulation of lymphatic endothelial cells with these 14 HGF variant proteins resulted in decreased activation of the downstream targets AKT and ERK1/2 for three of them. Clinically, HGF-associated PL was diverse, but predominantly bilateral in the lower limbs with onset varying from early childhood to adulthood. Finally, aggregation study in a second independent cohort underscored that rare likely pathogenic variants in HGF explain about 2% of PL. Therefore, HGF signalling seems crucial for lymphatic development and/or maintenance in human beings and HGF should be included in diagnostic genetic screens for PL.
Assuntos
Fator de Crescimento de Hepatócito , Linfedema , Humanos , Fator de Crescimento de Hepatócito/genética , Fator de Crescimento de Hepatócito/metabolismo , Masculino , Feminino , Criança , Adulto , Linfedema/genética , Linfedema/patologia , Adolescente , Pessoa de Meia-Idade , Animais , Mutação de Sentido Incorreto/genética , Mutação com Perda de Função , Idade de Início , Pré-Escolar , Células COS , Chlorocebus aethiops , Células Endoteliais/metabolismo , Células Endoteliais/patologia , Adulto JovemRESUMO
Chronic pancreatitis (CP) is an etiologically and genetically heterogeneous inflammatory syndrome characterised by progressive damage to the exocrine and endocrine components of the pancreas [ 1]. The multigenic paradigm of CP has sparked research in recent years [ 2]. We aimed to expand the current knowledge of genetic susceptibility of pancreatitis in patients of Indian origin. By employing whole-exome sequencing in an Indian hospital cohort, we dissect the genetic landscape associated with CP or recurrent acute pancreatitis (RAP). Notably, all patients had at least one genetic variant identified in a pancreatitis-risk gene, and most had a co-occurrence of a second variant in an additional risk gene. Based on the presence of both acinar and ductal gene variants in individual patients, we propose a two-hit hypothesis where variants in proteins expressed in both acinar and ductal cells are critical for RAP/CP development.
RESUMO
Twin and genomic studies indicate that genes play an important role in the development of cognitive ability. However, data limitations have made it difficult to pinpoint specific genes with a large impact. By examining the full gene sequences of >300 000 individuals, Chen et al. find eight such genes.
RESUMO
Despite advances in clinical genetic testing, including the introduction of exome sequencing (ES), more than 50% of individuals with a suspected Mendelian condition lack a precise molecular diagnosis. Clinical evaluation is increasingly undertaken by specialists outside of clinical genetics, often occurring in a tiered fashion and typically ending after ES. The current diagnostic rate reflects multiple factors, including technical limitations, incomplete understanding of variant pathogenicity, missing genotype-phenotype associations, complex gene-environment interactions, and reporting differences between clinical labs. Maintaining a clear understanding of the rapidly evolving landscape of diagnostic tests beyond ES, and their limitations, presents a challenge for non-genetics professionals. Newer tests, such as short-read genome or RNA sequencing, can be challenging to order, and emerging technologies, such as optical genome mapping and long-read DNA sequencing, are not available clinically. Furthermore, there is no clear guidance on the next best steps after inconclusive evaluation. Here, we review why a clinical genetic evaluation may be negative, discuss questions to be asked in this setting, and provide a framework for further investigation, including the advantages and disadvantages of new approaches that are nascent in the clinical sphere. We present a guide for the next best steps after inconclusive molecular testing based upon phenotype and prior evaluation, including when to consider referral to research consortia focused on elucidating the underlying cause of rare unsolved genetic disorders.
Assuntos
Exoma , Testes Genéticos , Humanos , Exoma/genética , Análise de Sequência de DNA , Fenótipo , Sequenciamento do Exoma , Doenças RarasRESUMO
Genome-wide association studies (GWASs) have established the contribution of common and low-frequency variants to metabolic blood measurements in the UK Biobank (UKB). To complement existing GWAS findings, we assessed the contribution of rare protein-coding variants in relation to 355 metabolic blood measurements-including 325 predominantly lipid-related nuclear magnetic resonance (NMR)-derived blood metabolite measurements (Nightingale Health Plc) and 30 clinical blood biomarkers-using 412,393 exome sequences from four genetically diverse ancestries in the UKB. Gene-level collapsing analyses were conducted to evaluate a diverse range of rare-variant architectures for the metabolic blood measurements. Altogether, we identified significant associations (p < 1 × 10-8) for 205 distinct genes that involved 1,968 significant relationships for the Nightingale blood metabolite measurements and 331 for the clinical blood biomarkers. These include associations for rare non-synonymous variants in PLIN1 and CREB3L3 with lipid metabolite measurements and SYT7 with creatinine, among others, which may not only provide insights into novel biology but also deepen our understanding of established disease mechanisms. Of the study-wide significant clinical biomarker associations, 40% were not previously detected on analyzing coding variants in a GWAS in the same cohort, reinforcing the importance of studying rare variation to fully understand the genetic architecture of metabolic blood measurements.
Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Bancos de Espécimes Biológicos , Biomarcadores , Lipídeos , Reino Unido , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Short-read genome sequencing (GS) holds the promise of becoming the primary diagnostic approach for the assessment of autism spectrum disorder (ASD) and fetal structural anomalies (FSAs). However, few studies have comprehensively evaluated its performance against current standard-of-care diagnostic tests: karyotype, chromosomal microarray (CMA), and exome sequencing (ES). To assess the clinical utility of GS, we compared its diagnostic yield against these three tests in 1,612 quartet families including an individual with ASD and in 295 prenatal families. Our GS analytic framework identified a diagnostic variant in 7.8% of ASD probands, almost 2-fold more than CMA (4.3%) and 3-fold more than ES (2.7%). However, when we systematically captured copy-number variants (CNVs) from the exome data, the diagnostic yield of ES (7.4%) was brought much closer to, but did not surpass, GS. Similarly, we estimated that GS could achieve an overall diagnostic yield of 46.1% in unselected FSAs, representing a 17.2% increased yield over karyotype, 14.1% over CMA, and 4.1% over ES with CNV calling or 36.1% increase without CNV discovery. Overall, GS provided an added diagnostic yield of 0.4% and 0.8% beyond the combination of all three standard-of-care tests in ASD and FSAs, respectively. This corresponded to nine GS unique diagnostic variants, including sequence variants in exons not captured by ES, structural variants (SVs) inaccessible to existing standard-of-care tests, and SVs where the resolution of GS changed variant classification. Overall, this large-scale evaluation demonstrated that GS significantly outperforms each individual standard-of-care test while also outperforming the combination of all three tests, thus warranting consideration as the first-tier diagnostic approach for the assessment of ASD and FSAs.
Assuntos
Transtorno do Espectro Autista , Feminino , Gravidez , Humanos , Transtorno do Espectro Autista/diagnóstico , Transtorno do Espectro Autista/genética , Primeiro Trimestre da Gravidez , Ultrassonografia Pré-Natal , Mapeamento Cromossômico , ExomaRESUMO
Allelic series are of candidate therapeutic interest because of the existence of a dose-response relationship between the functionality of a gene and the degree or severity of a phenotype. We define an allelic series as a collection of variants in which increasingly deleterious mutations lead to increasingly large phenotypic effects, and we have developed a gene-based rare-variant association test specifically targeted to identifying genes containing allelic series. Building on the well-known burden test and sequence kernel association test (SKAT), we specify a variety of association models covering different genetic architectures and integrate these into a Coding-Variant Allelic-Series Test (COAST). Through extensive simulations, we confirm that COAST maintains the type I error and improves the power when the pattern of coding-variant effect sizes increases monotonically with mutational severity. We applied COAST to identify allelic-series genes for four circulating-lipid traits and five cell-count traits among 145,735 subjects with available whole-exome sequencing data from the UK Biobank. Compared with optimal SKAT (SKAT-O), COAST identified 29% more Bonferroni-significant associations with circulating-lipid traits, on average, and 82% more with cell-count traits. All of the gene-trait associations identified by COAST have corroborating evidence either from rare-variant associations in the full cohort (Genebass, n = 400,000) or from common-variant associations in the GWAS Catalog. In addition to detecting many gene-trait associations present in Genebass by using only a fraction (36.9%) of the sample, COAST detects associations, such as that between ANGPTL4 and triglycerides, that are absent from Genebass but that have clear common-variant support.
Assuntos
Variação Genética , Lipídeos , Simulação por Computador , Estudos de Associação Genética , Fenótipo , Estudo de Associação Genômica AmplaRESUMO
The blood-brain barrier (BBB) is an essential gatekeeper for the central nervous system and incidence of neurodevelopmental disorders (NDDs) is higher in infants with a history of intracerebral hemorrhage (ICH). We discovered a rare disease trait in thirteen individuals, including four fetuses, from eight unrelated families associated with homozygous loss-of-function variant alleles of ESAM which encodes an endothelial cell adhesion molecule. The c.115del (p.Arg39Glyfs∗33) variant, identified in six individuals from four independent families of Southeastern Anatolia, severely impaired the in vitro tubulogenic process of endothelial colony-forming cells, recapitulating previous evidence in null mice, and caused lack of ESAM expression in the capillary endothelial cells of damaged brain. Affected individuals with bi-allelic ESAM variants showed profound global developmental delay/unspecified intellectual disability, epilepsy, absent or severely delayed speech, varying degrees of spasticity, ventriculomegaly, and ICH/cerebral calcifications, the latter being also observed in the fetuses. Phenotypic traits observed in individuals with bi-allelic ESAM variants overlap very closely with other known conditions characterized by endothelial dysfunction due to mutation of genes encoding tight junction molecules. Our findings emphasize the role of brain endothelial dysfunction in NDDs and contribute to the expansion of an emerging group of diseases that we propose to rename as "tightjunctionopathies."