RESUMO
Speech and language disorders are known to have a substantial genetic contribution. Although frequently examined as components of other conditions, research on the genetic basis of linguistic differences as separate phenotypic subgroups has been limited so far. Here, we performed an in-depth characterization of speech and language disorders in 52 143 individuals, reconstructing clinical histories using a large-scale data-mining approach of the electronic medical records from an entire large paediatric healthcare network. The reported frequency of these disorders was the highest between 2 and 5 years old and spanned a spectrum of 26 broad speech and language diagnoses. We used natural language processing to assess the degree to which clinical diagnoses in full-text notes were reflected in ICD-10 diagnosis codes. We found that aphasia and speech apraxia could be retrieved easily through ICD-10 diagnosis codes, whereas stuttering as a speech phenotype was coded in only 12% of individuals through appropriate ICD-10 codes. We found significant comorbidity of speech and language disorders in neurodevelopmental conditions (30.31%) and, to a lesser degree, with epilepsies (6.07%) and movement disorders (2.05%). The most common genetic disorders retrievable in our analysis of electronic medical records were STXBP1 (n = 21), PTEN (n = 20) and CACNA1A (n = 18). When assessing associations of genetic diagnoses with specific linguistic phenotypes, we observed associations of STXBP1 and aphasia (P = 8.57 × 10-7, 95% confidence interval = 18.62-130.39) and MYO7A with speech and language development delay attributable to hearing loss (P = 1.24 × 10-5, 95% confidence interval = 17.46-infinity). Finally, in a sub-cohort of 726 individuals with whole-exome sequencing data, we identified an enrichment of rare variants in neuronal receptor pathways, in addition to associations of UQCRC1 and KIF17 with expressive aphasia, MROH8 and BCHE with poor speech, and USP37, SLC22A9 and UMODL1 with aphasia. In summary, our study outlines the landscape of paediatric speech and language disorders, confirming the phenotypic complexity of linguistic traits and novel genotype-phenotype associations. Subgroups of paediatric speech and language disorders differ significantly with respect to the composition of monogenic aetiologies.
RESUMO
PURPOSE: An early genetic diagnosis can guide the time-sensitive treatment of individuals with genetic epilepsies. However, most genetic diagnoses occur long after disease onset. We aimed to identify early clinical features suggestive of genetic diagnoses in individuals with epilepsy through large-scale analysis of full-text electronic medical records. METHODS: We extracted 89 million time-stamped standardized clinical annotations using Natural Language Processing from 4,572,783 clinical notes from 32,112 individuals with childhood epilepsy, including 1925 individuals with known or presumed genetic epilepsies. We applied these features to train random forest models to predict SCN1A-related disorders and any genetic diagnosis. RESULTS: We identified 47,774 age-dependent associations of clinical features with genetic etiologies a median of 3.6 years before molecular diagnosis. Across all 710 genetic etiologies identified in our cohort, neurodevelopmental differences between 6 to 9 months increased the likelihood of a later molecular diagnosis 5-fold (P < .0001, 95% CI = 3.55-7.42). A later diagnosis of SCN1A-related disorders (area under the curve [AUC] = 0.91) or an overall positive genetic diagnosis (AUC = 0.82) could be reliably predicted using random forest models. CONCLUSION: Clinical features predictive of genetic epilepsies precede molecular diagnoses by up to several years in conditions with known precision treatments. An earlier diagnosis facilitated by automated electronic medical records analysis has the potential for earlier targeted therapeutic strategies in the genetic epilepsies.
RESUMO
More than 100 genetic etiologies have been identified in developmental and epileptic encephalopathies (DEEs), but correlating genetic findings with clinical features at scale has remained a hurdle because of a lack of frameworks for analyzing heterogenous clinical data. Here, we analyzed 31,742 Human Phenotype Ontology (HPO) terms in 846 individuals with existing whole-exome trio data and assessed associated clinical features and phenotypic relatedness by using HPO-based semantic similarity analysis for individuals with de novo variants in the same gene. Gene-specific phenotypic signatures included associations of SCN1A with "complex febrile seizures" (HP: 0011172; p = 2.1 × 10-5) and "focal clonic seizures" (HP: 0002266; p = 8.9 × 10-6), STXBP1 with "absent speech" (HP: 0001344; p = 1.3 × 10-11), and SLC6A1 with "EEG with generalized slow activity" (HP: 0010845; p = 0.018). Of 41 genes with de novo variants in two or more individuals, 11 genes showed significant phenotypic similarity, including SCN1A (n = 16, p < 0.0001), STXBP1 (n = 14, p = 0.0021), and KCNB1 (n = 6, p = 0.011). Including genetic and phenotypic data of control subjects increased phenotypic similarity for all genetic etiologies, whereas the probability of observing de novo variants decreased, emphasizing the conceptual differences between semantic similarity analysis and approaches based on the expected number of de novo events. We demonstrate that HPO-based phenotype analysis captures unique profiles for distinct genetic etiologies, reflecting the breadth of the phenotypic spectrum in genetic epilepsies. Semantic similarity can be used to generate statistical evidence for disease causation analogous to the traditional approach of primarily defining disease entities through similar clinical features.
Assuntos
Proteínas da Membrana Plasmática de Transporte de GABA/genética , Proteínas Munc18/genética , Canal de Sódio Disparado por Voltagem NAV1.1/genética , Convulsões/genética , Espasmos Infantis/genética , Distúrbios da Fala/genética , Pré-Escolar , Estudos de Coortes , Feminino , Expressão Gênica , Ontologia Genética , Humanos , Masculino , Mutação , Fenótipo , Convulsões/classificação , Convulsões/diagnóstico , Convulsões/fisiopatologia , Semântica , Canais de Potássio Shab/genética , Espasmos Infantis/classificação , Espasmos Infantis/diagnóstico , Espasmos Infantis/fisiopatologia , Distúrbios da Fala/classificação , Distúrbios da Fala/diagnóstico , Distúrbios da Fala/fisiopatologia , Terminologia como Assunto , Sequenciamento do ExomaRESUMO
Disease-causing variants in STXBP1 are among the most common genetic causes of neurodevelopmental disorders. However, the phenotypic spectrum in STXBP1-related disorders is wide and clear correlations between variant type and clinical features have not been observed so far. Here, we harmonized clinical data across 534 individuals with STXBP1-related disorders and analysed 19â973 derived phenotypic terms, including phenotypes of 253 individuals previously unreported in the scientific literature. The overall phenotypic landscape in STXBP1-related disorders is characterized by neurodevelopmental abnormalities in 95% and seizures in 89% of individuals, including focal-onset seizures as the most common seizure type (47%). More than 88% of individuals with STXBP1-related disorders have seizure onset in the first year of life, including neonatal seizure onset in 47%. Individuals with protein-truncating variants and deletions in STXBP1 (n = 261) were almost twice as likely to present with West syndrome and were more phenotypically similar than expected by chance. Five genetic hotspots with recurrent variants were identified in more than 10 individuals, including p.Arg406Cys/His (n = 40), p.Arg292Cys/His/Leu/Pro (n = 30), p.Arg551Cys/Gly/His/Leu (n = 24), p.Pro139Leu (n = 12), and p.Arg190Trp (n = 11). None of the recurrent variants were significantly associated with distinct electroclinical syndromes, single phenotypic features, or showed overall clinical similarity, indicating that the baseline variability in STXBP1-related disorders is too high for discrete phenotypic subgroups to emerge. We then reconstructed the seizure history in 62 individuals with STXBP1-related disorders in detail, retrospectively assigning seizure type and seizure frequency monthly across 4433 time intervals, and retrieved 251 anti-seizure medication prescriptions from the electronic medical records. We demonstrate a dynamic pattern of seizure control and complex interplay with response to specific medications particularly in the first year of life when seizures in STXBP1-related disorders are the most prominent. Adrenocorticotropic hormone and phenobarbital were more likely to initially reduce seizure frequency in infantile spasms and focal seizures compared to other treatment options, while the ketogenic diet was most effective in maintaining seizure freedom. In summary, we demonstrate how the multidimensional spectrum of phenotypic features in STXBP1-related disorders can be assessed using a computational phenotype framework to facilitate the development of future precision-medicine approaches.
Assuntos
Epilepsia , Espasmos Infantis , Eletroencefalografia , Epilepsia/genética , Humanos , Lactente , Proteínas Munc18/genética , Estudos Retrospectivos , Convulsões/genética , Espasmos Infantis/tratamento farmacológico , Espasmos Infantis/genéticaRESUMO
The Human Phenotype Ontology (HPO, https://hpo.jax.org) was launched in 2008 to provide a comprehensive logical standard to describe and computationally analyze phenotypic abnormalities found in human disease. The HPO is now a worldwide standard for phenotype exchange. The HPO has grown steadily since its inception due to considerable contributions from clinical experts and researchers from a diverse range of disciplines. Here, we present recent major extensions of the HPO for neurology, nephrology, immunology, pulmonology, newborn screening, and other areas. For example, the seizure subontology now reflects the International League Against Epilepsy (ILAE) guidelines and these enhancements have already shown clinical validity. We present new efforts to harmonize computational definitions of phenotypic abnormalities across the HPO and multiple phenotype ontologies used for animal models of disease. These efforts will benefit software such as Exomiser by improving the accuracy and scope of cross-species phenotype matching. The computational modeling strategy used by the HPO to define disease entities and phenotypic features and distinguish between them is explained in detail.We also report on recent efforts to translate the HPO into indigenous languages. Finally, we summarize recent advances in the use of HPO in electronic health record systems.
Assuntos
Ontologias Biológicas , Biologia Computacional/métodos , Bases de Dados Factuais , Doença/genética , Genoma , Fenótipo , Software , Animais , Modelos Animais de Doenças , Genótipo , Humanos , Recém-Nascido , Cooperação Internacional , Internet , Triagem Neonatal/métodos , Farmacogenética/métodos , Terminologia como AssuntoRESUMO
Making a specific diagnosis in neurodevelopmental disorders is traditionally based on recognizing clinical features of a distinct syndrome, which guides testing of its possible genetic etiologies. Scalable frameworks for genomic diagnostics, however, have struggled to integrate meaningful measurements of clinical phenotypic features. While standardization has enabled generation and interpretation of genomic data for clinical diagnostics at unprecedented scale, making the equivalent breakthrough for clinical data has proven challenging. However, increasingly clinical features are being recorded using controlled dictionaries with machine readable formats such as the Human Phenotype Ontology (HPO), which greatly facilitates their use in the diagnostic space. Improving the tractability of large-scale clinical information will present new opportunities to inform genomic research and diagnostics from a clinical perspective. Here, we describe novel approaches for computational phenotyping to harmonize clinical features, improve data translation through revising domain-specific dictionaries, quantify phenotypic features, and determine clinical relatedness. We demonstrate how these concepts can be applied to longitudinal phenotypic information, which represents a critical element of developmental disorders and pediatric conditions. Finally, we expand our discussion to clinical data derived from electronic medical records, a largely untapped resource of deep clinical information with distinct strengths and weaknesses.
Assuntos
Registros Eletrônicos de Saúde , Genômica , Criança , Humanos , FenótipoRESUMO
The developmental and epileptic encephalopathies (DEEs) are heterogeneous disorders with a strong genetic contribution, but the underlying genetic etiology remains unknown in a significant proportion of individuals. To explore whether statistical support for genetic etiologies can be generated on the basis of phenotypic features, we analyzed whole-exome sequencing data and phenotypic similarities by using Human Phenotype Ontology (HPO) in 314 individuals with DEEs. We identified a de novo c.508C>T (p.Arg170Trp) variant in AP2M1 in two individuals with a phenotypic similarity that was higher than expected by chance (p = 0.003) and a phenotype related to epilepsy with myoclonic-atonic seizures. We subsequently found the same de novo variant in two individuals with neurodevelopmental disorders and generalized epilepsy in a cohort of 2,310 individuals who underwent diagnostic whole-exome sequencing. AP2M1 encodes the µ-subunit of the adaptor protein complex 2 (AP-2), which is involved in clathrin-mediated endocytosis (CME) and synaptic vesicle recycling. Modeling of protein dynamics indicated that the p.Arg170Trp variant impairs the conformational activation and thermodynamic entropy of the AP-2 complex. Functional complementation of both the µ-subunit carrying the p.Arg170Trp variant in human cells and astrocytes derived from AP-2µ conditional knockout mice revealed a significant impairment of CME of transferrin. In contrast, stability, expression levels, membrane recruitment, and localization were not impaired, suggesting a functional alteration of the AP-2 complex as the underlying disease mechanism. We establish a recurrent pathogenic variant in AP2M1 as a cause of DEEs with distinct phenotypic features, and we implicate dysfunction of the early steps of endocytosis as a disease mechanism in epilepsy.
Assuntos
Complexo 2 de Proteínas Adaptadoras/genética , Subunidades mu do Complexo de Proteínas Adaptadoras/genética , Encefalopatias/etiologia , Clatrina/metabolismo , Endocitose , Epilepsia/etiologia , Mutação de Sentido Incorreto , Transtornos do Neurodesenvolvimento/etiologia , Adolescente , Animais , Encefalopatias/patologia , Criança , Pré-Escolar , Clatrina/genética , Epilepsia/patologia , Feminino , Humanos , Lactente , Camundongos , Camundongos Knockout , Transtornos do Neurodesenvolvimento/patologia , Sequenciamento do ExomaRESUMO
PURPOSE: Pathogenic variants in SCN2A cause a wide range of neurodevelopmental phenotypes. Reports of genotype-phenotype correlations are often anecdotal, and the available phenotypic data have not been systematically analyzed. METHODS: We extracted phenotypic information from primary descriptions of SCN2A-related disorders in the literature between 2001 and 2019, which we coded in Human Phenotype Ontology (HPO) terms. With higher-level phenotype terms inferred by the HPO structure, we assessed the frequencies of clinical features and investigated the association of these features with variant classes and locations within the NaV1.2 protein. RESULTS: We identified 413 unrelated individuals and derived a total of 10,860 HPO terms with 562 unique terms. Protein-truncating variants were associated with autism and behavioral abnormalities. Missense variants were associated with neonatal onset, epileptic spasms, and seizures, regardless of type. Phenotypic similarity was identified in 8/62 recurrent SCN2A variants. Three independent principal components accounted for 33% of the phenotypic variance, allowing for separation of gain-of-function versus loss-of-function variants with good performance. CONCLUSION: Our work shows that translating clinical features into a computable format using a standardized language allows for quantitative phenotype analysis, mapping the phenotypic landscape of SCN2A-related disorders in unprecedented detail and revealing genotype-phenotype correlations along a multidimensional spectrum.
Assuntos
Canal de Sódio Disparado por Voltagem NAV1.2 , Espasmos Infantis , Estudos de Associação Genética , Humanos , Recém-Nascido , Canal de Sódio Disparado por Voltagem NAV1.2/genética , Fenótipo , ConvulsõesRESUMO
OBJECTIVE: The clinical features of epilepsy determine how it is defined, which in turn guides management. Therefore, consideration of the fundamental clinical entities that comprise an epilepsy is essential in the study of causes, trajectories, and treatment responses. The Human Phenotype Ontology (HPO) is used widely in clinical and research genetics for concise communication and modeling of clinical features, allowing extracted data to be harmonized using logical inference. We sought to redesign the HPO seizure subontology to improve its consistency with current epileptological concepts, supporting the use of large clinical data sets in high-throughput clinical and research genomics. METHODS: We created a new HPO seizure subontology based on the 2017 International League Against Epilepsy (ILAE) Operational Classification of Seizure Types, and integrated concepts of status epilepticus, febrile, reflex, and neonatal seizures at different levels of detail. We compared the HPO seizure subontology prior to, and following, our revision, according to the information that could be inferred about the seizures of 791 individuals from three independent cohorts: 2 previously published and 150 newly recruited individuals. Each cohort's data were provided in a different format and harmonized using the two versions of the HPO. RESULTS: The new seizure subontology increased the number of descriptive concepts for seizures 5-fold. The number of seizure descriptors that could be annotated to the cohort increased by 40% and the total amount of information about individuals' seizures increased by 38%. The most important qualitative difference was the relationship of focal to bilateral tonic-clonic seizure to generalized-onset and focal-onset seizures. SIGNIFICANCE: We have generated a detailed contemporary conceptual map for harmonization of clinical seizure data, implemented in the official 2020-12-07 HPO release and freely available at hpo.jax.org. This will help to overcome the phenotypic bottleneck in genomics, facilitate reuse of valuable data, and ultimately improve diagnostics and precision treatment of the epilepsies.
Assuntos
Modelos Neurológicos , Convulsões/fisiopatologia , Big Data , Estudos de Coortes , Interpretação Estatística de Dados , Epilepsias Parciais/classificação , Epilepsias Parciais/fisiopatologia , Epilepsia , Epilepsia Generalizada/classificação , Epilepsia Generalizada/fisiopatologia , Epilepsia Tônico-Clônica/classificação , Epilepsia Tônico-Clônica/fisiopatologia , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Fenótipo , Convulsões/classificação , Convulsões/genéticaRESUMO
PURPOSE: Childhood epilepsies have a strong genetic contribution, but the disease trajectory for many genetic etiologies remains unknown. Electronic medical record (EMR) data potentially allow for the analysis of longitudinal clinical information but this has not yet been explored. METHODS: We analyzed provider-entered neurological diagnoses made at 62,104 patient encounters from 658 individuals with known or presumed genetic epilepsies. To harmonize clinical terminology, we mapped clinical descriptors to Human Phenotype Ontology (HPO) terms and inferred higher-level phenotypic concepts. We then binned the resulting 286,085 HPO terms to 100 3-month time intervals and assessed gene-phenotype associations at each interval. RESULTS: We analyzed a median follow-up of 6.9 years per patient and a cumulative 3251 patient years. Correcting for multiple testing, we identified significant associations between "Status epilepticus" with SCN1A at 1.0 years, "Severe intellectual disability" with PURA at 9.75 years, and "Infantile spasms" and "Epileptic spasms" with STXBP1 at 0.5 years. The identified associations reflect known clinical features of these conditions, and manual chart review excluded provider bias. CONCLUSION: Some aspects of the longitudinal disease histories can be reconstructed through EMR data and reveal significant gene-phenotype associations, even within closely related conditions. Gene-specific EMR footprints may enable outcome studies and clinical decision support.
Assuntos
Epilepsia , Deficiência Intelectual , Espasmos Infantis , Criança , Registros Eletrônicos de Saúde , Epilepsia/diagnóstico , Epilepsia/genética , Humanos , FenótipoRESUMO
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
RESUMO
OBJECTIVE: To define the phenotypic spectrum of phosphatidylinositol glycan class A protein (PIGA)-related congenital disorder of glycosylation (PIGA-CDG) and evaluate genotype-phenotype correlations. METHODS: Our cohort encompasses 40 affected males with a pathogenic PIGA variant. We performed a detailed phenotypic assessment, and in addition, we reviewed the available clinical data of 36 previously published cases and assessed the variant pathogenicity using bioinformatical approaches. RESULTS: Most individuals had hypotonia, moderate to profound global developmental delay, and intractable seizures. We found that PIGA-CDG spans from a pure neurological phenotype at the mild end to a Fryns syndrome-like phenotype. We found a high frequency of cardiac anomalies including structural anomalies and cardiomyopathy, and a high frequency of spontaneous death, especially in childhood. Comparative bioinformatical analysis of common variants, found in the healthy population, and pathogenic variants, identified in affected individuals, revealed a profound physiochemical dissimilarity of the substituted amino acids in variant constrained regions of the protein. SIGNIFICANCE: Our comprehensive analysis of the largest cohort of published and novel PIGA patients broadens the spectrum of PIGA-CDG. Our genotype-phenotype correlation facilitates the estimation on pathogenicity of variants with unknown clinical significance and prognosis for individuals with pathogenic variants in PIGA.
Assuntos
Variação Genética/genética , Hérnia Diafragmática/diagnóstico por imagem , Hérnia Diafragmática/genética , Deformidades Congênitas dos Membros/diagnóstico por imagem , Deformidades Congênitas dos Membros/genética , Proteínas de Membrana/genética , Adulto , Sequência de Aminoácidos , Criança , Estudos de Coortes , Eletroencefalografia/métodos , Fácies , Hérnia Diafragmática/fisiopatologia , Humanos , Recém-Nascido , Deformidades Congênitas dos Membros/fisiopatologia , Imageamento por Ressonância Magnética/métodos , MasculinoRESUMO
The insights gained from big data and omics approaches have transformed the field of childhood genetic epilepsy. With an increasing number of individuals receiving genetic testing for seizures, we are provided with an opportunity to identify clinically relevant subgroups and extract meaningful observations from this large-scale clinical data. However, the volume of data from electronic medical records and omics (e.g., genomics, transcriptomics) is so vast that standardized methods, such as the Human Phenotype Ontology, are necessary for reliable and comprehensive characterization. Here, we explore the integration of clinical and omics data, highlighting how these approaches pave the way for discovery in childhood epilepsies.
Assuntos
Big Data , Epilepsia , Genômica , Humanos , Epilepsia/genética , Epilepsia/diagnóstico , Criança , Registros Eletrônicos de Saúde , Testes GenéticosRESUMO
Speech and language disorders are known to have a substantial genetic contribution. Although frequently examined as components of other conditions, research on the genetic basis of linguistic differences as separate phenotypic subgroups has been limited so far. Here, we performed an in-depth characterization of speech and language disorders in 52,143 individuals, reconstructing clinical histories using a large-scale data mining approach of the Electronic Medical Records (EMR) from an entire large paediatric healthcare network. The reported frequency of these disorders was the highest between 2 and 5 years old and spanned a spectrum of twenty-six broad speech and language diagnoses. We used Natural Language Processing to assess to which degree clinical diagnosis in full-text notes were reflected in ICD-10 diagnosis codes. We found that aphasia and speech apraxia could be easily retrieved through ICD-10 diagnosis codes, while stuttering as a speech phenotype was only coded in 12% of individuals through appropriate ICD-10 codes. We found significant comorbidity of speech and language disorders in neurodevelopmental conditions (30.31%) and to a lesser degree with epilepsies (6.07%) and movement disorders (2.05%). The most common genetic disorders retrievable in our EMR analysis were STXBP1 (n=21), PTEN (n=20), and CACNA1A (n=18). When assessing associations of genetic diagnoses with specific linguistic phenotypes, we observed associations of STXBP1 and aphasia (P=8.57 × 10-7, CI=18.62-130.39) and MYO7A with speech and language development delay due to hearing loss (P=1.24 × 10-5, CI=17.46-Inf). Finally, in a sub-cohort of 726 individuals with whole exome sequencing data, we identified an enrichment of rare variants in synaptic protein and neuronal receptor pathways and associations of UQCRC1 with expressive aphasia and WASHC4 with abnormality of speech or vocalization. In summary, our study outlines the landscape of paediatric speech and language disorders, confirming the phenotypic complexity of linguistic traits and novel genotype-phenotype associations. Subgroups of paediatric speech and language disorders differ significantly with respect to the composition of monogenic aetiologies.
RESUMO
EEG plays an integral part in the diagnosis and management of children with genetic epilepsies. Nevertheless, how quantitative EEG features differ between genetic epilepsies and neurological outcomes remains largely unknown. Here, we aimed to identify quantitative EEG biomarkers in children with epilepsy and a genetic diagnosis in STXBP1, SCN1A, or SYNGAP1, and to assess how quantitative EEG features associate with neurological outcomes in genetic epilepsies more broadly. We analyzed individuals with pathogenic variants in STXBP1 (95 EEGs, n=20), SCN1A (154 EEGs, n=68), and SYNGAP1 (46 EEGs, n=21) and a control cohort of individuals without epilepsy or known cerebral disease (847 EEGs, n=806). After removing artifacts and epochs with excess noise or altered state from EEGs, we extracted spectral features. We validated our preprocessing pipeline by comparing automatically-detected posterior dominant rhythm (PDR) to annotations from clinical EEG reports. Next, as a coarse measure of pathological slowing, we compared the alpha-delta bandpower ratio between controls and the different genetic epilepsies. We then trained random forest models to predict a diagnosis of STXBP1, SCN1A, and SYNGAP1. Finally, to understand how EEG features vary with neurological outcomes, we trained random forest models to predict seizure frequency and motor function. There was strong agreement between the automatically-calculated PDR and clinical EEG reports (R 2=0.75). Individuals with STXBP1-related epilepsy have a significantly lower alpha-delta ratio than controls (P<0.001) across all age groups. Additionally, individuals with a missense variant in STXBP1 have a significantly lower alpha-delta ratio than those with a protein-truncating variant in toddlers (P<0.001), children (P=0.02), and adults (P<0.001). Models accurately predicted a diagnosis of STXBP1 (AUC=0.91), SYNGAP1 (AUC=0.82), and SCN1A (AUC=0.86) against controls and from each other in a three-class model (accuracy=0.74). From these models, we isolated highly correlated biomarkers for these respective genetic disorders, including alpha-theta ratio in frontal, occipital, and parietal electrodes with STXBP1, SYNGAP1, and SCN1A, respectively. Models were unable to predict seizure frequency (AUC=0.53). Random forest models predicted motor scores significantly better than age-based null models (P<0.001), suggesting spectral features contain information pertinent to gross motor function. In summary, we demonstrate that STXBP1-, SYNGAP1-, and SCN1A-related epilepsies have distinct quantitative EEG signatures. Furthermore, EEG spectral features are predictive of some functional outcome measures in patients with genetic epilepsies. Large-scale retrospective quantitative analysis of clinical EEG has the potential to discover novel biomarkers and to quantify and track individuals' disease progression across development.
RESUMO
Copy number variants (CNV) are established risk factors for neurodevelopmental disorders with seizures or epilepsy. With the hypothesis that seizure disorders share genetic risk factors, we pooled CNV data from 10,590 individuals with seizure disorders, 16,109 individuals with clinically validated epilepsy, and 492,324 population controls and identified 25 genome-wide significant loci, 22 of which are novel for seizure disorders, such as deletions at 1p36.33, 1q44, 2p21-p16.3, 3q29, 8p23.3-p23.2, 9p24.3, 10q26.3, 15q11.2, 15q12-q13.1, 16p12.2, 17q21.31, duplications at 2q13, 9q34.3, 16p13.3, 17q12, 19p13.3, 20q13.33, and reciprocal CNVs at 16p11.2, and 22q11.21. Using genetic data from additional 248,751 individuals with 23 neuropsychiatric phenotypes, we explored the pleiotropy of these 25 loci. Finally, in a subset of individuals with epilepsy and detailed clinical data available, we performed phenome-wide association analyses between individual CNVs and clinical annotations categorized through the Human Phenotype Ontology (HPO). For six CNVs, we identified 19 significant associations with specific HPO terms and generated, for all CNVs, phenotype signatures across 17 clinical categories relevant for epileptologists. This is the most comprehensive investigation of CNVs in epilepsy and related seizure disorders, with potential implications for clinical practice.
Assuntos
Variações do Número de Cópias de DNA , Epilepsia , Humanos , Fenótipo , Epilepsia/genética , Estudo de Associação Genômica Ampla , ConvulsõesRESUMO
BACKGROUND: The epilepsies are highly heritable conditions that commonly follow complex inheritance. While monogenic causes have been identified in rare familial epilepsies, most familial epilepsies remain unsolved. We aimed to determine (1) whether common genetic variation contributes to familial epilepsy risk, and (2) whether that genetic risk is enriched in familial compared with non-familial (sporadic) epilepsies. METHODS: Using common variants derived from the largest epilepsy genome-wide association study, we calculated polygenic risk scores (PRS) for patients with familial epilepsy (n = 1,818 from 1,181 families), their unaffected relatives (n = 771), sporadic patients (n = 1,182), and population controls (n = 15,929). We also calculated separate PRS for genetic generalised epilepsy (GGE) and focal epilepsy. Statistical analyses used mixed-effects regression models to account for familial relatedness, sex, and ancestry. FINDINGS: Patients with familial epilepsies had higher epilepsy PRS compared to population controls (OR 1·20, padj = 5×10-9), sporadic patients (OR 1·11, padj = 0.008), and their own unaffected relatives (OR 1·12, padj = 0.01). The top 1% of the PRS distribution was enriched 3.8-fold for individuals with familial epilepsy when compared to the lowest decile (padj = 5×10-11). Familial PRS enrichment was consistent across epilepsy type; overall, polygenic risk was greatest for the GGE clinical group. There was no significant PRS difference in familial cases with established rare variant genetic etiologies compared to unsolved familial cases. INTERPRETATION: The aggregate effects of common genetic variants, measured as polygenic risk scores, play an important role in explaining why some families develop epilepsy, why specific family members are affected while their relatives are not, and why families manifest specific epilepsy types. Polygenic risk contributes to the complex inheritance of the epilepsies, including in individuals with a known genetic etiology. FUNDING: National Health and Medical Research Council of Australia, National Institutes of Health, American Academy of Neurology, Thomas B and Jeannette E Laws McCabe Fund, Mirowski Family Foundation.
Assuntos
Epilepsia Generalizada , Epilepsia , Síndromes Epilépticas , Epilepsia/genética , Epilepsia Generalizada/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Herança Multifatorial/genéticaRESUMO
BACKGROUND: The developmental and epileptic encephalopathies (DEEs) are the most severe group of epilepsies which co-present with developmental delay and intellectual disability (ID). DEEs usually occur in people without a family history of epilepsy and have emerged as primarily monogenic, with damaging rare mutations found in 50% of patients. Little is known about the genetic architecture of patients with DEEs in whom no pathogenic variant is identified. Polygenic risk scoring (PRS) is a method that measures a person's common genetic burden for a trait or condition. Here, we used PRS to test whether genetic burden for epilepsy is relevant in individuals with DEEs, and other forms of epilepsy with ID. METHODS: Genetic data on 2,759 cases with DEEs, or epilepsy with ID presumed to have a monogenic basis, and 447,760 population-matched controls were analysed. We compared PRS for 'all epilepsy', 'focal epilepsy', and 'genetic generalised epilepsy' (GGE) between cases and controls. We performed pairwise comparisons between cases stratified for identifiable rare deleterious genetic variants and controls. FINDINGS: Cases of presumed monogenic severe epilepsy had an increased PRS for 'all epilepsy' (p<0.0001), 'focal epilepsy' (p<0.0001), and 'GGE' (p=0.0002) relative to controls, which explain between 0.08% and 3.3% of phenotypic variance. PRS was increased in cases both with and without an identified deleterious variant of major effect, and there was no significant difference in PRS between the two groups. INTERPRETATION: We provide evidence that common genetic variation contributes to the aetiology of DEEs and other forms of epilepsy with ID, even when there is a known pathogenic variant of major effect. These results provide insight into the genetic underpinnings of the severe epilepsies and warrant a shift in our understanding of the aetiology of the DEEs as complex, rather than monogenic, disorders. FUNDING: Science foundation Ireland, Human Genome Research Institute; National Heart, Lung, and Blood Institute; German Research Foundation.
Assuntos
Epilepsia Generalizada , Deficiência Intelectual , Epilepsia Generalizada/diagnóstico , Epilepsia Generalizada/genética , Variação Genética , Humanos , Herança Multifatorial , Mutação , FenótipoRESUMO
While genetic studies of epilepsies can be performed in thousands of individuals, phenotyping remains a manual, non-scalable task. A particular challenge is capturing the evolution of complex phenotypes with age. Here, we present a novel approach, applying phenotypic similarity analysis to a total of 3251 patient-years of longitudinal electronic medical record data from a previously reported cohort of 658 individuals with genetic epilepsies. After mapping clinical data to the Human Phenotype Ontology, we determined the phenotypic similarity of individuals sharing each genetic etiology within each 3-month age interval from birth up to a maximum age of 25 years. 140 of 600 (23%) of all 27 genes and 3-month age intervals with sufficient data for calculation of phenotypic similarity were significantly higher than expect by chance. 11 of 27 genetic etiologies had significant overall phenotypic similarity trajectories. These do not simply reflect strong statistical associations with single phenotypic features but appear to emerge from complex clinical constellations of features that may not be strongly associated individually. As an attempt to reconstruct the cognitive framework of syndrome recognition in clinical practice, longitudinal phenotypic similarity analysis extends the traditional phenotyping approach by utilizing data from electronic medical records at a scale that is far beyond the capabilities of manual phenotyping. Delineation of how the phenotypic homogeneity of genetic epilepsies varies with age could improve the phenotypic classification of these disorders, the accuracy of prognostic counseling, and by providing historical control data, the design and interpretation of precision clinical trials in rare diseases.
Assuntos
Heterogeneidade Genética , Testes Genéticos/estatística & dados numéricos , Fenótipo , Espasmos Infantis/genética , Criança , Pré-Escolar , Feminino , Humanos , Lactente , Masculino , Locos de Características Quantitativas , Espasmos Infantis/diagnósticoRESUMO
Febrile infection-related epilepsy syndrome (FIRES) is a devastating epilepsy characterized by new-onset refractory status epilepticus with a prior febrile infection. We performed exome sequencing in 50 individuals with FIRES, including 27 patient-parent trios and 23 single probands, none of whom had pathogenic variants in established genes for epilepsies or neurodevelopmental disorders. We also performed HLA sequencing in 29 individuals with FIRES and 529 controls, which failed to identify prominent HLA alleles. The genetic architecture of FIRES is substantially different from other developmental and epileptic encephalopathies, and the underlying etiology remains elusive, requiring novel approaches to identify the underlying causative factors.