RESUMEN
Recent advances in AI-based methods have revolutionized the field of structural biology. Concomitantly, high-throughput sequencing and functional genomics have generated genetic variants at an unprecedented scale. However, efficient tools and resources are needed to link disparate data types-to 'map' variants onto protein structures, to better understand how the variation causes disease, and thereby design therapeutics. Here we present the Genomics 2 Proteins portal ( https://g2p.broadinstitute.org/ ): a human proteome-wide resource that maps 20,076,998 genetic variants onto 42,413 protein sequences and 77,923 structures, with a comprehensive set of structural and functional features. Additionally, the Genomics 2 Proteins portal allows users to interactively upload protein residue-wise annotations (for example, variants and scores) as well as the protein structure beyond databases to establish the connection between genomics to proteins. The portal serves as an easy-to-use discovery tool for researchers and scientists to hypothesize the structure-function relationship between natural or synthetic variations and their molecular phenotypes.
Asunto(s)
Bases de Datos de Proteínas , Genómica , Humanos , Genómica/métodos , Proteínas/genética , Proteínas/química , Proteoma/genética , Conformación Proteica , Programas Informáticos , Pruebas Genéticas/métodos , Variación Genética , Secuencia de AminoácidosRESUMEN
The large-scale experimental measures of variant functional assays submitted to MaveDB have the potential to provide key information for resolving variants of uncertain significance, but the reporting of results relative to assayed sequence hinders their downstream utility. The Atlas of Variant Effects Alliance mapped multiplexed assays of variant effect data to human reference sequences, creating a robust set of machine-readable homology mappings. This method processed approximately 2.5 million protein and genomic variants in MaveDB, successfully mapping 98.61% of examined variants and disseminating data to resources such as the UCSC Genome Browser and Ensembl Variant Effect Predictor.
RESUMEN
Activating point mutations in the MET tyrosine kinase domain (TKD) are oncogenic in a subset of papillary renal cell carcinomas. Here, using comprehensive genomic profiling among >600,000 patients, we identify activating MET TKD point mutations as putative oncogenic driver across diverse cancers, with a frequency of â¼0.5%. The most common mutations in the MET TKD defined as oncogenic or likely oncogenic according to OncoKB resulted in amino acid substitutions at positions H1094, L1195, F1200, D1228, Y1230, M1250, and others. Preclinical modeling of these alterations confirmed their oncogenic potential and also demonstrated differential patterns of sensitivity to type I and type II MET inhibitors. Two patients with metastatic lung adenocarcinoma harboring MET TKD mutations (H1094Y, F1200I) and no other known oncogenic drivers achieved confirmed partial responses to a type I MET inhibitor. Activating MET TKD mutations occur in multiple malignancies and may confer clinical sensitivity to currently available MET inhibitors. Significance: The identification of targetable genomic subsets of cancer has revolutionized precision oncology and offers patients treatments with more selective and effective agents. Here, we demonstrate that activating, oncogenic MET tyrosine kinase domain mutations are found across a diversity of cancer types and are responsive to MET tyrosine kinase inhibitors.
Asunto(s)
Neoplasias Pulmonares , Mutación Puntual , Inhibidores de Proteínas Quinasas , Proteínas Proto-Oncogénicas c-met , Humanos , Proteínas Proto-Oncogénicas c-met/genética , Proteínas Proto-Oncogénicas c-met/antagonistas & inhibidores , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/tratamiento farmacológico , Neoplasias Pulmonares/patología , Inhibidores de Proteínas Quinasas/uso terapéutico , Inhibidores de Proteínas Quinasas/farmacología , Animales , Ratones , Línea Celular TumoralRESUMEN
Human genetic studies have revealed rare missense and protein-truncating variants in GRIN2A, encoding for the GluN2A subunit of the NMDA receptors, that confer significant risk for schizophrenia (SCZ). Mutations in GRIN2A are also associated with epilepsy and developmental delay/intellectual disability (DD/ID). However, it remains enigmatic how alterations to the same protein can result in diverse clinical phenotypes. Here, we performed functional characterization of human GluN1/GluN2A heteromeric NMDA receptors that contain SCZ-linked GluN2A variants, and compared them to NMDA receptors with GluN2A variants associated with epilepsy or DD/ID. Our findings demonstrate that SCZ-associated GRIN2A variants were predominantly loss-of-function (LoF), whereas epilepsy and DD/ID-associated variants resulted in both gain- and loss-of-function phenotypes. We additionally show that M653I and S809R, LoF GRIN2A variants associated with DD/ID, exert a dominant-negative effect when co-expressed with a wild-type GluN2A, whereas E58Ter and Y698C, SCZ-linked LoF variants, and A727T, an epilepsy-linked LoF variant, do not. These data offer a potential mechanism by which SCZ/epilepsy and DD/ID-linked variants can cause different effects on receptor function and therefore result in divergent pathological outcomes.
Asunto(s)
Epilepsia , Trastornos del Neurodesarrollo , Esquizofrenia , Humanos , Epilepsia/genética , Mutación , Trastornos del Neurodesarrollo/genética , Receptores de N-Metil-D-Aspartato/genética , Receptores de N-Metil-D-Aspartato/metabolismo , Esquizofrenia/genéticaRESUMEN
Recent advances in AI-based methods have revolutionized the field of structural biology. Concomitantly, high-throughput sequencing and functional genomics technologies have enabled the detection and generation of variants at an unprecedented scale. However, efficient tools and resources are needed to link these two disparate data types - to "map" variants onto protein structures, to better understand how the variation causes disease and thereby design therapeutics. Here we present the Genomics 2 Proteins Portal (G2P; g2p.broadinstitute.org/): a human proteome-wide resource that maps 19,996,443 genetic variants onto 42,413 protein sequences and 77,923 structures, with a comprehensive set of structural and functional features. Additionally, the G2P portal generalizes the capability of linking genomics to proteins beyond databases by allowing users to interactively upload protein residue-wise annotations (variants, scores, etc.) as well as the protein structure to establish the connection. The portal serves as an easy-to-use discovery tool for researchers and scientists to hypothesize the structure-function relationship between natural or synthetic variations and their molecular phenotype.
RESUMEN
BACKGROUND: The Fontan procedure is the final stage of a three-stage palliation process in patients born with a univentricular heart as part of hypoplastic left heart syndrome (HLHS) or other pathologies with a univentricular heart. As essential as this procedure has proven to be for such cases, the Fontan physiology diminishes cardiac output and expands systemic venous pressure, which then leads to venous congestion that can be complicated by protein-losing enteropathy (PLE). This retrospective study aimed to identify the predictors of such complications in all patients who underwent completion of the Fontan procedure at our center (Sheikh Khalifa Medical City/SKMC) in the past eight years. METHODS: This study examined the medical records of patients who underwent completion of Fontan repair at our center since the inauguration of the cardiac surgery program of SKMC in the United Arab Emirates (UAE) - 01 Jan 2012 to 31 Dec 2020. Exclusion criteria included the absence of any of the undermentioned data in patient files. Patients were divided into two groups: those who developed PLE and those who did not. For each group, the following data were collected: demographics data (current age and age at completion of Fontan), clinical and laboratory data (oxygen saturation, serum albumin), echocardiographic data (classification of original cardiac diagnosis, degree of atrio-ventricular valve regurgitation, ventricular functions), hemodynamic data (mean pressures of superior vena cava and pulmonary arteries before Fontan completion), operative data (type of initial palliation, type of Fontan, presence of fenestrations and its size) and the need for any cardiac intervention prior to Fontan completion, such as atrio-ventricular valve repair, peripheral pulmonary stenting and arch balloon dilatation. RESULTS: Of the 48 included patients,13 (25%) developed PLE. Multivariate regression analysis proved that the best predictors of PLE were superior vena cava mean pressure (P = 0.012) and the degree of atrio-ventricular valve regurgitation (P = 0.013). An oxygen saturation <83% prior to Fontan completion was 92% sensitive in predicting PLE after Fontan completion. CONCLUSION: This is a single-center study of the predictors of PLE after Fontan procedure and, as expected from similar studies, SVC pressure higher than 11 mmHg and moderate-to-severe atrio-ventricular valve regurgitation were predictors of Fontan failure. The higher prevalence of PLE in our cohort, as well as lower cut-offs of SVC pressure that can predict complications, may be related to the predominance of hypoplastic left heart in the operated patients, which has been the main referral center for cardiac surgeries in UAE in the last decade.
RESUMEN
Within recent years, there has been a growing number of genes associated with amyotrophic lateral sclerosis (ALS), resulting in an increasing number of novel variants, particularly missense variants, many of which are of unknown clinical significance. Here, we leverage the sequencing efforts of the ALS Knowledge Portal (3864 individuals with ALS and 7839 controls) and Project MinE ALS Sequencing Consortium (4366 individuals with ALS and 1832 controls) to perform proteomic and transcriptomic characterization of missense variants in 24 ALS-associated genes. The two sequencing datasets were interrogated for missense variants in the 24 genes, and variants were annotated with gnomAD minor allele frequencies, ClinVar pathogenicity classifications, protein sequence features including Uniprot functional site annotations, and PhosphoSitePlus post-translational modification site annotations, structural features from AlphaFold predicted monomeric 3D structures, and transcriptomic expression levels from Genotype-Tissue Expression. We then applied missense variant enrichment and gene-burden testing following binning of variation based on the selected proteomic and transcriptomic features to identify those most relevant to pathogenicity in ALS-associated genes. Using predicted human protein structures from AlphaFold, we determined that missense variants carried by individuals with ALS were significantly enriched in ß-sheets and α-helices, as well as in core, buried or moderately buried regions. At the same time, we identified that hydrophobic amino acid residues, compositionally biased protein regions and regions of interest are predominantly enriched in missense variants carried by individuals with ALS. Assessment of expression level based on transcriptomics also revealed enrichment of variants of high and medium expression across all tissues and within the brain. We further explored enriched features of interest using burden analyses and identified individual genes were indeed driving certain enrichment signals. A case study is presented for SOD1 to demonstrate proof-of-concept of how enriched features may aid in defining variant pathogenicity. Our results present proteomic and transcriptomic features that are important indicators of missense variant pathogenicity in ALS and are distinct from features associated with neurodevelopmental disorders.
Asunto(s)
Esclerosis Amiotrófica Lateral , Humanos , Esclerosis Amiotrófica Lateral/genética , Transcriptoma/genética , Proteómica , Mutación Missense/genética , Pruebas GenéticasRESUMEN
Neurodevelopmental disorders (NDDs), including severe paediatric epilepsy, autism and intellectual disabilities are heterogeneous conditions in which clinical genetic testing can often identify a pathogenic variant. For many of them, genetic therapies will be tested in this or the coming years in clinical trials. In contrast to first-generation symptomatic treatments, the new disease-modifying precision medicines require a genetic test-informed diagnosis before a patient can be enrolled in a clinical trial. However, even in 2022, most identified genetic variants in NDD genes are 'variants of uncertain significance'. To safely enrol patients in precision medicine clinical trials, it is important to increase our knowledge about which regions in NDD-associated proteins can 'tolerate' missense variants and which ones are 'essential' and will cause a NDD when mutated. In addition, knowledge about functionally indispensable regions in the 3D structure context of proteins can also provide insights into the molecular mechanisms of disease variants. We developed a novel consensus approach that overlays evolutionary, and population based genomic scores to identify 3D essential sites (Essential3D) on protein structures. After extensive benchmarking of AlphaFold predicted and experimentally solved protein structures, we generated the currently largest expert curated protein structure set for 242 NDDs and identified 14 377 Essential3D sites across 189 gene disorders associated proteins. We demonstrate that the consensus annotation of Essential3D sites improves prioritization of disease mutations over single annotations. The identified Essential3D sites were enriched for functional features such as intermembrane regions or active sites and discovered key inter-molecule interactions in protein complexes that were otherwise not annotated. Using the currently largest autism, developmental disorders, and epilepsies exome sequencing studies including >360 000 NDD patients and population controls, we found that missense variants at Essential3D sites are 8-fold enriched in patients. In summary, we developed a comprehensive protein structure set for 242 NDDs and identified 14 377 Essential3D sites in these. All data are available at https://es-ndd.broadinstitute.org for interactive visual inspection to enhance variant interpretation and development of mechanistic hypotheses for 242 NDDs genes. The provided resources will enhance clinical variant interpretation and in silico drug target development for NDD-associated genes and encoded proteins.
Asunto(s)
Discapacidad Intelectual , Trastornos del Neurodesarrollo , Humanos , Niño , Trastornos del Neurodesarrollo/genética , Pruebas Genéticas , Mutación/genética , Discapacidad Intelectual/genética , Mutación MissenseRESUMEN
PPM1D encodes a serine/threonine phosphatase that regulates numerous pathways including the DNA damage response and p53. Activating mutations and amplification of PPM1D are found across numerous cancer types. GSK2830371 is a potent and selective allosteric inhibitor of PPM1D, but its mechanism of binding and inhibition of catalytic activity are unknown. Here we use computational, biochemical and functional genetic studies to elucidate the molecular basis of GSK2830371 activity. These data confirm that GSK2830371 binds an allosteric site of PPM1D with high affinity. By further incorporating data from hydrogen deuterium exchange mass spectrometry and sedimentation velocity analytical ultracentrifugation, we demonstrate that PPM1D exists in an equilibrium between two conformations that are defined by the movement of the flap domain, which is required for substrate recognition. A hinge region was identified that is critical for switching between the two conformations and was directly implicated in the high-affinity binding of GSK2830371 to PPM1D. We propose that the two conformations represent active and inactive forms of the protein reflected by the position of the flap, and that binding of GSK2830371 shifts the equilibrium to the inactive form. Finally, we found that C-terminal truncating mutations proximal to residue 400 result in destabilization of the protein via loss of a stabilizing N- and C-terminal interaction, consistent with the observation from human genetic data that nearly all PPM1D mutations in cancer are truncating and occur distal to residue 400. Taken together, our findings elucidate the mechanism by which binding of a small molecule to an allosteric site of PPM1D inhibits its activity and provides insights into the biology of PPM1D.
Asunto(s)
Neoplasias , Proteína Fosfatasa 2C , Sitio Alostérico , Aminopiridinas/farmacología , Dipéptidos/farmacología , Humanos , Mutación , Neoplasias/tratamiento farmacológico , Neoplasias/enzimología , Neoplasias/genética , Conformación Proteica , Proteína Fosfatasa 2C/antagonistas & inhibidores , Proteína Fosfatasa 2C/química , Proteína Fosfatasa 2C/genética , Proteína Fosfatasa 2C/metabolismo , Serina/genética , Serina/metabolismo , Relación Estructura-ActividadRESUMEN
The contact topology of a protein determines important aspects of the folding process. The topological measure of contact order has been shown to be predictive of the rate of folding. Circuit topology is emerging as another fundamental descriptor of biomolecular structure, with predicted effects on the folding rate. We analyze the residue-based circuit topological environments of 21 K mutations labeled as pathogenic or benign. Multiple statistical lines of reasoning support the conclusion that the number of contacts in two specific circuit topological arrangements, namely inverse parallel and cross relations, with contacts involving the mutated residue have discriminatory value in determining the pathogenicity of human variants. We investigate how results vary with residue type and according to whether the gene is essential. We further explore the relationship to a number of structural features and find that circuit topology provides nonredundant information on protein structures and pathogenicity of mutations. Results may have implications for the polymer physics of protein folding and suggest that "local" topological information, including residue-based circuit topology and residue contact order, could be useful in improving state-of-the-art machine learning algorithms for pathogenicity prediction.
Asunto(s)
Mutación Missense , Pliegue de Proteína , Algoritmos , Humanos , Proteínas/química , VirulenciaRESUMEN
All proteomes contain both proteins and polypeptide segments that don't form a defined three-dimensional structure yet are biologically active-called intrinsically disordered proteins and regions (IDPs and IDRs). Most of these IDPs/IDRs lack useful functional annotation limiting our understanding of their importance for organism fitness. Here we characterized IDRs using protein sequence annotations of functional sites and regions available in the UniProt knowledgebase ("UniProt features": active site, ligand-binding pocket, regions mediating protein-protein interactions, etc.). By measuring the statistical enrichment of twenty-five UniProt features in 981 IDRs of 561 human proteins, we identified eight features that are commonly located in IDRs. We then collected the genetic variant data from the general population and patient-based databases and evaluated the prevalence of population and pathogenic variations in IDPs/IDRs. We observed that some IDRs tolerate 2 to 12-times more single amino acid-substituting missense mutations than synonymous changes in the general population. However, we also found that 37% of all germline pathogenic mutations are located in disordered regions of 96 proteins. Based on the observed-to-expected frequency of mutations, we categorized 34 IDRs in 20 proteins (DDX3X, KIT, RB1, etc.) as intolerant to mutation. Finally, using statistical analysis and a machine learning approach, we demonstrate that mutation-intolerant IDRs carry a distinct signature of functional features. Our study presents a novel approach to assign functional importance to IDRs by leveraging the wealth of available genetic data, which will aid in a deeper understating of the role of IDRs in biological processes and disease mechanisms.
Asunto(s)
Proteínas Intrínsecamente Desordenadas , Secuencia de Aminoácidos , Variación Genética/genética , Humanos , Proteínas Intrínsecamente Desordenadas/química , Conformación Proteica , Proteoma/genéticaRESUMEN
PURPOSE: Pathogenic variants in GABRB3 have been associated with a spectrum of phenotypes from severe developmental disorders and epileptic encephalopathies to milder epilepsy syndromes and mild intellectual disability (ID). In this study, we analyzed a large cohort of individuals with GABRB3 variants to deepen the phenotypic understanding and investigate genotype-phenotype correlations. METHODS: Through an international collaboration, we analyzed electro-clinical data of unpublished individuals with variants in GABRB3, and we reviewed previously published cases. All missense variants were mapped onto the 3-dimensional structure of the GABRB3 subunit, and clinical phenotypes associated with the different key structural domains were investigated. RESULTS: We characterized 71 individuals with GABRB3 variants, including 22 novel subjects, expressing a wide spectrum of phenotypes. Interestingly, phenotypes correlated with structural locations of the variants. Generalized epilepsy, with a median age at onset of 12 months, and mild-to-moderate ID were associated with variants in the extracellular domain. Focal epilepsy with earlier onset (median: age 4 months) and severe ID were associated with variants in both the pore-lining helical transmembrane domain and the extracellular domain. CONCLUSION: These genotype-phenotype correlations will aid the genetic counseling and treatment of individuals affected by GABRB3-related disorders. Future studies may reveal whether functional differences underlie the phenotypic differences.
Asunto(s)
Epilepsia , Discapacidad Intelectual , Epilepsia/genética , Estudios de Asociación Genética , Humanos , Discapacidad Intelectual/genética , Mutación , Fenotipo , Receptores de GABA-A/genéticaRESUMEN
PRICKLE2 encodes a member of a highly conserved family of proteins that are involved in the non-canonical Wnt and planar cell polarity signaling pathway. Prickle2 localizes to the post-synaptic density, and interacts with post-synaptic density protein 95 and the NMDA receptor. Loss-of-function variants in prickle2 orthologs cause seizures in flies and mice but evidence for the role of PRICKLE2 in human disease is conflicting. Our goal is to provide further evidence for the role of this gene in humans and define the phenotypic spectrum of PRICKLE2-related disorders. We report a cohort of six subjects from four unrelated families with heterozygous rare PRICKLE2 variants (NM_198859.4). Subjects were identified through an international collaboration. Detailed phenotypic and genetic assessment of the subjects were carried out and in addition, we assessed the variant pathogenicity using bioinformatic approaches. We identified two missense variants (c.122 C > T; p.(Pro41Leu), c.680 C > G; p.(Thr227Arg)), one nonsense variant (c.214 C > T; p.(Arg72*) and one frameshift variant (c.1286_1287delGT; p.(Ser429Thrfs*56)). While the p.(Ser429Thrfs*56) variant segregated with disease in a family with three affected females, the three remaining variants occurred de novo. Subjects shared a mild phenotype characterized by global developmental delay, behavioral difficulties ± epilepsy, autistic features, and attention deficit hyperactive disorder. Computational analysis of the missense variants suggest that the altered amino acid residues are likely to be located in protein regions important for function. This paper demonstrates that PRICKLE2 is involved in human neuronal development and that pathogenic variants in PRICKLE2 cause neurodevelopmental delay, behavioral difficulties and epilepsy in humans.
Asunto(s)
Discapacidades del Desarrollo/genética , Proteínas con Dominio LIM/genética , Proteínas de la Membrana/genética , Adolescente , Adulto , Anciano , Niño , Codón sin Sentido , Discapacidades del Desarrollo/patología , Femenino , Mutación del Sistema de Lectura , Humanos , Proteínas con Dominio LIM/química , Masculino , Proteínas de la Membrana/química , Mutación Missense , Fenotipo , Dominios ProteicosRESUMEN
Background: Oculocutaneous albinism (OCA) is a Mendelian disorder characterized by hypopigmentation of the skin, hair, and eyes, hypoplastic fovea, and low vision, known to be caused by mutations in the Tyrosinase (TYR) gene. Among the known TYR variants, some reduce but do not completely eliminate tyrosinase activity, allowing residual production of melanin and resulting in a contradictory assignment as either pathogenic or benign, preventing a precise clinical diagnostic.Materials and Methods: In the present work, we performed Whole Exome Sequencing and subsequent Sanger sequencing in a young male clinically diagnosed with OCA.Results: Whole-exome sequencing analysis revealed the identification of two variants in trans in TYR. The first, corresponds to a known pathogenic variant G47D, while the second S192Y, was considered a polymorphism due to its relatively high frequency in the European population.Conclusion: The lack of other pathogenic variants in TYR, the reported reduced enzymatic activity (ca. 40% respect to wt) for S192Y, together with the structural in-silico analysis strongly suggest that both reported variants are jointly disease-causing and that S192Y should be considered as likely pathogenic, especially when it is found in trans with a null variant.
Asunto(s)
Albinismo Oculocutáneo/genética , Monofenol Monooxigenasa/genética , Mutación Missense/genética , Polimorfismo de Nucleótido Simple/genética , Adolescente , Albinismo Oculocutáneo/diagnóstico , Secuencia de Aminoácidos , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Masculino , Datos de Secuencia Molecular , Linaje , Secuenciación del ExomaRESUMEN
Genetic variation of the 16p11.2 deletion locus containing the KCTD13 gene and of CUL3 is linked with autism. This genetic connection suggested that substrates of a CUL3-KCTD13 ubiquitin ligase may be involved in disease pathogenesis. Comparison of Kctd13 mutant (Kctd13 -/- ) and wild-type neuronal ubiquitylomes identified adenylosuccinate synthetase (ADSS), an enzyme that catalyzes the first step in adenosine monophosphate (AMP) synthesis, as a KCTD13 ligase substrate. In Kctd13 -/- neurons, there were increased levels of succinyl-adenosine (S-Ado), a metabolite downstream of ADSS. Notably, S-Ado levels are elevated in adenylosuccinate lyase deficiency, a metabolic disorder with autism and epilepsy phenotypes. The increased S-Ado levels in Kctd13 -/- neurons were decreased by treatment with an ADSS inhibitor. Lastly, functional analysis of human KCTD13 variants suggests that KCTD13 variation may alter ubiquitination of ADSS. These data suggest that succinyl-AMP metabolites accumulate in Kctd13 -/- neurons, and this observation may have implications for our understanding of 16p11.2 deletion syndrome.
RESUMEN
OBJECTIVE: We aimed to characterize the phenotypic spectrum and functional consequences associated with variants in the gene GABRB2, coding for the γ-aminobutyric acid type A (GABAA ) receptor subunit ß2. METHODS: We recruited and systematically evaluated 25 individuals with variants in GABRB2, 17 of whom are newly described and 8 previously reported with additional clinical data. Functional analysis was performed using a Xenopus laevis oocyte model system. RESULTS: Our cohort of 25 individuals from 22 families with variants in GABRB2 demonstrated a range of epilepsy phenotypes from genetic generalized epilepsy to developmental and epileptic encephalopathy. Fifty-eight percent of individuals had pharmacoresistant epilepsy; response to medications targeting the GABAergic pathway was inconsistent. Developmental disability (present in 84%) ranged from mild intellectual disability to severe global disability; movement disorders (present in 44%) included choreoathetosis, dystonia, and ataxia. Disease-associated variants cluster in the extracellular N-terminus and transmembrane domains 1-3, with more severe phenotypes seen in association with variants in transmembrane domains 1 and 2 and the allosteric binding site between transmembrane domains 2 and 3. Functional analysis of 4 variants in transmembrane domains 1 or 2 (p.Ile246Thr, p.Pro252Leu, p.Ile288Ser, p.Val282Ala) revealed strongly reduced amplitudes of GABA-evoked anionic currents. INTERPRETATION: GABRB2-related epilepsy ranges broadly in severity from genetic generalized epilepsy to developmental and epileptic encephalopathies. Developmental disability and movement disorder are key features. The phenotypic spectrum is comparable to other GABAA receptor-encoding genes. Phenotypic severity varies by protein domain. Experimental evidence supports loss of GABAergic inhibition as the mechanism underlying GABRB2-associated neurodevelopmental disorders. ANN NEUROL 2021;89:573-586.
Asunto(s)
Epilepsia/fisiopatología , Trastornos del Movimiento/fisiopatología , Trastornos del Neurodesarrollo/fisiopatología , Receptores de GABA-A/genética , Adolescente , Adulto , Animales , Ataxia/genética , Ataxia/fisiopatología , Atetosis/genética , Atetosis/fisiopatología , Niño , Preescolar , Corea/genética , Corea/fisiopatología , Estudios de Cohortes , Discapacidades del Desarrollo/genética , Discapacidades del Desarrollo/fisiopatología , Epilepsia Refractaria/genética , Epilepsia Refractaria/fisiopatología , Distonía/genética , Distonía/fisiopatología , Epilepsia/genética , Femenino , Genotipo , Humanos , Discapacidad Intelectual/genética , Discapacidad Intelectual/fisiopatología , Masculino , Persona de Mediana Edad , Trastornos del Movimiento/genética , Mutación Missense , Trastornos del Neurodesarrollo/genética , Oocitos , Técnicas de Placa-Clamp , Fenotipo , Dominios Proteicos/genética , Xenopus laevis , Adulto JovenRESUMEN
Advances in gene discovery have identified genetic variants in the solute carrier family 6 member 1 gene as a monogenic cause of neurodevelopmental disorders, including epilepsy with myoclonic atonic seizures, autism spectrum disorder and intellectual disability. The solute carrier family 6 member 1 gene encodes for the GABA transporter protein type 1, which is responsible for the reuptake of the neurotransmitter GABA, the primary inhibitory neurotransmitter in the central nervous system, from the extracellular space. GABAergic inhibition is essential to counterbalance neuronal excitation, and when significantly disrupted, it negatively impacts brain development leading to developmental differences and seizures. Aggregation of patient variants and observed clinical manifestations expand understanding of the genotypic and phenotypic spectrum of this disorder. Here, we assess genetic and phenotypic features in 116 individuals with solute carrier family 6 member 1 variants, the vast majority of which are likely to lead to GABA transporter protein type 1 loss-of-function. The knowledge acquired will guide therapeutic decisions and the development of targeted therapies that selectively enhance transporter function and may improve symptoms. We analysed the longitudinal and cell type-specific expression of solute carrier family 6 member 1 in humans and localization of patient and control missense variants in a novel GABA transporter protein type 1 protein structure model. In this update, we discuss the progress made in understanding and treating solute carrier family 6 member 1-related disorders thus far, through the concerted efforts of clinicians, scientists and family support groups.
RESUMEN
Interpretation of the colossal number of genetic variants identified from sequencing applications is one of the major bottlenecks in clinical genetics, with the inference of the effect of amino acid-substituting missense variations on protein structure and function being especially challenging. Here we characterize the three-dimensional (3D) amino acid positions affected in pathogenic and population variants from 1,330 disease-associated genes using over 14,000 experimentally solved human protein structures. By measuring the statistical burden of variations (i.e., point mutations) from all genes on 40 3D protein features, accounting for the structural, chemical, and functional context of the variations' positions, we identify features that are generally associated with pathogenic and population missense variants. We then perform the same amino acid-level analysis individually for 24 protein functional classes, which reveals unique characteristics of the positions of the altered amino acids: We observe up to 46% divergence of the class-specific features from the general characteristics obtained by the analysis on all genes, which is consistent with the structural diversity of essential regions across different protein classes. We demonstrate that the function-specific 3D features of the variants match the readouts of mutagenesis experiments for BRCA1 and PTEN, and positively correlate with an independent set of clinically interpreted pathogenic and benign missense variants. Finally, we make our results available through a web server to foster accessibility and downstream research. Our findings represent a crucial step toward translational genetics, from highlighting the impact of mutations on protein structure to rationalizing the variants' pathogenicity in terms of the perturbed molecular mechanisms.
Asunto(s)
Mutación Missense/genética , Proteínas/química , Proteínas/genética , Secuencia de Aminoácidos , Proteína BRCA1/química , Proteína BRCA1/genética , Biología Computacional/métodos , Humanos , Aprendizaje Automático , Modelos Moleculares , Mutación Missense/fisiología , Fosfohidrolasa PTEN/química , Fosfohidrolasa PTEN/genética , Conformación Proteica , Proteínas/fisiologíaRESUMEN
Malfunctions of voltage-gated sodium and calcium channels (encoded by SCNxA and CACNA1x family genes, respectively) have been associated with severe neurologic, psychiatric, cardiac, and other diseases. Altered channel activity is frequently grouped into gain or loss of ion channel function (GOF or LOF, respectively) that often corresponds not only to clinical disease manifestations but also to differences in drug response. Experimental studies of channel function are therefore important, but laborious and usually focus only on a few variants at a time. On the basis of known gene-disease mechanisms of 19 different diseases, we inferred LOF (n = 518) and GOF (n = 309) likely pathogenic variants from the disease phenotypes of variant carriers. By training a machine learning model on sequence- and structure-based features, we predicted LOF or GOF effects [area under the receiver operating characteristics curve (ROC) = 0.85] of likely pathogenic missense variants. Our LOF versus GOF prediction corresponded to molecular LOF versus GOF effects for 87 functionally tested variants in SCN1/2/8A and CACNA1I (ROC = 0.73) and was validated in exome-wide data from 21,703 cases and 128,957 controls. We showed respective regional clustering of inferred LOF and GOF nucleotide variants across the alignment of the entire gene family, suggesting shared pathomechanisms in the SCNxA/CACNA1x family genes.
Asunto(s)
Canales de Calcio , Preparaciones Farmacéuticas , Mutación Missense/genética , Fenotipo , SodioRESUMEN
Human genome sequencing efforts have greatly expanded, and a plethora of missense variants identified both in patients and in the general population is now publicly accessible. Interpretation of the molecular-level effect of missense variants, however, remains challenging and requires a particular investigation of amino acid substitutions in the context of protein structure and function. Answers to questions like 'Is a variant perturbing a site involved in key macromolecular interactions and/or cellular signaling?', or 'Is a variant changing an amino acid located at the protein core or part of a cluster of known pathogenic mutations in 3D?' are crucial. Motivated by these needs, we developed MISCAST (missense variant to protein structure analysis web suite; http://miscast.broadinstitute.org/). MISCAST is an interactive and user-friendly web server to visualize and analyze missense variants in protein sequence and structure space. Additionally, a comprehensive set of protein structural and functional features have been aggregated in MISCAST from multiple databases, and displayed on structures alongside the variants to provide users with the biological context of the variant location in an integrated platform. We further made the annotated data and protein structures readily downloadable from MISCAST to foster advanced offline analysis of missense variants by a wide biological community.