ABSTRACT
PURPOSE: Biallelic INPP4A variants have recently been associated with severe neurodevelopmental disease in single case reports. Here, we expand and elucidate the clinical-genetic spectrum and provide a pathomechanistic explanation for genotype-phenotype correlations. METHODS: Clinical and genomic investigations of 30 individuals were undertaken alongside molecular and in silico modelling and translation reinitiation studies. RESULTS: We characterize a clinically variable disorder with cardinal features including global developmental delay, severe-profound intellectual disability, microcephaly, limb weakness, cerebellar signs and short stature. A more severe presentation associated with biallelic INPP4A variants downstream of exon 4 has additional features of (ponto)cerebellar hypoplasia, reduced cerebral volume, peripheral spasticity, contractures, intractable seizures and cortical visual impairment. Our studies identify the likely pathomechanism of this genotype-phenotype correlation entailing translational reinitiation in exon 4 resulting in an N-terminal truncated INPP4A protein retaining partial functionality, associated with less severe disease. We also identified identical reinitiation site conservation in Inpp4a-/- mouse models displaying similar genotype-phenotype correlation. Additionally, we show fibroblasts from a single affected individual exhibit disrupted endocytic trafficking pathways, indicating the potential biological basis of the condition. CONCLUSION: Our studies comprehensively characterise INPP4A-related neurodevelopmental disorder and suggest genotype-specific clinical assessment guidelines. We propose the potential mechanistic basis of observed genotype-phenotype correlations entails exon 4 translation reinitiation.
ABSTRACT
Recent studies have identified both recessive and dominant forms of mitochondrial disease that result from ATAD3A variants. The recessive form includes subjects with biallelic deletions mediated by non-allelic homologous recombination. We report five unrelated neonates with a lethal metabolic disorder characterized by cardiomyopathy, corneal opacities, encephalopathy, hypotonia, and seizures in whom a monoallelic reciprocal duplication at the ATAD3 locus was identified. Analysis of the breakpoint junction fragment indicated that these 67 kb heterozygous duplications were likely mediated by non-allelic homologous recombination at regions of high sequence identity in ATAD3A exon 11 and ATAD3C exon 7. At the recombinant junction, the duplication allele produces a fusion gene derived from ATAD3A and ATAD3C, the protein product of which lacks key functional residues. Analysis of fibroblasts derived from two affected individuals shows that the fusion gene product is expressed and stable. These cells display perturbed cholesterol and mitochondrial DNA organization similar to that observed for individuals with severe ATAD3A deficiency. We hypothesize that the fusion protein acts through a dominant-negative mechanism to cause this fatal mitochondrial disorder. Our data delineate a molecular diagnosis for this disorder, extend the clinical spectrum associated with structural variation at the ATAD3 locus, and identify a third mutational mechanism for ATAD3 gene cluster variants. These results further affirm structural variant mutagenesis mechanisms in sporadic disease traits, emphasize the importance of copy number analysis in molecular genomic diagnosis, and highlight some of the challenges of detecting and interpreting clinically relevant rare gene rearrangements from next-generation sequencing data.
Subject(s)
ATPases Associated with Diverse Cellular Activities/genetics , Cholesterol/metabolism , Gene Duplication , Homologous Recombination , Membrane Proteins/genetics , Mitochondria/pathology , Mitochondrial Diseases/pathology , Mitochondrial Proteins/genetics , ATPases Associated with Diverse Cellular Activities/chemistry , Amino Acid Sequence , Brain Diseases/etiology , Brain Diseases/metabolism , Brain Diseases/pathology , Cardiomyopathies/etiology , Cardiomyopathies/metabolism , Cardiomyopathies/pathology , Corneal Opacity/etiology , Corneal Opacity/metabolism , Corneal Opacity/pathology , DNA Copy Number Variations , Female , Gene Rearrangement , Humans , Infant , Infant, Newborn , Male , Membrane Proteins/chemistry , Mitochondria/genetics , Mitochondria/metabolism , Mitochondrial Diseases/genetics , Mitochondrial Diseases/metabolism , Mitochondrial Proteins/chemistry , Muscle Hypotonia/etiology , Muscle Hypotonia/metabolism , Muscle Hypotonia/pathology , Mutation , Protein Conformation , Seizures/etiology , Seizures/metabolism , Seizures/pathology , Sequence HomologyABSTRACT
PURPOSE: We previously defined biallelic HYAL2 variants causing a novel disorder in 2 families, involving orofacial clefting, facial dysmorphism, congenital heart disease, and ocular abnormalities, with Hyal2 knockout mice displaying similar phenotypes. In this study, we better define the phenotype and pathologic disease mechanism. METHODS: Clinical and genomic investigations were undertaken alongside molecular studies, including immunoblotting and immunofluorescence analyses of variant/wild-type human HYAL2 expressed in mouse fibroblasts, and in silico modeling of putative pathogenic variants. RESULTS: Ten newly identified individuals with this condition were investigated, and they were associated with 9 novel pathogenic variants. Clinical studies defined genotype-phenotype correlations and confirmed a recognizable craniofacial phenotype in addition to myopia, cleft lip/palate, and congenital cardiac anomalies as the most consistent manifestations of the condition. In silico modeling of missense variants identified likely deleterious effects on protein folding. Consistent with this, functional studies indicated that these variants cause protein instability and a concomitant cell surface absence of HYAL2 protein. CONCLUSION: These studies confirm an association between HYAL2 alterations and syndromic cleft lip/palate, provide experimental evidence for the pathogenicity of missense alleles, enable further insights into the pathomolecular basis of the disease, and delineate the core and variable clinical outcomes of the condition.
Subject(s)
Cleft Lip , Cleft Palate , Alleles , Animals , Cell Adhesion Molecules/genetics , Cleft Lip/genetics , Cleft Palate/genetics , GPI-Linked Proteins/genetics , Genetic Association Studies , Humans , Hyaluronoglucosaminidase/genetics , Mice , PhenotypeABSTRACT
BACKGROUND: Pathogenicity predictors are integral to genomic variant interpretation but, despite their widespread usage, an independent validation of performance using a clinically relevant dataset has not been undertaken. METHODS: We derive two validation datasets: an 'open' dataset containing variants extracted from publicly available databases, similar to those commonly applied in previous benchmarking exercises, and a 'clinically representative' dataset containing variants identified through research/diagnostic exome and panel sequencing. Using these datasets, we evaluate the performance of three recent meta-predictors, REVEL, GAVIN and ClinPred, and compare their performance against two commonly used in silico tools, SIFT and PolyPhen-2. RESULTS: Although the newer meta-predictors outperform the older tools, the performance of all pathogenicity predictors is substantially lower in the clinically representative dataset. Using our clinically relevant dataset, REVEL performed best with an area under the receiver operating characteristic curve of 0.82. Using a concordance-based approach based on a consensus of multiple tools reduces the performance due to both discordance between tools and false concordance where tools make common misclassification. Analysis of tool feature usage may give an insight into the tool performance and misclassification. CONCLUSION: Our results support the adoption of meta-predictors over traditional in silico tools, but do not support a consensus-based approach as in current practice.
Subject(s)
Computational Biology/methods , Genetic Variation/genetics , Genomics/methods , Exome/genetics , Humans , ROC CurveABSTRACT
OBJECTIVE: Rare genetic disorders resulting in prenatal or neonatal death are genetically heterogeneous, but testing is often limited by the availability of fetal DNA, leaving couples without a potential prenatal test for future pregnancies. We describe our novel strategy of exome sequencing parental DNA samples to diagnose recessive monogenic disorders in an audit of the first 50 couples referred. METHOD: Exome sequencing was carried out in a consecutive series of 50 couples who had 1 or more pregnancies affected with a lethal or prenatal-onset disorder. In all cases, there was insufficient DNA for exome sequencing of the affected fetus. Heterozygous rare variants (MAF < 0.001) in the same gene in both parents were selected for analysis. Likely, disease-causing variants were tested in fetal DNA to confirm co-segregation. RESULTS: Parental exome analysis identified heterozygous pathogenic (or likely pathogenic) variants in 24 different genes in 26/50 couples (52%). Where 2 or more fetuses were affected, a genetic diagnosis was obtained in 18/29 cases (62%). In most cases, the clinical features were typical of the disorder, but in others, they result from a hypomorphic variant or represent the most severe form of a variable phenotypic spectrum. CONCLUSION: We conclude that exome sequencing of parental samples is a powerful strategy with high clinical utility for the genetic diagnosis of lethal or prenatal-onset recessive disorders. © 2017 The Authors Prenatal Diagnosis published by John Wiley & Sons Ltd.
Subject(s)
Congenital Abnormalities/genetics , Exome Sequencing , Genetic Diseases, Inborn/diagnosis , Parents , Prenatal Diagnosis/methods , Female , Genes, Recessive , Humans , Male , PregnancyABSTRACT
BACKGROUND: Genetic variants that severely alter protein products (e.g. nonsense, frameshift) are often associated with disease. For some genes, these predicted loss-of-function variants (pLoFs) are observed throughout the gene, whilst in others, they occur only at specific locations. We hypothesised that, for genes linked with monogenic diseases that display incomplete penetrance, pLoF variants present in apparently unaffected individuals may be limited to regions where pLoFs are tolerated. To test this, we investigated whether pLoF location could explain instances of incomplete penetrance of variants expected to be pathogenic for Mendelian conditions. METHODS: We used exome sequence data in 454,773 individuals in the UK Biobank (UKB) to investigate the locations of pLoFs in a population cohort. We counted numbers of unique pLoF, missense, and synonymous variants in UKB in each quintile of the coding sequence (CDS) of all protein-coding genes and clustered the variants using Gaussian mixture models. We limited the analyses to genes with ≥ 5 variants of each type (16,473 genes). We compared the locations of pLoFs in UKB with all theoretically possible pLoFs in a transcript, and pathogenic pLoFs from ClinVar, and performed simulations to estimate the false-positive rate of non-uniformly distributed variants. RESULTS: For most genes, all variant classes fell into clusters representing broadly uniform variant distributions, but genes in which haploinsufficiency causes developmental disorders were less likely to have uniform pLoF distribution than other genes (P < 2.2 × 10-6). We identified a number of genes, including ARID1B and GATA6, where pLoF variants in the first quarter of the CDS were rescued by the presence of an alternative translation start site and should not be reported as pathogenic. For other genes, such as ODC1, pLoFs were located approximately uniformly across the gene, but pathogenic pLoFs were clustered only at the end, consistent with a gain-of-function disease mechanism. CONCLUSIONS: Our results suggest the potential benefits of localised constraint metrics and that the location of pLoF variants should be considered when interpreting variants.
Subject(s)
Loss of Function Mutation , Penetrance , Humans , Genetic Diseases, Inborn/genetics , Genetic Predisposition to Disease , Exome , Cluster Analysis , Exome SequencingABSTRACT
BACKGROUND: The widespread clinical application of genome-wide sequencing has resulted in many new diagnoses for rare genetic conditions, but testing regularly identifies variants of uncertain significance (VUS). The remarkable rise in the amount of genomic data has been paralleled by a rise in the number of protein structures that are now publicly available, which may have clinical utility for the interpretation of missense and in-frame insertions or deletions. METHODS: Within a UK National Health Service genomic medicine diagnostic laboratory, we investigated the number of VUS over a 5-year period that were evaluated using protein structural analysis and how often this analysis aided variant classification. RESULTS: We found 99 novel missense and in-frame variants across 67 genes that were initially classified as VUS by our diagnostic laboratory using standard variant classification guidelines and for which further analysis of protein structure was requested. Evidence from protein structural analysis was used in the re-assessment of 64 variants, of which 47 were subsequently reclassified as pathogenic or likely pathogenic and 17 remained as VUS. We identified several case studies where protein structural analysis aided variant interpretation by predicting disease mechanisms that were consistent with the observed phenotypes, including loss-of-function through thermodynamic destabilisation or disruption of ligand binding, and gain-of-function through de-repression or escape from proteasomal degradation. CONCLUSIONS: We have shown that using in silico protein structural analysis can aid classification of VUS and give insights into the mechanisms of pathogenicity. Based on our experience, we propose a generic evidence-based workflow for incorporating protein structural information into diagnostic practice to facilitate variant classification.
Subject(s)
Genomics , State Medicine , Genetic Testing/methods , PhenotypeABSTRACT
Mitochondria membrane protein-associated neurodegeneration (MPAN) neurodegenerative disorder is typically associated with biallelic C19orf12 variants. Here we describe a new and review candidate previous monoallelic de novo C19orf12 variants to define loss of function mutations located in the putative non-membrane spanning C19orf12 isoform as the potential basis of monoallelic MPAN.
Subject(s)
Iron Metabolism Disorders/genetics , Membrane Proteins/genetics , Mitochondrial Membranes/metabolism , Mitochondrial Proteins/genetics , Neuroaxonal Dystrophies/genetics , Amish , Humans , Iron Metabolism Disorders/diagnosis , Iron Metabolism Disorders/pathology , Iron Metabolism Disorders/physiopathology , Loss of Function Mutation , Magnetic Resonance Imaging , Neuroaxonal Dystrophies/diagnosis , Neuroaxonal Dystrophies/pathology , Neuroaxonal Dystrophies/physiopathology , Pedigree , Protein IsoformsABSTRACT
BACKGROUND: ATPase family AAA-domain containing protein 3A (ATAD3A) is a nuclear-encoded mitochondrial membrane-anchored protein involved in diverse processes including mitochondrial dynamics, mitochondrial DNA organization, and cholesterol metabolism. Biallelic deletions (null), recessive missense variants (hypomorph), and heterozygous missense variants or duplications (antimorph) in ATAD3A lead to neurological syndromes in humans. METHODS: To expand the mutational spectrum of ATAD3A variants and to provide functional interpretation of missense alleles in trans to deletion alleles, we performed exome sequencing for identification of single nucleotide variants (SNVs) and copy number variants (CNVs) in ATAD3A in individuals with neurological and mitochondrial phenotypes. A Drosophila Atad3a Gal4 knockin-null allele was generated using CRISPR-Cas9 genome editing technology to aid the interpretation of variants. RESULTS: We report 13 individuals from 8 unrelated families with biallelic ATAD3A variants. The variants included four missense variants inherited in trans to loss-of-function alleles (p.(Leu77Val), p.(Phe50Leu), p.(Arg170Trp), p.(Gly236Val)), a homozygous missense variant p.(Arg327Pro), and a heterozygous non-frameshift indel p.(Lys568del). Affected individuals exhibited findings previously associated with ATAD3A pathogenic variation, including developmental delay, hypotonia, congenital cataracts, hypertrophic cardiomyopathy, and cerebellar atrophy. Drosophila studies indicated that Phe50Leu, Gly236Val, Arg327Pro, and Lys568del are severe loss-of-function alleles leading to early developmental lethality. Further, we showed that Phe50Leu, Gly236Val, and Arg327Pro cause neurogenesis defects. On the contrary, Leu77Val and Arg170Trp are partial loss-of-function alleles that cause progressive locomotion defects and whose expression leads to an increase in autophagy and mitophagy in adult muscles. CONCLUSION: Our findings expand the allelic spectrum of ATAD3A variants and exemplify the use of a functional assay in Drosophila to aid variant interpretation.
Subject(s)
ATPases Associated with Diverse Cellular Activities/genetics , Genetic Variation , Membrane Proteins/genetics , Mitochondria/genetics , Mitochondrial Proteins/genetics , Neurons/metabolism , Adolescent , Alleles , Amino Acid Sequence , Animals , Autophagy/genetics , Computer Simulation , Drosophila/ultrastructure , Female , Humans , Infant , Infant, Newborn , Locomotion , Male , Mitophagy/genetics , Mutation, Missense/genetics , Neurogenesis/genetics , Pedigree , Phenotype , Polymorphism, Single Nucleotide/genetics , Young AdultABSTRACT
Despite the rapid expansion in recent years of databases reporting either benign or pathogenic genetic variations, the interpretation of novel missense variants remains challenging, particularly for clinical or genetic testing laboratories where functional analysis is often unfeasible. Previous studies have shown that thermodynamic analysis of protein structure in silico can discriminate between groups of benign and pathogenic missense variants. However, although structures exist for many human diseaseâassociated proteins, such analysis remains largely unexploited in clinical laboratories. Here, we analyzed the predicted effect of 338 known missense variants on the structure of menin, the MEN1 gene product. Results provided strong discrimination between pathogenic and benign variants, with a threshold of >4 kcal/mol for the predicted change in stability, providing a strong indicator of pathogenicity. Subsequent analysis of seven novel missense variants identified during clinical testing of patients with MEN1 showed that all seven were predicted to destabilize menin by >4 kcal/mol. We conclude that structural analysis provides a useful tool in understanding the effect of missense variants in MEN1 and that integration of proteomic with genomic data could potentially contribute to the classification of novel variants in this disease.
ABSTRACT
INTRODUCTION: Spermatogenesis is a highly complex process involving several thousand genes, only a minority of which have been studied in infertile men. In a previous study, we identified a number of Copy Number Variants (CNVs) by high-resolution array-Comparative Genomic Hybridization (a-CGH) analysis of the X chromosome, including 16 patient-specific X chromosome-linked gains. Of these, five gains (DUP1A, DUP5, DUP20, DUP26 and DUP40) were selected for further analysis to evaluate their clinical significance. MATERIALS AND METHODS: The copy number state of the five selected loci was analyzed by quantitative-PCR on a total of 276 idiopathic infertile patients and 327 controls in a conventional case-control setting (199 subjects belonged to the previous a-CGH study). For one interesting locus (intersecting DUP1A) additional 338 subjects were analyzed. RESULTS AND DISCUSSION: All gains were confirmed as patient-specific and the difference in duplication load between patients and controls is significant (p = 1.65 × 10(-4)). Two of the CNVs are private variants, whereas 3 are found recurrently in patients and none of the controls. These CNVs include, or are in close proximity to, genes with testis-specific expression. DUP1A, mapping to the PAR1, is found at the highest frequency (1.4%) that was significantly different from controls (0%) (p = 0.047 after Bonferroni correction). Two mechanisms are proposed by which DUP1A may cause spermatogenic failure: i) by affecting the correct regulation of a gene with potential role in spermatogenesis; ii) by disturbing recombination between PAR1 regions during meiosis. This study allowed the identification of novel spermatogenesis candidate genes linked to the 5 CNVs and the discovery of the first recurrent, X-linked gain with potential clinical relevance.