RESUMO
Gene Expression Omnibus (GEO) is a database repository hosting a substantial proportion of publicly available high throughput gene expression data. Gene expression analysis is a powerful tool to gain insight into the mechanisms and processes underlying the biological and phenotypic differences between sample groups. Despite the wide availability of gene expression datasets, their access, analysis, and integration are not trivial and require specific expertise and programming proficiency. We developed the GEOexplorer webserver to allow scientists to access, integrate and analyse gene expression datasets without requiring programming proficiency. Via its user-friendly graphic interface, users can easily apply GEOexplorer to perform interactive and reproducible gene expression analysis of microarray and RNA-seq datasets, while producing a wealth of interactive visualisations to facilitate data exploration and interpretation, and generating a range of publication ready figures. The webserver allows users to search and retrieve datasets from GEO as well as to upload user-generated data and combine and harmonise two datasets to perform joint analyses. GEOexplorer, available at https://geoexplorer.rosalind.kcl.ac.uk, provides a solution for performing interactive and reproducible analyses of microarray and RNA-seq gene expression data, empowering life scientists to perform exploratory data analysis and differential gene expression analysis on-the-fly without informatics proficiency.
Assuntos
Bases de Dados Genéticas , Perfilação da Expressão Gênica , Análise em Microsséries , RNA-Seq , SoftwareRESUMO
OBJECTIVE: Neurofilament heavy-chain gene (NEFH) variants are associated with multiple neurodegenerative diseases, however, their relationship with ALS has not been robustly explored. Still, NEFH is commonly included in genetic screening panels worldwide. We therefore aimed to determine if NEFH variants modify ALS risk. METHODS: Genetic data of 11,130 people with ALS and 7,416 controls from the literature and Project MinE were analysed. We performed meta-analyses of published case-control studies reporting NEFH variants, and variant analysis of NEFH in Project MinE whole-genome sequencing data. RESULTS: Fixed-effects meta-analysis found that rare (MAF <1%) missense variants in the tail domain of NEFH increase ALS risk (OR 4.55, 95% CI 2.13-9.71, p < 0.0001). In Project MinE, ultrarare NEFH variants increased ALS risk (OR 1.37 95% CI 1.14-1.63, p = 0.0007), with rod domain variants (mostly intronic) appearing to drive the association (OR 1.45 95% CI 1.18-1.77, pMadsen-Browning = 0.0007, pSKAT-O = 0.003). While in the tail domain, ultrarare (MAF <0.1%) pathogenic missense variants were also associated with higher risk of ALS (OR 1.94, 95% CI 0.86-4.37, pMadsen-Browning = 0.039), supporting the meta-analysis results. Finally, several tail in-frame deletions were also found to affect disease risk, however, both protective and pathogenic deletions were found in this domain, highlighting an intricate architecture that requires further investigation. INTERPRETATION: We showed that NEFH tail missense and in-frame deletion variants, and intronic rod variants are risk factors for ALS. However, they are not variants of large effect, and their functional impact needs to be clarified in further studies. Therefore, their inclusion in routine genetic screening panels should be reconsidered.
Assuntos
Esclerose Lateral Amiotrófica , Proteínas de Neurofilamentos , Humanos , Esclerose Lateral Amiotrófica/genética , Esclerose Lateral Amiotrófica/epidemiologia , Predisposição Genética para Doença/genética , Mutação , Mutação de Sentido Incorreto , Proteínas de Neurofilamentos/genética , Domínios Proteicos/genéticaRESUMO
Objective: Variants in the superoxide dismutase (SOD1) gene are among the most common genetic causes of amyotrophic lateral sclerosis. Reflecting the wide spectrum of putatively deleterious variants that have been reported to date, it has become clear that SOD1-linked ALS presents a highly variable age at symptom onset and disease duration.Methods: Here we describe an open access web tool for comparative phenotype analysis in ALS: https://sod1-als-browser.rosalind.kcl.ac.uk/. The tool contains a built-in dataset of clinical information from 1383 people with ALS harboring a SOD1 variant resulting in one of 162 unique amino acid sequence alterations and from a non-SOD1 comparator ALS cohort of 13,469 individuals. We present two examples of analyses possible with this tool, testing how the ALS phenotype relates to SOD1 variants that alter amino acid residue hydrophobicity and to distinct variants at the 94th residue of SOD1, where six are sampled.Results and conclusions: The tool provides immediate access to the datasets and enables bespoke analysis of phenotypic trends associated with different protein variants, including the option for users to upload their own datasets for integration with the server data. The tool can be used to study SOD1-ALS and provides an analytical framework to study the differences between other user-uploaded ALS groups and our large reference database of SOD1 and non-SOD1 ALS. The tool is designed to be useful for clinicians and researchers, including those without programming expertise, and is highly flexible in the analyses that can be conducted.
RESUMO
Mutations in the superoxide dismutase 1 (SOD1) gene are the second most common known cause of ALS. SOD1 variants express high phenotypic variability and over 200 have been reported in people with ALS. It was previously proposed that variants can be broadly classified in two groups, 'wild-type like' (WTL) and 'metal binding region' (MBR) variants, based on their structural location and biophysical properties. MBR variants, but not WTL variants, were associated with a reduction of SOD1 enzymatic activity. In this study we used molecular dynamics and large clinical datasets to characterise the differences in the structural and dynamic behaviour of WTL and MBR variants with respect to the wild-type SOD1, and how such differences influence the ALS clinical phenotype. Our study identified marked structural differences, some of which are observed in both variant groups, while others are group specific. Moreover, collecting clinical data of approximately 500 SOD1 ALS patients carrying variants, we showed that the survival time of patients carrying an MBR variant is generally longer (â¼6 years median difference, p < 0.001) with respect to patients with a WTL variant. In conclusion, our study highlighted key differences in the dynamic behaviour between WTL and MBR SOD1 variants, and between variants and wild-type SOD1 at an atomic and molecular level, that could be further investigated to explain the associated phenotypic variability. Our results support the hypothesis of a decoupling between mechanisms of onset and progression of SOD1 ALS, and an involvement of loss-of-function of SOD1 with the disease progression.
RESUMO
BACKGROUND: Genetic penetrance is the probability of a phenotype when harbouring a particular pathogenic variant. Accurate penetrance estimates are important across biomedical fields including genetic counselling, disease research, and gene therapy. However, existing approaches for penetrance estimation require, for instance, large family pedigrees or availability of large databases of people affected and not affected by a disease. METHODS: We present a method for penetrance estimation in autosomal dominant phenotypes. It examines the distribution of a variant among people affected (cases) and unaffected (controls) by a phenotype within population-scale data and can be operated using cases only by considering family disease history. It is validated through simulation studies and candidate variant-disease case studies. RESULTS: Our method yields penetrance estimates which align with those obtained via existing approaches in the Parkinson's disease LRRK2 gene and pulmonary arterial hypertension BMPR2 gene case studies. In the amyotrophic lateral sclerosis case studies, examining penetrance for variants in the SOD1 and C9orf72 genes, we make novel penetrance estimates which correspond closely to understanding of the disease. CONCLUSIONS: The present approach broadens the spectrum of traits for which reliable penetrance estimates can be obtained. It has substantial utility for facilitating the characterisation of disease risks associated with rare variants with an autosomal dominant inheritance pattern. The yielded estimates avoid any kinship-specific effects and can circumvent ascertainment biases common when sampling rare variants among control populations.
Assuntos
Esclerose Lateral Amiotrófica , Humanos , Penetrância , Linhagem , Esclerose Lateral Amiotrófica/genética , Fenótipo , Características da FamíliaRESUMO
We describe the characterisation of a variable number tandem repeat (VNTR) domain within intron 1 of the amyotrophic lateral sclerosis (ALS) risk gene CFAP410 (Cilia and flagella associated protein 410) (previously known as C21orf2), providing insight into how this domain could support differential gene expression and thus be a modulator of ALS progression or risk. We demonstrated the VNTR was functional in a reporter gene assay in the HEK293 cell line, exhibiting both the properties of an activator domain and a transcriptional start site, and that the differential expression was directed by distinct repeat number in the VNTR. These properties embedded in the VNTR demonstrated the potential for this VNTR to modulate CFAP410 expression. We extrapolated these findings in silico by utilisation of tagging SNPs for the two most common VNTR alleles to establish a correlation with endogenous gene expression. Consistent with in vitro data, CFAP410 isoform expression was found to be variable in the brain. Furthermore, although the number of matched controls was low, there was evidence for one specific isoform being correlated with lower expression in those with ALS. To address if the genotype of the VNTR was associated with ALS risk, we characterised the variation of the CFAP410 VNTR in ALS cases and matched controls by PCR analysis of the VNTR length, defining eight alleles of the VNTR. No significant difference was observed between cases and controls, we noted, however, the cohort was unlikely to contain sufficient power to enable any firm conclusion to be drawn from this analysis. This data demonstrated that the VNTR domain has the potential to modulate CFAP410 expression as a regulatory element that could play a role in its tissue-specific and stimulus-inducible regulation that could impact the mechanism by which CFAP410 is involved in ALS.
RESUMO
Superoxide dismutase (SOD1) gene variants may cause amyotrophic lateral sclerosis, some of which are associated with a distinct phenotype. Most studies assess limited variants or sample sizes. In this international, retrospective observational study, we compare phenotypic and demographic characteristics between people with SOD1-ALS and people with ALS and no recorded SOD1 variant. We investigate which variants are associated with age at symptom onset and time from onset to death or censoring using Cox proportional-hazards regression. The SOD1-ALS dataset reports age of onset for 1122 and disease duration for 883 people; the comparator population includes 10,214 and 9010 people respectively. Eight variants are associated with younger age of onset and distinct survival trajectories; a further eight associated with younger onset only and one with distinct survival only. Here we show that onset and survival are decoupled in SOD1-ALS. Future research should characterise rarer variants and molecular mechanisms causing the observed variability.