RESUMO
BACKGROUND: The U.K. 100,000 Genomes Project is in the process of investigating the role of genome sequencing in patients with undiagnosed rare diseases after usual care and the alignment of this research with health care implementation in the U.K. National Health Service. Other parts of this project focus on patients with cancer and infection. METHODS: We conducted a pilot study involving 4660 participants from 2183 families, among whom 161 disorders covering a broad spectrum of rare diseases were present. We collected data on clinical features with the use of Human Phenotype Ontology terms, undertook genome sequencing, applied automated variant prioritization on the basis of applied virtual gene panels and phenotypes, and identified novel pathogenic variants through research analysis. RESULTS: Diagnostic yields varied among family structures and were highest in family trios (both parents and a proband) and families with larger pedigrees. Diagnostic yields were much higher for disorders likely to have a monogenic cause (35%) than for disorders likely to have a complex cause (11%). Diagnostic yields for intellectual disability, hearing disorders, and vision disorders ranged from 40 to 55%. We made genetic diagnoses in 25% of the probands. A total of 14% of the diagnoses were made by means of the combination of research and automated approaches, which was critical for cases in which we found etiologic noncoding, structural, and mitochondrial genome variants and coding variants poorly covered by exome sequencing. Cohortwide burden testing across 57,000 genomes enabled the discovery of three new disease genes and 19 new associations. Of the genetic diagnoses that we made, 25% had immediate ramifications for clinical decision making for the patients or their relatives. CONCLUSIONS: Our pilot study of genome sequencing in a national health care system showed an increase in diagnostic yield across a range of rare diseases. (Funded by the National Institute for Health Research and others.).
Assuntos
Genoma Humano , Doenças Raras/genética , Adolescente , Adulto , Criança , Pré-Escolar , Características da Família , Feminino , Variação Genética , Humanos , Masculino , Pessoa de Meia-Idade , Projetos Piloto , Reação em Cadeia da Polimerase , Doenças Raras/diagnóstico , Sensibilidade e Especificidade , Medicina Estatal , Reino Unido , Sequenciamento Completo do Genoma , Adulto JovemRESUMO
Improvements in functional genomic annotation have led to a critical mass of neurogenetic discoveries. This is exemplified in hereditary ataxia, a heterogeneous group of disorders characterised by incoordination from cerebellar dysfunction. Associated pathogenic variants in more than 300 genes have been described, leading to a detailed genetic classification partitioned by age-of-onset. Despite these advances, up to 75% of patients with ataxia remain molecularly undiagnosed even following whole genome sequencing, as exemplified in the 100 000 Genomes Project. This study aimed to understand whether we can improve our knowledge of the genetic architecture of hereditary ataxia by leveraging functional genomic annotations, and as a result, generate insights and strategies that raise the diagnostic yield. To achieve these aims, we used publicly-available multi-omics data to generate 294 genic features, capturing information relating to a gene's structure, genetic variation, tissue-specific, cell-type-specific and temporal expression, as well as protein products of a gene. We studied these features across genes typically causing childhood-onset, adult-onset or both types of disease first individually, then collectively. This led to the generation of testable hypotheses which we investigated using whole genome sequencing data from up to 2182 individuals presenting with ataxia and 6658 non-neurological probands recruited in the 100 000 Genomes Project. Using this approach, we demonstrated a high short tandem repeat (STR) density within childhood-onset genes suggesting that we may be missing pathogenic repeat expansions within this cohort. This was verified in both childhood- and adult-onset ataxia patients from the 100 000 Genomes Project who were unexpectedly found to have a trend for higher repeat sizes even at naturally-occurring STRs within known ataxia genes, implying a role for STRs in pathogenesis. Using unsupervised analysis, we found significant similarities in genomic annotation across the gene panels, which suggested adult- and childhood-onset patients should be screened using a common diagnostic gene set. We tested this within the 100 000 Genomes Project by assessing the burden of pathogenic variants among childhood-onset genes in adult-onset patients and vice versa. This demonstrated a significantly higher burden of rare, potentially pathogenic variants in conventional childhood-onset genes among individuals with adult-onset ataxia. Our analysis has implications for the current clinical practice in genetic testing for hereditary ataxia. We suggest that the diagnostic rate for hereditary ataxia could be increased by removing the age-of-onset partition, and through a modified screening for repeat expansions in naturally-occurring STRs within known ataxia-associated genes, in effect treating these regions as candidate pathogenic loci.
Assuntos
Ataxia Cerebelar , Degenerações Espinocerebelares , Adulto , Humanos , Degenerações Espinocerebelares/genética , Ataxia Cerebelar/diagnóstico , Ataxia Cerebelar/genética , Ataxia/diagnóstico , Ataxia/genética , Genômica , Testes GenéticosRESUMO
CAG repeat expansions in exon 1 of the AR gene on the X chromosome cause spinal and bulbar muscular atrophy, a male-specific progressive neuromuscular disorder associated with a variety of extra-neurological symptoms. The disease has a reported male prevalence of approximately 1:30 000 or less, but the AR repeat expansion frequency is unknown. We established a pipeline, which combines the use of the ExpansionHunter tool and visual validation, to detect AR CAG expansion on whole-genome sequencing data, benchmarked it to fragment PCR sizing, and applied it to 74 277 unrelated individuals from four large cohorts. Our pipeline showed sensitivity of 100% [95% confidence interval (CI) 90.8-100%], specificity of 99% (95% CI 94.2-99.7%), and a positive predictive value of 97.4% (95% CI 84.4-99.6%). We found the mutation frequency to be 1:3182 (95% CI 1:2309-1:4386, n = 117 734) X chromosomes-10 times more frequent than the reported disease prevalence. Modelling using the novel mutation frequency led to estimate disease prevalence of 1:6887 males, more than four times more frequent than the reported disease prevalence. This discrepancy is possibly due to underdiagnosis of this neuromuscular condition, reduced penetrance, and/or pleomorphic clinical manifestations.
Assuntos
Atrofia Muscular Espinal , Receptores Androgênicos , Humanos , Masculino , Receptores Androgênicos/genética , Atrofia Muscular Espinal/genética , Atrofia Muscular , Reação em Cadeia da Polimerase , Expansão das Repetições de Trinucleotídeos/genéticaRESUMO
SUMMARY: We describe a novel computational method for genotyping repeats using sequence graphs. This method addresses the long-standing need to accurately genotype medically important loci containing repeats adjacent to other variants or imperfect DNA repeats such as polyalanine repeats. Here we introduce a new version of our repeat genotyping software, ExpansionHunter, that uses this method to perform targeted genotyping of a broad class of such loci. AVAILABILITY AND IMPLEMENTATION: ExpansionHunter is implemented in C++ and is available under the Apache License Version 2.0. The source code, documentation, and Linux/macOS binaries are available at https://github.com/Illumina/ExpansionHunter/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Repetições de Microssatélites , Software , GenótipoRESUMO
PURPOSE: CLAPO syndrome is a rare vascular disorder characterized by capillary malformation of the lower lip, lymphatic malformation predominant on the face and neck, asymmetry, and partial/generalized overgrowth. Here we tested the hypothesis that, although the genetic cause is not known, the tissue distribution of the clinical manifestations in CLAPO seems to follow a pattern of somatic mosaicism. METHODS: We clinically evaluated a cohort of 13 patients with CLAPO and screened 20 DNA blood/tissue samples from 9 patients using high-throughput, deep sequencing. RESULTS: We identified five activating mutations in the PIK3CA gene in affected tissues from 6 of the 9 patients studied; one of the variants (NM_006218.2:c.248T>C; p.Phe83Ser) has not been previously described in developmental disorders. CONCLUSION: We describe for the first time the presence of somatic activating PIK3CA mutations in patients with CLAPO. We also report an update of the phenotype and natural history of the syndrome.
Assuntos
Malformações Arteriovenosas/genética , Malformações Arteriovenosas/fisiopatologia , Classe I de Fosfatidilinositol 3-Quinases/genética , Doenças Linfáticas/genética , Doenças Linfáticas/fisiopatologia , Adolescente , Adulto , Criança , Classe I de Fosfatidilinositol 3-Quinases/fisiologia , Feminino , Estudos de Associação Genética/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Masculino , Mutação , Fosfatidilinositol 3-Quinases/genética , Estudos RetrospectivosRESUMO
There is epidemiological evidence that patients with certain Central Nervous System (CNS) disorders have a lower than expected probability of developing some types of Cancer. We tested here the hypothesis that this inverse comorbidity is driven by molecular processes common to CNS disorders and Cancers, and that are deregulated in opposite directions. We conducted transcriptomic meta-analyses of three CNS disorders (Alzheimer's disease, Parkinson's disease and Schizophrenia) and three Cancer types (Lung, Prostate, Colorectal) previously described with inverse comorbidities. A significant overlap was observed between the genes upregulated in CNS disorders and downregulated in Cancers, as well as between the genes downregulated in CNS disorders and upregulated in Cancers. We also observed expression deregulations in opposite directions at the level of pathways. Our analysis points to specific genes and pathways, the upregulation of which could increase the incidence of CNS disorders and simultaneously lower the risk of developing Cancer, while the downregulation of another set of genes and pathways could contribute to a decrease in the incidence of CNS disorders while increasing the Cancer risk. These results reinforce the previously proposed involvement of the PIN1 gene, Wnt and P53 pathways, and reveal potential new candidates, in particular related with protein degradation processes.
Assuntos
Doença de Alzheimer/genética , Comorbidade , Neoplasias/genética , Doença de Parkinson/genética , Esquizofrenia/genética , Doença de Alzheimer/epidemiologia , Doença de Alzheimer/patologia , Sistema Nervoso Central/patologia , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Humanos , Peptidilprolil Isomerase de Interação com NIMA , Neoplasias/epidemiologia , Neoplasias/patologia , Doença de Parkinson/epidemiologia , Doença de Parkinson/patologia , Peptidilprolil Isomerase/genética , Esquizofrenia/epidemiologia , Esquizofrenia/patologia , Transdução de SinaisAssuntos
Cardiomiopatia Hipertrófica/genética , Doença de Depósito de Glicogênio Tipo IIb/genética , Proteína 2 de Membrana Associada ao Lisossomo/genética , Feminino , Variação Genética , Doença de Depósito de Glicogênio Tipo IIb/diagnóstico , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Padrões de Herança , Mutação com Perda de FunçãoRESUMO
Chimeric RNAs that comprise two or more different transcripts have been identified in many cancers and among the Expressed Sequence Tags (ESTs) isolated from different organisms; they might represent functional proteins and produce different disease phenotypes. The ChiTaRS database of Chimeric Transcripts and RNA-Sequencing data (http://chitars.bioinfo.cnio.es/) collects more than 16 000 chimeric RNAs from humans, mice and fruit flies, 233 chimeras confirmed by RNA-seq reads and â¼2000 cancer breakpoints. The database indicates the expression and tissue specificity of these chimeras, as confirmed by RNA-seq data, and it includes mass spectrometry results for some human entries at their junctions. Moreover, the database has advanced features to analyze junction consistency and to rank chimeras based on the evidence of repeated junction sites. Finally, 'Junction Search' screens through the RNA-seq reads found at the chimeras' junction sites to identify putative junctions in novel sequences entered by users. Thus, ChiTaRS is an extensive catalog of human, mouse and fruit fly chimeras that will extend our understanding of the evolution of chimeric transcripts in eukaryotes and can be advantageous in the analysis of human cancer breakpoints.
Assuntos
Bases de Dados Genéticas , Proteínas Mutantes Quiméricas/genética , RNA/química , Animais , Pontos de Quebra do Cromossomo , Gráficos por Computador , Drosophila/genética , Fusão Gênica , Humanos , Internet , Camundongos , Proteínas Mutantes Quiméricas/metabolismo , Neoplasias/genética , RNA/metabolismo , Análise de Sequência de RNARESUMO
Overgrowth syndromes (OGS) are a group of disorders in which all parameters of growth and physical development are above the mean for age and sex. We evaluated a series of 270 families from the Spanish Overgrowth Syndrome Registry with no known OGS. We identified one de novo deletion and three missense mutations in RNF125 in six patients from four families with overgrowth, macrocephaly, intellectual disability, mild hydrocephaly, hypoglycemia, and inflammatory diseases resembling Sjögren syndrome. RNF125 encodes an E3 ubiquitin ligase and is a novel gene of OGS. Our studies of the RNF125 pathway point to upregulation of RIG-I-IPS1-MDA5 and/or disruption of the PI3K-AKT and interferon signaling pathways as the putative final effectors.
Assuntos
Transtornos do Crescimento/genética , Mutação , Ubiquitina-Proteína Ligases/genética , Feminino , Transtornos do Crescimento/epidemiologia , Humanos , Masculino , Linhagem , Sistema de Registros , Espanha/epidemiologia , SíndromeRESUMO
The phosphorylation and dephosphorylation of the carboxyl-terminal domain (CTD) of the largest RNA polymerase II (RNAPII) subunit is a critical regulatory checkpoint for transcription and mRNA processing. This CTD is unique to eukaryotic organisms and it contains multiple tandem-repeats with the consensus sequence Tyr(1) -Ser(2) -Pro(3) -Thr(4) -Ser(5) -Pro(6) -Ser(7) . Traditionally, CTD phosphatases that use metal-ion-independent (cysteine-based) and metal-ion-assisted (aspartate-based) catalytic mechanisms have been considered to belong to two independent groups. However, using structural comparisons we have identified a common structural scaffold in these two groups of CTD phosphatases. This common scaffold accommodates different catalytic processes with the same substrate specificity, in this case phospho-serine/threonine residues flanked by prolines. Furthermore, this scaffold provides a structural connection between two groups of protein tyrosine phosphatases (PTPs): Cys-based (classes I, II, and III) and Asp-based (class IV) PTPs. Redundancy in catalytic mechanisms is not infrequent and may arise in specific biological settings. To better understand the activity of the CTD phosphatases, we combined our structural analyses with data on CTD phosphatase expression in different human and mouse tissues. The results suggest that aspartate- and cysteine-based CTD-dephosphorylation acts in concert during cellular stress, when high levels of reactive oxygen species can inhibit the nucleophilic function of the catalytic cysteine, as occurs in mental and neurodegenerative disorders like schizophrenia, Alzheimer's and Parkinson's diseases. Moreover, these findings have significant implications for the study of the RNAPII-CTD dephosphorylation in eukaryotes.
Assuntos
Evolução Molecular , Fosfoproteínas Fosfatases/química , Fosfoproteínas Fosfatases/metabolismo , RNA Polimerase II/metabolismo , Sequência de Aminoácidos , Animais , Catálise , Biologia Computacional , Bases de Dados de Proteínas , Humanos , Camundongos , Dados de Sequência Molecular , Peptidilprolil Isomerase de Interação com NIMA , Peptidilprolil Isomerase/química , Peptidilprolil Isomerase/metabolismo , Fosfoproteínas Fosfatases/classificação , Fosfoproteínas Fosfatases/genética , Fosforilação , Schizosaccharomyces/enzimologia , Especificidade da EspécieRESUMO
GC-rich tandem repeat expansions (TREs) are often associated with DNA methylation, gene silencing and folate-sensitive fragile sites, and underlie several congenital and late-onset disorders. Through a combination of DNA-methylation profiling and tandem repeat genotyping, we identified 24 methylated TREs and investigated their effects on human traits using phenome-wide association studies in 168,641 individuals from the UK Biobank, identifying 156 significant TRE-trait associations involving 17 different TREs. Of these, a GCC expansion in the promoter of AFF3 was associated with a 2.4-fold reduced probability of completing secondary education, an effect size comparable to several recurrent pathogenic microdeletions. In a cohort of 6,371 probands with neurodevelopmental problems of suspected genetic etiology, we observed a significant enrichment of AFF3 expansions compared with controls. With a population prevalence that is at least fivefold higher than the TRE that causes fragile X syndrome, AFF3 expansions represent a major cause of neurodevelopmental delay.
RESUMO
Repeat expansion disorders (REDs) are a devastating group of predominantly neurological diseases. Together they are common, affecting 1 in 3,000 people worldwide with population-specific differences. However, prevalence estimates of REDs are hampered by heterogeneous clinical presentation, variable geographic distributions and technological limitations leading to underascertainment. Here, leveraging whole-genome sequencing data from 82,176 individuals from different populations, we found an overall disease allele frequency of REDs of 1 in 283 individuals. Modeling disease prevalence using genetic data, age at onset and survival, we show that the expected number of people with REDs would be two to three times higher than currently reported figures, indicating underdiagnosis and/or incomplete penetrance. While some REDs are population specific, for example, Huntington disease-like 2 in Africans, most REDs are represented in all broad genetic ancestries (that is, Europeans, Africans, Americans, East Asians and South Asians), challenging the notion that some REDs are found only in specific populations. These results have worldwide implications for local and global health communities in the diagnosis and counseling of REDs.
RESUMO
Spinocerebellar ataxias form a clinically and genetically heterogeneous group of neurodegenerative disorders characterized by progressive cerebellar ataxia. Their prevalence varies among populations and ethnicities. Spinocerebellar ataxia 36 is caused by a GGCCTG repeat expansion in the first intron of the NOP56 gene and is characterized by late-onset ataxia, sensorineural hearing loss and upper and lower motor neuron signs, including tongue fasciculations. Spinocerebellar ataxia 36 has been described mainly in East Asian and Western European patients and was thought to be absent in the British population. Leveraging novel bioinformatic tools to detect repeat expansions from whole-genome sequencing, we analyse the NOP56 repeat in 1257 British patients with hereditary ataxia and in 7506 unrelated controls. We identify pathogenic repeat expansions in five families (seven patients), representing the first cohort of White British descent patients with spinocerebellar ataxia 36. Employing in silico approaches using whole-genome sequencing data, we found an 87â kb shared haplotype in among the affected individuals from five families around the NOP56 repeat region, although this block was also shared between several controls, suggesting that the repeat arises on a permissive haplotype. Clinically, the patients presented with slowly progressive cerebellar ataxia with a low rate of hearing loss and variable rates of motor neuron impairment. Our findings show that the NOP56 expansion causes ataxia in the British population and that spinocerebellar ataxia 36 can be suspected in patients with a late-onset, slowly progressive ataxia, even without the findings of hearing loss and tongue fasciculation.
RESUMO
Repeat expansion disorders (REDs) are a devastating group of predominantly neurological diseases. Together they are common, affecting 1 in 3,000 people worldwide with population-specific differences. However, prevalence estimates of REDs are hampered by heterogeneous clinical presentation, variable geographic distributions, and technological limitations leading to under-ascertainment. Here, leveraging whole genome sequencing data from 82,176 individuals from different populations we found an overall carrier frequency of REDs of 1 in 340 individuals. Modelling disease prevalence using genetic data, age at onset and survival, we show that REDs are up to 3-fold more prevalent than currently reported figures. While some REDs are population-specific, e.g. Huntington's disease type 2, most REDs are represented in all broad genetic ancestries, including Africans and Asians, challenging the notion that some REDs are found only in European populations. These results have worldwide implications for local and global health communities in the diagnosis and management of REDs both at local and global levels.
RESUMO
GC-rich tandem repeat expansions (TREs) are often associated with DNA methylation, gene silencing and folate-sensitive fragile sites and underlie several congenital and late-onset disorders. Through a combination of DNA methylation profiling and tandem repeat genotyping, we identified 24 methylated TREs and investigated their effects on human traits using PheWAS in 168,641 individuals from the UK Biobank, identifying 156 significant TRE:trait associations involving 17 different TREs. Of these, a GCC expansion in the promoter of AFF3 was linked with a 2.4-fold reduced probability of completing secondary education, an effect size comparable to several recurrent pathogenic microdeletions. In a cohort of 6,371 probands with neurodevelopmental problems of suspected genetic etiology, we observed a significant enrichment of AFF3 expansions compared to controls. With a population prevalence that is at least 5-fold higher than the TRE that causes fragile X syndrome, AFF3 expansions represent a significant cause of neurodevelopmental delay.
Assuntos
Sequenciamento do Exoma , Exoma , Doenças Genéticas Inatas/diagnóstico , Doenças Genéticas Inatas/genética , Testes Genéticos , Criança , Feminino , Testes Genéticos/métodos , Impressão Genômica , Genótipo , Humanos , Masculino , Mosaicismo , Mutação , Pais , Fenótipo , Proteínas/genética , Sequenciamento do Exoma/métodosRESUMO
BACKGROUND: Repeat expansion disorders affect about 1 in 3000 individuals and are clinically heterogeneous diseases caused by expansions of short tandem DNA repeats. Genetic testing is often locus-specific, resulting in underdiagnosis of people who have atypical clinical presentations, especially in paediatric patients without a previous positive family history. Whole genome sequencing is increasingly used as a first-line test for other rare genetic disorders, and we aimed to assess its performance in the diagnosis of patients with neurological repeat expansion disorders. METHODS: We retrospectively assessed the diagnostic accuracy of whole genome sequencing to detect the most common repeat expansion loci associated with neurological outcomes (AR, ATN1, ATXN1, ATXN2, ATXN3, ATXN7, C9orf72, CACNA1A, DMPK, FMR1, FXN, HTT, and TBP) using samples obtained within the National Health Service in England from patients who were suspected of having neurological disorders; previous PCR test results were used as the reference standard. The clinical accuracy of whole genome sequencing to detect repeat expansions was prospectively examined in previously genetically tested and undiagnosed patients recruited in 2013-17 to the 100â000 Genomes Project in the UK, who were suspected of having a genetic neurological disorder (familial or early-onset forms of ataxia, neuropathy, spastic paraplegia, dementia, motor neuron disease, parkinsonian movement disorders, intellectual disability, or neuromuscular disorders). If a repeat expansion call was made using whole genome sequencing, PCR was used to confirm the result. FINDINGS: The diagnostic accuracy of whole genome sequencing to detect repeat expansions was evaluated against 793 PCR tests previously performed within the NHS from 404 patients. Whole genome sequencing correctly classified 215 of 221 expanded alleles and 1316 of 1321 non-expanded alleles, showing 97·3% sensitivity (95% CI 94·2-99·0) and 99·6% specificity (99·1-99·9) across the 13 disease-associated loci when compared with PCR test results. In samples from 11â631 patients in the 100â000 Genomes Project, whole genome sequencing identified 81 repeat expansions, which were also tested by PCR: 68 were confirmed as repeat expansions in the full pathogenic range, 11 were non-pathogenic intermediate expansions or premutations, and two were non-expanded repeats (16% false discovery rate). INTERPRETATION: In our study, whole genome sequencing for the detection of repeat expansions showed high sensitivity and specificity, and it led to identification of neurological repeat expansion disorders in previously undiagnosed patients. These findings support implementation of whole genome sequencing in clinical laboratories for diagnosis of patients who have a neurological presentation consistent with a repeat expansion disorder. FUNDING: Medical Research Council, Department of Health and Social Care, National Health Service England, National Institute for Health Research, and Illumina.
Assuntos
Expansão das Repetições de DNA , Medicina Estatal , Criança , Proteína do X Frágil da Deficiência Intelectual/genética , Humanos , Estudos Prospectivos , Estudos Retrospectivos , Reino Unido , Sequenciamento Completo do Genoma/métodosRESUMO
BACKGROUND: Expansions of short tandem repeats are the cause of many neurogenetic disorders including familial amyotrophic lateral sclerosis, Huntington disease, and many others. Multiple methods have been recently developed that can identify repeat expansions in whole genome or exome sequencing data. Despite the widely recognized need for visual assessment of variant calls in clinical settings, current computational tools lack the ability to produce such visualizations for repeat expansions. Expanded repeats are difficult to visualize because they correspond to large insertions relative to the reference genome and involve many misaligning and ambiguously aligning reads. RESULTS: We implemented REViewer, a computational method for visualization of sequencing data in genomic regions containing long repeat expansions and FlipBook, a companion image viewer designed for manual curation of large collections of REViewer images. To generate a read pileup, REViewer reconstructs local haplotype sequences and distributes reads to these haplotypes in a way that is most consistent with the fragment lengths and evenness of read coverage. To create appropriate training materials for onboarding new users, we performed a concordance study involving 12 scientists involved in short tandem repeat research. We used the results of this study to create a user guide that describes the basic principles of using REViewer as well as a guide to the typical features of read pileups that correspond to low confidence repeat genotype calls. Additionally, we demonstrated that REViewer can be used to annotate clinically relevant repeat interruptions by comparing visual assessment results of 44 FMR1 repeat alleles with the results of triplet repeat primed PCR. For 38 of these alleles, the results of visual assessment were consistent with triplet repeat primed PCR. CONCLUSIONS: Read pileup plots generated by REViewer offer an intuitive way to visualize sequencing data in regions containing long repeat expansions. Laboratories can use REViewer and FlipBook to assess the quality of repeat genotype calls as well as to visually detect interruptions or other imperfections in the repeat sequence and the surrounding flanking regions. REViewer and FlipBook are available under open-source licenses at https://github.com/illumina/REViewer and https://github.com/broadinstitute/flipbook respectively.
Assuntos
Esclerose Lateral Amiotrófica , Sequências de Repetição em Tandem , Alelos , Esclerose Lateral Amiotrófica/genética , Exoma , Proteína do X Frágil da Deficiência Intelectual/genética , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , HumanosRESUMO
Idiopathic pulmonary fibrosis (IPF) is a fatal disorder characterised by progressive, destructive lung scarring. Despite substantial progress, the genetic determinants of this disease remain incompletely defined. Using whole genome and whole exome sequencing data from 752 individuals with sporadic IPF and 119,055 UK Biobank controls, we performed a variant-level exome-wide association study (ExWAS) and gene-level collapsing analyses. Our variant-level analysis revealed a novel association between a rare missense variant in SPDL1 and IPF (NM_017785.5:g.169588475 G > A p.Arg20Gln; p = 2.4 × 10-7, odds ratio = 2.87, 95% confidence interval: 2.03-4.07). This signal was independently replicated in the FinnGen cohort, which contains 1028 cases and 196,986 controls (combined p = 2.2 × 10-20), firmly associating this variant as an IPF risk allele. SPDL1 encodes Spindly, a protein involved in mitotic checkpoint signalling during cell division that has not been previously described in fibrosis. To the best of our knowledge, these results highlight a novel mechanism underlying IPF, providing the potential for new therapeutic discoveries in a disease of great unmet need.
Assuntos
Proteínas de Ciclo Celular/genética , Fibrose Pulmonar Idiopática/genética , Mutação de Sentido Incorreto , Idoso , Estudos de Casos e Controles , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Fibrose Pulmonar Idiopática/diagnóstico , Masculino , Fenótipo , Sequenciamento do ExomaRESUMO
OBJECTIVE: To determine whether whole genome sequencing can be used to define the molecular basis of suspected mitochondrial disease. DESIGN: Cohort study. SETTING: National Health Service, England, including secondary and tertiary care. PARTICIPANTS: 345 patients with suspected mitochondrial disorders recruited to the 100 000 Genomes Project in England between 2015 and 2018. INTERVENTION: Short read whole genome sequencing was performed. Nuclear variants were prioritised on the basis of gene panels chosen according to phenotypes, ClinVar pathogenic/likely pathogenic variants, and the top 10 prioritised variants from Exomiser. Mitochondrial DNA variants were called using an in-house pipeline and compared with a list of pathogenic variants. Copy number variants and short tandem repeats for 13 neurological disorders were also analysed. American College of Medical Genetics guidelines were followed for classification of variants. MAIN OUTCOME MEASURE: Definite or probable genetic diagnosis. RESULTS: A definite or probable genetic diagnosis was identified in 98/319 (31%) families, with an additional 6 (2%) possible diagnoses. Fourteen of the diagnoses (4% of the 319 families) explained only part of the clinical features. A total of 95 different genes were implicated. Of 104 families given a diagnosis, 39 (38%) had a mitochondrial diagnosis and 65 (63%) had a non-mitochondrial diagnosis. CONCLUSION: Whole genome sequencing is a useful diagnostic test in patients with suspected mitochondrial disorders, yielding a diagnosis in a further 31% after exclusion of common causes. Most diagnoses were non-mitochondrial disorders and included developmental disorders with intellectual disability, epileptic encephalopathies, other metabolic disorders, cardiomyopathies, and leukodystrophies. These would have been missed if a targeted approach was taken, and some have specific treatments.