RESUMEN
Many common variants have been associated with hematological traits, but identification of causal genes and pathways has proven challenging. We performed a genome-wide association analysis in the UK Biobank and INTERVAL studies, testing 29.5 million genetic variants for association with 36 red cell, white cell, and platelet properties in 173,480 European-ancestry participants. This effort yielded hundreds of low frequency (<5%) and rare (<1%) variants with a strong impact on blood cell phenotypes. Our data highlight general properties of the allelic architecture of complex traits, including the proportion of the heritable component of each blood trait explained by the polygenic signal across different genome regulatory domains. Finally, through Mendelian randomization, we provide evidence of shared genetic pathways linking blood cell indices with complex pathologies, including autoimmune diseases, schizophrenia, and coronary heart disease and evidence suggesting previously reported population associations between blood cell indices and cardiovascular disease may be non-causal.
Asunto(s)
Variación Genética , Estudio de Asociación del Genoma Completo , Células Madre Hematopoyéticas/metabolismo , Enfermedades del Sistema Inmune/genética , Alelos , Diferenciación Celular , Predisposición Genética a la Enfermedad , Células Madre Hematopoyéticas/patología , Humanos , Enfermedades del Sistema Inmune/patología , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Población Blanca/genéticaRESUMEN
Despite significant progress in unraveling the genetic causes of neurodevelopmental disorders (NDDs), a substantial proportion of individuals with NDDs remain without a genetic diagnosis after microarray and/or exome sequencing. Here, we aimed to assess the power of short-read genome sequencing (GS), complemented with long-read GS, to identify causal variants in participants with NDD from the National Institute for Health and Care Research (NIHR) BioResource project. Short-read GS was conducted on 692 individuals (489 affected and 203 unaffected relatives) from 465 families. Additionally, long-read GS was performed on five affected individuals who had structural variants (SVs) in technically challenging regions, had complex SVs, or required distal variant phasing. Causal variants were identified in 36% of affected individuals (177/489), and a further 23% (112/489) had a variant of uncertain significance after multiple rounds of re-analysis. Among all reported variants, 88% (333/380) were coding nuclear SNVs or insertions and deletions (indels), and the remainder were SVs, non-coding variants, and mitochondrial variants. Furthermore, long-read GS facilitated the resolution of challenging SVs and invalidated variants of difficult interpretation from short-read GS. This study demonstrates the value of short-read GS, complemented with long-read GS, in investigating the genetic causes of NDDs. GS provides a comprehensive and unbiased method of identifying all types of variants throughout the nuclear and mitochondrial genomes in individuals with NDD.
Asunto(s)
Genoma Humano , Trastornos del Neurodesarrollo , Humanos , Genoma Humano/genética , Mapeo Cromosómico , Secuencia de Bases , Mutación INDEL , Trastornos del Neurodesarrollo/genéticaRESUMEN
Clinical validity assessments of gene-disease associations underpin analysis and reporting in diagnostic genomics, and yet wide variability exists in practice, particularly in use of these assessments for virtual gene panel design and maintenance. Harmonization efforts are hampered by the lack of agreed terminology, agreed gene curation standards, and platforms that can be used to identify and resolve discrepancies at scale. We undertook a systematic comparison of the content of 80 virtual gene panels used in two healthcare systems by multiple diagnostic providers in the United Kingdom and Australia. The process was enabled by a shared curation platform, PanelApp, and resulted in the identification and review of 2,144 discordant gene ratings, demonstrating the utility of sharing structured gene-disease validity assessments and collaborative discordance resolution in establishing national and international consensus.
Asunto(s)
Consenso , Curaduría de Datos/normas , Enfermedades Genéticas Congénitas/genética , Genómica/normas , Anotación de Secuencia Molecular/normas , Australia , Biomarcadores/metabolismo , Curaduría de Datos/métodos , Atención a la Salud , Expresión Génica , Ontología de Genes , Enfermedades Genéticas Congénitas/diagnóstico , Enfermedades Genéticas Congénitas/patología , Genómica/métodos , Humanos , Aplicaciones Móviles/provisión & distribución , Terminología como Asunto , Reino UnidoRESUMEN
PURPOSE: Several groups and resources provide information that pertains to the validity of gene-disease relationships used in genomic medicine and research; however, universal standards and terminologies to define the evidence base for the role of a gene in disease and a single harmonized resource were lacking. To tackle this issue, the Gene Curation Coalition (GenCC) was formed. METHODS: The GenCC drafted harmonized definitions for differing levels of gene-disease validity on the basis of existing resources, and performed a modified Delphi survey with 3 rounds to narrow the list of terms. The GenCC also developed a unified database to display curated gene-disease validity assertions from its members. RESULTS: On the basis of 241 survey responses from the genetics community, a consensus term set was chosen for grading gene-disease validity and database submissions. As of December 2021, the database contained 15,241 gene-disease assertions on 4569 unique genes from 12 submitters. When comparing submissions to the database from distinct sources, conflicts in assertions of gene-disease validity ranged from 5.3% to 13.4%. CONCLUSION: Terminology standardization, sharing of gene-disease validity classifications, and resolution of curation conflicts will facilitate collaborations across international curation efforts and in turn, improve consistency in genetic testing and variant interpretation.
Asunto(s)
Bases de Datos Genéticas , Genómica , Pruebas Genéticas , Variación Genética , HumanosRESUMEN
Rationale: Recently, rare heterozygous mutations in GDF2 were identified in patients with pulmonary arterial hypertension (PAH). GDF2 encodes the circulating BMP (bone morphogenetic protein) type 9, which is a ligand for the BMP2 receptor.Objectives: Here we determined the functional impact of GDF2 mutations and characterized plasma BMP9 and BMP10 levels in patients with idiopathic PAH.Methods: Missense BMP9 mutant proteins were expressed in vitro and the impact on BMP9 protein processing and secretion, endothelial signaling, and functional activity was assessed. Plasma BMP9 and BMP10 levels and activity were assayed in patients with PAH with GDF2 variants and in control subjects. Levels were also measured in a larger cohort of control subjects (n = 120) and patients with idiopathic PAH (n = 260).Measurements and Main Results: We identified a novel rare variation at the GDF2 and BMP10 loci, including copy number variation. In vitro, BMP9 missense proteins demonstrated impaired cellular processing and secretion. Patients with PAH who carried these mutations exhibited reduced plasma levels of BMP9 and reduced BMP activity. Unexpectedly, plasma BMP10 levels were also markedly reduced in these individuals. Although overall BMP9 and BMP10 levels did not differ between patients with PAH and control subjects, BMP10 levels were lower in PAH females. A subset of patients with PAH had markedly reduced plasma levels of BMP9 and BMP10 in the absence of GDF2 mutations.Conclusions: Our findings demonstrate that GDF2 mutations result in BMP9 loss of function and are likely causal. These mutations lead to reduced circulating levels of both BMP9 and BMP10. These findings support therapeutic strategies to enhance BMP9 or BMP10 signaling in PAH.
Asunto(s)
Proteínas Morfogenéticas Óseas/genética , Factor 2 de Diferenciación de Crecimiento/genética , Hipertensión Arterial Pulmonar/genética , Adulto , Proteínas Morfogenéticas Óseas/metabolismo , Estudios de Casos y Controles , Variaciones en el Número de Copia de ADN , Femenino , Factor 2 de Diferenciación de Crecimiento/metabolismo , Heterocigoto , Humanos , Masculino , Persona de Mediana Edad , Mutación Missense , Transporte de Proteínas , Hipertensión Arterial Pulmonar/metabolismo , Factores SexualesRESUMEN
Inherited bleeding, thrombotic, and platelet disorders (BPDs) are diseases that affect â¼300 individuals per million births. With the exception of hemophilia and von Willebrand disease patients, a molecular analysis for patients with a BPD is often unavailable. Many specialized tests are usually required to reach a putative diagnosis and they are typically performed in a step-wise manner to control costs. This approach causes delays and a conclusive molecular diagnosis is often never reached, which can compromise treatment and impede rapid identification of affected relatives. To address this unmet diagnostic need, we designed a high-throughput sequencing platform targeting 63 genes relevant for BPDs. The platform can call single nucleotide variants, short insertions/deletions, and large copy number variants (though not inversions) which are subjected to automated filtering for diagnostic prioritization, resulting in an average of 5.34 candidate variants per individual. We sequenced 159 and 137 samples, respectively, from cases with and without previously known causal variants. Among the latter group, 61 cases had clinical and laboratory phenotypes indicative of a particular molecular etiology, whereas the remainder had an a priori highly uncertain etiology. All previously detected variants were recapitulated and, when the etiology was suspected but unknown or uncertain, a molecular diagnosis was reached in 56 of 61 and only 8 of 76 cases, respectively. The latter category highlights the need for further research into novel causes of BPDs. The ThromboGenomics platform thus provides an affordable DNA-based test to diagnose patients suspected of having a known inherited BPD.
Asunto(s)
Trastornos de las Plaquetas Sanguíneas/genética , Predisposición Genética a la Enfermedad , Hemorragia/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Trombosis/genética , Estudios de Casos y Controles , Variaciones en el Número de Copia de ADN , Femenino , Estudios de Asociación Genética/métodos , Humanos , Masculino , Mutación , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN/métodosRESUMEN
The HUGO Gene Nomenclature Committee situated at the European Bioinformatics Institute assigns unique symbols and names to human genes. Since 2011, the data within our database has expanded largely owing to an increase in naming pseudogenes and non-coding RNA genes, and we now have >33,500 approved symbols. Our gene families and groups have also increased to nearly 500, with â¼45% of our gene entries associated to at least one family or group. We have also redesigned the HUGO Gene Nomenclature Committee website http://www.genenames.org creating a constant look and feel across the site and improving usability and readability for our users. The site provides a public access portal to our database with no restrictions imposed on access or the use of the data. Within this article, we review our online resources and data with particular emphasis on the updates to our website.
Asunto(s)
Bases de Datos Genéticas , Genes , Terminología como Asunto , Humanos , Internet , Proteínas/genéticaRESUMEN
The HUGO Gene Nomenclature Committee (HGNC) assigns approved gene symbols to human loci. There are currently over 33,000 approved gene symbols, the majority of which represent protein-coding genes, but we also name other locus types such as non-coding RNAs, pseudogenes and phenotypic loci. Where relevant, the HGNC organise these genes into gene families and groups. The HGNC website http://www.genenames.org/ is an online repository of HGNC-approved gene nomenclature and associated resources for human genes, and includes links to genomic, proteomic and phenotypic information. In addition to this, we also have dedicated gene family web pages and are currently expanding and generating more of these pages using data curated by the HGNC and from information derived from external resources that focus on particular gene families. Here, we review our current online resources with a particular focus on our gene family data, using it to highlight our new Gene Symbol Report and gene family data downloads.
Asunto(s)
Bases de Datos Genéticas , Sitios Genéticos/genética , Familia de Multigenes/genética , Proteínas/genética , Terminología como Asunto , Variación Genética , Genómica/métodos , Humanos , Proteínas/clasificación , Proteínas/metabolismo , Proteómica/métodos , Navegador WebRESUMEN
BACKGROUND: Repeat expansion disorders affect about 1 in 3000 individuals and are clinically heterogeneous diseases caused by expansions of short tandem DNA repeats. Genetic testing is often locus-specific, resulting in underdiagnosis of people who have atypical clinical presentations, especially in paediatric patients without a previous positive family history. Whole genome sequencing is increasingly used as a first-line test for other rare genetic disorders, and we aimed to assess its performance in the diagnosis of patients with neurological repeat expansion disorders. METHODS: We retrospectively assessed the diagnostic accuracy of whole genome sequencing to detect the most common repeat expansion loci associated with neurological outcomes (AR, ATN1, ATXN1, ATXN2, ATXN3, ATXN7, C9orf72, CACNA1A, DMPK, FMR1, FXN, HTT, and TBP) using samples obtained within the National Health Service in England from patients who were suspected of having neurological disorders; previous PCR test results were used as the reference standard. The clinical accuracy of whole genome sequencing to detect repeat expansions was prospectively examined in previously genetically tested and undiagnosed patients recruited in 2013-17 to the 100â000 Genomes Project in the UK, who were suspected of having a genetic neurological disorder (familial or early-onset forms of ataxia, neuropathy, spastic paraplegia, dementia, motor neuron disease, parkinsonian movement disorders, intellectual disability, or neuromuscular disorders). If a repeat expansion call was made using whole genome sequencing, PCR was used to confirm the result. FINDINGS: The diagnostic accuracy of whole genome sequencing to detect repeat expansions was evaluated against 793 PCR tests previously performed within the NHS from 404 patients. Whole genome sequencing correctly classified 215 of 221 expanded alleles and 1316 of 1321 non-expanded alleles, showing 97·3% sensitivity (95% CI 94·2-99·0) and 99·6% specificity (99·1-99·9) across the 13 disease-associated loci when compared with PCR test results. In samples from 11â631 patients in the 100â000 Genomes Project, whole genome sequencing identified 81 repeat expansions, which were also tested by PCR: 68 were confirmed as repeat expansions in the full pathogenic range, 11 were non-pathogenic intermediate expansions or premutations, and two were non-expanded repeats (16% false discovery rate). INTERPRETATION: In our study, whole genome sequencing for the detection of repeat expansions showed high sensitivity and specificity, and it led to identification of neurological repeat expansion disorders in previously undiagnosed patients. These findings support implementation of whole genome sequencing in clinical laboratories for diagnosis of patients who have a neurological presentation consistent with a repeat expansion disorder. FUNDING: Medical Research Council, Department of Health and Social Care, National Health Service England, National Institute for Health Research, and Illumina.
Asunto(s)
Expansión de las Repeticiones de ADN , Medicina Estatal , Niño , Proteína de la Discapacidad Intelectual del Síndrome del Cromosoma X Frágil/genética , Humanos , Estudios Prospectivos , Estudios Retrospectivos , Reino Unido , Secuenciación Completa del Genoma/métodosRESUMEN
Pulmonary arterial hypertension (PAH) is a rare disorder with a poor prognosis. Deleterious variation within components of the transforming growth factor-ß pathway, particularly the bone morphogenetic protein type 2 receptor (BMPR2), underlies most heritable forms of PAH. To identify the missing heritability we perform whole-genome sequencing in 1038 PAH index cases and 6385 PAH-negative control subjects. Case-control analyses reveal significant overrepresentation of rare variants in ATP13A3, AQP1 and SOX17, and provide independent validation of a critical role for GDF2 in PAH. We demonstrate familial segregation of mutations in SOX17 and AQP1 with PAH. Mutations in GDF2, encoding a BMPR2 ligand, lead to reduced secretion from transfected cells. In addition, we identify pathogenic mutations in the majority of previously reported PAH genes, and provide evidence for further putative genes. Taken together these findings contribute new insights into the molecular basis of PAH and indicate unexplored pathways for therapeutic intervention.