Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
Int J Mol Sci ; 22(22)2021 Nov 12.
Artículo en Inglés | MEDLINE | ID: mdl-34830104

RESUMEN

Epidermolysis bullosa is a group of genetic skin conditions characterized by abnormal skin (and mucosal) fragility caused by pathogenic variants in various genes. The disease severity ranges from early childhood mortality in the most severe types to occasional acral blistering in the mildest types. The subtype and severity of EB is linked to the gene involved and the specific variants in that gene, which also determine its mode of inheritance. Current treatment is mainly focused on symptomatic relief such as wound care and blister prevention, because truly curative treatment options are still at the preclinical stage. Given the current level of understanding, the broad spectrum of genes and variants underlying EB makes it impossible to develop a single treatment strategy for all patients. It is likely that many different variant-specific treatment strategies will be needed to ultimately treat all patients. Antisense-oligonucleotide (ASO)-mediated exon skipping aims to counteract pathogenic sequence variants by restoring the open reading frame through the removal of the mutant exon from the pre-messenger RNA. This should lead to the restored production of the protein absent in the affected skin and, consequently, improvement of the phenotype. Several preclinical studies have demonstrated that exon skipping can restore protein production in vitro, in skin equivalents, and in skin grafts derived from EB-patient skin cells, indicating that ASO-mediated exon skipping could be a viable strategy as a topical or systemic treatment. The potential value of exon skipping for EB is supported by a study showing reduced phenotypic severity in patients who carry variants that result in natural exon skipping. In this article, we review the substantial progress made on exon skipping for EB in the past 15 years and highlight the opportunities and current challenges of this RNA-based therapy approach. In addition, we present a prioritization strategy for the development of exon skipping based on genomic information of all EB-involved genes.


Asunto(s)
Epidermólisis Ampollosa , Exones , Fibroblastos/inmunología , Mutación , Oligonucleótidos Antisentido , Piel/inmunología , Epidermólisis Ampollosa/genética , Epidermólisis Ampollosa/inmunología , Epidermólisis Ampollosa/terapia , Humanos , Oligonucleótidos Antisentido/genética , Oligonucleótidos Antisentido/uso terapéutico
2.
Bioinformatics ; 35(6): 1076-1078, 2019 03 15.
Artículo en Inglés | MEDLINE | ID: mdl-30165396

RESUMEN

MOTIVATION: The volume and complexity of biological data increases rapidly. Many clinical professionals and biomedical researchers without a bioinformatics background are generating big '-omics' data, but do not always have the tools to manage, process or publicly share these data. RESULTS: Here we present MOLGENIS Research, an open-source web-application to collect, manage, analyze, visualize and share large and complex biomedical datasets, without the need for advanced bioinformatics skills. AVAILABILITY AND IMPLEMENTATION: MOLGENIS Research is freely available (open source software). It can be installed from source code (see http://github.com/molgenis), downloaded as a precompiled WAR file (for your own server), setup inside a Docker container (see http://molgenis.github.io), or requested as a Software-as-a-Service subscription. For a public demo instance and complete installation instructions see http://molgenis.org/research.


Asunto(s)
Biología Computacional , Programas Informáticos , Algoritmos , Genoma , Genómica
3.
Hum Mutat ; 39(3): 333-344, 2018 03.
Artículo en Inglés | MEDLINE | ID: mdl-29266534

RESUMEN

Microvillus inclusion disease (MVID) is a rare but fatal autosomal recessive congenital diarrheal disorder caused by MYO5B mutations. In 2013, we launched an open-access registry for MVID patients and their MYO5B mutations (www.mvid-central.org). Since then, additional unique MYO5B mutations have been identified in MVID patients, but also in non-MVID patients. Animal models have been generated that formally prove the causality between MYO5B and MVID. Importantly, mutations in two other genes, STXBP2 and STX3, have since been associated with variants of MVID, shedding new light on the pathogenesis of this congenital diarrheal disorder. Here, we review these additional genes and their mutations. Furthermore, we discuss recent data from cell studies that indicate that the three genes are functionally linked and, therefore, may constitute a common disease mechanism that unifies a subset of phenotypically linked congenital diarrheal disorders. We present new data based on patient material to support this. To congregate existing and future information on MVID geno-/phenotypes, we have updated and expanded the MVID registry to include all currently known MVID-associated gene mutations, their demonstrated or predicted functional consequences, and associated clinical information.


Asunto(s)
Diarrea/congénito , Diarrea/genética , Predisposición Genética a la Enfermedad , Proteínas Munc18/genética , Mutación/genética , Miosina Tipo V/genética , Proteínas Qa-SNARE/genética , Animales , Humanos
4.
Bioinformatics ; 32(14): 2176-83, 2016 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-27153686

RESUMEN

MOTIVATION: While the size and number of biobanks, patient registries and other data collections are increasing, biomedical researchers still often need to pool data for statistical power, a task that requires time-intensive retrospective integration. RESULTS: To address this challenge, we developed MOLGENIS/connect, a semi-automatic system to find, match and pool data from different sources. The system shortlists relevant source attributes from thousands of candidates using ontology-based query expansion to overcome variations in terminology. Then it generates algorithms that transform source attributes to a common target DataSchema. These include unit conversion, categorical value matching and complex conversion patterns (e.g. calculation of BMI). In comparison to human-experts, MOLGENIS/connect was able to auto-generate 27% of the algorithms perfectly, with an additional 46% needing only minor editing, representing a reduction in the human effort and expertise needed to pool data. AVAILABILITY AND IMPLEMENTATION: Source code, binaries and documentation are available as open-source under LGPLv3 from http://github.com/molgenis/molgenis and www.molgenis.org/connect CONTACT: : m.a.swertz@rug.nl SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Bancos de Muestras Biológicas , Biología Computacional/métodos , Fenotipo , Programas Informáticos , Algoritmos , Ontologías Biológicas , Humanos
6.
Nucleic Acids Res ; 42(Database issue): D794-801, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24217915

RESUMEN

Interactions between proteins are highly conserved across species. As a result, the molecular basis of multiple diseases affecting humans can be studied in model organisms that offer many alternative experimental opportunities. One such organism-Caenorhabditis elegans-has been used to produce much molecular quantitative genetics and systems biology data over the past decade. We present WormQTL(HD) (Human Disease), a database that quantitatively and systematically links expression Quantitative Trait Loci (eQTL) findings in C. elegans to gene-disease associations in man. WormQTL(HD), available online at http://www.wormqtl-hd.org, is a user-friendly set of tools to reveal functionally coherent, evolutionary conserved gene networks. These can be used to predict novel gene-to-gene associations and the functions of genes underlying the disease of interest. We created a new database that links C. elegans eQTL data sets to human diseases (34 337 gene-disease associations from OMIM, DGA, GWAS Central and NHGRI GWAS Catalogue) based on overlapping sets of orthologous genes associated to phenotypes in these two species. We utilized QTL results, high-throughput molecular phenotypes, classical phenotypes and genotype data covering different developmental stages and environments from WormQTL database. All software is available as open source, built on MOLGENIS and xQTL workbench.


Asunto(s)
Caenorhabditis elegans/genética , Bases de Datos Genéticas , Modelos Animales de Enfermedad , Enfermedad/genética , Variación Genética , Sitios de Carácter Cuantitativo , Animales , Expresión Génica , Genoma de los Helmintos , Genómica , Humanos , Internet , Presión Osmótica , Fenotipo
7.
Hum Mutat ; 36(7): 712-9, 2015 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-25871441

RESUMEN

Next-generation sequencing in clinical diagnostics is providing valuable genomic variant data, which can be used to support healthcare decisions. In silico tools to predict pathogenicity are crucial to assess such variants and we have evaluated a new tool, Combined Annotation Dependent Depletion (CADD), and its classification of gene variants in Lynch syndrome by using a set of 2,210 DNA mismatch repair gene variants. These had already been classified by experts from InSiGHT's Variant Interpretation Committee. Overall, we found CADD scores do predict pathogenicity (Spearman's ρ = 0.595, P < 0.001). However, we discovered 31 major discrepancies between the InSiGHT classification and the CADD scores; these were explained in favor of the expert classification using population allele frequencies, cosegregation analyses, disease association studies, or a second-tier test. Of 751 variants that could not be clinically classified by InSiGHT, CADD indicated that 47 variants were worth further study to confirm their putative pathogenicity. We demonstrate CADD is valuable in prioritizing variants in clinically relevant genes for further assessment by expert classification teams.


Asunto(s)
Biología Computacional , Reparación de la Incompatibilidad de ADN , Variación Genética , Modelos Moleculares , Neoplasias Colorrectales Hereditarias sin Poliposis/genética , Estudios de Asociación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Programas Informáticos
8.
Nucleic Acids Res ; 41(Database issue): D738-43, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23180786

RESUMEN

Here, we present WormQTL (http://www.wormqtl.org), an easily accessible database enabling search, comparative analysis and meta-analysis of all data on variation in Caenorhabditis spp. Over the past decade, Caenorhabditis elegans has become instrumental for molecular quantitative genetics and the systems biology of natural variation. These efforts have resulted in a valuable amount of phenotypic, high-throughput molecular and genotypic data across different developmental worm stages and environments in hundreds of C. elegans strains. WormQTL provides a workbench of analysis tools for genotype-phenotype linkage and association mapping based on but not limited to R/qtl (http://www.rqtl.org). All data can be uploaded and downloaded using simple delimited text or Excel formats and are accessible via a public web user interface for biologists and R statistic and web service interfaces for bioinformaticians, based on open source MOLGENIS and xQTL workbench software. WormQTL welcomes data submissions from other worm researchers.


Asunto(s)
Caenorhabditis/genética , Bases de Datos Genéticas , Sitios de Carácter Cuantitativo , Animales , Caenorhabditis elegans/genética , Expresión Génica , Estudios de Asociación Genética , Variación Genética , Internet
9.
Brief Bioinform ; 13(2): 135-42, 2012 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-22396485

RESUMEN

During a meeting of the SYSGENET working group 'Bioinformatics', currently available software tools and databases for systems genetics in mice were reviewed and the needs for future developments discussed. The group evaluated interoperability and performed initial feasibility studies. To aid future compatibility of software and exchange of already developed software modules, a strong recommendation was made by the group to integrate HAPPY and R/qtl analysis toolboxes, GeneNetwork and XGAP database platforms, and TIQS and xQTL processing platforms. R should be used as the principal computer language for QTL data analysis in all platforms and a 'cloud' should be used for software dissemination to the community. Furthermore, the working group recommended that all data models and software source code should be made visible in public repositories to allow a coordinated effort on the use of common data structures and file formats.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Factuales , Algoritmos , Animales , Redes Reguladoras de Genes , Ratones/genética , Sitios de Carácter Cuantitativo , Programas Informáticos
10.
Hum Mutat ; 34(12): 1597-605, 2013 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-24014347

RESUMEN

Microvillus inclusion disease (MVID) is one of the most severe congenital intestinal disorders and is characterized by neonatal secretory diarrhea and the inability to absorb nutrients from the intestinal lumen. MVID is associated with patient-, family-, and ancestry-unique mutations in the MYO5B gene, encoding the actin-based motor protein myosin Vb. Here, we review the MYO5B gene and all currently known MYO5B mutations and for the first time methodologically categorize these with regard to functional protein domains and recurrence in MYO7A associated with Usher syndrome and other myosins. We also review animal models for MVID and the latest data on functional studies related to the myosin Vb protein. To congregate existing and future information on MVID geno-/phenotypes and facilitate its quick and easy sharing among clinicians and researchers, we have constructed an online MOLGENIS-based international patient registry (www.MVID-central.org). This easily accessible database currently contains detailed information of 137 MVID patients together with reported clinical/phenotypic details and 41 unique MYO5B mutations, of which several unpublished. The future expansion and prospective nature of this registry is expected to improve disease diagnosis, prognosis, and genetic counseling.


Asunto(s)
Síndromes de Malabsorción/genética , Microvellosidades/patología , Mucolipidosis/genética , Mutación , Cadenas Pesadas de Miosina/genética , Miosina Tipo V/genética , Sistemas en Línea , Sistema de Registros , Animales , Modelos Animales de Enfermedad , Enterocitos/metabolismo , Enterocitos/patología , Humanos , Síndromes de Malabsorción/diagnóstico , Síndromes de Malabsorción/metabolismo , Microvellosidades/genética , Microvellosidades/metabolismo , Mucolipidosis/diagnóstico , Mucolipidosis/metabolismo , Cadenas Pesadas de Miosina/química , Cadenas Pesadas de Miosina/metabolismo , Miosina Tipo V/química , Miosina Tipo V/metabolismo , Miosinas/genética
11.
Bioinformatics ; 28(7): 1042-4, 2012 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-22308096

RESUMEN

SUMMARY: xQTL workbench is a scalable web platform for the mapping of quantitative trait loci (QTLs) at multiple levels: for example gene expression (eQTL), protein abundance (pQTL), metabolite abundance (mQTL) and phenotype (phQTL) data. Popular QTL mapping methods for model organism and human populations are accessible via the web user interface. Large calculations scale easily on to multi-core computers, clusters and Cloud. All data involved can be uploaded and queried online: markers, genotypes, microarrays, NGS, LC-MS, GC-MS, NMR, etc. When new data types come available, xQTL workbench is quickly customized using the Molgenis software generator. AVAILABILITY: xQTL workbench runs on all common platforms, including Linux, Mac OS X and Windows. An online demo system, installation guide, tutorials, software and source code are available under the LGPL3 license from http://www.xqtl.org. CONTACT: m.a.swertz@rug.nl.


Asunto(s)
Mapeo Cromosómico/métodos , Internet , Sitios de Carácter Cuantitativo , Programas Informáticos , Algoritmos , Biología Computacional/métodos , Humanos , Interfaz Usuario-Computador
12.
Front Genet ; 13: 824510, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35299955

RESUMEN

Background: In the molecular genetic diagnostics of Mendelian disorders, solutions are needed for the major challenge of dealing with the large number of variants of uncertain significance (VUSs) identified using next-generation sequencing (NGS). Recently, promising approaches using constraint metrics to calculate case excess scores (CE), etiological fractions (EF), and gnomAD-derived constraint scores have been reported that estimate the likelihood of rare variants in specific genes or regions that are pathogenic. Our objective is to study the usability of these constraint data into variant interpretation in a diagnostic setting, using our cardiomyopathy cohort. Methods and Results: Patients (N = 2002) referred for clinical genetic diagnostics underwent NGS testing of 55-61 genes associated with cardiomyopathies. Previously classified likely pathogenic (LP) and pathogenic (P) variants were used to validate the use of data from CE, EF, and gnomAD constraint analyses for (re)classification of associated variant types in specific cardiomyopathy subtype-related genes. The classifications corroborated in 94% (354/378) of cases. Next, we reclassified 23 unique VUSs to LP, increasing the diagnostic yield by 1.2%. In addition, 106 unique VUSs (5.3% of patients) were prioritized for co-segregation or functional analyses. Conclusions: Our analysis confirms that the use of constraint metrics data can improve variant interpretation, and we, therefore, recommend using constraint scores on other cohorts and disorders and its inclusion in variant interpretation protocols.

13.
Orphanet J Rare Dis ; 17(1): 436, 2022 12 14.
Artículo en Inglés | MEDLINE | ID: mdl-36517834

RESUMEN

INTRODUCTION: Rare disease patient data are typically sensitive, present in multiple registries controlled by different custodians, and non-interoperable. Making these data Findable, Accessible, Interoperable, and Reusable (FAIR) for humans and machines at source enables federated discovery and analysis across data custodians. This facilitates accurate diagnosis, optimal clinical management, and personalised treatments. In Europe, twenty-four European Reference Networks (ERNs) work on rare disease registries in different clinical domains. The process and the implementation choices for making data FAIR ('FAIRification') differ among ERN registries. For example, registries use different software systems and are subject to different legal regulations. To support the ERNs in making informed decisions and to harmonise FAIRification, the FAIRification steward team was established to work as liaisons between ERNs and researchers from the European Joint Programme on Rare Diseases. RESULTS: The FAIRification steward team inventoried the FAIRification challenges of the ERN registries and proposed solutions collectively with involved stakeholders to address them. Ninety-eight FAIRification challenges from 24 ERNs' registries were collected and categorised into "training" (31), "community" (9), "modelling" (12), "implementation" (26), and "legal" (20). After curating and aggregating highly similar challenges, 41 unique FAIRification challenges remained. The two categories with the most challenges were "training" (15) and "implementation" (9), followed by "community" (7), and then "modelling" (5) and "legal" (5). To address all challenges, eleven types of solutions were proposed. Among them, the provision of guidelines and the organisation of training activities resolved the "training" challenges, which ranged from less-technical "coffee-rounds" to technical workshops, from informal FAIR Games to formal hackathons. Obtaining implementation support from technical experts was the solution type for tackling the "implementation" challenges. CONCLUSION: This work shows that a dedicated team of FAIR data stewards is an asset for harmonising the various processes of making data FAIR in a large organisation with multiple stakeholders. Additionally, multi-levelled training activities are required to accommodate the diverse needs of the ERNs. Finally, the lessons learned from the experience of the FAIRification steward team described in this paper may help to increase FAIR awareness and provide insights into FAIRification challenges and solutions of rare disease registries.


Asunto(s)
Enfermedades Raras , Programas Informáticos , Humanos , Europa (Continente) , Enfermedades Raras/terapia , Sistema de Registros
14.
J Biomed Semantics ; 13(1): 9, 2022 03 15.
Artículo en Inglés | MEDLINE | ID: mdl-35292119

RESUMEN

BACKGROUND: The European Platform on Rare Disease Registration (EU RD Platform) aims to address the fragmentation of European rare disease (RD) patient data, scattered among hundreds of independent and non-coordinating registries, by establishing standards for integration and interoperability. The first practical output of this effort was a set of 16 Common Data Elements (CDEs) that should be implemented by all RD registries. Interoperability, however, requires decisions beyond data elements - including data models, formats, and semantics. Within the European Joint Programme on Rare Diseases (EJP RD), we aim to further the goals of the EU RD Platform by generating reusable RD semantic model templates that follow the FAIR Data Principles. RESULTS: Through a team-based iterative approach, we created semantically grounded models to represent each of the CDEs, using the SemanticScience Integrated Ontology as the core framework for representing the entities and their relationships. Within that framework, we mapped the concepts represented in the CDEs, and their possible values, into domain ontologies such as the Orphanet Rare Disease Ontology, Human Phenotype Ontology and National Cancer Institute Thesaurus. Finally, we created an exemplar, reusable ETL pipeline that we will be deploying over these non-coordinating data repositories to assist them in creating model-compliant FAIR data without requiring site-specific coding nor expertise in Linked Data or FAIR. CONCLUSIONS: Within the EJP RD project, we determined that creating reusable, expert-designed templates reduced or eliminated the requirement for our participating biomedical domain experts and rare disease data hosts to understand OWL semantics. This enabled them to publish highly expressive FAIR data using tools and approaches that were already familiar to them.


Asunto(s)
Elementos de Datos Comunes , Enfermedades Raras , Humanos , Sistema de Registros , Semántica , Flujo de Trabajo
15.
Sci Data ; 9(1): 169, 2022 04 13.
Artículo en Inglés | MEDLINE | ID: mdl-35418585

RESUMEN

The genomes of thousands of individuals are profiled within Dutch healthcare and research each year. However, this valuable genomic data, associated clinical data and consent are captured in different ways and stored across many systems and organizations. This makes it difficult to discover rare disease patients, reuse data for personalized medicine and establish research cohorts based on specific parameters. FAIR Genomes aims to enable NGS data reuse by developing metadata standards for the data descriptions needed to FAIRify genomic data while also addressing ELSI issues. We developed a semantic schema of essential data elements harmonized with international FAIR initiatives. The FAIR Genomes schema v1.1 contains 110 elements in 9 modules. It reuses common ontologies such as NCIT, DUO and EDAM, only introducing new terms when necessary. The schema is represented by a YAML file that can be transformed into templates for data entry software (EDC) and programmatic interfaces (JSON, RDF) to ease genomic data sharing in research and healthcare. The schema, documentation and MOLGENIS reference implementation are available at https://fairgenomes.org .


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Metadatos , Atención a la Salud , Genómica , Humanos , Programas Informáticos
16.
Sci Rep ; 11(1): 10606, 2021 05 19.
Artículo en Inglés | MEDLINE | ID: mdl-34012022

RESUMEN

Allele specific expression (ASE) concerns divergent expression quantity of alternative alleles and is measured by RNA sequencing. Multiple studies show that ASE plays a role in hereditary diseases by modulating penetrance or phenotype severity. However, genome diagnostics is based on DNA sequencing and therefore neglects gene expression regulation such as ASE. To take advantage of ASE in absence of RNA sequencing, it must be predicted using only DNA variation. We have constructed ASE models from BIOS (n = 3432) and GTEx (n = 369) that predict ASE using DNA features. These models are highly reproducible and comprise many different feature types, highlighting the complex regulation that underlies ASE. We applied the BIOS-trained model to population variants in three genes in which ASE plays a clinically relevant role: BRCA2, RET and NF1. This resulted in predicted ASE effects for 27 variants, of which 10 were known pathogenic variants. We demonstrated that ASE can be predicted from DNA features using machine learning. Future efforts may improve sensitivity and translate these models into a new type of genome diagnostic tool that prioritizes candidate pathogenic variants or regulators thereof for follow-up validation by RNA sequencing. All used code and machine learning models are available at GitHub and Zenodo.


Asunto(s)
Alelos , Regulación de la Expresión Génica , Aprendizaje Automático , Análisis de Secuencia de ADN , Sesgo , Estudios de Factibilidad , Genoma Humano , Humanos , Modelos Genéticos , Polimorfismo de Nucleótido Simple/genética , Curva ROC
17.
Front Pediatr ; 9: 600556, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34136434

RESUMEN

Background: Genetic disorders are a substantial cause of infant morbidity and mortality and are frequently suspected in neonatal intensive care units. Non-specific clinical presentation or limitations to physical examination can result in a plethora of genetic testing techniques, without clear strategies on test ordering. Here, we review our 2-years experiences of rapid genetic testing of NICU patients in order to provide such recommendations. Methods: We retrospectively included all patients admitted to the NICU who received clinical genetic consultation and genetic testing in our University hospital. We documented reasons for referral for genetic consultation, presenting phenotypes, differential diagnoses, genetic testing requested and their outcomes, as well as the consequences of each (rapid) genetic diagnostic approach. We calculated diagnostic yield and turnaround times (TATs). Results: Of 171 included infants that received genetic consultation 140 underwent genetic testing. As a result of testing as first tier, 13/14 patients received a genetic diagnosis from QF-PCR; 14/115 from SNP-array; 12/89 from NGS testing, of whom 4/46 were diagnosed with a small gene panel and 8/43 with a large OMIM-morbid based gene panel. Subsequent secondary or tertiary analysis and/or additional testing resulted in five more diagnoses. TATs ranged from 1 day (QF-PCR) to a median of 14 for NGS and SNP-array testing, with increasing TAT in particular when many consecutive tests were performed. Incidental findings were detected in 5/140 tested patients (3.6%). Conclusion: We recommend implementing a broad NGS gene panel in combination with CNV calling as the first tier of genetic testing for NICU patients given the often unspecific phenotypes of ill infants and the high yield of this large panel.

18.
Adv Genet (Hoboken) ; 1(1): e10023, 2020 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-36619248

RESUMEN

Despite an explosive growth of next-generation sequencing data, genome diagnostics only provides a molecular diagnosis to a minority of patients. Software tools that prioritize genes based on patient symptoms using known gene-disease associations may complement variant filtering and interpretation to increase chances of success. However, many of these tools cannot be used in practice because they are embedded within variant prioritization algorithms, or exist as remote services that cannot be relied upon or are unacceptable because of legal/ethical barriers. In addition, many tools are not designed for command-line usage, closed-source, abandoned, or unavailable. We present Variant Interpretation using Biomedical literature Evidence (VIBE), a tool to prioritize disease genes based on Human Phenotype Ontology codes. VIBE is a locally installed executable that ensures operational availability and is built upon DisGeNET-RDF, a comprehensive knowledge platform containing gene-disease associations mostly from literature and variant-disease associations mostly from curated source databases. VIBE's command-line interface and output are designed for easy incorporation into bioinformatic pipelines that annotate and prioritize variants for further clinical interpretation. We evaluate VIBE in a benchmark based on 305 patient cases alongside seven other tools. Our results demonstrate that VIBE offers consistent performance with few cases missed, but we also find high complementarity among all tested tools. VIBE is a powerful, free, open source and locally installable solution for prioritizing genes based on patient symptoms. Project source code, documentation, benchmark and executables are available at https://github.com/molgenis/vibe.

19.
Genome Med ; 12(1): 75, 2020 08 24.
Artículo en Inglés | MEDLINE | ID: mdl-32831124

RESUMEN

Exome sequencing is now mainstream in clinical practice. However, identification of pathogenic Mendelian variants remains time-consuming, in part, because the limited accuracy of current computational prediction methods requires manual classification by experts. Here we introduce CAPICE, a new machine-learning-based method for prioritizing pathogenic variants, including SNVs and short InDels. CAPICE outperforms the best general (CADD, GAVIN) and consequence-type-specific (REVEL, ClinPred) computational prediction methods, for both rare and ultra-rare variants. CAPICE is easily added to diagnostic pipelines as pre-computed score file or command-line software, or using online MOLGENIS web service with API. Download CAPICE for free and open-source (LGPLv3) at https://github.com/molgenis/capice .


Asunto(s)
Biología Computacional/métodos , Exoma , Variación Genética , Programas Informáticos , Frecuencia de los Genes , Estudios de Asociación Genética/métodos , Humanos , Mutación INDEL , Aprendizaje Automático , Técnicas de Diagnóstico Molecular , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple , Curva ROC , Reproducibilidad de los Resultados
20.
Nat Commun ; 10(1): 2837, 2019 06 28.
Artículo en Inglés | MEDLINE | ID: mdl-31253775

RESUMEN

The diagnostic yield of exome and genome sequencing remains low (8-70%), due to incomplete knowledge on the genes that cause disease. To improve this, we use RNA-seq data from 31,499 samples to predict which genes cause specific disease phenotypes, and develop GeneNetwork Assisted Diagnostic Optimization (GADO). We show that this unbiased method, which does not rely upon specific knowledge on individual genes, is effective in both identifying previously unknown disease gene associations, and flagging genes that have previously been incorrectly implicated in disease. GADO can be run on www.genenetwork.nl by supplying HPO-terms and a list of genes that contain candidate variants. Finally, applying GADO to a cohort of 61 patients for whom exome-sequencing analysis had not resulted in a genetic diagnosis, yields likely causative genes for ten cases.


Asunto(s)
Regulación de la Expresión Génica/fisiología , Predisposición Genética a la Enfermedad , Análisis de Secuencia de ARN/métodos , Transcriptoma , Bases de Datos de Ácidos Nucleicos , Humanos , Modelos Genéticos , Análisis de Componente Principal , Programas Informáticos , Interfaz Usuario-Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA