RESUMO
Rare disease patients are more likely to receive a rapid molecular diagnosis nowadays thanks to the wide adoption of next-generation sequencing. However, many cases remain undiagnosed even after exome or genome analysis, because the methods used missed the molecular cause in a known gene, or a novel causative gene could not be identified and/or confirmed. To address these challenges, the RD-Connect Genome-Phenome Analysis Platform (GPAP) facilitates the collation, discovery, sharing, and analysis of standardized genome-phenome data within a collaborative environment. Authorized clinicians and researchers submit pseudonymised phenotypic profiles encoded using the Human Phenotype Ontology, and raw genomic data which is processed through a standardized pipeline. After an optional embargo period, the data are shared with other platform users, with the objective that similar cases in the system and queries from peers may help diagnose the case. Additionally, the platform enables bidirectional discovery of similar cases in other databases from the Matchmaker Exchange network. To facilitate genome-phenome analysis and interpretation by clinical researchers, the RD-Connect GPAP provides a powerful user-friendly interface and leverages tens of information sources. As a result, the resource has already helped diagnose hundreds of rare disease patients and discover new disease causing genes.
Assuntos
Genômica , Doenças Raras , Exoma , Estudos de Associação Genética , Genômica/métodos , Humanos , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/genéticaRESUMO
Rare diseases (RD) have a prevalence of not more than 1/2000 persons in the European population, and are characterised by the difficulty experienced in obtaining a correct and timely diagnosis. According to Orphanet, 72.5% of RD have a genetic origin although 35% of them do not yet have an identified causative gene. A significant proportion of patients suspected to have a genetic RD receive an inconclusive exome/genome sequencing. Working towards the International Rare Diseases Research Consortium (IRDiRC)'s goal for 2027 to ensure that all people living with a RD receive a diagnosis within one year of coming to medical attention, the Solve-RD project aims to identify the molecular causes underlying undiagnosed RD. As part of this strategy, we developed a phenotypic similarity-based variant prioritization methodology comparing submitted cases with other submitted cases and with known RD in Orphanet. Three complementary approaches based on phenotypic similarity calculations using the Human Phenotype Ontology (HPO), the Orphanet Rare Diseases Ontology (ORDO) and the HPO-ORDO Ontological Module (HOOM) were developed; genomic data reanalysis was performed by the RD-Connect Genome-Phenome Analysis Platform (GPAP). The methodology was tested in 4 exemplary cases discussed with experts from European Reference Networks. Variants of interest (pathogenic or likely pathogenic) were detected in 8.8% of the 725 cases clustered by similarity calculations. Diagnostic hypotheses were validated in 42.1% of them and needed further exploration in another 10.9%. Based on the promising results, we are devising an automated standardized phenotypic-based re-analysis pipeline to be applied to the entire unsolved cases cohort.
Assuntos
Genômica , Doenças Raras , Humanos , Doenças Raras/diagnóstico , Doenças Raras/epidemiologia , Doenças Raras/genética , Fenótipo , Mapeamento CromossômicoRESUMO
The Solve-RD project brings together clinicians, scientists, and patient representatives from 51 institutes spanning 15 countries to collaborate on genetically diagnosing ("solving") rare diseases (RDs). The project aims to significantly increase the diagnostic success rate by co-analyzing data from thousands of RD cases, including phenotypes, pedigrees, exome/genome sequencing, and multiomics data. Here we report on the data infrastructure devised and created to support this co-analysis. This infrastructure enables users to store, find, connect, and analyze data and metadata in a collaborative manner. Pseudonymized phenotypic and raw experimental data are submitted to the RD-Connect Genome-Phenome Analysis Platform and processed through standardized pipelines. Resulting files and novel produced omics data are sent to the European Genome-Phenome Archive, which adds unique file identifiers and provides long-term storage and controlled access services. MOLGENIS "RD3" and Café Variome "Discovery Nexus" connect data and metadata and offer discovery services, and secure cloud-based "Sandboxes" support multiparty data analysis. This successfully deployed and useful infrastructure design provides a blueprint for other projects that need to analyze large amounts of heterogeneous data.
Assuntos
Doenças Raras , Doenças Raras/genética , Humanos , Bases de Dados Genéticas , Fenótipo , Metadados , Biologia Computacional/métodos , Genômica/métodosRESUMO
The Solve-RD project objectives include solving undiagnosed rare diseases (RD) through collaborative research on shared genome-phenome datasets. The RD-Connect Genome-Phenome Analysis Platform (GPAP), for data collation and analysis, and the European Genome-Phenome Archive (EGA), for file storage, are two key components of the Solve-RD infrastructure. Clinical researchers can identify candidate genetic variants within the RD-Connect GPAP and, thanks to the developments presented here as part of joint ELIXIR activities, are able to remotely visualize the corresponding alignments stored at the EGA. The Global Alliance for Genomics and Health (GA4GH) htsget streaming application programming interface (API) is used to retrieve alignment slices, which are rendered by an integrated genome viewer (IGV) instance embedded in the GPAP. As a result, it is no longer necessary for over 11,000 datasets to download large alignment files to visualize them locally. This work highlights the advantages, from both the user and infrastructure perspectives, of implementing interoperability standards for establishing federated genomics data networks.
RESUMO
Many patients experiencing a rare disease remain undiagnosed even after genomic testing. Reanalysis of existing genomic data has shown to increase diagnostic yield, although there are few systematic and comprehensive reanalysis efforts that enable collaborative interpretation and future reinterpretation. The Undiagnosed Rare Disease Program of Catalonia project collated previously inconclusive good quality genomic data (panels, exomes, and genomes) and standardized phenotypic profiles from 323 families (543 individuals) with a neurologic rare disease. The data were reanalyzed systematically to identify relatedness, runs of homozygosity, consanguinity, single-nucleotide variants, insertions and deletions, and copy number variants. Data were shared and collaboratively interpreted within the consortium through a customized Genome-Phenome Analysis Platform, which also enables future data reinterpretation. Reanalysis of existing genomic data provided a diagnosis for 20.7% of the patients, including 1.8% diagnosed after the generation of additional genomic data to identify a second pathogenic heterozygous variant. Diagnostic rate was significantly higher for family-based exome/genome reanalysis compared with singleton panels. Most new diagnoses were attributable to recent gene-disease associations (50.8%), additional or improved bioinformatic analysis (19.7%), and standardized phenotyping data integrated within the Undiagnosed Rare Disease Program of Catalonia Genome-Phenome Analysis Platform functionalities (18%).
Assuntos
Genômica , Doenças Raras , Biologia Computacional , Exoma , Humanos , Doenças Raras/diagnóstico , Doenças Raras/genética , Sequenciamento do ExomaRESUMO
Reanalysis of inconclusive exome/genome sequencing data increases the diagnosis yield of patients with rare diseases. However, the cost and efforts required for reanalysis prevent its routine implementation in research and clinical environments. The Solve-RD project aims to reveal the molecular causes underlying undiagnosed rare diseases. One of the goals is to implement innovative approaches to reanalyse the exomes and genomes from thousands of well-studied undiagnosed cases. The raw genomic data is submitted to Solve-RD through the RD-Connect Genome-Phenome Analysis Platform (GPAP) together with standardised phenotypic and pedigree data. We have developed a programmatic workflow to reanalyse genome-phenome data. It uses the RD-Connect GPAP's Application Programming Interface (API) and relies on the big-data technologies upon which the system is built. We have applied the workflow to prioritise rare known pathogenic variants from 4411 undiagnosed cases. The queries returned an average of 1.45 variants per case, which first were evaluated in bulk by a panel of disease experts and afterwards specifically by the submitter of each case. A total of 120 index cases (21.2% of prioritised cases, 2.7% of all exome/genome-negative samples) have already been solved, with others being under investigation. The implementation of solutions as the one described here provide the technical framework to enable periodic case-level data re-evaluation in clinical settings, as recommended by the American College of Medical Genetics.
Assuntos
Testes Genéticos/métodos , Genômica/métodos , Doenças Raras/genética , Software , Testes Genéticos/normas , Genômica/normas , Humanos , Linhagem , Doenças Raras/diagnóstico , Sensibilidade e EspecificidadeRESUMO
Autozygosity is associated with an increased risk of genetic rare disease, thus being a relevant factor for clinical genetic studies. More than 2400 exome sequencing data sets were analyzed and screened for autozygosity on the basis of detection of >1 Mbp runs of homozygosity (ROHs). A model was built to predict if an individual is likely to be a consanguineous offspring (accuracy, 98%), and probability of consanguinity ranges were established according to the total ROH size. Application of the model resulted in the reclassification of the consanguinity status of 12% of the patients. The analysis of a subset of 79 consanguineous cases with the Rare Disease (RD)-Connect Genome-Phenome Analysis Platform, combining variant filtering and homozygosity mapping, enabled a 50% reduction in the number of candidate variants and the identification of homozygous pathogenic variants in 41 patients, with an overall diagnostic yield of 52%. The newly defined consanguinity ranges provide, for the first time, specific ROH thresholds to estimate inbreeding within a pedigree on disparate exome sequencing data, enabling confirmation or (re)classification of consanguineous status, hence increasing the efficiency of molecular diagnosis and reporting on secondary consanguinity findings, as recommended by American College of Medical Genetics and Genomics guidelines.
Assuntos
Sequenciamento do Exoma/métodos , Homozigoto , Técnicas de Diagnóstico Molecular/métodos , Doenças Raras/diagnóstico , Doenças Raras/genética , Consanguinidade , Exoma , Genoma Humano , Humanos , Modelos Genéticos , Linhagem , Polimorfismo de Nucleotídeo Único , Doenças Raras/epidemiologia , Doenças Raras/etnologiaRESUMO
BACKGROUND: Rare diseases are individually rare but globally affect around 6% of the population, and in over 70% of cases are genetically determined. Their rarity translates into a delayed diagnosis, with 25% of patients waiting 5 to 30 years for one. It is essential to raise awareness of patients and clinicians of existing gene and variant-specific therapeutics at the time of diagnosis to avoid that treatment delays add up to the diagnostic odyssey of rare diseases' patients and their families. AIMS: This paper aims to provide guidance and give detailed instructions on how to write homogeneous systematic reviews of rare diseases' treatments in a manner that allows the capture of the results in a computer-accessible form. The published results need to comply with the FAIR guiding principles for scientific data management and stewardship to facilitate the extraction of datasets that are easily transposable into machine-actionable information. The ultimate purpose is the creation of a database of rare disease treatments ("Treatabolome") at gene and variant levels as part of the H2020 research project Solve-RD. RESULTS: Each systematic review follows a written protocol to address one or more rare diseases in which the authors are experts. The bibliographic search strategy requires detailed documentation to allow its replication. Data capture forms should be built to facilitate the filling of a data capture spreadsheet and to record the application of the inclusion and exclusion criteria to each search result. A PRISMA flowchart is required to provide an overview of the processes of search and selection of papers. A separate table condenses the data collected during the Systematic Review, appraised according to their level of evidence. CONCLUSIONS: This paper provides a template that includes the instructions for writing FAIR-compliant systematic reviews of rare diseases' treatments that enables the assembly of a Treatabolome database that complement existing diagnostic and management support tools with treatment awareness data.
Assuntos
Gerenciamento de Dados , Doenças Raras , Humanos , Doenças Raras/genética , Doenças Raras/terapia , Projetos de Pesquisa , Revisões Sistemáticas como Assunto , RedaçãoRESUMO
The COVID-19 pandemic has posed and is continuously posing enormous societal and health challenges worldwide. The research community has mobilized to develop novel projects to find a cure or a vaccine, as well as to contribute to mass testing, which has been a critical measure to contain the infection in several countries. Through this article, we share our experiences and learnings as a group of volunteers at the Centre for Genomic Regulation (CRG) in Barcelona, Spain. As members of the ORFEU project, an initiative by the Government of Catalonia to achieve mass testing of people at risk and contain the epidemic in Spain, we share our motivations, challenges and the key lessons learnt, which we feel will help better prepare the global society to address similar situations in the future.