Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
Add more filters










Publication year range
1.
JAMIA Open ; 7(1): ooae002, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38283884

ABSTRACT

Objectives: To provide a real-world example on how and to what extent Health Level Seven Fast Healthcare Interoperability Resources (FHIR) implements the Findable, Accessible, Interoperable, and Reusable (FAIR) guiding principles for scientific data. Additionally, presents a list of FAIR implementation choices for supporting future FAIR implementations that use FHIR. Materials and methods: A case study was conducted on the Medical Information Mart for Intensive Care-IV Emergency Department (MIMIC-ED) dataset, a deidentified clinical dataset converted into FHIR. The FAIRness of this dataset was assessed using a set of common FAIR assessment indicators. Results: The FHIR distribution of MIMIC-ED, comprising an implementation guide and demo data, was more FAIR compared to the non-FHIR distribution. The FAIRness score increased from 60 to 82 out of 95 points, a relative improvement of 37%. The most notable improvements were observed in interoperability, with a score increase from 5 to 19 out of 19 points, and reusability, with a score increase from 8 to 14 out of 24 points. A total of 14 FAIR implementation choices were identified. Discussion: Our work examined how and to what extent the FHIR standard contributes to FAIR data. Challenges arose from interpreting the FAIR assessment indicators. This study stands out for providing a real-world example of a dataset that was made more FAIR using FHIR. Conclusion: To the best of our knowledge, this is the first study that formally assessed the conformance of a FHIR dataset to the FAIR principles. FHIR improved the accessibility, interoperability, and reusability of MIMIC-ED. Future research should focus on implementing FHIR in research data infrastructures.

2.
J Biomed Semantics ; 14(1): 19, 2023 Dec 05.
Article in English | MEDLINE | ID: mdl-38053130

ABSTRACT

INTRODUCTION: Healthcare data and the knowledge gleaned from it play a key role in improving the health of current and future patients. These knowledge sources are regularly represented as 'linked' resources based on the Resource Description Framework (RDF). Making resources 'linkable' to facilitate their interoperability is especially important in the rare-disease domain, where health resources are scattered and scarce. However, to benefit from using RDF, resources need to be of good quality. Based on existing metrics, we aim to assess the quality of RDF resources related to rare diseases and provide recommendations for their improvement. METHODS: Sixteen resources of relevance for the rare-disease domain were selected: two schemas, three metadatasets, and eleven ontologies. These resources were tested on six objective metrics regarding resolvability, parsability, and consistency. Any URI that failed the test based on any of the six metrics was recorded as an error. The error count and percentage of each tested resource were recorded. The assessment results were represented in RDF, using the Data Quality Vocabulary schema. RESULTS: For three out of the six metrics, the assessment revealed quality issues. Eleven resources have non-resolvable URIs with proportion to all URIs ranging from 0.1% (6/6,712) in the Anatomical Therapeutic Chemical Classification to 13.7% (17/124) in the WikiPathways Ontology; seven resources have undefined URIs; and two resources have incorrectly used properties of the 'owl:ObjectProperty' type. Individual errors were examined to generate suggestions for the development of high-quality RDF resources, including the tested resources. CONCLUSION: We assessed the resolvability, parsability, and consistency of RDF resources in the rare-disease domain, and determined the extent of these types of errors that potentially affect interoperability. The qualitative investigation on these errors reveals how they can be avoided. All findings serve as valuable input for the development of a guideline for creating high-quality RDF resources, thereby enhancing the interoperability of biomedical resources.


Subject(s)
Knowledge Bases , Rare Diseases , Humans
3.
BMC Med Inform Decis Mak ; 23(Suppl 1): 90, 2023 05 10.
Article in English | MEDLINE | ID: mdl-37165363

ABSTRACT

INTRODUCTION: The Semantic Web community provides a common Resource Description Framework (RDF) that allows representation of resources such that they can be linked. To maximize the potential of linked data - machine-actionable interlinked resources on the Web - a certain level of quality of RDF resources should be established, particularly in the biomedical domain in which concepts are complex and high-quality biomedical ontologies are in high demand. However, it is unclear which quality metrics for RDF resources exist that can be automated, which is required given the multitude of RDF resources. Therefore, we aim to determine these metrics and demonstrate an automated approach to assess such metrics of RDF resources. METHODS: An initial set of metrics are identified through literature, standards, and existing tooling. Of these, metrics are selected that fulfil these criteria: (1) objective; (2) automatable; and (3) foundational. Selected metrics are represented in RDF and semantically aligned to existing standards. These metrics are then implemented in an open-source tool. To demonstrate the tool, eight commonly used RDF resources were assessed, including data models in the healthcare domain (HL7 RIM, HL7 FHIR, CDISC CDASH), ontologies (DCT, SIO, FOAF, ORDO), and a metadata profile (GRDDL). RESULTS: Six objective metrics are identified in 3 categories: Resolvability (1), Parsability (1), and Consistency (4), and represented in RDF. The tool demonstrates that these metrics can be automated, and application in the healthcare domain shows non-resolvable URIs (ranging from 0.3% to 97%) among all eight resources and undefined URIs in HL7 RIM, and FHIR. In the tested resources no errors were found for parsability and the other three consistency metrics for correct usage of classes and properties. CONCLUSION: We extracted six objective and automatable metrics from literature, as the foundational quality requirements of RDF resources to maximize the potential of linked data. Automated tooling to assess resources has shown to be effective to identify quality issues that must be avoided. This approach can be expanded to incorporate more automatable metrics so as to reflect additional quality dimensions with the assessment tool implementing more metrics.


Subject(s)
Biological Ontologies , Humans , Delivery of Health Care
4.
Orphanet J Rare Dis ; 17(1): 436, 2022 12 14.
Article in English | MEDLINE | ID: mdl-36517834

ABSTRACT

INTRODUCTION: Rare disease patient data are typically sensitive, present in multiple registries controlled by different custodians, and non-interoperable. Making these data Findable, Accessible, Interoperable, and Reusable (FAIR) for humans and machines at source enables federated discovery and analysis across data custodians. This facilitates accurate diagnosis, optimal clinical management, and personalised treatments. In Europe, twenty-four European Reference Networks (ERNs) work on rare disease registries in different clinical domains. The process and the implementation choices for making data FAIR ('FAIRification') differ among ERN registries. For example, registries use different software systems and are subject to different legal regulations. To support the ERNs in making informed decisions and to harmonise FAIRification, the FAIRification steward team was established to work as liaisons between ERNs and researchers from the European Joint Programme on Rare Diseases. RESULTS: The FAIRification steward team inventoried the FAIRification challenges of the ERN registries and proposed solutions collectively with involved stakeholders to address them. Ninety-eight FAIRification challenges from 24 ERNs' registries were collected and categorised into "training" (31), "community" (9), "modelling" (12), "implementation" (26), and "legal" (20). After curating and aggregating highly similar challenges, 41 unique FAIRification challenges remained. The two categories with the most challenges were "training" (15) and "implementation" (9), followed by "community" (7), and then "modelling" (5) and "legal" (5). To address all challenges, eleven types of solutions were proposed. Among them, the provision of guidelines and the organisation of training activities resolved the "training" challenges, which ranged from less-technical "coffee-rounds" to technical workshops, from informal FAIR Games to formal hackathons. Obtaining implementation support from technical experts was the solution type for tackling the "implementation" challenges. CONCLUSION: This work shows that a dedicated team of FAIR data stewards is an asset for harmonising the various processes of making data FAIR in a large organisation with multiple stakeholders. Additionally, multi-levelled training activities are required to accommodate the diverse needs of the ERNs. Finally, the lessons learned from the experience of the FAIRification steward team described in this paper may help to increase FAIR awareness and provide insights into FAIRification challenges and solutions of rare disease registries.


Subject(s)
Rare Diseases , Software , Humans , Europe , Rare Diseases/therapy , Registries
5.
J Biomed Semantics ; 13(1): 19, 2022 07 15.
Article in English | MEDLINE | ID: mdl-35841031

ABSTRACT

BACKGROUND: Ontology matching should contribute to the interoperability aspect of FAIR data (Findable, Accessible, Interoperable, and Reusable). Multiple data sources can use different ontologies for annotating their data and, thus, creating the need for dynamic ontology matching services. In this experimental study, we assessed the performance of ontology matching systems in the context of a real-life application from the rare disease domain. Additionally, we present a method for analyzing top-level classes to improve precision. RESULTS: We included three ontologies (NCIt, SNOMED CT, ORDO) and three matching systems (AgreementMakerLight 2.0, FCA-Map, LogMap 2.0). We evaluated the performance of the matching systems against reference alignments from BioPortal and the Unified Medical Language System Metathesaurus (UMLS). Then, we analyzed the top-level ancestors of matched classes, to detect incorrect mappings without consulting a reference alignment. To detect such incorrect mappings, we manually matched semantically equivalent top-level classes of ontology pairs. AgreementMakerLight 2.0, FCA-Map, and LogMap 2.0 had F1-scores of 0.55, 0.46, 0.55 for BioPortal and 0.66, 0.53, 0.58 for the UMLS respectively. Using vote-based consensus alignments increased performance across the board. Evaluation with manually created top-level hierarchy mappings revealed that on average 90% of the mappings' classes belonged to top-level classes that matched. CONCLUSIONS: Our findings show that the included ontology matching systems automatically produced mappings that were modestly accurate according to our evaluation. The hierarchical analysis of mappings seems promising when no reference alignments are available. All in all, the systems show potential to be implemented as part of an ontology matching service for querying FAIR data. Future research should focus on developing methods for the evaluation of mappings used in such mapping services, leading to their implementation in a FAIR data ecosystem.


Subject(s)
Biological Ontologies , Ecosystem , Consensus , Information Storage and Retrieval , Systematized Nomenclature of Medicine , Unified Medical Language System
6.
BMC Med ; 20(1): 173, 2022 05 04.
Article in English | MEDLINE | ID: mdl-35505341

ABSTRACT

BACKGROUND: Necrotising soft tissue infections (NSTIs) are rapidly progressing bacterial infections usually caused by either several pathogens in unison (polymicrobial infections) or Streptococcus pyogenes (mono-microbial infection). These infections are rare and are associated with high mortality rates. However, the underlying pathogenic mechanisms in this heterogeneous group remain elusive. METHODS: In this study, we built interactomes at both the population and individual levels consisting of host-pathogen interactions inferred from dual RNA-Seq gene transcriptomic profiles of the biopsies from NSTI patients. RESULTS: NSTI type-specific responses in the host were uncovered. The S. pyogenes mono-microbial subnetwork was enriched with host genes annotated with involved in cytokine production and regulation of response to stress. The polymicrobial network consisted of several significant associations between different species (S. pyogenes, Porphyromonas asaccharolytica and Escherichia coli) and host genes. The host genes associated with S. pyogenes in this subnetwork were characterised by cellular response to cytokines. We further found several virulence factors including hyaluronan synthase, Sic1, Isp, SagF, SagG, ScfAB-operon, Fba and genes upstream and downstream of EndoS along with bacterial housekeeping genes interacting with the human stress and immune response in various subnetworks between host and pathogen. CONCLUSIONS: At the population level, we found aetiology-dependent responses showing the potential modes of entry and immune evasion strategies employed by S. pyogenes, congruent with general cellular processes such as differentiation and proliferation. After stratifying the patients based on the subject-specific networks to study the patient-specific response, we observed different patient groups with different collagens, cytoskeleton and actin monomers in association with virulence factors, immunogenic proteins and housekeeping genes which we utilised to postulate differing modes of entry and immune evasion for different bacteria in relationship to the patients' phenotype.


Subject(s)
Coinfection , Soft Tissue Infections , Streptococcal Infections , Coinfection/genetics , Humans , Soft Tissue Infections/genetics , Soft Tissue Infections/microbiology , Streptococcal Infections/genetics , Streptococcal Infections/microbiology , Streptococcus pyogenes/genetics , Virulence Factors/genetics
7.
J Biomed Semantics ; 13(1): 9, 2022 03 15.
Article in English | MEDLINE | ID: mdl-35292119

ABSTRACT

BACKGROUND: The European Platform on Rare Disease Registration (EU RD Platform) aims to address the fragmentation of European rare disease (RD) patient data, scattered among hundreds of independent and non-coordinating registries, by establishing standards for integration and interoperability. The first practical output of this effort was a set of 16 Common Data Elements (CDEs) that should be implemented by all RD registries. Interoperability, however, requires decisions beyond data elements - including data models, formats, and semantics. Within the European Joint Programme on Rare Diseases (EJP RD), we aim to further the goals of the EU RD Platform by generating reusable RD semantic model templates that follow the FAIR Data Principles. RESULTS: Through a team-based iterative approach, we created semantically grounded models to represent each of the CDEs, using the SemanticScience Integrated Ontology as the core framework for representing the entities and their relationships. Within that framework, we mapped the concepts represented in the CDEs, and their possible values, into domain ontologies such as the Orphanet Rare Disease Ontology, Human Phenotype Ontology and National Cancer Institute Thesaurus. Finally, we created an exemplar, reusable ETL pipeline that we will be deploying over these non-coordinating data repositories to assist them in creating model-compliant FAIR data without requiring site-specific coding nor expertise in Linked Data or FAIR. CONCLUSIONS: Within the EJP RD project, we determined that creating reusable, expert-designed templates reduced or eliminated the requirement for our participating biomedical domain experts and rare disease data hosts to understand OWL semantics. This enabled them to publish highly expressive FAIR data using tools and approaches that were already familiar to them.


Subject(s)
Common Data Elements , Rare Diseases , Humans , Registries , Semantics , Workflow
8.
Stud Health Technol Inform ; 287: 104-108, 2021 Nov 18.
Article in English | MEDLINE | ID: mdl-34795091

ABSTRACT

Ontologies listed in the OBO Foundry are often regarded as reliable choices to be reused but ontology interoperability of them remains unknown. This study evaluated the resolvability of URIs and consistency of axioms in the OBO Foundry library, BFO ontology, and CIDO ontology. All had nonresolvable URIs, but the OBO library and the CIDO had additional interoperability issues regarding the use of incorrect prefixes, mixing up with ontologies, and inconsistency in the use of property. These detected issues reflected the real-world common problems that were not significant from human beings' point of view but hindered the machine-processability of ontologies. The assessment performed in this study was automated and enables scale-up against more metrics over more ontologies, which remains future work.


Subject(s)
Biological Ontologies , Humans , Pilot Projects
9.
Front Microbiol ; 11: 1679, 2020.
Article in English | MEDLINE | ID: mdl-32765473

ABSTRACT

Mycoplasma hyopneumoniae (M. hyopneumoniae) causes enzootic pneumonia in pigs but it is still largely unknown which host-pathogen interactions enable persistent infection and cause disease. In this study, we analyzed the host and bacterial transcriptomes during infection using RNA sequencing. Comparison of the transcriptome of lung lesion tissue from infected pigs with lung tissue from non-infected animals, identified 424 differentially expressed genes (FDR < 0.01 and fold change > 1.5LOG2). These genes were part of the following major pathways of the immune system: interleukin signaling (type 4, 10, 13, and 18), regulation of Toll-like receptors by endogenous ligand and activation of C3 and C5 in the complement system. Besides analyzing the lung transcriptome, a sampling protocol was developed to obtain enough bacterial mRNA from infected lung tissue for RNA sequencing. This was done by flushing infected lobes in the lung, and subsequently enriching for bacterial RNA. On average, 2.2 million bacterial reads were obtained per biological replicate to analyze the bacterial in vivo transcriptome. We compared the in vivo bacterial transcriptome with the transcriptome of bacteria grown in vitro and identified 22 up-regulated and 30 down-regulated genes (FDR < 0.01 and fold change > 2LOG2). Six out of seven genes in the operon encoding the mycoplasma specific F1-like ATPase (MHP_RS02445-MHP_RS02475) and all genes in the operon MHP_RS01965-MHP_RS01990 with functions related to nucleotide metabolism, spermidine transport and glycerol-3-phoshate transport were up-regulated in vivo. Down-regulated in vivo were genes related to glycerol uptake, cilium adhesion (P102), cell division and myo-inositol metabolism. In addition to providing a novel method to isolate bacterial mRNA from infected lung, this study provided insights into changes in gene expression during infection, which could help development of novel treatment strategies against enzootic pneumonia caused by M. hyopneumoniae.

10.
BMC Genomics ; 20(1): 1028, 2019 Dec 30.
Article in English | MEDLINE | ID: mdl-31888466

ABSTRACT

BACKGROUND: The mammalian intestine is a complex biological system that exhibits functional plasticity in its response to diverse stimuli to maintain homeostasis. To improve our understanding of this plasticity, we performed a high-level data integration of 14 whole-genome transcriptomics datasets from samples of intestinal mouse mucosa. We used the tool Centrality based Pathway Analysis (CePa), along with information from the Reactome database. RESULTS: The results show an integrated response of the mouse intestinal mucosa to challenges with agents introduced orally that were expected to perturb homeostasis. We observed that a common set of pathways respond to different stimuli, of which the most reactive was the Regulation of Complement Cascade pathway. Altered expression of the Regulation of Complement Cascade pathway was verified in mouse organoids challenged with different stimuli in vitro. CONCLUSIONS: Results of the integrated transcriptomics analysis and data driven experiment suggest an important role of epithelial production of complement and host complement defence factors in the maintenance of homeostasis.


Subject(s)
Complement System Proteins/immunology , Homeostasis , Intestinal Mucosa/immunology , Intestinal Mucosa/metabolism , Transcriptome , Animals , Complement Activation , Computational Biology/methods , Gene Expression Profiling , Mice , Models, Biological , Molecular Sequence Annotation , Signal Transduction
11.
PLoS One ; 12(11): e0188282, 2017.
Article in English | MEDLINE | ID: mdl-29149221

ABSTRACT

Dietary protein sources can have profound effects on host-microbe interactions in the gut that are critically important for immune resilience. However more knowledge is needed to assess the impact of different protein sources on gut and animal health. Thirty-six wildtype male C57BL/6J mice of 35 d age (n = 6/group; mean ± SEM body weight 21.9 ± 0.25 g) were randomly assigned to groups fed for four weeks with semi synthetic diets prepared with one of the following protein sources containing (300 g/kg as fed basis): soybean meal (SBM), casein, partially delactosed whey powder, spray dried plasma protein, wheat gluten meal and yellow meal worm. At the end of the experiment, mice were sacrificed to collect ileal tissue to acquire gene expression data, and mammalian (mechanistic) target of rapamycin (mTOR) activity, ileal digesta to study changes in microbiota and serum to measure cytokines and chemokines. By genome-wide transcriptome analysis, we identified fourteen high level regulatory genes that are strongly affected in SBM-fed mice compared to the other experimental groups. They mostly related to the mTOR pathway. In addition, an increased (P < 0.05) concentration of granulocyte colony-stimulating factor was observed in serum of SBM-fed mice compared to other dietary groups. Moreover, by 16S rRNA sequencing, we observed that SBM-fed mice had higher (P < 0.05) abundances of Bacteroidales family S24-7, compared to the other dietary groups. We showed that measurements of genome-wide expression and microbiota composition in the mouse ileum reveal divergent responses to diets containing different protein sources, in particular for a diet based on SBM.


Subject(s)
Gastrointestinal Microbiome/genetics , Gene Regulatory Networks , Genes, Regulator , Ileum/microbiology , TOR Serine-Threonine Kinases/genetics , Transcriptome , Animals , Bacteria/classification , Bacteria/genetics , Bacteria/metabolism , Blood Proteins/administration & dosage , Blood Proteins/metabolism , Caseins/administration & dosage , Caseins/metabolism , Cytokines/genetics , Cytokines/immunology , Dietary Proteins/administration & dosage , Dietary Proteins/metabolism , Food, Formulated , Gastrointestinal Microbiome/immunology , Glutens/administration & dosage , Glutens/metabolism , Granulocyte Colony-Stimulating Factor/genetics , Granulocyte Colony-Stimulating Factor/immunology , Ileum/metabolism , Male , Mice , Mice, Inbred C57BL , Glycine max/chemistry , Glycine max/metabolism , TOR Serine-Threonine Kinases/metabolism , Whey Proteins/administration & dosage , Whey Proteins/metabolism
12.
Front Physiol ; 8: 388, 2017.
Article in English | MEDLINE | ID: mdl-28659815

ABSTRACT

The genotype and external phenotype of organisms are linked by so-called internal phenotypes which are influenced by environmental conditions. In this study, we used five existing -omics datasets representing five different layers of internal phenotypes, which were simultaneously measured in dietarily perturbed mice. We performed 10 pair-wise correlation analyses verified with a null model built from randomized data. Subsequently, the inferred networks were merged and literature mined for co-occurrences of identified linked nodes. Densely connected internal phenotypes emerged. Forty-five nodes have links with all other data-types and we denote them "connectivity hubs." In literature, we found proof of 6% of the 577 connections, suggesting a biological meaning for the observed correlations. The observed connectivities between metabolite and cytokines hubs showed higher numbers of literature hits as compared to the number of literature hits on the connectivities between the microbiota and gene expression internal phenotypes. We conclude that multi-level integrated networks may help to generate hypotheses and to design experiments aiming to further close the gap between genotype and phenotype. We describe and/or hypothesize on the biological relevance of four identified multi-level connectivity hubs.

13.
F1000Res ; 5: 2414, 2016.
Article in English | MEDLINE | ID: mdl-27990265

ABSTRACT

Biological pathways are increasingly available in the BioPAX format which uses an RDF model for data storage. One can retrieve the information in this data model in the scripting language R using the package rBiopaxParser, which converts the BioPAX format to one readable in R. It also has a function to build a regulatory network from the pathway information. Here we describe an extension of this function. The new function allows the user to build graphs of entire pathways, including regulated as well as non-regulated elements, and therefore provides a maximum of information. This function is available as part of the rBiopaxParser distribution from Bioconductor.

14.
BMC Genomics ; 16: 556, 2015 Jul 29.
Article in English | MEDLINE | ID: mdl-26220188

ABSTRACT

BACKGROUND: Evidence is accumulating that perturbation of early life microbial colonization of the gut induces long-lasting adverse health effects in individuals. Understanding the mechanisms behind these effects will facilitate modulation of intestinal health. The objective of this study was to identify biological processes involved in these long lasting effects and the (molecular) factors that regulate them. We used an antibiotic and the same antibiotic in combination with stress on piglets as an early life perturbation. Then we used host gene expression data from the gut (jejunum) tissue and community-scale analysis of gut microbiota from the same location of the gut, at three different time-points to gauge the reaction to the perturbation. We analysed the data by a new combination of existing tools. First, we analysed the data in two dimensions, treatment and time, with quadratic regression analysis. Then we applied network-based data integration approaches to find correlations between host gene expression and the resident microbial species. RESULTS: The use of a new combination of data analysis tools allowed us to identify significant long-lasting differences in jejunal gene expression patterns resulting from the early life perturbations. In addition, we were able to identify potential key gene regulators (hubs) for these long-lasting effects. Furthermore, data integration also showed that there are a handful of bacterial groups that were associated with temporal changes in gene expression. CONCLUSION: The applied systems-biology approach allowed us to take the first steps in unravelling biological processes involved in long lasting effects in the gut due to early life perturbations. The observed data are consistent with the hypothesis that these long lasting effects are due to differences in the programming of the gut immune system as induced by the temporary early life changes in the composition and/or diversity of microbiota in the gut.


Subject(s)
Bacteria/genetics , Intestines/microbiology , Swine/microbiology , Animals , Animals, Newborn , Bacteria/isolation & purification , Gene Regulatory Networks , Host-Pathogen Interactions/genetics , Microbiota , Oligonucleotide Array Sequence Analysis , RNA, Ribosomal, 16S/chemistry , RNA, Ribosomal, 16S/genetics , Transcriptome
SELECTION OF CITATIONS
SEARCH DETAIL
...