RESUMEN
The Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema was released in 2022 and approved by ISO as a standard for sharing clinical and genomic information about an individual, including phenotypic descriptions, numerical measurements, genetic information, diagnoses, and treatments. A phenopacket can be used as an input file for software that supports phenotype-driven genomic diagnostics and for algorithms that facilitate patient classification and stratification for identifying new diseases and treatments. There has been a great need for a collection of phenopackets to test software pipelines and algorithms. Here, we present Phenopacket Store. Phenopacket Store v.0.1.19 includes 6,668 phenopackets representing 475 Mendelian and chromosomal diseases associated with 423 genes and 3,834 unique pathogenic alleles curated from 959 different publications. This represents the first large-scale collection of case-level, standardized phenotypic information derived from case reports in the literature with detailed descriptions of the clinical data and will be useful for many purposes, including the development and testing of software for prioritizing genes and diseases in diagnostic genomics, machine learning analysis of clinical phenotype data, patient stratification, and genotype-phenotype correlations. This corpus also provides best-practice examples for curating literature-derived data using the GA4GH Phenopacket Schema.
RESUMEN
INTRODUCTION: For an interoperable Intelligent Tutoring System (ITS), we used resources from Fast Healthcare Interoperability Resources (FHIR) and mapped learning content with Unified Medical Language System (UMLS) codes to enhance healthcare education. This study addresses the need to enhance the interoperability and effectiveness of ITS in healthcare education. STATE OF THE ART: The current state of the art in ITS involves advanced personalized learning and adaptability techniques, integrating technologies such as machine learning to personalize the learning experience and to create systems that dynamically respond to individual learner needs. However, existing ITS architectures face challenges related to interoperability and integration with healthcare systems. CONCEPT: Our system maps learning content with UMLS codes, each scored for similarity, ensuring consistency and extensibility. FHIR is used to standardize the exchange of medical information and learning content. IMPLEMENTATION: Implemented as a microservice architecture, the system uses a recommender to request FHIR resources, provide questions, and measure learner progress. LESSONS LEARNED: Using international standards, our ITS ensures reproducibility and extensibility, enhancing interoperability and integration with existing platforms.
Asunto(s)
Interoperabilidad de la Información en Salud , Estándar HL7 , Unified Medical Language System , Humanos , Aprendizaje Automático , Instrucción por Computador/métodosRESUMEN
In the international classifications ICD-10-WHO and ICD-11-WHO, many sex-specific diseases have incomplete coding. It is possible to further enhance semantic interoperability using SNOMED CT additionally to ICD. Part of the analysis of semantic interoperability of diagnoses in the ICD are Sexual Dysfunctions, Postpartum Depression, Sexual Assault, Premenstrual Tension Syndrome and Premenstrual Dysphoric Disorder, Female Genital Mutilation and Cutting, Gender Incongruence and Disorders of Breast. Labeling biases have been identified in all diagnoses, either in SNOMED CT or ICD. For mental disorders associated with pregnancy, gender incongruence and sexual violence the use of the GPS of SNOMED CT can help enhance semantic interoperability additionally to ICD.
Asunto(s)
Clasificación Internacional de Enfermedades , Systematized Nomenclature of Medicine , Humanos , Femenino , Masculino , Sexismo , SemánticaRESUMEN
Our novel Intelligent Tutoring System (ITS) architecture integrates HL7 Fast Healthcare Interoperability Resources (FHIR) for data exchange and Unified Medical Language System (UMLS) codes for content mapping.
Asunto(s)
Estándar HL7 , Unified Medical Language System , Interoperabilidad de la Información en Salud , Integración de Sistemas , HumanosRESUMEN
This paper explores the critical role of Interoperability (IOP) in the integration of Artificial Intelligence (AI) for clinical applications. As AI gains prominence in medical analytics, its application in clinical practice faces challenges due to the lack of standardization in the medical sector. IOP, the ability of systems to exchange information seamlessly, emerges as a fundamental solution. Our paper discusses the indispensable nature of IOP throughout the Data Life Cycle, demonstrating how interoperable data can facilitate AI applications. The benefits of IOP encompass streamlined data entry for healthcare professionals, efficient data processing, enabling the sharing of data and algorithms for replication, and potentially increasing the significance of results obtained by medical data analytics via AI. Despite the challenges of IOP, its successful implementation promises substantial benefits for integrating AI into clinical practice, which could ultimately enhance patient outcomes and healthcare quality.
Asunto(s)
Inteligencia Artificial , Humanos , Interoperabilidad de la Información en Salud , Registros Electrónicos de Salud , Integración de SistemasRESUMEN
This work presents the Fast Healthcare Interoperability Resources (FHIR®) specification of the NFDI4Health Metadata schema based on FHIR Version 4: We created 16 profiles to facilitate the integration of clinical, epidemiological, and public health study data. Despite challenges arising from the extensive MDS as well as missing concepts in semantic standards, it marks a significant advance in applying information technology standards to health research.
Asunto(s)
Interoperabilidad de la Información en Salud , Estándar HL7 , Metadatos , Humanos , Registros Electrónicos de Salud , Estudios Epidemiológicos , Salud Pública , Investigación BiomédicaRESUMEN
Congenital heart disease (CHD) represents a significant challenge in prenatal care due to low prenatal detection rates. Artificial Intelligence (AI) offers promising avenues for precise CHD prediction. In this study we conducted a systematic review according to the PRISMA guidelines, investigating the landscape of AI applications in prenatal CHD detection. Through searches on PubMed, Embase, and Web of Science, 621 articles were screened, yielding 28 relevant studies for analysis. Deep Learning (DL) emerged as the predominant AI approach. Data types were limited to ultrasound and MRI sequences mainly. This comprehensive analysis provides valuable insights for future research and clinical practice in CHD detection using AI applications.
Asunto(s)
Inteligencia Artificial , Cardiopatías Congénitas , Cardiopatías Congénitas/diagnóstico por imagen , Humanos , Femenino , Embarazo , Diagnóstico Prenatal , Imagen por Resonancia Magnética , Ultrasonografía Prenatal , Aprendizaje ProfundoRESUMEN
This paper explores key success factors for the development and implementation of a Common Data Model (CDM) for Rare Diseases (RDs) focusing on the European context. Several challenges hinder RD care and research in diagnosis, treatment, and research, including data fragmentation, lack of standardisation, and Interoperability (IOP) issues within healthcare information systems. We identify key issues and recommendations for an RD-CDM, drawing on international guidelines and existing infrastructure, to address organisational, consensus, interoperability, usage, and secondary use challenges. Based on these, we analyse the importance of balancing the scope and IOP of a CDM to cater to the unique requirements of RDs while ensuring effective data exchange and usage across systems. In conclusion, a well-designed RD-CDM can bridge gaps in RD care and research, enhance patient care and facilitate international collaborations.
Asunto(s)
Elementos de Datos Comunes , Enfermedades Raras , Humanos , Registros Electrónicos de Salud , Europa (Continente) , Interoperabilidad de la Información en Salud , Enfermedades Raras/terapiaRESUMEN
The German initiative "National Research Data Infrastructure for Personal Health Data" (NFDI4Health) focuses on research data management in health research. It aims to foster and develop harmonized informatics standards for public health, epidemiological studies, and clinical trials, facilitating access to relevant data and metadata standards. This publication lists syntactic and semantic data standards of potential use for NFDI4Health and beyond, based on interdisciplinary meetings and workshops, mappings of study questionnaires and the NFDI4Health metadata schema, and literature search. Included are 7 syntactic, 32 semantic and 9 combined syntactic and semantic standards. In addition, 101 ISO Standards from ISO/TC 215 Health Informatics and ISO/TC 276 Biotechnology could be identified as being potentially relevant. The work emphasizes the utilization of standards for epidemiological and health research data ensuring interoperability as well as the compatibility to NFDI4Health, its use cases, and to (inter-)national efforts within these sectors. The goal is to foster collaborative and inter-sectoral work in health research and initiate a debate around the potential of using common standards.
Asunto(s)
Interoperabilidad de la Información en Salud , Humanos , Metadatos , Alemania , Registros de Salud Personal , Manejo de DatosRESUMEN
The Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema was released in 2022 and approved by ISO as a standard for sharing clinical and genomic information about an individual, including phenotypic descriptions, numerical measurements, genetic information, diagnoses, and treatments. A phenopacket can be used as an input file for software that supports phenotype-driven genomic diagnostics and for algorithms that facilitate patient classification and stratification for identifying new diseases and treatments. There has been a great need for a collection of phenopackets to test software pipelines and algorithms. Here, we present phenopacket-store. Version 0.1.12 of phenopacket-store includes 4916 phenopackets representing 277 Mendelian and chromosomal diseases associated with 236 genes, and 2872 unique pathogenic alleles curated from 605 different publications. This represents the first large-scale collection of case-level, standardized phenotypic information derived from case reports in the literature with detailed descriptions of the clinical data and will be useful for many purposes, including the development and testing of software for prioritizing genes and diseases in diagnostic genomics, machine learning analysis of clinical phenotype data, patient stratification, and genotype-phenotype correlations. This corpus also provides best-practice examples for curating literature-derived data using the GA4GH Phenopacket Schema.
RESUMEN
Health data are extremely important in today's data-driven world. Through automation, healthcare processes can be optimized, and clinical decisions can be supported. For any reuse of data, the quality, validity, and trustworthiness of data are essential, and it is the only way to guarantee that data can be reused sensibly. Specific requirements for the description and coding of reusable data are defined in the FAIR guiding principles for data stewardship. Various national research associations and infrastructure projects in the German healthcare sector have already clearly positioned themselves on the FAIR principles: both the infrastructures of the Medical Informatics Initiative and the University Medicine Network operate explicitly on the basis of the FAIR principles, as do the National Research Data Infrastructure for Personal Health Data and the German Center for Diabetes Research.To ensure that a resource complies with the FAIR principles, the degree of FAIRness should first be determined (so-called FAIR assessment), followed by the prioritization for improvement steps (so-called FAIRification). Since 2016, a set of tools and guidelines have been developed for both steps, based on the different, domain-specific interpretations of the FAIR principles.Neighboring European countries have also invested in the development of a national framework for semantic interoperability in the context of the FAIR (Findable, Accessible, Interoperable, Reusable) principles. Concepts for comprehensive data enrichment were developed to simplify data analysis, for example, in the European Health Data Space or via the Observational Health Data Sciences and Informatics network. With the support of the European Open Science Cloud, among others, structured FAIRification measures have already been taken for German health datasets.
Asunto(s)
Registros Electrónicos de Salud , Humanos , Alemania , Internacionalidad , Programas Nacionales de SaludRESUMEN
The interoperability Working Group of the Medical Informatics Initiative (MII) is the platform for the coordination of overarching procedures, data structures, and interfaces between the data integration centers (DIC) of the university hospitals and national and international interoperability committees. The goal is the joint content-related and technical design of a distributed infrastructure for the secondary use of healthcare data that can be used via the Research Data Portal for Health. Important general conditions are data privacy and IT security for the use of health data in biomedical research. To this end, suitable methods are used in dedicated task forces to enable procedural, syntactic, and semantic interoperability for data use projects. The MII core dataset was developed as several modules with corresponding information models and implemented using the HL7® FHIR® standard to enable content-related and technical specifications for the interoperable provision of healthcare data through the DIC. International terminologies and consented metadata are used to describe these data in more detail. The overall architecture, including overarching interfaces, implements the methodological and legal requirements for a distributed data use infrastructure, for example, by providing pseudonymized data or by federated analyses. With these results of the Interoperability Working Group, the MII is presenting a future-oriented solution for the exchange and use of healthcare data, the applicability of which goes beyond the purpose of research and can play an essential role in the digital transformation of the healthcare system.
Asunto(s)
Interoperabilidad de la Información en Salud , Humanos , Conjuntos de Datos como Asunto , Registros Electrónicos de Salud , Alemania , Interoperabilidad de la Información en Salud/normas , Informática Médica , Registro Médico Coordinado/métodos , Integración de SistemasRESUMEN
Adhering to FAIR principles (findability, accessibility, interoperability, reusability) ensures sustainability and reliable exchange of data and metadata. Research communities need common infrastructures and information models to collect, store, manage and work with data and metadata. The German initiative NFDI4Health created a metadata schema and an infrastructure integrating existing platforms based on different information models and standards. To ensure system compatibility and enhance data integration possibilities, we mapped the Investigation-Study-Assay (ISA) model to Fast Healthcare Interoperability Resources (FHIR). We present the mapping in FHIR logical models, a resulting FHIR resources' network and challenges that we encountered. Challenges mainly related to ISA's genericness, and to different structures and datatypes used in ISA and FHIR. Mapping ISA to FHIR is feasible but requires further analyses of example data and adaptations to better specify target FHIR elements, and enable possible automatized conversions from ISA to FHIR.
Asunto(s)
Medicamentos Genéricos , Instituciones de Salud , Humanos , Metadatos , Atención a la SaludRESUMEN
The COVID-19 pandemic has led to tremendous investment in clinical studies to generate much-needed knowledge on the prevention, diagnosis, treatment and long-term effects of the disease. Case report forms, comprised of questions and answers (variables), are commonly used to collect data in clinical trials. Maximizing the value of study data depends on data quality and on the ability to easily pool and share data from several sources. ISARIC, in collaboration with the WHO, has created a case report form that is available for use by the scientific community to collect COVID-19 trial data. One of such research initiatives collecting and analyzing multi-country and multi-cohort COVID-19 study data is the Horizon 2020 project ORCHESTRA. Following the ISO/TS 21564:2019 standard, a mapping between five ORCHESTRA studies' variables and the ISARIC Freestanding Follow-Up Survey elements was created. Measures of correspondence of shared semantic domain of 0 (perfect match), 1 (fully inclusive match), 2 (partial match), 4 (transformation required) or 4* (not present in ORCHESTRA) as compared to the target code system, ORCHESTRA study variables, were assigned to each of the elements in the ISARIC FUP case report form (CRF) which was considered the source code system. Of the ISARIC FUP CRF's variables, around 34% were found to show an exact match with corresponding variables in ORCHESTRA studies and about 33% showed a non-inclusive overlap. Matching variables provided information on patient demographics, COVID-19 testing, hospital admission and symptoms. More in-depth details are covered in ORCHESTRA variables with regards to treatment and comorbidities. ORCHESTRA's Long-Term Sequelae and Fragile population studies' CRFs include 32 and 27 variables respectively which were evaluated as a perfect match to variables in the ISARIC FUP CRF. Our study serves as an example of the kind of maps between case report form variables from different research projects needed to link ongoing COVID-19 research efforts and facilitate collaboration and data sharing. To enable data aggregation across two data systems, the information they contain needs to be connected through a map to determine compatibility and transformation needs. Combining data from various clinical studies can increase the power of analytical insights.
Asunto(s)
Prueba de COVID-19 , COVID-19 , Humanos , Estudios de Seguimiento , Pandemias , Semántica , COVID-19/epidemiología , FatigaRESUMEN
The COVID-19 pandemic has made it clear: sharing and exchanging data among research institutions is crucial in order to efficiently respond to global health threats. This can be facilitated by defining health data models based on interoperability standards. In Germany, a national effort is in progress to create common data models using international healthcare IT standards. In this context, collaborative work on a data set module for microbiology is of particular importance as the WHO has declared antimicrobial resistance one of the top global public health threats that humanity is facing. In this article, we describe how we developed a common model for microbiology data in an interdisciplinary collaborative effort and how we make use of the standard HL7 FHIR and terminologies such as SNOMED CT or LOINC to ensure syntactic and semantic interoperability. The use of international healthcare standards qualifies our data model to be adopted beyond the environment where it was first developed and used at an international level.
Asunto(s)
COVID-19 , Humanos , Pandemias , Alemania , Instituciones de Salud , HumanidadesRESUMEN
Background: The COVID-19 pandemic has spurred large-scale, interinstitutional research efforts. To enable these efforts, researchers must agree on data set definitions that not only cover all elements relevant to the respective medical specialty but also are syntactically and semantically interoperable. Therefore, the German Corona Consensus (GECCO) data set was developed as a harmonized, interoperable collection of the most relevant data elements for COVID-19-related patient research. As the GECCO data set is a compact core data set comprising data across all medical fields, the focused research within particular medical domains demands the definition of extension modules that include data elements that are the most relevant to the research performed in those individual medical specialties. Objective: We aimed to (1) specify a workflow for the development of interoperable data set definitions that involves close collaboration between medical experts and information scientists and (2) apply the workflow to develop data set definitions that include data elements that are the most relevant to COVID-19-related patient research regarding immunization, pediatrics, and cardiology. Methods: We developed a workflow to create data set definitions that were (1) content-wise as relevant as possible to a specific field of study and (2) universally usable across computer systems, institutions, and countries (ie, interoperable). We then gathered medical experts from 3 specialties-infectious diseases (with a focus on immunization), pediatrics, and cardiology-to select data elements that were the most relevant to COVID-19-related patient research in the respective specialty. We mapped the data elements to international standardized vocabularies and created data exchange specifications, using Health Level Seven International (HL7) Fast Healthcare Interoperability Resources (FHIR). All steps were performed in close interdisciplinary collaboration with medical domain experts and medical information specialists. Profiles and vocabulary mappings were syntactically and semantically validated in a 2-stage process. Results: We created GECCO extension modules for the immunization, pediatrics, and cardiology domains according to pandemic-related requests. The data elements included in each module were selected, according to the developed consensus-based workflow, by medical experts from these specialties to ensure that the contents aligned with their research needs. We defined data set specifications for 48 immunization, 150 pediatrics, and 52 cardiology data elements that complement the GECCO core data set. We created and published implementation guides, example implementations, and data set annotations for each extension module. Conclusions: The GECCO extension modules, which contain data elements that are the most relevant to COVID-19-related patient research on infectious diseases (with a focus on immunization), pediatrics, and cardiology, were defined in an interdisciplinary, iterative, consensus-based workflow that may serve as a blueprint for developing further data set definitions. The GECCO extension modules provide standardized and harmonized definitions of specialty-related data sets that can help enable interinstitutional and cross-country COVID-19 research in these specialties.
RESUMEN
BACKGROUND: Resources are increasingly spent on artificial intelligence (AI) solutions for medical applications aiming to improve diagnosis, treatment, and prevention of diseases. While the need for transparency and reduction of bias in data and algorithm development has been addressed in past studies, little is known about the knowledge and perception of bias among AI developers. OBJECTIVE: This study's objective was to survey AI specialists in health care to investigate developers' perceptions of bias in AI algorithms for health care applications and their awareness and use of preventative measures. METHODS: A web-based survey was provided in both German and English language, comprising a maximum of 41 questions using branching logic within the REDCap web application. Only the results of participants with experience in the field of medical AI applications and complete questionnaires were included for analysis. Demographic data, technical expertise, and perceptions of fairness, as well as knowledge of biases in AI, were analyzed, and variations among gender, age, and work environment were assessed. RESULTS: A total of 151 AI specialists completed the web-based survey. The median age was 30 (IQR 26-39) years, and 67% (101/151) of respondents were male. One-third rated their AI development projects as fair (47/151, 31%) or moderately fair (51/151, 34%), 12% (18/151) reported their AI to be barely fair, and 1% (2/151) not fair at all. One participant identifying as diverse rated AI developments as barely fair, and among the 2 undefined gender participants, AI developments were rated as barely fair or moderately fair, respectively. Reasons for biases selected by respondents were lack of fair data (90/132, 68%), guidelines or recommendations (65/132, 49%), or knowledge (60/132, 45%). Half of the respondents worked with image data (83/151, 55%) from 1 center only (76/151, 50%), and 35% (53/151) worked with national data exclusively. CONCLUSIONS: This study shows that the perception of biases in AI overall is moderately fair. Gender minorities did not once rate their AI development as fair or very fair. Therefore, further studies need to focus on minorities and women and their perceptions of AI. The results highlight the need to strengthen knowledge about bias in AI and provide guidelines on preventing biases in AI health care applications.
Asunto(s)
Algoritmos , Inteligencia Artificial , Humanos , Femenino , Masculino , Adulto , Sesgo , Atención a la Salud , InternetRESUMEN
The need to harness large amounts of data, possibly within a short period of time, became apparent during the Covid-19 pandemic outbreak. In 2022, the Corona Data Exchange Platform (CODEX), which had been developed within the German Network University Medicine (NUM), was extended by a number of common components, including a section on FAIR science. The FAIR principles enable research networks to evaluate how well they comply with current standards in open and reproducible science. To be more transparent, but also to guide scientists on how to improve data and software reusability, we disseminated an online survey within the NUM. Here we present the outcomes and lessons learnt.
Asunto(s)
COVID-19 , Medicina , Humanos , COVID-19/epidemiología , Universidades , Pandemias , Programas InformáticosRESUMEN
Several European health data research initiatives aim to make health data FAIR for research and healthcare, and supply their national communities with coordinated data models, infrastructures, and tools. We present a first map of the Swiss Personalized Healthcare Network dataset to Fast Healthcare Interoperability Resources (FHIR®). All concepts could be mapped using 22 FHIR resources and three datatypes. Deeper analyses will follow before creating a FHIR specification, to potentially enable data conversion and exchange between research networks.