RESUMO
The interoperability Working Group of the Medical Informatics Initiative (MII) is the platform for the coordination of overarching procedures, data structures, and interfaces between the data integration centers (DIC) of the university hospitals and national and international interoperability committees. The goal is the joint content-related and technical design of a distributed infrastructure for the secondary use of healthcare data that can be used via the Research Data Portal for Health. Important general conditions are data privacy and IT security for the use of health data in biomedical research. To this end, suitable methods are used in dedicated task forces to enable procedural, syntactic, and semantic interoperability for data use projects. The MII core dataset was developed as several modules with corresponding information models and implemented using the HL7® FHIR® standard to enable content-related and technical specifications for the interoperable provision of healthcare data through the DIC. International terminologies and consented metadata are used to describe these data in more detail. The overall architecture, including overarching interfaces, implements the methodological and legal requirements for a distributed data use infrastructure, for example, by providing pseudonymized data or by federated analyses. With these results of the Interoperability Working Group, the MII is presenting a future-oriented solution for the exchange and use of healthcare data, the applicability of which goes beyond the purpose of research and can play an essential role in the digital transformation of the healthcare system.
Assuntos
Interoperabilidade da Informação em Saúde , Humanos , Conjuntos de Dados como Assunto , Registros Eletrônicos de Saúde , Alemanha , Interoperabilidade da Informação em Saúde/normas , Informática Médica , Registro Médico Coordenado/métodos , Integração de SistemasRESUMO
BACKGROUND: The Federal Ministry of Education and Research of Germany (BMBF) funds a network of university medicines (NUM) to support COVID-19 and pandemic research at national level. The "COVID-19 Data Exchange Platform" (CODEX) as part of NUM establishes a harmonised infrastructure that supports research use of COVID-19 datasets. The broad consent (BC) of the Medical Informatics Initiative (MII) is agreed by all German federal states and forms the legal base for data processing. All 34 participating university hospitals (NUM sites) work upon a harmonised infrastructural as well as legal basis for their data protection-compliant collection and transfer of their research dataset to the central CODEX platform. Each NUM site ensures that the exchanged consent information conforms to the already-balloted HL7 FHIR consent profiles and the interoperability concept of the MII Task Force "Consent Implementation" (TFCI). The Independent Trusted Third-Party (TTP) of the University Medicine Greifswald supports data protection-compliant data processing and provides the consent management solutions gICS. METHODS: Based on a stakeholder dialogue a required set of FHIR-functionalities was identified and technically specified supported by official FHIR experts. Next, a "TTP-FHIR Gateway" for the HL7 FHIR-compliant exchange of consent information using gICS was implemented. A last step included external integration tests and the development of a pre-configured consent template for the BC for the NUM sites. RESULTS: A FHIR-compliant gICS-release and a corresponding consent template for the BC were provided to all NUM sites in June 2021. All FHIR functionalities comply with the already-balloted FHIR consent profiles of the HL7 Working Group Consent Management. The consent template simplifies the technical BC rollout and the corresponding implementation of the TFCI interoperability concept at the NUM sites. CONCLUSIONS: This article shows that a HL7 FHIR-compliant and interoperable nationwide exchange of consent information could be built using of the consent management software gICS and the provided TTP-FHIR Gateway. The initial functional scope of the solution covers the requirements identified in the NUM-CODEX setting. The semantic correctness of these functionalities was validated by project-partners from the Ludwig-Maximilian University in Munich. The production rollout of the solution package to all NUM sites has started successfully.
Assuntos
COVID-19 , Registros Eletrônicos de Saúde , Humanos , Software , Consentimento Livre e EsclarecidoRESUMO
BACKGROUND: The aim of the German Medical Informatics Initiative is to establish a national infrastructure for integrating and sharing health data. To this, Data Integration Centers are set up at university medical centers, which address data harmonization, information security and data protection. To capture patient consent, a common informed consent template has been developed. It consists of different modules addressing permissions for using data and biosamples. On the technical level, a common digital representation of information from signed consent templates is needed. As the partners in the initiative are free to adopt different solutions for managing consent information (e.g. IHE BPPC or HL7 FHIR Consent Resources), we had to develop an interoperability layer. METHODS: First, we compiled an overview of data items required to reflect the information from the MII consent template as well as patient preferences and derived permissions. Next, we created entity-relationship diagrams to formally describe the conceptual data model underlying relevant items. We then compared this data model to conceptual models describing representations of consent information using different interoperability standards. We used the result of this comparison to derive an interoperable representation that can be mapped to common standards. RESULTS: The digital representation needs to capture the following information: (1) version of the consent, (2) consent status for each module, and (3) period of validity of the status. We found that there is no generally accepted solution to represent status information in a manner interoperable with all relevant standards. Hence, we developed a pragmatic solution, comprising codes which describe combinations of modules with a basic set of status labels. We propose to maintain these codes in a public registry called ART-DECOR. We present concrete technical implementations of our approach using HL7 FHIR and IHE BPPC which are also compatible with the open-source consent management software gICS. CONCLUSIONS: The proposed digital representation is (1) generic enough to capture relevant information from a wide range of consent documents and data use regulations and (2) interoperable with common technical standards. We plan to extend our model to include more fine-grained status codes and rules for automated access control.
Assuntos
Segurança Computacional , Consentimento Livre e Esclarecido , Informática Médica , Alemanha , Humanos , SoftwareRESUMO
INTRODUCTION: To support research projects that require medical data from multiple sites is one of the goals of the German Medical Informatics Initiative (MII). The data integration centers (DIC) at university medical centers in Germany provide patient data via FHIR® in compliance with the MII core data set (CDS). Requirements for data protection and other legal bases for processing prefer decentralized processing of the relevant data in the DICs and the subsequent exchange of aggregated results for cross-site evaluation. METHODS: Requirements from clinical experts were obtained in the context of the MII use case INTERPOLAR. A software architecture was then developed, modeled using 3LGM2, finally implemented and published in a github repository. RESULTS: With the CDS tool chain, we have created software components for decentralized processing on the basis of the MII CDS. The CDS tool chain requires access to a local FHIR endpoint and then transfers the data to an SQL database. This is accessed by the DataProcessor component, which performs calculations with the help of rules (input repo) and writes the results back to the database. The CDS tool chain also has a frontend module (REDCap), which is used to display the output data and calculated results, and allows verification, evaluation, comments and other responses. This feedback is also persisted in the database and is available for further use, analysis or data sharing in the future. DISCUSSION: Other solutions are conceivable. Our solution utilizes the advantages of an SQL database. This enables flexible and direct processing of the stored data using established analysis methods. Due to the modularization, adjustments can be made so that it can be used in other projects. We are planning further developments to support pseudonymization and data sharing. Initial experience is being gathered. An evaluation is pending and planned.
Assuntos
Software , Alemanha , Registros Eletrônicos de Saúde , Humanos , Informática Médica , Segurança Computacional , Conjuntos de Dados como AssuntoRESUMO
The Data Integration Centers (DICs), all part of the German Medical Informatics Initiative (MII), prepare routine care data captured in university hospitals to enable its reuse in clinical research. Tackling this challenging task requires them to maintain multiple data stores, implement the necessary transformation processes, and provide the required terminology services, all while also addressing the use case specific needs researchers might have. An MII wide application of the standardized profiles defined in the IHE QRPH domain might therefore be able to drastically reduce the overhead at any one DIC. The MII DIC reference model built in 3LGM2, a method to describe complex information system architectures, serves as a starting point to evaluate whether such an application is possible. We first extend the IHE modeling capabilities of 3LGM2 to also support the five profiles from the QRPH domain that our experts evaluated as relevant in the MII DIC context. We then expand the DIC reference model by some IHE QRPH actors and transactions, showing that their application could be beneficial in the MII DIC context, provided they surpass their trial status.
Assuntos
Aplicações da Informática Médica , Informática Médica , Humanos , Integração de SistemasRESUMO
OBJECTIVES: The TMF (Technology, Methods, and Infrastructure for Networked Medical Research) Data Protection Guide (TMF-DP) makes path-breaking recommendations on the subject of data protection in research projects. It includes comprehensive requirements for applications such as patient lists, pseudonymization services, and consent management services. Nevertheless, it lacks a structured, categorized list of requirements for simplified application in research projects and systematic evaluation. The 3LGM2IHE ("Three-layer Graphbased meta model - Integrating the Healthcare Enterprise [IHE] " ) project is funded by the German Research Foundation (DFG). 3LGM2IHE aims to define modeling paradigms and implement modeling tools for planning health care information systems. In addition, one of the goals is to create and publish 3LGM2 information system architecture design patterns (short "design patterns") for the community as design models in terms of a framework. A structured list of data protection-related requirements based on the TMF-DP is a precondition to integrate functions (3LGM2 Domain Layer) and building blocks (3LGM2 Logical Tool Layer) in 3LGM2 design patterns. METHODS: In order to structure the continuous text of the TMF-DP, requirement types were defined in a first step. In a second step, dependencies and delineations of the definitions were identified. In a third step, the requirements from the TMF-DP were systematically extracted. Based on the identified lists of requirements, a fourth step included the comparison of the identified requirements with exemplary open source tools as provided by the "Independent Trusted Third Party of the University Medicine Greifswald" (TTP tools). RESULTS: As a result, four lists of requirements were created, which contain requirements for the "patient list", the "pseudonymization service", and the "consent management", as well as cross-component requirements from the TMF-DP chapter 6 in a structured form. Further to requirements (1), possible variants (2) of implementations (to fulfill a single requirement) and recommendations (3) were identified. A comparison of the requirements lists with the functional scopes of the open source tools E-PIX (record linkage), gPAS (pseudonym management), and gICS (consent management) has shown that these fulfill more than 80% of the requirements. CONCLUSIONS: A structured set of data protection-related requirements facilitates a systematic evaluation of implementations with respect to the fulfillment of the TMF-DP guidelines. These re-usable lists provide a decision aid for the selection of suitable tools for new research projects. As a result, these lists form the basis for the development of data protection-related 3LGM2 design patterns as part of the 3LGM2IHE project.
Assuntos
Pesquisa Biomédica , Atenção à Saúde , Humanos , Segurança ComputacionalRESUMO
In the EU project FAIR4Health, a ETL pipeline for the FAIRification of structured health data as well as an agent-based, distributed query platform for the analysis of research hypotheses and the training of machine learning models were developed. The system has been successfully tested in two clinical use cases with patient data from five university hospitals. Currently, the solution is also being considered for use in other hospitals. However, configuring the system and deploying it in the local IT architecture is non-trivial and meets with understandable concerns about security. This paper presents a model for describing the information architecture based on a formal approach, the 3LGM metamodel. The model was evaluated by the developers. As a result, the clear separation of tasks and the software components that implement them as well as the rich description of interactions via interfaces were positively emphasized.
Assuntos
Aprendizado de Máquina , Software , HumanosRESUMO
The academic research environment is characterized by self-developed, innovative, customized solutions, which are often free to use for third parties with open-source code and open licenses. On the other hand, they are maintained only to a very limited extent after the end of project funding. The ToolPool Gesundheitsforschung addresses the problem of finding ready to use solutions by building a registry of proven and supported tools, services, concepts and consulting offers. The goal is to provide an up-to-date selection of "relevant" solutions for a given domain that are immediately usable and that are actually used by third parties, rather than aiming at a complete list of all solutions which belong to that domain. Proof of relevance and usage must be provided, for example, by concrete application scenarios, experience reports by uninvolved third parties, references in publications or workshops held. Quality assurance is carried out for new entries by an agreed list of admission criteria, for existing entries at least once a year by a special task force. Currently, 79 solutions are represented, this number is to be significantly expanded by involving of new editors from current national funding initiatives in Germany.
Assuntos
Software , Estudos Epidemiológicos , Alemanha , Sistema de RegistrosRESUMO
BACKGROUND: Clinical trials, epidemiological studies, clinical registries, and other prospective research projects, together with patient care services, are main sources of data in the medical research domain. They serve often as a basis for secondary research in evidence-based medicine, prediction models for disease, and its progression. This data are often neither sufficiently described nor accessible. Related models are often not accessible as a functional program tool for interested users from the health care and biomedical domains. OBJECTIVE: The interdisciplinary project Leipzig Health Atlas (LHA) was developed to close this gap. LHA is an online platform that serves as a sustainable archive providing medical data, metadata, models, and novel phenotypes from clinical trials, epidemiological studies, and other medical research projects. METHODS: Data, models, and phenotypes are described by semantically rich metadata. The platform prefers to share data and models presented in original publications but is also open for nonpublished data. LHA provides and associates unique permanent identifiers for each dataset and model. Hence, the platform can be used to share prepared, quality-assured datasets and models while they are referenced in publications. All managed data, models, and phenotypes in LHA follow the FAIR principles, with public availability or restricted access for specific user groups. RESULTS: The LHA platform is in productive mode (https://www.health-atlas.de/). It is already used by a variety of clinical trial and research groups and is becoming increasingly popular also in the biomedical community. LHA is an integral part of the forthcoming initiative building a national research data infrastructure for health in Germany.
Assuntos
Estudos Prospectivos , AlemanhaRESUMO
IHE has defined more than 200 integration profiles in order to improve the interoperability of application systems in healthcare. These profiles describe how standards should be used in particular use cases. These profiles are very helpful but their correct use is challenging, if the user is not familiar to the specifications. Therefore, inexperienced modelers of information systems quickly lose track of existing IHE profiles. In addition, the users of these profiles are often not aware of rules that are defined within these profiles and of dependencies that exist between the profiles. There are also modelers that do not notice the differences between the implemented actors, because they do not know the optional capabilities of some actors. The aim of this paper is therefore to describe a concept how modelers of information systems can be supported in the selection and use of IHE profiles and how this concept was prototypically implemented in the "Three-layer Graph-based meta model" modeling tool (3LGM2 Tool). The described modeling process consists of the following steps that can be looped: defining the use case, choosing suitable integration profiles, choosing actors and their options and assigning them to application systems, checking for required actor groupings and modeling transactions. Most of these steps were implemented in the 3LGM2 Tool. Further implementation effort and evaluation of our approach by inexperienced users is needed. But after that our tool should be a valuable tool for modelers planning healthcare information system architectures, in particular those based on IHE.
Assuntos
Atenção à Saúde , Integração de Sistemas , Sistemas de InformaçãoRESUMO
Sharing data is of great importance for research in medical sciences. It is the basis for reproducibility and reuse of already generated outcomes in new projects and in new contexts. FAIR data principles are the basics for sharing data. The Leipzig Health Atlas (LHA) platform follows these principles and provides data, describing metadata, and models that have been implemented in novel software tools and are available as demonstrators. LHA reuses and extends three different major components that have been previously developed by other projects. The SEEK management platform is the foundation providing a repository for archiving, presenting and secure sharing a wide range of publication results, such as published reports, (bio)medical data as well as interactive models and tools. The LHA Data Portal manages study metadata and data allowing to search for data of interest. Finally, PhenoMan is an ontological framework for phenotype modelling. This paper describes the interrelation of these three components. In particular, we use the PhenoMan to, firstly, model and represent phenotypes within the LHA platform. Then, secondly, the ontological phenotype representation can be used to generate search queries that are executed by the LHA Data Portal. The PhenoMan generates the queries in a novel domain specific query language (SDQL), which is specific for data management systems based on CDISC ODM standard, such as the LHA Data Portal. Our approach was successfully applied to represent phenotypes in the Leipzig Health Atlas with the possibility to execute corresponding queries within the LHA Data Portal.
Assuntos
Metadados , Software , Arquivos , Fenótipo , Reprodutibilidade dos TestesRESUMO
Healthcare and medical research in Germany are heading to more interconnected systems. New initiatives are funded by the German government to encourage the development of Integrated Research and Treatment Centers (IFB). Within an IFB new organizational structures and infrastructures for interdisciplinary, translational and trans-sectoral working relationship between existing rigid separated sectors are intended and needed. This paper describes how an IT-infrastructure of an IFB could look like, what major challenges have to be solved and what methods can be used to plan such a complex IT-infrastructure in the field of healthcare. By means of project management, system analyses, process models, 3LGM2-models and resource plans an appropriate concept with different views is created. This concept supports the information management in its enterprise architecture planning activities and implies a first step of implementing a connected healthcare and medical research platform.
Assuntos
Ensaios Clínicos como Assunto , Prestação Integrada de Cuidados de Saúde/organização & administração , Sistemas de Informação Hospitalar , Pesquisa Biomédica/organização & administração , Alemanha , HumanosRESUMO
Despite their young age, the FAIR principles are recognised as important guidelines for research data management. Their generic design, however, leaves much room for interpretation in domain-specific application. Based on practical experience in the operation of a data repository, this article addresses problems in FAIR provisioning of medical data for research purposes in the use case of the Leipzig Health Atlas project and shows necessary future developments.
Assuntos
Bases de Dados FactuaisRESUMO
The web portal Medfloss.org lists over 360 medical free/libre and open source software (MEDFLOSS) projects. These projects are described with the help of a self-developed nomenclature. Due to inconsistencies, the nomenclature shall be replaced by HITO, the Health IT Ontology. HITO is developed iteratively based on different use cases. This paper aims to describe methods and results of the second HITO use case in which HITO is extended to improve the description, retrieval and comparisons of MEDFLOSS projects on Medfloss.org. We use a mixed-methods approach to add concepts and relationships to describe MEDFLOSS precisely. The resulting HITO version stresses functional descriptions based on features and supported enterprise functions, rather than just describing technical characteristics. However, describing a larger number of MEDFLOSS projects requires the commitment of the community.
Assuntos
Software , SemânticaRESUMO
As hospital information systems are complex and the requirements for interoperability grow with the increasing networking in healthcare, careful planning becomes more and more necessary. The use of standards as described in IHE profiles, for example, are an important prerequisite for enabling interoperability. Enterprise Architecture Planning (EAP) methods should support this, but none of the currently available EAP methods offers the option of using IHE profiles. The 3LGM2IHE project wants to close this gap and implement the support of IHE profiles in the 3LGM2 tool. This paper describes how requirements for this tool were determined and presents the results.
Assuntos
Sistemas de Informação Hospitalar , Software , Integração de Sistemas , Atenção à SaúdeRESUMO
Phenotyping means the determination of clinical relevant phenotypes, e.g. by classification or calculation based on EHR data. Within the German Medical Informatics Initiative, the SMITH consortium is working on the implementation of a phenotyping pipeline. to extract, structure and normalize information from the EHR data of the hospital information systems of the participating sites; to automatically apply complex algorithms and models and to enrich the data within the research data warehouses of the distributed data integration centers with the computed results. Here we present the overall picture and essential building blocks and workflows of this concept.
Assuntos
Registros Eletrônicos de Saúde , Informática Médica , Algoritmos , FenótipoRESUMO
Secondary use of electronic health record (EHR) data requires a detailed description of metadata, especially when data collection and data re-use are organizationally and technically far apart. This paper describes the concept of the SMITH consortium that includes conventions, processes, and tools for describing and managing metadata using common standards for semantic interoperability. It deals in particular with the chain of processing steps of data from existing information systems and provides an overview of the planned use of metadata, medical terminologies, and semantic services in the consortium.
Assuntos
Registros Eletrônicos de Saúde , Metadados , Coleta de Dados , Alemanha , Sistemas de Informação , SemânticaRESUMO
The digitization of health records and cross-institutional data sharing is a necessary precondition to improve clinical research and patient care. The SMITH project unites several university hospitals and medical faculties in order to provide medical informatics solutions for health data integration and cross-institutional communication. In this paper, we focus on requirements elicitation and management for extracting clinical data from heterogeneous subsystems and data integration based on eHealth standards such as HL7 FHIR and IHE profiles.
Assuntos
Instalações de Saúde , Disseminação de Informação , Armazenamento e Recuperação da Informação , Sistemas Computadorizados de Registros Médicos , TelemedicinaRESUMO
Medical research is an active field in which a wide range of information is collected, collated, combined and analyzed. Essential results are reported in publications, but it is often problematic to have the data (raw and processed), algorithms and tools associated with the publication available. The Leipzig Health Atlas (LHA) project has therefore set itself the goal of providing a repository for this purpose and enabling controlled access to it via a web-based portal. A data sharing concept in accordance to FAIR and OAIS is the basis for the processing and provision of data in the LHA. An IT architecture has been designed for this purpose. The paper presents essential aspects of the data sharing concept, the IT architecture and the methods used.
Assuntos
Algoritmos , Estatística como Assunto , Humanos , PesquisaRESUMO
BACKGROUND: Medical plaintext documents contain important facts about patients, but they are rarely available for structured queries. The provision of structured information from natural language texts in addition to the existing structured data can significantly speed up the search for fulfilled inclusion criteria and thus improve the recruitment rate. OBJECTIVES: This work is aimed at supporting clinical trial recruitment with text mining techniques to identify suitable subjects in hospitals. METHOD: Based on the inclusion/exclusion criteria of 5 sample studies and a text corpus consisting of 212 doctor's letters and medical follow-up documentation from a university cancer center, a prototype was developed and technically evaluated using NLP procedures (UIMA) for the extraction of facts from medical free texts. RESULTS: It was found that although the extracted entities are not always correct (precision between 23% and 96%), they provide a decisive indication as to which patient file should be read preferentially. CONCLUSION: The prototype presented here demonstrates the technical feasibility. In order to find available, lucrative phenotypes, an in-depth evaluation is required.