Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 28
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Int J Cancer ; 149(5): 1150-1165, 2021 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-33997972

RESUMO

Quantification of DNA methylation in neoplastic cells is crucial both from mechanistic and diagnostic perspectives. However, such measurements are prone to different experimental biases. Polymerase chain reaction (PCR) bias results in an unequal recovery of methylated and unmethylated alleles at the sample preparation step. Post-PCR biases get introduced additionally by the readout processes. Correcting the biases is more practicable than optimising experimental conditions, as demonstrated previously. However, utilisation of our earlier developed algorithm strongly necessitates automation. Here, we present two R packages: rBiasCorrection, the core algorithms to correct biases; and BiasCorrector, its web-based graphical user interface frontend. The software detects and analyses experimental biases in calibration DNA samples at a single base resolution by using cubic polynomial and hyperbolic regression. The correction coefficients from the best regression type are employed to compensate for the bias. Three common technologies-bisulphite pyrosequencing, next-generation sequencing and oligonucleotide microarrays-were used to comprehensively test BiasCorrector. We demonstrate the accuracy of BiasCorrector's performance and reveal technology-specific PCR- and post-PCR biases. BiasCorrector effectively eliminates biases regardless of their nature, locus, the number of interrogated methylation sites and the detection method, thus representing a user-friendly tool for producing accurate epigenetic results.


Assuntos
Algoritmos , Metilação de DNA , Neoplasias/genética , Reação em Cadeia da Polimerase/normas , Análise de Sequência de DNA/normas , Software , Viés , Ilhas de CpG , Humanos , Tecnologia
2.
J Biomed Inform ; 100: 103314, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31629921

RESUMO

Searching for patient cohorts in electronic patient data often requires the definition of temporal constraints between the selection criteria. However, beyond a certain degree of temporal complexity, the non-graphical, form-based approaches implemented in current translational research platforms may be limited when modeling such constraints. In our opinion, there is a need for an easily accessible and implementable, fully graphical method for creating temporal queries. We aim to respond to this challenge with a new graphical notation. Based on Allen's time interval algebra, it allows for modeling temporal queries by arranging simple horizontal bars depicting symbolic time intervals. To make our approach applicable to complex temporal patterns, we apply two extensions: with duration intervals, we enable the inference about relative temporal distances between patient events, and with time interval modifiers, we support counting and excluding patient events, as well as constraining numeric values. We describe how to generate database queries from this notation. We provide a prototypical implementation, consisting of a temporal query modeling frontend and an experimental backend that connects to an i2b2 system. We evaluate our modeling approach on the MIMIC-III database to demonstrate that it can be used for modeling typical temporal phenotyping queries.


Assuntos
Gráficos por Computador , Simulação por Computador , Algoritmos , Bases de Dados Factuais , Humanos , Armazenamento e Recuperação da Informação , Tempo
3.
Artigo em Alemão | MEDLINE | ID: mdl-26753865

RESUMO

BACKGROUND: Medical research projects often require more biological material than can be supplied by a single biobank. For this reason, a multitude of strategies support locating potential research partners with matching material without requiring centralization of sample storage. OBJECTIVES: Classification of different strategies for biobank networks, in particular for locating suitable samples. Description of an IT infrastructure combining these strategies. MATERIALS AND METHODS: Existing strategies can be classified according to three criteria: (a) granularity of sample data: coarse bank-level data (catalogue) vs. fine-granular sample-level data, (b) location of sample data: central (central search service) vs. decentral storage (federated search services), and (c) level of automation: automatic (query-based, federated search service) vs. semi-automatic (inquiry-based, decentral search). All mentioned search services require data integration. Metadata help to overcome semantic heterogeneity. RESULTS: The "Common Service IT" in BBMRI-ERIC (Biobanking and BioMolecular Resources Research Infrastructure) unites a catalogue, the decentral search and metadata in an integrated platform. As a result, researchers receive versatile tools to search suitable biomaterial, while biobanks retain a high degree of data sovereignty. CONCLUSIONS: Despite their differences, the presented strategies for biobank networks do not rule each other out but can complement and even benefit from each other.


Assuntos
Bancos de Espécimes Biológicos/organização & administração , Pesquisa Biomédica/organização & administração , Sistemas de Gerenciamento de Base de Dados/organização & administração , Bases de Dados Factuais , Relações Interinstitucionais , Sistema de Registros , Europa (Continente) , Previsões , Disseminação de Informação/métodos , Armazenamento e Recuperação da Informação/métodos , Modelos Organizacionais , Manejo de Espécimes/métodos
4.
Langenbecks Arch Surg ; 398(2): 251-8, 2013 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-23292500

RESUMO

PURPOSE: Research projects and clinical trials strongly rely on high-quality biospecimens which are provided by biobanks. Since differences in sample processing and storage can strongly affect the outcome of such studies, standardization between biobanks is necessary to guarantee reliable results of large, multicenter studies. The German Cancer Aid Foundation (Deutsche Krebshilfe e.V.) has therefore initiated the priority program "tumor tissue banks" in 2010 by funding four biobank networks focusing on central nervous system tumors, melanomas, breast carcinomas, and colorectal carcinomas. The latter one, the North German Tumor Bank of Colorectal Cancer (ColoNet) is managed by surgeons, pathologists, gastroenterologists, oncologists, scientists, and medical computer scientists. METHODS AND RESULTS: The ColoNet consortium has developed and harmonized standard operating procedures concerning all biobanking aspects. Crucial steps for quality assurance have been implemented and resulted in certification according to DIN EN ISO 9001. A further achievement is the construction of a web-based database for exploring available samples. In addition, common scientific projects have been initiated. Thus, ColoNet's repository will be used for research projects in order to improve early diagnosis, therapy, follow-up, and prognosis of colorectal cancer patients. Apart from the routine sample storage at -170 °C, the tumor banks' unique characteristic is the participation of outpatient clinics and private practices to further expand the sample and clinical data collection. CONCLUSION: The first 2 years of funding by the German Cancer Aid Foundation have already led to a closer scientific connection between the participating institutions and to a substantial collection of biospecimens obtained under highly standardized conditions.


Assuntos
Neoplasias Colorretais/patologia , Bancos de Tecidos/organização & administração , Pesquisa Biomédica , Neoplasias Colorretais/epidemiologia , Alemanha/epidemiologia , Humanos
5.
Stud Health Technol Inform ; 180: 559-63, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22874253

RESUMO

This paper presents a biobanking IT framework, comprising a set of integrated biobanking information technology components. It provides adaptable and scalable IT support for varying biobanking scenarios, workflows and projects, while avoiding redundancy in data and technology. Feasibility of this approach is illustrated by implementations for four different biobanking projects at Erlangen University Hospital and with cooperating partners in Münster and Lübeck.


Assuntos
Pesquisa Biomédica/métodos , Sistemas de Gerenciamento de Base de Dados , Registros Eletrônicos de Saúde , Armazenamento e Recuperação da Informação/métodos , Interface Usuário-Computador , Alemanha , Registros de Saúde Pessoal
6.
Stud Health Technol Inform ; 294: 674-678, 2022 May 25.
Artigo em Inglês | MEDLINE | ID: mdl-35612174

RESUMO

COVID-19 has challenged the healthcare systems worldwide. To quickly identify successful diagnostic and therapeutic approaches large data sharing approaches are inevitable. Though organizational clinical data are abundant, many of them are available only in isolated silos and largely inaccessible to external researchers. To overcome and tackle this challenge the university medicine network (comprising all 36 German university hospitals) has been founded in April 2020 to coordinate COVID-19 action plans, diagnostic and therapeutic strategies and collaborative research activities. 13 projects were initiated from which the CODEX project, aiming at the development of a Germany-wide Covid-19 Data Exchange Platform, is presented in this publication. We illustrate the conceptual design, the stepwise development and deployment, first results and the current status.


Assuntos
COVID-19 , Atenção à Saúde , Alemanha , Hospitais Universitários , Humanos , Disseminação de Informação
7.
Stud Health Technol Inform ; 169: 502-6, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21893800

RESUMO

In an ongoing effort to share heterogeneous electronic medical record (EMR) data in an i2b2 instance between the University Hospitals Münster and Erlangen for joint cancer research projects, an ontology based system for the mapping of EMR data to a set of common data elements has been developed. The system translates the mappings into local SQL scripts, which are then used to extract, transform and load the facts data from each EMR into the i2b2 database. By using Semantic Web standards, it is the authors' goal to reuse the laboriously compiled "mapping knowledge" in future projects, such as a comprehensive cancer ontology or even a hospital-wide clinical ontology.


Assuntos
Registros Eletrônicos de Saúde , Sistemas de Informação Hospitalar , Armazenamento e Recuperação da Informação/métodos , Algoritmos , Redes de Comunicação de Computadores , Humanos , Informática Médica/métodos , Linguagens de Programação , Semântica , Software , Integração de Sistemas , Vocabulário Controlado
8.
Stud Health Technol Inform ; 169: 892-6, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21893875

RESUMO

This paper presents the concept of an integrated IT infrastructure framework established at the comprehensive cancer center at the University Hospital Erlangen. The framework is based on the single source concept where data from the electronic medical record are reused for clinical and translational research projects. The applicability of the approach is illustrated by two case studies from colon cancer and prostate cancer research projects.


Assuntos
Institutos de Câncer , Informática Médica/métodos , Oncologia/métodos , Pesquisa Translacional Biomédica/métodos , Algoritmos , Alemanha , Sistemas de Informação Hospitalar , Humanos , Sistemas Computadorizados de Registros Médicos , Desenvolvimento de Programas , Avaliação de Programas e Projetos de Saúde , Projetos de Pesquisa , Software
9.
Appl Clin Inform ; 12(1): 57-64, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-33506478

RESUMO

BACKGROUND: The identification of patient cohorts for recruiting patients into clinical trials requires an evaluation of study-specific inclusion and exclusion criteria. These criteria are specified depending on corresponding clinical facts. Some of these facts may not be present in the clinical source systems and need to be calculated either in advance or at cohort query runtime (so-called feasibility query). OBJECTIVES: We use the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) as the repository for our clinical data. However, Atlas, the graphical user interface of OMOP, does not offer the functionality to perform calculations on facts data. Therefore, we were in search for a different approach. The objective of this study is to investigate whether the Arden Syntax can be used for feasibility queries on the OMOP CDM to enable on-the-fly calculations at query runtime, to eliminate the need to precalculate data elements that are involved with researchers' criteria specification. METHODS: We implemented a service that reads the facts from the OMOP repository and provides it in a form which an Arden Syntax Medical Logic Module (MLM) can process. Then, we implemented an MLM that applies the eligibility criteria to every patient data set and outputs the list of eligible cases (i.e., performs the feasibility query). RESULTS: The study resulted in an MLM-based feasibility query that identifies cases of overventilation as an example of how an on-the-fly calculation can be realized. The algorithm is split into two MLMs to provide the reusability of the approach. CONCLUSION: We found that MLMs are a suitable technology for feasibility queries on the OMOP CDM. Our method of performing on-the-fly calculations can be employed with any OMOP instance and without touching existing infrastructure like the Extract, Transform and Load pipeline. Therefore, we think that it is a well-suited method to perform on-the-fly calculations on OMOP.


Assuntos
Algoritmos , Bases de Dados Factuais , Estudos de Coortes
10.
JMIR Med Inform ; 9(4): e25645, 2021 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-33792554

RESUMO

BACKGROUND: The harmonization and standardization of digital medical information for research purposes is a challenging and ongoing collaborative effort. Current research data repositories typically require extensive efforts in harmonizing and transforming original clinical data. The Fast Healthcare Interoperability Resources (FHIR) format was designed primarily to represent clinical processes; therefore, it closely resembles the clinical data model and is more widely available across modern electronic health records. However, no common standardized data format is directly suitable for statistical analyses, and data need to be preprocessed before statistical analysis. OBJECTIVE: This study aimed to elucidate how FHIR data can be queried directly with a preprocessing service and be used for statistical analyses. METHODS: We propose that the binary JavaScript Object Notation format of the PostgreSQL (PSQL) open source database is suitable for not only storing FHIR data, but also extending it with preprocessing and filtering services, which directly transform data stored in FHIR format into prepared data subsets for statistical analysis. We specified an interface for this preprocessor, implemented and deployed it at University Hospital Erlangen-Nürnberg, generated 3 sample data sets, and analyzed the available data. RESULTS: We imported real-world patient data from 2016 to 2018 into a standard PSQL database, generating a dataset of approximately 35.5 million FHIR resources, including "Patient," "Encounter," "Condition" (diagnoses specified using International Classification of Diseases codes), "Procedure," and "Observation" (laboratory test results). We then integrated the developed preprocessing service with the PSQL database and the locally installed web-based KETOS analysis platform. Advanced statistical analyses were feasible using the developed framework using 3 clinically relevant scenarios (data-driven establishment of hemoglobin reference intervals, assessment of anemia prevalence in patients with cancer, and investigation of the adverse effects of drugs). CONCLUSIONS: This study shows how the standard open source database PSQL can be used to store FHIR data and be integrated with a specifically developed preprocessing and analysis framework. This enables dataset generation with advanced medical criteria and the integration of subsequent statistical analysis. The web-based preprocessing service can be deployed locally at the hospital level, protecting patients' privacy while being integrated with existing open source data analysis tools currently being developed across Germany.

11.
Stud Health Technol Inform ; 278: 217-223, 2021 May 24.
Artigo em Inglês | MEDLINE | ID: mdl-34042897

RESUMO

Semantic interoperability is a major challenge in multi-center data sharing projects, a challenge that the German Initiative for Medical Informatics is taking up. With respect to laboratory data, enriching site-specific tests and measurements with LOINC codes appears to be a crucial step in supporting cross-institutional research. However, this effort is very time-consuming, as it requires expert knowledge of local site specifics. To ease this process, we developed a generic manual collaborative terminology mapping tool, the MIRACUM Mapper. It allows the creation of arbitrary mapping workflows involving different user roles. A mapping workflow with two user roles has been implemented at University Hospital Erlangen to support the local LOINC mapping. Additionally, the MIRACUM LabVisualizeR provides summary statistics and visualizations of analyte data. We developed a toolbox that facilitates the collaborative creation of mappings and streamlines the review as well as the validation process. The two tools are available under an open source license.


Assuntos
Logical Observation Identifiers Names and Codes , Informática Médica , Instalações de Saúde , Humanos , Disseminação de Informação , Laboratórios
12.
Appl Clin Inform ; 12(4): 826-835, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-34433217

RESUMO

BACKGROUND: Many research initiatives aim at using data from electronic health records (EHRs) in observational studies. Participating sites of the German Medical Informatics Initiative (MII) established data integration centers to integrate EHR data within research data repositories to support local and federated analyses. To address concerns regarding possible data quality (DQ) issues of hospital routine data compared with data specifically collected for scientific purposes, we have previously presented a data quality assessment (DQA) tool providing a standardized approach to assess DQ of the research data repositories at the MIRACUM consortium's partner sites. OBJECTIVES: Major limitations of the former approach included manual interpretation of the results and hard coding of analyses, making their expansion to new data elements and databases time-consuming and error prone. We here present an enhanced version of the DQA tool by linking it to common data element definitions stored in a metadata repository (MDR), adopting the harmonized DQA framework from Kahn et al and its application within the MIRACUM consortium. METHODS: Data quality checks were consequently aligned to a harmonized DQA terminology. Database-specific information were systematically identified and represented in an MDR. Furthermore, a structured representation of logical relations between data elements was developed to model plausibility-statements in the MDR. RESULTS: The MIRACUM DQA tool was linked to data element definitions stored in a consortium-wide MDR. Additional databases used within MIRACUM were linked to the DQ checks by extending the respective data elements in the MDR with the required information. The evaluation of DQ checks was automated. An adaptable software implementation is provided with the R package DQAstats. CONCLUSION: The enhancements of the DQA tool facilitate the future integration of new data elements and make the tool scalable to other databases and data models. It has been provided to all ten MIRACUM partners and was successfully deployed and integrated into their respective data integration center infrastructure.


Assuntos
Confiabilidade dos Dados , Informática Médica , Bases de Dados Factuais , Registros Eletrônicos de Saúde , Metadados
13.
Appl Clin Inform ; 11(2): 342-349, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-32403139

RESUMO

OBJECTIVES: This study aimed to describe an alternative approach for accessing electronic medical records (EMRs) from clinical decision support (CDS) functions based on Arden Syntax Medical Logic Modules, which can be paraphrased as "map the entire record." METHODS: Based on an experimental Arden Syntax processor, we implemented a method to transform patient data from a commercial patient data management system (PDMS) to tree-structured documents termed CDS EMRs. They are encoded in a specific XML format that can be directly transformed to Arden Syntax data types by a mapper natively integrated into the processor. The internal structure of a CDS EMR reflects the tabbed view of an EMR in the graphical user interface of the PDMS. RESULTS: The study resulted in an architecture that provides CDS EMRs in the form of a network service. The approach enables uniform data access from all Medical Logic Modules and requires no mapping parameters except a case number. Measurements within a CDS EMR can be addressed with straightforward path expressions. The approach is in routine use at a German university hospital for more than 2 years. CONCLUSION: This practical approach facilitates the use of CDS functions in the clinical routine at our local hospital. It is transferrable to standard-compliant Arden Syntax processors with moderate effort. Its comprehensibility can also facilitate teaching and development. Moreover, it may lower the entry barrier for the application of the Arden Syntax standard and could therefore promote its dissemination.


Assuntos
Registros Eletrônicos de Saúde , Lógica , Fatores de Tempo
14.
Biopreserv Biobank ; 18(3): 155-164, 2020 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-32302498

RESUMO

Introduction: The Minimum Information About BIobank data Sharing (MIABIS) was initiated in 2012. MIABIS aims to create a common biobank terminology to facilitate data sharing in biobanks and sample collections. The MIABIS Core terminology consists of three components describing biobanks, sample collections, and studies, in which information on samples and sample donors is provided at aggregated form. However, there is also a need to describe samples and sample donors at an individual level to allow more elaborate queries on available biobank samples and data. Therefore the MIABIS terminology has now been extended with components describing samples and sample donors at an individual level. Materials and Methods: The components were defined according to specific scope and use cases by a large group of experts, and through several cycles of reviews, according to the new MIABIS governance model of BBMRI-ERIC (Biobanking and Biomolecular Resources Research Infrastructure-European Research Infrastructure Consortium). The guiding principles applied in developing these components included the following terms: model should consider only samples of human origin, model should be applicable to all types of samples and all sample donors, and model should describe the current status of samples stored in a given biobank. Results: A minimal set of standard attributes for defining samples and sample donors is presented here. We added an "event" component to describe attributes that are not directly describing samples or sample donors but are tightly related to them. To better utilize the generic data model, we suggest a procedure by which interoperability can be promoted, using specific MIABIS profiles. Discussion: The MIABIS sample and donor component extensions and the new generic data model complement the existing MIABIS Core 2.0 components, and substantially increase the potential usability of this terminology for better describing biobank samples and sample donors. They also support the use of individual level data about samples and sample donors to obtain accurate and detailed biobank availability queries.


Assuntos
Bancos de Espécimes Biológicos , Disseminação de Informação/métodos , Guias como Assunto , Humanos , Terminologia como Assunto
15.
Front Public Health ; 8: 594117, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33520914

RESUMO

The COVID-19 pandemic has caused strains on health systems worldwide disrupting routine hospital services for all non-COVID patients. Within this retrospective study, we analyzed inpatient hospital admissions across 18 German university hospitals during the 2020 lockdown period compared to 2018. Patients admitted to hospital between January 1 and May 31, 2020 and the corresponding periods in 2018 and 2019 were included in this study. Data derived from electronic health records were collected and analyzed using the data integration center infrastructure implemented in the university hospitals that are part of the four consortia funded by the German Medical Informatics Initiative. Admissions were grouped and counted by ICD 10 chapters and specific reasons for treatment at each site. Pooled aggregated data were centrally analyzed with descriptive statistics to compare absolute and relative differences between time periods of different years. The results illustrate how care process adoptions depended on the COVID-19 epidemiological situation and the criticality of the disease. Overall inpatient hospital admissions decreased by 35% in weeks 1 to 4 and by 30.3% in weeks 5 to 8 after the lockdown announcement compared to 2018. Even hospital admissions for critical care conditions such as malignant cancer treatments were reduced. We also noted a high reduction of emergency admissions such as myocardial infarction (38.7%), whereas the reduction in stroke admissions was smaller (19.6%). In contrast, we observed a considerable reduction in admissions for non-critical clinical situations, such as hysterectomies for benign tumors (78.8%) and hip replacements due to arthrosis (82.4%). In summary, our study shows that the university hospital admission rates in Germany were substantially reduced following the national COVID-19 lockdown. These included critical care or emergency conditions in which deferral is expected to impair clinical outcomes. Future studies are needed to delineate how appropriate medical care of critically ill patients can be maintained during a pandemic.


Assuntos
COVID-19/epidemiologia , Serviço Hospitalar de Emergência/estatística & dados numéricos , Hospitalização/estatística & dados numéricos , Hospitais Universitários/estatística & dados numéricos , Pandemias/estatística & dados numéricos , Admissão do Paciente/estatística & dados numéricos , Quarentena/estatística & dados numéricos , Serviço Hospitalar de Emergência/tendências , Previsões , Alemanha/epidemiologia , Hospitalização/tendências , Hospitais Universitários/tendências , Humanos , Admissão do Paciente/tendências , Quarentena/tendências , Estudos Retrospectivos , SARS-CoV-2
16.
Appl Clin Inform ; 10(4): 679-692, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-31509880

RESUMO

BACKGROUND: High-quality clinical data and biological specimens are key for medical research and personalized medicine. The Biobanking and Biomolecular Resources Research Infrastructure-European Research Infrastructure Consortium (BBMRI-ERIC) aims to facilitate access to such biological resources. The accompanying ADOPT BBMRI-ERIC project kick-started BBMRI-ERIC by collecting colorectal cancer data from European biobanks. OBJECTIVES: To transform these data into a common representation, a uniform approach for data integration and harmonization had to be developed. This article describes the design and the implementation of a toolset for this task. METHODS: Based on the semantics of a metadata repository, we developed a lexical bag-of-words matcher, capable of semiautomatically mapping local biobank terms to the central ADOPT BBMRI-ERIC terminology. Its algorithm supports fuzzy matching, utilization of synonyms, and sentiment tagging. To process the anonymized instance data based on these mappings, we also developed a data transformation application. RESULTS: The implementation was used to process the data from 10 European biobanks. The lexical matcher automatically and correctly mapped 78.48% of the 1,492 local biobank terms, and human experts were able to complete the remaining mappings. We used the expert-curated mappings to successfully process 147,608 data records from 3,415 patients. CONCLUSION: A generic harmonization approach was created and successfully used for cross-institutional data harmonization across 10 European biobanks. The software tools were made available as open source.


Assuntos
Bancos de Espécimes Biológicos/normas , Neoplasias Colorretais , Europa (Continente) , Humanos , Padrões de Referência
17.
Stud Health Technol Inform ; 267: 247-253, 2019 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-31483279

RESUMO

INTRODUCTION: Data quality (DQ) is an important prerequisite for secondary use of electronic health record (EHR) data in clinical research, particularly with regards to progressing towards a learning health system, one of the MIRACUM consortium's goals. Following the successful integration of the i2b2 research data repository in MIRACUM, we present a standardized and generic DQ framework. STATE OF THE ART: Already established DQ evaluation methods do not cover all of MIRACUM's requirements. CONCEPT: A data quality analysis plan was developed to assess common data quality dimensions for demographic-, condition-, procedure- and department-related variables of MIRACUM's research data repository. IMPLEMENTATION: A data quality analysis (DQA) tool was developed using R scripts packaged in a Docker image with all the necessary dependencies and R libraries for easy distribution. It integrates with the i2b2 data repository at each MIRACUM site, executes an analysis on the data and generates a DQ report. LESSONS LEARNED: Our DQA tool brings the analysis to the data and thus meets the MIRACUM data protection requirements. It evaluates established DQ dimensions of data repositories in a standardized and easily distributable way. This analysis allowed us to reveal and revise inconsistencies in earlier versions of the ETL jobs. The framework is portable, easy to deploy across different sites and even further adaptable to other database schemes. CONCLUSION: The presented framework provides the first step towards a unified, standardized and harmonized EHR DQ assessment in MIRACUM. DQ issues can now be systematically identified by individual hospitals to subsequently implement site- or consortium-wide feedback loops to increase data quality.


Assuntos
Confiabilidade dos Dados , Registros Eletrônicos de Saúde , Bases de Dados Factuais
18.
PLoS One ; 14(10): e0223010, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31581246

RESUMO

BACKGROUND AND OBJECTIVE: To take full advantage of decision support, machine learning, and patient-level prediction models, it is important that models are not only created, but also deployed in a clinical setting. The KETOS platform demonstrated in this work implements a tool for researchers allowing them to perform statistical analyses and deploy resulting models in a secure environment. METHODS: The proposed system uses Docker virtualization to provide researchers with reproducible data analysis and development environments, accessible via Jupyter Notebook, to perform statistical analysis and develop, train and deploy models based on standardized input data. The platform is built in a modular fashion and interfaces with web services using the Health Level 7 (HL7) Fast Healthcare Interoperability Resources (FHIR) standard to access patient data. In our prototypical implementation we use an OMOP common data model (OMOP-CDM) database. The architecture supports the entire research lifecycle from creating a data analysis environment, retrieving data, and training to final deployment in a hospital setting. RESULTS: We evaluated the platform by establishing and deploying an analysis and end user application for hemoglobin reference intervals within the University Hospital Erlangen. To demonstrate the potential of the system to deploy arbitrary models, we loaded a colorectal cancer dataset into an OMOP database and built machine learning models to predict patient outcomes and made them available via a web service. We demonstrated both the integration with FHIR as well as an example end user application. Finally, we integrated the platform with the open source DataSHIELD architecture to allow for distributed privacy preserving data analysis and training across networks of hospitals. CONCLUSION: The KETOS platform takes a novel approach to data analysis, training and deploying decision support models in a hospital or healthcare setting. It does so in a secure and privacy-preserving manner, combining the flexibility of Docker virtualization with the advantages of standardized vocabularies, a widely applied database schema (OMOP-CDM), and a standardized way to exchange medical data (FHIR).


Assuntos
Sistemas de Apoio a Decisões Clínicas , Interoperabilidade da Informação em Saúde , Internet , Aprendizado de Máquina , Modelos Teóricos , Neoplasias Colorretais/terapia , Hemoglobinas/metabolismo , Humanos , Privacidade , Valores de Referência , Resultado do Tratamento , Interface Usuário-Computador
20.
Artigo em Inglês | MEDLINE | ID: mdl-30147028

RESUMO

Clinical trials are the foundation of evidence-based medicine and their computerized support has been a recurring theme in medical informatics. One challenging aspect is the representation of eligibility criteria in a machine-readable format to automate the identification of suitable participants. In this study, we investigate the capabilities for expressing trial eligibility criteria via the search functionality specified in HL7 FHIR, an emerging standard for exchanging healthcare information electronically which also defines a set of operations for searching for health record data. Using a randomly sampled subset of 303 eligibility criteria from ClinicalTrials.gov yielded a 34 % success rate in representing them using the FHIR search semantics. While limitations are present, the FHIR search semantics are a viable tool for supporting preliminary trial eligibility assessment.


Assuntos
Ensaios Clínicos como Assunto , Registros Eletrônicos de Saúde , Semântica , Atenção à Saúde , Informática Médica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA