RESUMO
BACKGROUND: The National Institute of Health and Social Care Research (NIHR) Health Informatics Collaborative (HIC) for Hearing Health has been established in the UK to curate routinely collected hearing health data to address research questions. This study defines priority research areas, outlines its aims, governance structure and demonstrates how hearing health data have been integrated into a common data model using pure tone audiometry (PTA) as a case study. METHODS: After identifying key research aims in hearing health, the governance structure for the NIHR HIC for Hearing Health is described. The Observational Medical Outcomes Partnership (OMOP) was chosen as our common data model to provide a case study example. RESULTS: The NIHR HIC Hearing Health theme have developed a data architecture outlying the flow of data from all of the various siloed electronic patient record systems to allow the effective linkage of data from electronic patient record systems to research systems. Using PTAs as an example, OMOPification of hearing health data successfully collated a rich breadth of datapoints across multiple centres. CONCLUSION: This study identified priority research areas where routinely collected hearing health data could be useful. It demonstrates integration and standardisation of such data into a common data model from multiple centres. By describing the process of data sharing across the HIC, we hope to invite more centres to contribute and utilise data to address research questions in hearing health. This national initiative has the power to transform UK hearing research and hearing care using routinely collected clinical data.
Assuntos
Informática Médica , Humanos , Reino Unido , Registros Eletrônicos de Saúde , Pesquisa Biomédica , Audiometria de Tons PurosRESUMO
BACKGROUND: As more health care organizations transition to using electronic health record (EHR) systems, it is important for these organizations to maximize the secondary use of their data to support service improvement and clinical research. These organizations will find it challenging to have systems capable of harnessing the unstructured data fields in the record (clinical notes, letters, etc) and more practically have such systems interact with all of the hospital data systems (legacy and current). OBJECTIVE: We describe the deployment of the EHR interfacing information extraction and retrieval platform CogStack at University College London Hospitals (UCLH). METHODS: At UCLH, we have deployed the CogStack platform, an information retrieval platform with natural language processing capabilities. The platform addresses the problem of data ingestion and harmonization from multiple data sources using the Apache NiFi module for managing complex data flows. The platform also facilitates the extraction of structured data from free-text records through use of the MedCAT natural language processing library. Finally, data science tools are made available to support data scientists and the development of downstream applications dependent upon data ingested and analyzed by CogStack. RESULTS: The platform has been deployed at the hospital, and in particular, it has facilitated a number of research and service evaluation projects. To date, we have processed over 30 million records, and the insights produced from CogStack have informed a number of clinical research use cases at the hospital. CONCLUSIONS: The CogStack platform can be configured to handle the data ingestion and harmonization challenges faced by a hospital. More importantly, the platform enables the hospital to unlock important clinical information from the unstructured portion of the record using natural language processing technology.
RESUMO
Background: To determine the impact of the COVID-19 pandemic on the population with chronic Hepatitis B virus (HBV) infection under hospital follow-up in the UK, we quantified the coverage and frequency of measurements of biomarkers used for routine surveillance (alanine transferase [ALT] and HBV viral load). Methods: We used anonymized electronic health record data from the National Institute for Health Research (NIHR) Health Informatics Collaborative (HIC) pipeline representing five UK National Health Service (NHS) Trusts. Results: We report significant reductions in surveillance of both biomarkers during the pandemic compared to pre-COVID-19 years, both in terms of the proportion of patients who had ≥1 measurement annually, and the mean number of measurements per patient. Conclusions: These results demonstrate the real-time utility of HIC data in monitoring health-care provision, and support interventions to provide catch-up services to minimise the impact of the pandemic. Further investigation is required to determine whether these disruptions will be associated with increased rates of adverse chronic HBV outcomes.
RESUMO
Clinical trials often fail to recruit an adequate number of appropriate patients. Identifying eligible trial participants is resource-intensive when relying on manual review of clinical notes, particularly in critical care settings where the time window is short. Automated review of electronic health records (EHR) may help, but much of the information is in free text rather than a computable form. We applied natural language processing (NLP) to free text EHR data using the CogStack platform to simulate recruitment into the LeoPARDS study, a clinical trial aiming to reduce organ dysfunction in septic shock. We applied an algorithm to identify eligible patients using a moving 1-hour time window, and compared patients identified by our approach with those actually screened and recruited for the trial, for the time period that data were available. We manually reviewed records of a random sample of patients identified by the algorithm but not screened in the original trial. Our method identified 376 patients, including 34 patients with EHR data available who were actually recruited to LeoPARDS in our centre. The sensitivity of CogStack for identifying patients screened was 90% (95% CI 85%, 93%). Of the 203 patients identified by both manual screening and CogStack, the index date matched in 95 (47%) and CogStack was earlier in 94 (47%). In conclusion, analysis of EHR data using NLP could effectively replicate recruitment in a critical care trial, and identify some eligible patients at an earlier stage, potentially improving trial recruitment if implemented in real time.
Assuntos
Ensaios Clínicos como Assunto , Mineração de Dados/métodos , Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Seleção de Pacientes , Adulto , Simulação por Computador , Cuidados Críticos , Feminino , Humanos , MasculinoRESUMO
OBJECTIVE: The National Institute for Health Research (NIHR) Health Informatics Collaborative (HIC) is a programme of infrastructure development across NIHR Biomedical Research Centres. The aim of the NIHR HIC is to improve the quality and availability of routinely collected data for collaborative, cross-centre research. This is demonstrated through research collaborations in selected therapeutic areas, one of which is viral hepatitis. DESIGN: The collaboration in viral hepatitis identified a rich set of datapoints, including information on clinical assessment, antiviral treatment, laboratory test results and health outcomes. Clinical data from different centres were standardised and combined to produce a research-ready dataset; this was used to generate insights regarding disease prevalence and treatment response. RESULTS: A comprehensive database has been developed for potential viral hepatitis research interests, with a corresponding data dictionary for researchers across the centres. An initial cohort of 960 patients with chronic hepatitis B infections and 1404 patients with chronic hepatitis C infections has been collected. CONCLUSION: For the first time, large prospective cohorts are being formed within National Health Service (NHS) secondary care services that will allow research questions to be rapidly addressed using real-world data. Interactions with industry partners will help to shape future research and will inform patient-stratified clinical practice. An emphasis on NHS-wide systems interoperability, and the increased utilisation of structured data solutions for electronic patient records, is improving access to data for research, service improvement and the reduction of clinical data gaps.