Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 10 de 10
1.
Br J Surg ; 111(1)2024 Jan 03.
Article En | MEDLINE | ID: mdl-38198154

BACKGROUND: Cancer multidisciplinary team (MDT) meetings are under intense pressure to reform given the rapidly rising incidence of cancer and national mandates for protocolized streaming of cases. The aim of this study was to validate a natural language processing (NLP)-based web platform to automate evidence-based MDT decisions for skin cancer with basal cell carcinoma as a use case. METHODS: A novel and validated NLP information extraction model was used to extract perioperative tumour and surgical factors from histopathology reports. A web application with a bespoke application programming interface used data from this model to provide an automated clinical decision support system, mapped to national guidelines and generating a patient letter to communicate ongoing management. Performance was assessed against retrospectively derived recommendations by two independent and blinded expert clinicians. RESULTS: There were 893 patients (1045 lesions) used to internally validate the model. High accuracy was observed when compared against human predictions, with an overall value of 0.92. Across all classifiers the virtual skin MDT was highly specific (0.96), while sensitivity was lower (0.72). CONCLUSION: This study demonstrates the feasibility of a fully automated, virtual, web-based service model to host the skin MDT with good system performance. This platform could be used to support clinical decision-making during MDTs as 'human in the loop' approach to aid protocolized streaming. Future prospective studies are needed to validate the model in tumour types where guidelines are more complex.


Natural Language Processing , Skin Neoplasms , Humans , Retrospective Studies , Skin Neoplasms/surgery , Patient Care Team , Internet
2.
Epilepsia ; 64(11): 3099-3108, 2023 11.
Article En | MEDLINE | ID: mdl-37643892

OBJECTIVE: This study was undertaken to develop a novel pathway linking genetic data with routinely collected data for people with epilepsy, and to analyze the influence of rare, deleterious genetic variants on epilepsy outcomes. METHODS: We linked whole-exome sequencing (WES) data with routinely collected primary and secondary care data and natural language processing (NLP)-derived seizure frequency information for people with epilepsy within the Secure Anonymised Information Linkage Databank. The study participants were adults who had consented to participate in the Swansea Neurology Biobank, Wales, between 2016 and 2018. DNA sequencing was carried out as part of the Epi25 collaboration. For each individual, we calculated the total number and cumulative burden of rare and predicted deleterious genetic variants and the total of rare and deleterious variants in epilepsy and drug metabolism genes. We compared these measures with the following outcomes: (1) no unscheduled hospital admissions versus unscheduled admissions for epilepsy, (2) antiseizure medication (ASM) monotherapy versus polytherapy, and (3) at least 1 year of seizure freedom versus <1 year of seizure freedom. RESULTS: We linked genetic data for 107 individuals with epilepsy (52% female) to electronic health records. Twenty-six percent had unscheduled hospital admissions, and 70% were prescribed ASM polytherapy. Seizure frequency information was linked for 100 individuals, and 10 were seizure-free. There was no significant difference between the outcome groups in terms of the exome-wide and gene-based burden of rare and deleterious genetic variants. SIGNIFICANCE: We successfully uploaded, annotated, and linked genetic sequence data and NLP-derived seizure frequency data to anonymized health care records in this proof-of-concept study. We did not detect a genetic influence on real-world epilepsy outcomes, but our study was limited by a small sample size. Future studies will require larger (WES) data to establish genetic variant contribution to epilepsy outcomes.


Epilepsy , Adult , Humans , Female , Male , Exome Sequencing , Epilepsy/drug therapy , Epilepsy/genetics , Seizures/drug therapy , Delivery of Health Care , Information Storage and Retrieval , Anticonvulsants/therapeutic use
4.
Front Surg ; 9: 870494, 2022.
Article En | MEDLINE | ID: mdl-36439548

Introduction: Routinely collected healthcare data are a powerful research resource, but often lack detailed disease-specific information that is collected in clinical free text such as histopathology reports. We aim to use natural Language Processing (NLP) techniques to extract detailed clinical and pathological information from histopathology reports to enrich routinely collected data. Methods: We used the general architecture for text engineering (GATE) framework to build an NLP information extraction system using rule-based techniques. During validation, we deployed our rule-based NLP pipeline on 200 previously unseen, de-identified and pseudonymised basal cell carcinoma (BCC) histopathological reports from Swansea Bay University Health Board, Wales, UK. The results of our algorithm were compared with gold standard human annotation by two independent and blinded expert clinicians involved in skin cancer care. Results: We identified 11,224 items of information with a mean precision, recall, and F1 score of 86.0% (95% CI: 75.1-96.9), 84.2% (95% CI: 72.8-96.1), and 84.5% (95% CI: 73.0-95.1), respectively. The difference between clinician annotator F1 scores was 7.9% in comparison with 15.5% between the NLP pipeline and the gold standard corpus. Cohen's Kappa score on annotated tokens was 0.85. Conclusion: Using an NLP rule-based approach for named entity recognition in BCC, we have been able to develop and validate a pipeline with a potential application in improving the quality of cancer registry data, supporting service planning, and enhancing the quality of routinely collected data for research.

5.
Seizure ; 94: 39-42, 2022 Jan.
Article En | MEDLINE | ID: mdl-34864250

PURPOSE: The COVID-19 pandemic has increased mortality worldwide and those with chronic conditions may have been disproportionally affected. However, it is unknown whether the pandemic has changed mortality rates for people with epilepsy. We aimed to compare mortality rates in people with epilepsy in Wales during the pandemic with pre-pandemic rates. METHODS: We performed a retrospective study using individual-level linked population-scale anonymised electronic health records. We identified deaths in people with epilepsy (DPWE), i.e. those with a diagnosis of epilepsy, and deaths associated with epilepsy (DAE), where epilepsy was recorded as a cause of death on death certificates. We compared death rates in 2020 with average rates in 2015-2019 using Poisson models to calculate death rate ratios. RESULTS: There were 188 DAE and 628 DPWE in Wales in 2020 (death rates: 7.7/100,000/year and 25.7/100,000/year). The average rates for DAE and DPWE from 2015 to 2019 were 5.8/100,000/year and 23.8/100,000/year, respectively. Death rate ratios (2020 compared to 2015-2019) for DAE were 1.34 (95%CI 1.14-1.57, p<0.001) and for DPWE were 1.08 (0.99-1.17, p = 0.09). The death rate ratios for non-COVID deaths (deaths without COVID mentioned on death certificates) for DAE were 1.17 (0.99-1.39, p = 0.06) and for DPWE were 0.96 (0.87-1.05, p = 0.37). CONCLUSIONS: The significant increase in DAE in Wales during 2020 could be explained by the direct effect of COVID-19 infection. Non-COVID-19 deaths have not increased significantly but further work is needed to assess the longer-term impact.


COVID-19 , Epilepsy , Cause of Death , Epilepsy/epidemiology , Humans , Pandemics , Retrospective Studies , SARS-CoV-2 , Wales/epidemiology
6.
Front Digit Health ; 3: 598916, 2021.
Article En | MEDLINE | ID: mdl-34713086

Across various domains, such as health and social care, law, news, and social media, there are increasing quantities of unstructured texts being produced. These potential data sources often contain rich information that could be used for domain-specific and research purposes. However, the unstructured nature of free-text data poses a significant challenge for its utilisation due to the necessity of substantial manual intervention from domain-experts to label embedded information. Annotation tools can assist with this process by providing functionality that enables the accurate capture and transformation of unstructured texts into structured annotations, which can be used individually, or as part of larger Natural Language Processing (NLP) pipelines. We present Markup (https://www.getmarkup.com/) an open-source, web-based annotation tool that is undergoing continued development for use across all domains. Markup incorporates NLP and Active Learning (AL) technologies to enable rapid and accurate annotation using custom user configurations, predictive annotation suggestions, and automated mapping suggestions to both domain-specific ontologies, such as the Unified Medical Language System (UMLS), and custom, user-defined ontologies. We demonstrate a real-world use case of how Markup has been used in a healthcare setting to annotate structured information from unstructured clinic letters, where captured annotations were used to build and test NLP applications.

7.
Epilepsia ; 62(7): 1604-1616, 2021 07.
Article En | MEDLINE | ID: mdl-34046890

OBJECTIVE: This study was undertaken to determine whether epilepsy and antiepileptic drugs (including enzyme-inducing and non-enzyme-inducing drugs) are associated with major cardiovascular events using population-level, routinely collected data. METHODS: Using anonymized, routinely collected, health care data in Wales, UK, we performed a retrospective matched cohort study (2003-2017) of adults with epilepsy prescribed an antiepileptic drug. Controls were matched with replacement on age, gender, deprivation quintile, and year of entry into the study. Participants were followed to the end of the study for the occurrence of a major cardiovascular event, and survival models were constructed to compare the time to a major cardiovascular event (cardiac arrest, myocardial infarction, stroke, ischemic heart disease, clinically significant arrhythmia, thromboembolism, onset of heart failure, or a cardiovascular death) for individuals in the case group versus the control group. RESULTS: There were 10 241 cases (mean age = 49.6 years, 52.2% male, mean follow-up = 6.1 years) matched to 35 145 controls. A total of 3180 (31.1%) cases received enzyme-inducing antiepileptic drugs, and 7061 (68.9%) received non-enzyme-inducing antiepileptic drugs. Cases had an increased risk of experiencing a major cardiovascular event compared to controls (adjusted hazard ratio = 1.58, 95% confidence interval [CI] = 1.51-1.63, p < .001). There was no notable difference in major cardiovascular events between those treated with enzyme-inducing antiepileptic drugs and those treated with non-enzyme-inducing antiepileptic drugs (adjusted hazard ratio = .95, 95% CI = .86-1.05, p = .300). SIGNIFICANCE: Individuals with epilepsy prescribed antiepileptic drugs are at an increased risk of major cardiovascular events compared with population controls. Being prescribed an enzyme-inducing antiepileptic drug is not associated with a greater risk of a major cardiovascular event compared to treatment with other antiepileptic drugs. Our data emphasize the importance of cardiovascular risk management in the clinical care of people with epilepsy.


Anticonvulsants/adverse effects , Anticonvulsants/therapeutic use , Cardiovascular Diseases/etiology , Epilepsy/complications , Epilepsy/drug therapy , Adolescent , Adult , Aged , Aged, 80 and over , Cardiovascular Diseases/epidemiology , Case-Control Studies , Cohort Studies , Enzyme Induction/drug effects , Epilepsy/epidemiology , Female , Humans , Male , Middle Aged , Retrospective Studies , Risk Assessment , Risk Factors , Survival Analysis , Treatment Outcome , United Kingdom/epidemiology , Wales , Young Adult
8.
Neurology ; 2021 Jan 20.
Article En | MEDLINE | ID: mdl-33472926

OBJECTIVE: To characterise trends in incidence, prevalence, and healthcare outcomes in the idiopathic intracranial hypertension (IIH) population in Wales using routinely collected healthcare data. METHODS: We used and validated primary and secondary care IIH diagnosis codes within the Secure Anonymised Information Linkage databank, to ascertain IIH cases and controls, in a retrospective cohort study between 2003 and 2017. We recorded body mass index (BMI), deprivation quintile, CSF diversion surgery and unscheduled hospital admissions in case and control cohorts. RESULTS: We analysed 35 million patient years of data. There were 1765 cases of IIH in 2017 (85% female). The prevalence and incidence of IIH in 2017 was 76/100,000 and 7.8/100,000/year, a significant increase from 2003 (corresponding figures=12/100,000 and 2.3/100,000/year) (p<0.001). IIH prevalence is associated with increasing BMI and increasing deprivation. The odds ratio for developing IIH in the least deprived quintile compared to the most deprived quintile, adjusted for gender and BMI, was 0.65 (95% CI 0.55 to 0.76). 9% of IIH cases had CSF shunts with less than 0.2% having bariatric surgery. Unscheduled hospital admissions were higher in the IIH cohort compared to controls (rate ratio=5.28, p<0.001) and in individuals with IIH and CSF shunts compared to those without shunts (rate ratio=2.02, p<0.01). CONCLUSIONS: IIH incidence and prevalence is increasing considerably, corresponding to population increases in BMI, and is associated with increased deprivation. This has important implications for healthcare professionals and policy makers given the comorbidities, complications and increased healthcare utilization associated with IIH.

9.
BMJ Open ; 9(4): e023232, 2019 04 01.
Article En | MEDLINE | ID: mdl-30940752

OBJECTIVE: Routinely collected healthcare data are a powerful research resource but often lack detailed disease-specific information that is collected in clinical free text, for example, clinic letters. We aim to use natural language processing techniques to extract detailed clinical information from epilepsy clinic letters to enrich routinely collected data. DESIGN: We used the general architecture for text engineering (GATE) framework to build an information extraction system, ExECT (extraction of epilepsy clinical text), combining rule-based and statistical techniques. We extracted nine categories of epilepsy information in addition to clinic date and date of birth across 200 clinic letters. We compared the results of our algorithm with a manual review of the letters by an epilepsy clinician. SETTING: De-identified and pseudonymised epilepsy clinic letters from a Health Board serving half a million residents in Wales, UK. RESULTS: We identified 1925 items of information with overall precision, recall and F1 score of 91.4%, 81.4% and 86.1%, respectively. Precision and recall for epilepsy-specific categories were: epilepsy diagnosis (88.1%, 89.0%), epilepsy type (89.8%, 79.8%), focal seizures (96.2%, 69.7%), generalised seizures (88.8%, 52.3%), seizure frequency (86.3%-53.6%), medication (96.1%, 94.0%), CT (55.6%, 58.8%), MRI (82.4%, 68.8%) and electroencephalogram (81.5%, 75.3%). CONCLUSIONS: We have built an automated clinical text extraction system that can accurately extract epilepsy information from free text in clinic letters. This can enhance routinely collected data for research in the UK. The information extracted with ExECT such as epilepsy type, seizure frequency and neurological investigations are often missing from routinely collected data. We propose that our algorithm can bridge this data gap enabling further epilepsy research opportunities. While many of the rules in our pipeline were tailored to extract epilepsy specific information, our methods can be applied to other diseases and also can be used in clinical practice to record patient information in a structured manner.


Epilepsy/classification , Information Storage and Retrieval , Medical Records , Natural Language Processing , Seizures/classification , Algorithms , Electroencephalography , Electronic Health Records , Epilepsy/diagnosis , Humans , Magnetic Resonance Imaging , Seizures/diagnosis , Wales
10.
Seizure ; 52: 195-198, 2017 Nov.
Article En | MEDLINE | ID: mdl-29059611

PURPOSE: Anonymised, routinely-collected healthcare data is increasingly being used for epilepsy research. We validated algorithms using general practitioner (GP) primary healthcare records to identify people with epilepsy from anonymised healthcare data within the Secure Anonymised Information Linkage (SAIL) databank in Wales, UK. METHOD: A reference population of 150 people with definite epilepsy and 150 people without epilepsy was ascertained from hospital records and linked to records contained within SAIL (containing GP records for 2.4 million people). We used three different algorithms, using combinations of GP epilepsy diagnosis and anti-epileptic drug (AED) prescription codes, to identify the reference population. RESULTS: Combining diagnosis and AED prescription codes had a sensitivity of 84% (95% ci 77-90) and specificity of 98% (95-100) in identifying people with epilepsy; diagnosis codes alone had a sensitivity of 86% (80-91) and a specificity of 97% (92-99); and AED prescription codes alone achieved a sensitivity of 92% (70-83) and a specificity of 73% (65-80). Using AED codes only was more accurate in children achieving a sensitivity of 88% (75-95) and specificity of 98% (88-100). CONCLUSION: GP epilepsy diagnosis and AED prescription codes can be confidently used to identify people with epilepsy using anonymised healthcare records in Wales, UK.


Data Collection/methods , Epilepsy/diagnosis , Epilepsy/epidemiology , Adult , Algorithms , Anticonvulsants/therapeutic use , Child , Electronic Health Records/statistics & numerical data , Epilepsy/drug therapy , Female , Humans , Male , Reproducibility of Results , Wales/epidemiology
...