Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
J Biomed Inform ; 126: 103969, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34864210

RESUMO

With clinical trials unable to detect all potential adverse reactions to drugs and medical devices prior to their release into the market, accurate post-market surveillance is critical to ensure their safety and efficacy. Electronic health records (EHR) contain rich observational patient data, making them a valuable source to actively monitor the safety of drugs and devices. While structured EHR data and spontaneous reporting systems often underreport the complexities of patient encounters and outcomes, free-text clinical notes offer greater detail about a patient's status. Previous studies have proposed machine learning methods to detect adverse events from clinical notes, but suffer from manually extracted features, reliance on costly hand-labeled data, and lack of validation on external datasets. To address these challenges, we develop a weakly-supervised machine learning framework for adverse event detection from unstructured clinical notes and evaluate it on insulin pump failure as a test case. Our model accurately detected cases of pump failure with 0.842 PR AUC on the holdout test set and 0.815 PR AUC when validated on an external dataset. Our approach allowed us to leverage a large dataset with far less hand-labeled data and can be easily transferred to additional adverse events for scalable post-market surveillance.


Assuntos
Registros Eletrônicos de Saúde , Aprendizado de Máquina , Humanos
2.
Am J Emerg Med ; 51: 388-392, 2022 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-34839182

RESUMO

BACKGROUND: The Mortality Probability Model (MPM) is used in research and quality improvement to adjust for severity of illness and can also inform triage decisions. However, a limitation for its automated use or application is that it includes the variable "intracranial mass effect" (IME), which requires human engagement with the electronic health record (EHR). We developed and tested a natural language processing (NLP) algorithm to identify IME from CT head reports. METHODS: We obtained initial CT head reports from adult patients who were admitted to the ICU from our ED between 10/2013 and 9/2016. Each head CT head report was labeled yes/no IME by at least two of five independent labelers. The reports were then randomly divided 80/20 into training and test sets. All reports were preprocessed to remove linguistic and style variability, and a dictionary was created to map similar common terms. We tested three vectorization strategies: Term Frequency-Inverse Document frequency (TF-IDF), Word2Vec, and Universal Sentence Encoder to convert the report text to a numerical vector. This vector served as the input to a classification-tree-based ensemble machine learning algorithm (XGBoost). After training, model performance was assessed in the test set using the area under the receiver operating characteristic curve (AUROC). We also divided the continuous range of scores into positive/inconclusive/negative categories for IME. RESULTS: Of the 1202 CT reports in the training set, 308 (25.6%) reports were manually labeled as "yes" for IME. Of the 355 reports in the test set, 108 (30.4%) were labeled as "yes" for IME. The TF-IDF vectorization strategy as an input for the XGBoost model had the best AUROC:-- 0.9625 (95% CI 0.9443-0.9807). TF-IDF score categories were defined and had the following likelihood ratios: "positive" (TF-IDF score > 0.5) LR = 24.59; "inconclusive" (TF-IDF 0.05-0.5) LR = 0.99; and "negative" (TF-IDF < 0.05) LR = 0.05. 82% of reports were classified as either "positive" or "negative". In the test set, only 4 of 199 (2.0%) reports with a "negative" classification were false negatives and only 8 of 93 (8.6%) reports classified as "positive" were false positives. CONCLUSION: NLP can accurately identify IME from free-text reports of head CTs in approximately 80% of records, adequate to allow automatic calculation of MPM based on EHR data for many applications.


Assuntos
Neoplasias Encefálicas/diagnóstico por imagem , Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Tomografia Computadorizada por Raios X , Área Sob a Curva , Humanos , Modelos Logísticos , Aprendizado de Máquina , Curva ROC
3.
Sci Rep ; 11(1): 9461, 2021 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-33947927

RESUMO

Efficient prediction of cancer recurrence in advance may help to recruit high risk breast cancer patients for clinical trial on-time and can guide a proper treatment plan. Several machine learning approaches have been developed for recurrence prediction in previous studies, but most of them use only structured electronic health records and only a small training dataset, with limited success in clinical application. While free-text clinic notes may offer the greatest nuance and detail about a patient's clinical status, they are largely excluded in previous predictive models due to the increase in processing complexity and need for a complex modeling framework. In this study, we developed a weak-supervision framework for breast cancer recurrence prediction in which we trained a deep learning model on a large sample of free-text clinic notes by utilizing a combination of manually curated labels and NLP-generated non-perfect recurrence labels. The model was trained jointly on manually curated data from 670 patients and NLP-curated data of 8062 patients. It was validated on manually annotated data from 224 patients with recurrence and achieved 0.94 AUROC. This weak supervision approach allowed us to learn from a larger dataset using imperfect labels and ultimately provided greater accuracy compared to a smaller hand-curated dataset, with less manual effort invested in curation.


Assuntos
Neoplasias da Mama/patologia , Recidiva Local de Neoplasia/patologia , Doença Crônica , Registros Eletrônicos de Saúde , Feminino , Humanos , Aprendizado de Máquina , Pessoa de Meia-Idade
4.
AMIA Jt Summits Transl Sci Proc ; 2020: 552-560, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32477677

RESUMO

A substantial percentage of prostate cancer cases are overdiagnosed and overtreated due to the challenge in deter- mining aggressiveness. Multi-parametric MR is a powerful imaging technique to capture distinct characteristics of prostate lesions that are informative for aggressiveness assessment. However, manual interpretation requires a high level of expertise, is time-consuming, and significant inter-observer variation exists for radiologists. We propose a completely automated approach to assessing pixel-level aggressiveness of prostate cancer in multi-parametric MRI. Our model efficiently combines traditional computer vision and deep learning algorithms, to remove reliance on manual features, prostate segmentation, and prior lesion detection and identified optimal combinations of MR pulse sequences for assessment. Using ADC and DWI, our proposed model achieves ROC-AUC of 0.86 and ROC-AUC of 0.88 for the diagnosis of aggressive and non-aggressive prostate lesions, respectively. In performing pixel-level clas- sification, our model's classifications are easily interpretable and allow clinicians to infer localized analyses of the lesion.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa