Using Natural Language Processing to Predict Fatal Drug Overdose From Autopsy Narrative Text: Algorithm Development and Validation Study.

Tang, Leigh Anne; Korona-Bailey, Jessica; Zaras, Dimitrios; Roberts, Allison; Mukhopadhyay, Sutapa; Espy, Stephen; Walsh, Colin G

Tang, Leigh Anne; Korona-Bailey, Jessica; Zaras, Dimitrios; Roberts, Allison; Mukhopadhyay, Sutapa; Espy, Stephen; Walsh, Colin G.

Affiliation

Tang LA; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States.
Korona-Bailey J; Office of Informatics and Analytics, Tennessee Department of Health, Nashville, TN, United States.
Zaras D; Office of Informatics and Analytics, Tennessee Department of Health, Nashville, TN, United States.
Roberts A; Office of Informatics and Analytics, Tennessee Department of Health, Nashville, TN, United States.
Mukhopadhyay S; Office of Informatics and Analytics, Tennessee Department of Health, Nashville, TN, United States.
Espy S; Office of Informatics and Analytics, Tennessee Department of Health, Nashville, TN, United States.
Walsh CG; Office of Informatics and Analytics, Tennessee Department of Health, Nashville, TN, United States.

JMIR Public Health Surveill ; 9: e45246, 2023 05 19.

Article in En | MEDLINE | ID: mdl-37204824

ABSTRACT

ABSTRACT

BACKGROUND:

Fatal drug overdose surveillance informs prevention but is often delayed because of autopsy report processing and death certificate coding. Autopsy reports contain narrative text describing scene evidence and medical history (similar to preliminary death scene investigation reports) and may serve as early data sources for identifying fatal drug overdoses. To facilitate timely fatal overdose reporting, natural language processing was applied to narrative texts from autopsies.

OBJECTIVE:

This study aimed to develop a natural language processing-based model that predicts the likelihood that an autopsy report narrative describes an accidental or undetermined fatal drug overdose.

METHODS:

Autopsy reports of all manners of death (2019-2021) were obtained from the Tennessee Office of the State Chief Medical Examiner. The text was extracted from autopsy reports (PDFs) using optical character recognition. Three common narrative text sections were identified, concatenated, and preprocessed (bag-of-words) using term frequency-inverse document frequency scoring. Logistic regression, support vector machine (SVM), random forest, and gradient boosted tree classifiers were developed and validated. Models were trained and calibrated using autopsies from 2019 to 2020 and tested using those from 2021. Model discrimination was evaluated using the area under the receiver operating characteristic, precision, recall, F1-score, and F2-score (prioritizes recall over precision). Calibration was performed using logistic regression (Platt scaling) and evaluated using the Spiegelhalter z test. Shapley additive explanations values were generated for models compatible with this method. In a post hoc subgroup analysis of the random forest classifier, model discrimination was evaluated by forensic center, race, age, sex, and education level.

RESULTS:

A total of 17,342 autopsies (n=5934, 34.22% cases) were used for model development and validation. The training set included 10,215 autopsies (n=3342, 32.72% cases), the calibration set included 538 autopsies (n=183, 34.01% cases), and the test set included 6589 autopsies (n=2409, 36.56% cases). The vocabulary set contained 4002 terms. All models showed excellent performance (area under the receiver operating characteristic ≥0.95, precision ≥0.94, recall ≥0.92, F1-score ≥0.94, and F2-score ≥0.92). The SVM and random forest classifiers achieved the highest F2-scores (0.948 and 0.947, respectively). The logistic regression and random forest were calibrated (P=.95 and P=.85, respectively), whereas the SVM and gradient boosted tree classifiers were miscalibrated (P=.03 and P<.001, respectively). "Fentanyl" and "accident" had the highest Shapley additive explanations values. Post hoc subgroup analyses revealed lower F2-scores for autopsies from forensic centers D and E. Lower F2-score were observed for the American Indian, Asian, ≤14 years, and ≥65 years subgroups, but larger sample sizes are needed to validate these findings.

CONCLUSIONS:

The random forest classifier may be suitable for identifying potential accidental and undetermined fatal overdose autopsies. Further validation studies should be conducted to ensure early detection of accidental and undetermined fatal drug overdoses across all subgroups.

Subject(s)

Drug Overdose; Natural Language Processing; Humans; Autopsy; Algorithms; Random Forest

Key words

SUDORS; State Unintentional Drug Overdose Reporting System; Tennessee; fatal drug overdose; natural language processing; surveillance

Fulltext

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Natural Language Processing / Drug Overdose Type of study: Prognostic_studies / Risk_factors_studies / Screening_studies Limits: Humans Language: En Journal: JMIR Public Health Surveill Year: 2023 Type: Article Affiliation country: United States

Fulltext

XML

PubMed Links

Search on Google