Your browser doesn't support javascript.
loading
Automatically determining cause of death from verbal autopsy narratives.
Jeblee, Serena; Gomes, Mireille; Jha, Prabhat; Rudzicz, Frank; Hirst, Graeme.
Affiliation
  • Jeblee S; Department of Computer Science, University of Toronto, Toronto, Canada. sjeblee@cs.toronto.edu.
  • Gomes M; Vector Institute for Artificial Intelligence, Toronto, Canada. sjeblee@cs.toronto.edu.
  • Jha P; Centre for Global Health Research, St.Michael's Hospital, Toronto, Canada.
  • Rudzicz F; Dalla Lana School of Public Health, University of Toronto, Toronto, Canada.
  • Hirst G; Centre for Global Health Research, St.Michael's Hospital, Toronto, Canada.
BMC Med Inform Decis Mak ; 19(1): 127, 2019 07 09.
Article in En | MEDLINE | ID: mdl-31288814
BACKGROUND: A verbal autopsy (VA) is a post-hoc written interview report of the symptoms preceding a person's death in cases where no official cause of death (CoD) was determined by a physician. Current leading automated VA coding methods primarily use structured data from VAs to assign a CoD category. We present a method to automatically determine CoD categories from VA free-text narratives alone. METHODS: After preprocessing and spelling correction, our method extracts word frequency counts from the narratives and uses them as input to four different machine learning classifiers: naïve Bayes, random forest, support vector machines, and a neural network. RESULTS: For individual CoD classification, our best classifier achieves a sensitivity of.770 for adult deaths for 15 CoD categories (as compared to the current best reported sensitivity of.57), and.662 with 48 WHO categories. When predicting the CoD distribution at the population level, our best classifier achieves.962 cause-specific mortality fraction accuracy for 15 categories and.908 for 48 categories, which is on par with leading CoD distribution estimation methods. CONCLUSIONS: Our narrative-based machine learning classifier performs as well as classifiers based on structured data at the individual level. Moreover, our method demonstrates that VA narratives provide important information that can be used by a machine learning system for automated CoD classification. Unlike the structured questionnaire-based methods, this method can be applied to any verbal autopsy dataset, regardless of the collection process or country of origin.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Natural Language Processing / Cause of Death / Diagnostic Techniques and Procedures / Narration / Machine Learning Type of study: Diagnostic_studies / Qualitative_research Limits: Adolescent / Adult / Aged / Child / Child, preschool / Female / Humans / Infant / Male / Middle aged Language: En Journal: BMC Med Inform Decis Mak Journal subject: INFORMATICA MEDICA Year: 2019 Document type: Article Affiliation country: Country of publication:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Natural Language Processing / Cause of Death / Diagnostic Techniques and Procedures / Narration / Machine Learning Type of study: Diagnostic_studies / Qualitative_research Limits: Adolescent / Adult / Aged / Child / Child, preschool / Female / Humans / Infant / Male / Middle aged Language: En Journal: BMC Med Inform Decis Mak Journal subject: INFORMATICA MEDICA Year: 2019 Document type: Article Affiliation country: Country of publication: