Your browser doesn't support javascript.
loading
Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression.
Van Vleck, Tielman T; Chan, Lili; Coca, Steven G; Craven, Catherine K; Do, Ron; Ellis, Stephen B; Kannry, Joseph L; Loos, Ruth J F; Bonis, Peter A; Cho, Judy; Nadkarni, Girish N.
  • Van Vleck TT; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, USA. Electronic address: tielman.vanvleck@mssm.edu.
  • Chan L; Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, USA.
  • Coca SG; Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, USA.
  • Craven CK; Institute for Healthcare Delivery Science, Dept. of Pop. Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, USA; Clinical Informatics Group, IT Department, Mount Sinai Health System, New York, USA.
  • Do R; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, USA.
  • Ellis SB; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, USA.
  • Kannry JL; Information Technology, Mount Sinai Medical Center, New York, USA.
  • Loos RJF; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, USA.
  • Bonis PA; Division of Gastroenterology, Tufts Medical Center, Boston, USA.
  • Cho J; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, USA; Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, USA; Division of Gastroenterology, Icahn School of Medicine at Mount Sinai, New York, USA.
  • Nadkarni GN; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, USA; Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, USA. Electronic address: girish.nadkarni@mountsinai.org.
Int J Med Inform ; 129: 334-341, 2019 09.
Article en En | MEDLINE | ID: mdl-31445275
ABSTRACT

OBJECTIVE:

Electronic health record (EHR) systems contain structured data (such as diagnostic codes) and unstructured data (clinical documentation). Clinical insights can be derived from analyzing both. The use of natural language processing (NLP) algorithms to effectively analyze unstructured data has been well demonstrated. Here we examine the utility of NLP for the identification of patients with non-alcoholic fatty liver disease, assess patterns of disease progression, and identify gaps in care related to breakdown in communication among providers. MATERIALS AND

METHODS:

All clinical notes available on the 38,575 patients enrolled in the Mount Sinai BioMe cohort were loaded into the NLP system. We compared analysis of structured and unstructured EHR data using NLP, free-text search, and diagnostic codes with validation against expert adjudication. We then used the NLP findings to measure physician impression of progression from early-stage NAFLD to NASH or cirrhosis. Similarly, we used the same NLP findings to identify mentions of NAFLD in radiology reports that did not persist into clinical notes.

RESULTS:

Out of 38,575 patients, we identified 2,281 patients with NAFLD. From the remainder, 10,653 patients with similar data density were selected as a control group. NLP outperformed ICD and text search in both sensitivity (NLP 0.93, ICD 0.28, text search 0.81) and F2 score (NLP 0.92, ICD 0.34, text search 0.81). Of 2281 NAFLD patients, 673 (29.5%) were believed to have progressed to NASH or cirrhosis. Among 176 where NAFLD was noted prior to NASH, the average progression time was 410 days. 619 (27.1%) NAFLD patients had it documented only in radiology notes and not acknowledged in other forms of clinical documentation. Of these, 170 (28.4%) were later identified as having likely developed NASH or cirrhosis after a median 1057.3 days.

DISCUSSION:

NLP-based approaches were more accurate at identifying NAFLD within the EHR than ICD/text search-based approaches. Suspected NAFLD on imaging is often not acknowledged in subsequent clinical documentation. Many such patients are later found to have more advanced liver disease. Analysis of information flows demonstrated loss of key information that could have been used to help prevent the progression of early NAFLD (NAFL) to NASH or cirrhosis.

CONCLUSION:

For identification of NAFLD, NLP performed better than alternative selection modalities. It then facilitated analysis of knowledge flow between physician and enabled the identification of breakdowns where key information was lost that could have slowed or prevented later disease progression.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Procesamiento de Lenguaje Natural / Registros Electrónicos de Salud / Enfermedad del Hígado Graso no Alcohólico Tipo de estudio: Etiology_studies / Incidence_studies / Observational_studies / Prognostic_studies / Qualitative_research / Risk_factors_studies Límite: Female / Humans / Male / Middle aged Idioma: En Año: 2019 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Procesamiento de Lenguaje Natural / Registros Electrónicos de Salud / Enfermedad del Hígado Graso no Alcohólico Tipo de estudio: Etiology_studies / Incidence_studies / Observational_studies / Prognostic_studies / Qualitative_research / Risk_factors_studies Límite: Female / Humans / Male / Middle aged Idioma: En Año: 2019 Tipo del documento: Article