Your browser doesn't support javascript.
loading
A natural language processing algorithm accurately classifies steatotic liver disease pathology to estimate the risk of cirrhosis.
Sherman, Marc S; Challa, Prasanna K; Przybyszewski, Eric M; Wilechansky, Robert M; Uche-Anya, Eugenia N; Ott, Ashley T; McGoldrick, Jessica; Goessling, Wolfram; Khalili, Hamed; Simon, Tracey G.
Affiliation
  • Sherman MS; Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
  • Challa PK; Brigham Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
  • Przybyszewski EM; Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
  • Wilechansky RM; Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
  • Uche-Anya EN; Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
  • Ott AT; Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
  • McGoldrick J; Division of Gastroenterology and Hepatology, Department of Medicine, Stanford University, Stanford, California, USA.
  • Goessling W; Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
  • Khalili H; Division of Gastroenterology, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
  • Simon TG; Brigham Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
Hepatol Commun ; 8(4)2024 Apr 01.
Article in En | MEDLINE | ID: mdl-38551386
ABSTRACT

BACKGROUND:

Histopathology remains the gold standard for diagnosing and staging metabolic dysfunction-associated steatotic liver disease (MASLD). The feasibility of studying MASLD progression in electronic medical records based on histological features is limited by the free-text nature of pathology reports. Here we introduce a natural language processing (NLP) algorithm to automatically score MASLD histology features.

METHODS:

From the Mass General Brigham health care system electronic medical record, we identified all patients (1987-2021) with steatosis on index liver biopsy after excluding excess alcohol use and other etiologies of liver disease. An NLP algorithm was constructed in Python to detect steatosis, lobular inflammation, ballooning, and fibrosis stage from pathology free-text and manually validated in >1200 pathology reports. Patients were followed from the index biopsy to incident decompensated liver disease accounting for covariates.

RESULTS:

The NLP algorithm demonstrated positive and negative predictive values from 93.5% to 100% for all histologic concepts. Among 3134 patients with biopsy-confirmed MASLD followed for 20,604 person-years, rates of the composite endpoint increased monotonically with worsening index fibrosis stage (p for linear trend <0.005). Compared to simple steatosis (incidence rate, 15.06/1000 person-years), the multivariable-adjusted HRs for cirrhosis were 1.04 (0.72-1.5) for metabolic dysfunction-associated steatohepatitis (MASH)/F0, 1.19 (0.92-1.54) for MASH/F1, 1.89 (1.41-2.52) for MASH/F2, and 4.21 (3.26-5.43) for MASH/F3.

CONCLUSIONS:

The NLP algorithm accurately scores histological features of MASLD from pathology free-text. This algorithm enabled the construction of a large and high-quality MASLD cohort across a multihospital health care system and disclosed an accelerating risk for cirrhosis based on the index MASLD fibrosis stage.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Natural Language Processing / Fatty Liver Limits: Humans Language: En Journal: Hepatol Commun Year: 2024 Document type: Article Affiliation country: Estados Unidos Country of publication: Estados Unidos

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Natural Language Processing / Fatty Liver Limits: Humans Language: En Journal: Hepatol Commun Year: 2024 Document type: Article Affiliation country: Estados Unidos Country of publication: Estados Unidos