Your browser doesn't support javascript.
loading
LATTE: Label-efficient incident phenotyping from longitudinal electronic health records.
Wen, Jun; Hou, Jue; Bonzel, Clara-Lea; Zhao, Yihan; Castro, Victor M; Gainer, Vivian S; Weisenfeld, Dana; Cai, Tianrun; Ho, Yuk-Lam; Panickan, Vidul A; Costa, Lauren; Hong, Chuan; Gaziano, J Michael; Liao, Katherine P; Lu, Junwei; Cho, Kelly; Cai, Tianxi.
Affiliation
  • Wen J; Harvard Medical School, Boston, MA, USA.
  • Hou J; VA Boston Healthcare System, Boston, MA, USA.
  • Bonzel CL; University of Minnesota, Minneapolis, MN, USA.
  • Zhao Y; Harvard Medical School, Boston, MA, USA.
  • Castro VM; VA Boston Healthcare System, Boston, MA, USA.
  • Gainer VS; Harvard University, Cambridge, MA, USA.
  • Weisenfeld D; Mass General Brigham, Boston, MA, USA.
  • Cai T; Mass General Brigham, Boston, MA, USA.
  • Ho YL; Brigham and Women's Hospital, Boston, MA, USA.
  • Panickan VA; VA Boston Healthcare System, Boston, MA, USA.
  • Costa L; Mass General Brigham, Boston, MA, USA.
  • Hong C; VA Boston Healthcare System, Boston, MA, USA.
  • Gaziano JM; Harvard Medical School, Boston, MA, USA.
  • Liao KP; VA Boston Healthcare System, Boston, MA, USA.
  • Lu J; VA Boston Healthcare System, Boston, MA, USA.
  • Cho K; Duke University, Durham, NC, USA.
  • Cai T; Harvard Medical School, Boston, MA, USA.
Patterns (N Y) ; 5(1): 100906, 2024 Jan 12.
Article in En | MEDLINE | ID: mdl-38264714
ABSTRACT
Electronic health record (EHR) data are increasingly used to support real-world evidence studies but are limited by the lack of precise timings of clinical events. Here, we propose a label-efficient incident phenotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data. By leveraging the pre-trained semantic embeddings, LATTE selects predictive features and compresses their information into longitudinal visit embeddings through visit attention learning. LATTE models the sequential dependency between the target event and visit embeddings to derive the timings. To improve label efficiency, LATTE constructs longitudinal silver-standard labels from unlabeled patients to perform semi-supervised training. LATTE is evaluated on the onset of type 2 diabetes, heart failure, and relapses of multiple sclerosis. LATTE consistently achieves substantial improvements over benchmark methods while providing high prediction interpretability. The event timings are shown to help discover risk factors of heart failure among patients with rheumatoid arthritis.

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Patterns (N Y) Year: 2024 Document type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Patterns (N Y) Year: 2024 Document type: Article Affiliation country: United States