Your browser doesn't support javascript.
loading
A survey of word embeddings for clinical text.
Khattak, Faiza Khan; Jeblee, Serena; Pou-Prom, Chloé; Abdalla, Mohamed; Meaney, Christopher; Rudzicz, Frank.
Afiliação
  • Khattak FK; Department of Computer Science, University of Toronto, Toronto, Ontario, Canada; Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada; Li Ka Shing Knowledge Institute, St Michael's Hospital, Toronto, Ontario, Canada. Electronic address: faizakk@cs.toronto.edu.
  • Jeblee S; Department of Computer Science, University of Toronto, Toronto, Ontario, Canada; Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada. Electronic address: sjeblee@cs.toronto.edu.
  • Pou-Prom C; Department of Computer Science, University of Toronto, Toronto, Ontario, Canada; Li Ka Shing Knowledge Institute, St Michael's Hospital, Toronto, Ontario, Canada. Electronic address: poupromc@smh.ca.
  • Abdalla M; Department of Computer Science, University of Toronto, Toronto, Ontario, Canada; Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada. Electronic address: mohamed.abdalla@mail.utoronto.ca.
  • Meaney C; Department of Biostatistics, University of Toronto, Toronto, Ontario, Canada; Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada. Electronic address: christopher.meaney@utoronto.ca.
  • Rudzicz F; Department of Computer Science, University of Toronto, Toronto, Ontario, Canada; Vector Institute for Artificial Intelligence, Toronto, Ontario, Canada; Li Ka Shing Knowledge Institute, St Michael's Hospital, Toronto, Ontario, Canada; Surgical Safety Technologies Inc, Toronto, Ontario, Canada. Elect
J Biomed Inform ; 100S: 100057, 2019.
Article em En | MEDLINE | ID: mdl-34384583
ABSTRACT
Representing words as numerical vectors based on the contexts in which they appear has become the de facto method of analyzing text with machine learning. In this paper, we provide a guide for training these representations on clinical text data, using a survey of relevant research. Specifically, we discuss different types of word representations, clinical text corpora, available pre-trained clinical word vector embeddings, intrinsic and extrinsic evaluation, applications, and limitations of these approaches. This work can be used as a blueprint for clinicians and healthcare workers who may want to incorporate clinical text features in their own models and applications.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Qualitative_research Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Qualitative_research Idioma: En Ano de publicação: 2019 Tipo de documento: Article