Your browser doesn't support javascript.
loading
Comparison of orthogonal NLP methods for clinical phenotyping and assessment of bone scan utilization among prostate cancer patients.
Coquet, Jean; Bozkurt, Selen; Kan, Kathleen M; Ferrari, Michelle K; Blayney, Douglas W; Brooks, James D; Hernandez-Boussard, Tina.
Afiliação
  • Coquet J; Department of Medicine, Stanford University, Stanford, CA, USA.
  • Bozkurt S; Department of Medicine, Stanford University, Stanford, CA, USA; Department of Biomedical Data Science, Stanford University, Stanford, USA.
  • Kan KM; Department of Urology, Stanford University School of Medicine, Stanford, USA.
  • Ferrari MK; Department of Urology, Stanford University School of Medicine, Stanford, USA.
  • Blayney DW; Department of Medicine, Stanford University, Stanford, CA, USA; Stanford Cancer Institute, Stanford University School of Medicine, Stanford, USA.
  • Brooks JD; Department of Urology, Stanford University School of Medicine, Stanford, USA; Stanford Cancer Institute, Stanford University School of Medicine, Stanford, USA.
  • Hernandez-Boussard T; Department of Medicine, Stanford University, Stanford, CA, USA; Department of Biomedical Data Science, Stanford University, Stanford, USA; Department of Surgery, Stanford University School of Medicine, Stanford, USA. Electronic address: boussard@stanford.edu.
J Biomed Inform ; 94: 103184, 2019 06.
Article em En | MEDLINE | ID: mdl-31014980
OBJECTIVE: Clinical care guidelines recommend that newly diagnosed prostate cancer patients at high risk for metastatic spread receive a bone scan prior to treatment and that low risk patients not receive it. The objective was to develop an automated pipeline to interrogate heterogeneous data to evaluate the use of bone scans using a two different Natural Language Processing (NLP) approaches. MATERIALS AND METHODS: Our cohort was divided into risk groups based on Electronic Health Records (EHR). Information on bone scan utilization was identified in both structured data and free text from clinical notes. Our pipeline annotated sentences with a combination of a rule-based method using the ConText algorithm (a generalization of NegEx) and a Convolutional Neural Network (CNN) method using word2vec to produce word embeddings. RESULTS: A total of 5500 patients and 369,764 notes were included in the study. A total of 39% of patients were high-risk and 73% of these received a bone scan; of the 18% low risk patients, 10% received one. The accuracy of CNN model outperformed the rule-based model one (F-measure = 0.918 and 0.897 respectively). We demonstrate a combination of both models could maximize precision or recall, based on the study question. CONCLUSION: Using structured data, we accurately classified patients' cancer risk group, identified bone scan documentation with two NLP methods, and evaluated guideline adherence. Our pipeline can be used to provide concrete feedback to clinicians and guide treatment decisions.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fenótipo / Neoplasias da Próstata / Neoplasias Ósseas / Processamento de Linguagem Natural Tipo de estudo: Etiology_studies / Guideline / Prognostic_studies / Risk_factors_studies Limite: Humans / Male Idioma: En Revista: J Biomed Inform Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fenótipo / Neoplasias da Próstata / Neoplasias Ósseas / Processamento de Linguagem Natural Tipo de estudo: Etiology_studies / Guideline / Prognostic_studies / Risk_factors_studies Limite: Humans / Male Idioma: En Revista: J Biomed Inform Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos