Your browser doesn't support javascript.
loading
Web Application for the Automated Extraction of Diagnosis and Site From Pathology Reports for Keratinocyte Cancers.
Thompson, Bridie S; Hardy, Sam; Pandeya, Nirmala; Dusingize, Jean Claude; Green, Adele C; Millane, Athon; Bourke, Daniel; Grande, Ronald; Bean, Cameron D; Olsen, Catherine M; Whiteman, David C.
Afiliación
  • Thompson BS; Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.
  • Hardy S; Otso, Brisbane, Queensland, Australia.
  • Pandeya N; Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.
  • Dusingize JC; School of Public Health, University of Queensland, Brisbane, Queensland, Australia.
  • Green AC; Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.
  • Millane A; Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.
  • Bourke D; Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom.
  • Grande R; School of Public Health, University of Queensland, Brisbane, Queensland, Australia.
  • Bean CD; Max Kelsen, Brisbane, Queensland, Australia.
  • Olsen CM; Max Kelsen, Brisbane, Queensland, Australia.
  • Whiteman DC; Max Kelsen, Brisbane, Queensland, Australia.
JCO Clin Cancer Inform ; 4: 711-723, 2020 08.
Article en En | MEDLINE | ID: mdl-32755460
PURPOSE: Keratinocyte cancers are exceedingly common in high-risk populations, but accurate measures of incidence are seldom derived because the burden of manually reviewing pathology reports to extract relevant diagnostic information is excessive. Thus, we sought to develop supervised learning algorithms for classifying basal and squamous cell carcinomas and other diagnoses, as well as disease site, and incorporate these into a Web application capable of processing large numbers of pathology reports. METHODS: Participants in the QSkin study were recruited in 2011 and comprised men and women age 40-69 years at baseline (N = 43,794) who were randomly selected from a population register in Queensland, Australia. Histologic data were manually extracted from free-text pathology reports for participants with histologically confirmed keratinocyte cancers for whom a pathology report was available (n = 25,786 reports). This provided a training data set for the development of algorithms capable of deriving diagnosis and site from free-text pathology reports. We calculated agreement statistics between algorithm-derived classifications and 3 independent validation data sets of manually abstracted pathology reports. RESULTS: The agreement for classifications of basal cell carcinoma (κ = 0.97 and κ = 0.96) and squamous cell carcinoma (κ = 0.93 for both) was almost perfect in 2 validation data sets but was slightly lower for a third (κ = 0.82 and κ = 0.90, respectively). Agreement for total counts of specific diagnoses was also high (κ > 0.8). Similar levels of agreement between algorithm-derived and manually extracted data were observed for classifications of keratoacanthoma and intraepidermal carcinoma. CONCLUSION: Supervised learning methods were used to develop a Web application capable of accurately and rapidly classifying large numbers of pathology reports for keratinocyte cancers and related diagnoses. Such tools may provide the means to accurately measure subtype-specific skin cancer incidence.
Asunto(s)

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Neoplasias Cutáneas / Carcinoma Basocelular / Carcinoma de Células Escamosas Tipo de estudio: Diagnostic_studies / Incidence_studies / Prognostic_studies Idioma: En Revista: JCO Clin Cancer Inform Año: 2020 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Neoplasias Cutáneas / Carcinoma Basocelular / Carcinoma de Células Escamosas Tipo de estudio: Diagnostic_studies / Incidence_studies / Prognostic_studies Idioma: En Revista: JCO Clin Cancer Inform Año: 2020 Tipo del documento: Article