Your browser doesn't support javascript.
loading
Developing a Data and Analytics Platform to Enable a Breast Cancer Learning Health System at a Regional Cancer Center.
Petch, Jeremy; Kempainnen, Joel; Pettengell, Christopher; Aviv, Steven; Butler, Bill; Pond, Greg; Saha, Ashirbani; Bogach, Jessica; Allard-Coutu, Alexandria; Sztur, Peter; Ranisau, Jonathan; Levine, Mark.
Afiliación
  • Petch J; Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, Canada.
  • Kempainnen J; Institute for Health Policy Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Canada.
  • Pettengell C; Division of Cardiology, Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada.
  • Aviv S; Population Health Research Institute, Hamilton Health Sciences, Hamilton, Canada.
  • Butler B; Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, Canada.
  • Pond G; Pentavere Research Group, Toronto, Canada.
  • Saha A; Pentavere Research Group, Toronto, Canada.
  • Bogach J; Hamilton Health Sciences, Hamilton, Canada.
  • Allard-Coutu A; Escarpment Cancer Research Institute, Hamilton Health Sciences, Hamilton, Canada.
  • Sztur P; Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, Canada.
  • Ranisau J; Escarpment Cancer Research Institute, Hamilton Health Sciences, Hamilton, Canada.
  • Levine M; Department of Oncology, Faculty of Health Sciences, McMaster University, Hamilton, Canada.
JCO Clin Cancer Inform ; 7: e2200182, 2023 03.
Article en En | MEDLINE | ID: mdl-37001040
ABSTRACT

PURPOSE:

This study documents the creation of automated, longitudinal, and prospective data and analytics platform for breast cancer at a regional cancer center. This platform combines principles of data warehousing with natural language processing (NLP) to provide the integrated, timely, meaningful, high-quality, and actionable data required to establish a learning health system.

METHODS:

Data from six hospital information systems and one external data source were integrated on a nightly basis by automated extract/transform/load jobs. Free-text clinical documentation was processed using a commercial NLP engine.

RESULTS:

The platform contains 141 data elements of 7,019 patients with newly diagnosed breast cancer who received care at our regional cancer center from January 1, 2014, to June 3, 2022. Daily updating of the database takes an average of 56 minutes. Evaluation of the tuning of NLP jobs found overall high performance, with an F1 of 1.0 for 19 variables, with a further 16 variables with an F1 of > 0.95.

CONCLUSION:

This study describes how data warehousing combined with NLP can be used to create a prospective data and analytics platform to enable a learning health system. Although upfront time investment required to create the platform was considerable, now that it has been developed, daily data processing is completed automatically in less than an hour.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Neoplasias de la Mama / Aprendizaje del Sistema de Salud Tipo de estudio: Diagnostic_studies / Observational_studies / Risk_factors_studies Límite: Female / Humans Idioma: En Revista: JCO Clin Cancer Inform Año: 2023 Tipo del documento: Article País de afiliación: Canadá

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Neoplasias de la Mama / Aprendizaje del Sistema de Salud Tipo de estudio: Diagnostic_studies / Observational_studies / Risk_factors_studies Límite: Female / Humans Idioma: En Revista: JCO Clin Cancer Inform Año: 2023 Tipo del documento: Article País de afiliación: Canadá