Your browser doesn't support javascript.
loading
Machine learning and natural language processing (NLP) approach to predict early progression to first-line treatment in real-world hormone receptor-positive (HR+)/HER2-negative advanced breast cancer patients.
Ribelles, Nuria; Jerez, Jose M; Rodriguez-Brazzarola, Pablo; Jimenez, Begoña; Diaz-Redondo, Tamara; Mesa, Hector; Marquez, Antonia; Sanchez-Muñoz, Alfonso; Pajares, Bella; Carabantes, Francisco; Bermejo, Maria J; Villar, Ester; Dominguez-Recio, Maria E; Saez, Enrique; Galvez, Laura; Godoy, Ana; Franco, Leo; Ruiz-Medina, Sofia; Lopez, Irene; Alba, Emilio.
Afiliação
  • Ribelles N; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain. Electronic address: nuria.ribelles.sspa@juntadeandalucia.es.
  • Jerez JM; University of Málaga, Department of Languages and Computer Science, E.T.S.I. Computing, Málaga, Spain.
  • Rodriguez-Brazzarola P; University of Málaga, Department of Languages and Computer Science, E.T.S.I. Computing, Málaga, Spain.
  • Jimenez B; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Diaz-Redondo T; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Mesa H; University of Málaga, Department of Languages and Computer Science, E.T.S.I. Computing, Málaga, Spain.
  • Marquez A; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Sanchez-Muñoz A; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Pajares B; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Carabantes F; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Bermejo MJ; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Villar E; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Dominguez-Recio ME; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Saez E; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Galvez L; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Godoy A; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Franco L; University of Málaga, Department of Languages and Computer Science, E.T.S.I. Computing, Málaga, Spain.
  • Ruiz-Medina S; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Lopez I; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
  • Alba E; Medical Oncology Intercenter Unit, Regional and Virgen de la Victoria University Hospitals, IBIMA, Málaga, Spain.
Eur J Cancer ; 144: 224-231, 2021 02.
Article em En | MEDLINE | ID: mdl-33373867
ABSTRACT

BACKGROUND:

CDK4/6 inhibitors plus endocrine therapies are the current standard of care in the first-line treatment of HR+/HER2-negative metastatic breast cancer, but there are no well-established clinical or molecular predictive factors for patient response. In the era of personalised oncology, new approaches for developing predictive models of response are needed. MATERIALS AND

METHODS:

Data derived from the electronic health records (EHRs) of real-world patients with HR+/HER2-negative advanced breast cancer were used to develop predictive models for early and late progression to first-line treatment. Two machine learning approaches were used a classic approach using a data set of manually extracted features from reviewed (EHR) patients, and a second approach using natural language processing (NLP) of free-text clinical notes recorded during medical visits.

RESULTS:

Of the 610 patients included, there were 473 (77.5%) progressions to first-line treatment, of which 126 (20.6%) occurred within the first 6 months. There were 152 patients (24.9%) who showed no disease progression before 28 months from the onset of first-line treatment. The best predictive model for early progression using the manually extracted dataset achieved an area under the curve (AUC) of 0.734 (95% CI 0.687-0.782). Using the NLP free-text processing approach, the best model obtained an AUC of 0.758 (95% CI 0.714-0.800). The best model to predict long responders using manually extracted data obtained an AUC of 0.669 (95% CI 0.608-0.730). With NLP free-text processing, the best model attained an AUC of 0.752 (95% CI 0.705-0.799).

CONCLUSIONS:

Using machine learning methods, we developed predictive models for early and late progression to first-line treatment of HR+/HER2-negative metastatic breast cancer, also finding that NLP-based machine learning models are slightly better than predictive models based on manually obtained data.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Neoplasias da Mama / Protocolos de Quimioterapia Combinada Antineoplásica / Receptores de Progesterona / Receptores de Estrogênio / Receptor ErbB-2 / Aprendizado de Máquina Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Neoplasias da Mama / Protocolos de Quimioterapia Combinada Antineoplásica / Receptores de Progesterona / Receptores de Estrogênio / Receptor ErbB-2 / Aprendizado de Máquina Idioma: En Ano de publicação: 2021 Tipo de documento: Article