RESUMEN
BACKGROUND: The aim of this study was to develop a machine learning (ML) based model to accurately predict pathologic complete response (pCR) to neoadjuvant chemotherapy (NAC) using pretreatment clinical and pathological characteristics of electronic medical record (EMR) data in breast cancer (BC). METHODS: The EMR data from patients diagnosed with early and locally advanced BC and who received NAC followed by curative surgery were reviewed. A total of 16 clinical and pathological characteristics was selected to develop ML model. We practiced six ML models using default settings for multivariate analysis with extracted variables. RESULTS: In total, 2065 patients were included in this analysis. Overall, 30.6% (n = 632) of patients achieved pCR. Among six ML models, the LightGBM had the highest area under the curve (AUC) for pCR prediction. After hyper-parameter tuning with Bayesian optimization, AUC was 0.810. Performance of pCR prediction models in different histology-based subtypes was compared. The AUC was highest in HR+HER2- subgroup and lowest in HR-/HER2- subgroup (HR+/HER2- 0.841, HR+/HER2+ 0.716, HR-/HER2 0.753, HR-/HER2- 0.653). CONCLUSIONS: A ML based pCR prediction model using pre-treatment clinical and pathological characteristics provided useful information to predict pCR during NAC. This prediction model would help to determine treatment strategy in patients with BC planned NAC.
Asunto(s)
Neoplasias de la Mama , Terapia Neoadyuvante , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapéutico , Área Bajo la Curva , Teorema de Bayes , Neoplasias de la Mama/tratamiento farmacológico , Quimioterapia Adyuvante , Femenino , Humanos , Aprendizaje Automático , Receptor ErbB-2/metabolismo , Resultado del TratamientoRESUMEN
The achievement of the pathologic complete response (pCR) has been considered a metric for the success of neoadjuvant chemotherapy (NAC) and a powerful surrogate indicator of the risk of recurrence and long-term survival. This study aimed to develop a multimodal deep learning model that combined clinical information and pretreatment MR images for predicting pCR to NAC in patients with breast cancer. The retrospective study cohort consisted of 536 patients with invasive breast cancer who underwent pre-operative NAC. We developed a deep learning model to fuse high-dimensional MR image features and the clinical information for the pretreatment prediction of pCR to NAC in breast cancer. The proposed deep learning model trained on all datasets as clinical information, T1-weighted subtraction images, and T2-weighted images shows better performance with area under the curve (AUC) of 0.888 as compared to the model using only clinical information (AUC = 0.827, P < 0.05). Our results demonstrate that the multimodal fusion approach using deep learning with both clinical information and MR images achieve higher prediction performance compared to the deep learning model without the fusion approach. Deep learning could integrate pretreatment MR images with clinical information to improve pCR prediction performance.