RESUMO
PURPOSE: To Demonstrate the clinical validation of a machine learning (ML) model for applicator and interstitial needle prediction in gynecologic brachytherapy through a prospective clinical study in a single institution. METHODS: The study included cervical cancer patients receiving high-dose-rate brachytherapy using intracavitary (IC) or hybrid interstitial (IC/IS) applicators. For each patient, the primary radiation oncologist contoured the high-risk clinical target volume on a pre-brachytherapy MRI, indicated the approximate applicator location, and made a clinical determination of the first fraction applicator. A pre-trained ML model predicted the applicator and IC/IS needle arrangement using tumor geometry. Following the first fraction, ML and radiation oncologist predictions were compared and a replanning study determined the applicator providing optimal organ-at-risk (OAR) dosimetry. The ML-predicted applicator and needle arrangement and the clinical determination were compared to this dosimetric ground truth. RESULTS: Ten patients were accrued from December 2020 to October 2022. Compared to the dosimetrically optimal applicator, both the radiation oncologist and ML had an accuracy of 70%. ML demonstrated better identification of patients requiring IC/IS applicators and provided balanced IC and IC/IS predictions. The needle selection model achieved an average accuracy of 82.5%. ML-predicted needle arrangements matched or improved plan quality when compared to clinically selected arrangements. Overall, ML predictions led to an average total improvement of 2.0 Gy to OAR doses over three treatment fractions when compared to clinical predictions. CONCLUSION: In the context of a single institution study, the presented ML model demonstrates valuable decision-support for the applicator and needle selection process with the potential to provide improved dosimetry. Future work will include a multi-center study to assess generalizability.
Assuntos
Braquiterapia , Aprendizado de Máquina , Dosagem Radioterapêutica , Neoplasias do Colo do Útero , Humanos , Braquiterapia/instrumentação , Braquiterapia/métodos , Feminino , Neoplasias do Colo do Útero/radioterapia , Neoplasias do Colo do Útero/diagnóstico por imagem , Estudos Prospectivos , Agulhas , Planejamento da Radioterapia Assistida por Computador/métodos , Pessoa de Meia-Idade , Órgãos em Risco/efeitos da radiação , IdosoRESUMO
PURPOSE: The purpose of this analysis is to predict worsening post-treatment normal tissue toxicity in patients undergoing accelerated partial breast irradiation (APBI) therapy and to quantitatively identify which diagnostic, anatomical, and dosimetric features are contributing to these outcomes. METHODS: A retrospective study of APBI treatments was performed using 32 features pertaining to various stages of the patient's treatment journey. These features were used to inform and construct a Bayesian network (BN) based on both statistical analysis of feature distributions and relative clinical importance. The target feature for prediction was defined as a measurable worsening of telangiectasia, subcutaneous tissue induration, or fibrosis when compared against the observed baseline. Parameter learning for the network was performed using data from the 299 patients included in the ACCEL trial and predictive performance was measured. Feature importance for the BN was quantified using a novel information-theoretic approach. RESULTS: Cross-validated performance of the BN for predicting toxicity was consistently higher when compared against conventional machine learning (ML) techniques. The measured BN receiver operating characteristic area under the curve was 0.960 ± $\,{\pm}\,$ 0.013 against the best ML result of 0.942 ± $\,{\pm}\,$ 0.021 using five-fold cross-validation with separate test data across 100 trials. The volume of the clinical target volume, gross target volume, and baseline toxicity measurements were found to have the highest feature importance and mutual dependence with normal tissue toxicity in the network, representing the strongest contribution to patient outcomes. CONCLUSIONS: The BN outperformed conventional ML techniques in predicting tissue toxicity outcomes and provided deeper insight into which features are contributing to these outcomes.
Assuntos
Neoplasias da Mama , Mama , Teorema de Bayes , Mama/efeitos da radiação , Neoplasias da Mama/radioterapia , Feminino , Humanos , Aprendizado de Máquina , Curva ROC , Estudos RetrospectivosRESUMO
[This corrects the article DOI: 10.3389/fonc.2021.611437.].
RESUMO
Purpose: To develop and validate a preliminary machine learning (ML) model aiding in the selection of intracavitary (IC) versus hybrid interstitial (IS) applicators for high-dose-rate (HDR) cervical brachytherapy. Methods: From a dataset of 233 treatments using IC or IS applicators, a set of geometric features of the structure set were extracted, including the volumes of OARs (bladder, rectum, sigmoid colon) and HR-CTV, proximity of OARs to the HR-CTV, mean and maximum lateral and vertical HR-CTV extent, and offset of the HR-CTV centre-of-mass from the applicator tandem axis. Feature selection using an ANOVA F-test and mutual information removed uninformative features from this set. Twelve classification algorithms were trained and tested over 100 iterations to determine the highest performing individual models through nested 5-fold cross-validation. Three models with the highest accuracy were combined using soft voting to form the final model. This model was trained and tested over 1,000 iterations, during which the relative importance of each feature in the applicator selection process was determined. Results: Feature selection indicated that the mean and maximum lateral and vertical extent, volume, and axis offset of the HR-CTV were the most informative features and were thus provided to the ML models. Relative feature importances indicated that the HR-CTV volume and mean lateral extent were most important for applicator selection. From the comparison of the individual classification algorithms, it was found that the highest performing algorithms were tree-based ensemble methods - AdaBoost Classifier (ABC), Gradient Boosting Classifier (GBC), and Random Forest Classifier (RFC). The accuracy of the individual models was compared to the voting model for 100 iterations (ABC = 91.6 ± 3.1%, GBC = 90.4 ± 4.1%, RFC = 89.5 ± 4.0%, Voting Model = 92.2 ± 1.8%) and the voting model was found to have superior accuracy. Over the final 1,000 evaluation iterations, the final voting model demonstrated a high predictive accuracy (91.5 ± 0.9%) and F1 Score (90.6 ± 1.1%). Conclusion: The presented model demonstrates high discriminative performance, highlighting the potential for utilization in informing applicator selection prospectively following further clinical validation.