RESUMEN
BACKGROUND AND PURPOSE: A higher radiation dose to the heart is known to be associated with increased mortality in non-small cell lung cancer (NSCLC) patients. It is however unknown what the contribution of the heart dose is when other risk factors for mortality are also accounted for. MATERIALS AND METHODS: We constructed and externally validated prediction models of mortality after definitive chemoradiotherapy for NSCLC. Models were developed in 145 stage I-IIIB NSCLC patients. Clinical (performance status, age, gross tumour volume (GTV) combining primary tumour and involved lymph nodes, current smoker) and dosimetric (mean lung (MLD) and heart (MHD) dose) variables were considered. Multivariable logistic regression models predicting 12 and 24â¯month mortality were built in 5-fold cross-validation. Discrimination and calibration was assessed in 3 external validation datasets containing 878 (via distributed learning), 127 and 96 NSCLC patients. RESULTS: The best discriminating prediction models combined GTV, smoker and/or MHD: bootstrapping AUC (95% CI) of 0.74 (0.66-0.78) and 0.69 (0.55-0.74) at 12 and 24â¯months. At external validation, the 24â¯month mortality GTV-smoker-MHD model robustly showed moderate discrimination (AUCâ¯=â¯0.61-0.64 before and 0.64-0.65 after model update) with limited 0.01-0.07 improvement over a GTV-only model, and calibration slope (0.64-0.65). This model can identify patients for whom a MHD reduction may be useful (e.g. PPVâ¯=â¯77%, NPVâ¯=â¯52% (60% cut-off)). CONCLUSIONS: Tumour volume is strongly related to mortality risk in the first 2â¯years after chemoradiotherapy for NSCLC. Modelling indicates that efforts to reduce cardiac dose may be relevant for small tumours and that smoking has an important negative association with survival.
Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas , Neoplasias Pulmonares , Carcinoma de Pulmón de Células no Pequeñas/terapia , Quimioradioterapia/efectos adversos , Humanos , Neoplasias Pulmonares/terapia , Factores de Riesgo , Carga TumoralRESUMEN
BACKGROUND AND PURPOSE: Access to healthcare data is indispensable for scientific progress and innovation. Sharing healthcare data is time-consuming and notoriously difficult due to privacy and regulatory concerns. The Personal Health Train (PHT) provides a privacy-by-design infrastructure connecting FAIR (Findable, Accessible, Interoperable, Reusable) data sources and allows distributed data analysis and machine learning. Patient data never leaves a healthcare institute. MATERIALS AND METHODS: Lung cancer patient-specific databases (tumor staging and post-treatment survival information) of oncology departments were translated according to a FAIR data model and stored locally in a graph database. Software was installed locally to enable deployment of distributed machine learning algorithms via a central server. Algorithms (MATLAB, code and documentation publicly available) are patient privacy-preserving as only summary statistics and regression coefficients are exchanged with the central server. A logistic regression model to predict post-treatment two-year survival was trained and evaluated by receiver operating characteristic curves (ROC), root mean square prediction error (RMSE) and calibration plots. RESULTS: In 4 months, we connected databases with 23 203 patient cases across 8 healthcare institutes in 5 countries (Amsterdam, Cardiff, Maastricht, Manchester, Nijmegen, Rome, Rotterdam, Shanghai) using the PHT. Summary statistics were computed across databases. A distributed logistic regression model predicting post-treatment two-year survival was trained on 14 810 patients treated between 1978 and 2011 and validated on 8 393 patients treated between 2012 and 2015. CONCLUSION: The PHT infrastructure demonstrably overcomes patient privacy barriers to healthcare data sharing and enables fast data analyses across multiple institutes from different countries with different regulatory regimens. This infrastructure promotes global evidence-based medicine while prioritizing patient privacy.
Asunto(s)
Neoplasias Pulmonares , Aprendizaje Automático , Algoritmos , China , Humanos , PrivacidadRESUMEN
BACKGROUND: Tumor hypoxia increases resistance to radiotherapy and systemic therapy. Our aim was to develop and validate a disease-agnostic and disease-specific CT (+FDG-PET) based radiomics hypoxia classification signature. MATERIAL AND METHODS: A total of 808 patients with imaging data were included: N = 100 training/N = 183 external validation cases for a disease-agnostic CT hypoxia classification signature, N = 76 training/N = 39 validation cases for the H&N CT signature and N = 62 training/N = 36 validation cases for the Lung CT signature. The primary gross tumor volumes (GTV) were manually defined by experts on CT. In order to dichotomize between hypoxic/well-oxygenated tumors a threshold of 20% was used for the [18F]-HX4-derived hypoxic fractions (HF). A random forest (RF)-based machine-learning classifier/regressor was trained to classify patients as hypoxia-positive/ negative based on radiomic features. RESULTS: A 11 feature "disease-agnostic CT model" reached AUC's of respectively 0.78 (95% confidence interval [CI], 0.62-0.94), 0.82 (95% CI, 0.67-0.96) and 0.78 (95% CI, 0.67-0.89) in three external validation datasets. A "disease-agnostic FDG-PET model" reached an AUC of 0.73 (0.95% CI, 0.49-0.97) in validation by combining 5 features. The highest "lung-specific CT model" reached an AUC of 0.80 (0.95% CI, 0.65-0.95) in validation with 4 CT features, while the "H&N-specific CT model" reached an AUC of 0.84 (0.95% CI, 0.64-1.00) in validation with 15 CT features. A tumor volume-alone model was unable to significantly classify patients as hypoxia-positive/ negative. A significant survival split (P = 0.037) was found between CT-classified hypoxia strata in an external H&N cohort (n = 517), while 117 significant hypoxia gene-CT signature feature associations were found in an external lung cohort (n = 80). CONCLUSION: The disease-specific radiomics signatures perform better than the disease agnostic ones. By identifying hypoxic patients our signatures have the potential to enrich interventional hypoxia-targeting trials.
Asunto(s)
Fluorodesoxiglucosa F18 , Hipoxia Tumoral , Humanos , Pulmón , Tomografía de Emisión de Positrones , Tomografía Computarizada por Rayos XRESUMEN
Prediction modelling with radiomics is a rapidly developing research topic that requires access to vast amounts of imaging data. Methods that work on decentralized data are urgently needed, because of concerns about patient privacy. Previously published computed tomography medical image sets with gross tumour volume (GTV) outlines for non-small cell lung cancer have been updated with extended follow-up. In a previous study, these were referred to as Lung1 (n = 421) and Lung2 (n = 221). The Lung1 dataset is made publicly accessible via The Cancer Imaging Archive (TCIA; https://www.cancerimagingarchive.net ). We performed a decentralized multi-centre study to develop a radiomic signature (hereafter "ZS2019") in one institution and validated the performance in an independent institution, without the need for data exchange and compared this to an analysis where all data was centralized. The performance of ZS2019 for 2-year overall survival validated in distributed radiomics was not statistically different from the centralized validation (AUC 0.61 vs 0.61; p = 0.52). Although slightly different in terms of data and methods, no statistically significant difference in performance was observed between the new signature and previous work (c-index 0.58 vs 0.65; p = 0.37). Our objective was not the development of a new signature with the best performance, but to suggest an approach for distributed radiomics. Therefore, we used a similar method as an earlier study. We foresee that the Lung1 dataset can be further re-used for testing radiomic models and investigating feature reproducibility.
Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas/diagnóstico por imagen , Neoplasias Pulmonares/diagnóstico por imagen , Conjuntos de Datos como Asunto , Humanos , Tomografía Computarizada por Rayos XRESUMEN
PURPOSE: To investigate the feasibility of in vivo dosimetry using microMOSFET dosimeters in patients treated with brachytherapy using two types of gynecological applicators. METHODS AND MATERIALS: In this study, a microMOSFET was placed in an empty needle of an Utrecht Interstitial Fletcher applicator or MUPIT (Martinez Universal Perineal Interstitial Template) applicator for independent verification of treatment delivery. Measurements were performed in 10 patients, with one to three microMOSFETs per applicator and repeated for one to four fractions, resulting in 50 in vivo measurements. Phantom measurements were used to determine characteristics of the microMOSFETs. RESULTS: Phantom measurements showed a linear relationship between dose and microMOSFET threshold voltage, and a calibration coefficient (mV/cGy) was determined. Reproducibility of repeated 50 cGy irradiations was 2% (1 standard deviation). Distance and angle dependencies were measured and correction factors were determined. Subsequently, three microMOSFETs were placed in a phantom to measure a validation plan. The difference between predicted and measured dose was less than the measurement uncertainty (±9%, 2 standard deviations). In vivo measurements were corrected for distance and angle dependencies. Differences between predicted and measured dose in the patients were smaller than the measurement uncertainty for the majority of the measurements. CONCLUSIONS: In vivo dosimetry using microMOSFETs in MUPIT and Utrecht Interstitial Fletcher applicators has proved to be feasible. Reimaging should be performed after detection of differences larger than 10% between predicted and measured dose to verify the applicator configuration. Movement of the applicator relative to the target or organs at risk is undetectable with this method.
Asunto(s)
Braquiterapia/instrumentación , Neoplasias de los Genitales Femeninos/radioterapia , Dosimetría in Vivo , Dosímetros de Radiación , Braquiterapia/métodos , Calibración , Estudios de Factibilidad , Femenino , Humanos , Fantasmas de Imagen , Dosificación Radioterapéutica , Reproducibilidad de los ResultadosRESUMEN
BACKGROUND AND PURPOSE: We externally validated a previously established multivariable normal-tissue complication probability (NTCP) model for Grade ≥2 acute esophageal toxicity (AET) after intensity-modulated (chemo-)radiotherapy or volumetric-modulated arc therapy for locally advanced non-small cell lung cancer. MATERIALS AND METHODS: A total of 603 patients from five cohorts (A-E) within four different Dutch institutes were included. Using the NTCP model, containing predictors concurrent chemoradiotherapy, mean esophageal dose, gender and clinical tumor stage, the risk of Grade ≥2 AET was estimated per patient and model discrimination and (re)calibration performance were evaluated. RESULTS: Four validation cohorts (A, B, D, E) experienced higher incidence of Grade ≥2 AET compared to the training cohort (49.3-70.2% vs 35.6%; borderline significant for one cohort, highly significant for three cohorts). Cohort C experienced lower Grade ≥2 AET incidence (21.7%, pâ¯<â¯0.001). For three cohorts (A-C), discriminative performance was similar to the training cohort (area under the curve (AUC) 0.81-0.89 vs 0.84). In the two remaining cohorts (D-E) the model showed poor discriminative power (AUC 0.64 and 0.63). Reasonable calibration performance was observed in two cohorts (A-B), and recalibration further improved performance in all three cohorts with good discrimination (A-C). Recalibration for the two poorly discriminating cohorts (D-E) did not improve performance. CONCLUSIONS: The NTCP model for AET prediction was successfully validated in three out of five patient cohorts (AUC ≥0.80). The model did not perform well in two cohorts, which included patients receiving substantially different treatment. Before applying the model in clinical practice, validation of discrimination and (re)calibration performance in a local cohort is recommended.
Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas/terapia , Quimioradioterapia/efectos adversos , Esófago/efectos de la radiación , Neoplasias Pulmonares/terapia , Traumatismos por Radiación/etiología , Adulto , Anciano , Área Bajo la Curva , Carcinoma de Pulmón de Células no Pequeñas/patología , Estudios de Cohortes , Femenino , Humanos , Neoplasias Pulmonares/patología , Masculino , Persona de Mediana Edad , Probabilidad , Radioterapia de Intensidad Modulada/efectos adversosRESUMEN
PURPOSE: Machine learning classification algorithms (classifiers) for prediction of treatment response are becoming more popular in radiotherapy literature. General Machine learning literature provides evidence in favor of some classifier families (random forest, support vector machine, gradient boosting) in terms of classification performance. The purpose of this study is to compare such classifiers specifically for (chemo)radiotherapy datasets and to estimate their average discriminative performance for radiation treatment outcome prediction. METHODS: We collected 12 datasets (3496 patients) from prior studies on post-(chemo)radiotherapy toxicity, survival, or tumor control with clinical, dosimetric, or blood biomarker features from multiple institutions and for different tumor sites, that is, (non-)small-cell lung cancer, head and neck cancer, and meningioma. Six common classification algorithms with built-in feature selection (decision tree, random forest, neural network, support vector machine, elastic net logistic regression, LogitBoost) were applied on each dataset using the popular open-source R package caret. The R code and documentation for the analysis are available online (https://github.com/timodeist/classifier_selection_code). All classifiers were run on each dataset in a 100-repeated nested fivefold cross-validation with hyperparameter tuning. Performance metrics (AUC, calibration slope and intercept, accuracy, Cohen's kappa, and Brier score) were computed. We ranked classifiers by AUC to determine which classifier is likely to also perform well in future studies. We simulated the benefit for potential investigators to select a certain classifier for a new dataset based on our study (pre-selection based on other datasets) or estimating the best classifier for a dataset (set-specific selection based on information from the new dataset) compared with uninformed classifier selection (random selection). RESULTS: Random forest (best in 6/12 datasets) and elastic net logistic regression (best in 4/12 datasets) showed the overall best discrimination, but there was no single best classifier across datasets. Both classifiers had a median AUC rank of 2. Preselection and set-specific selection yielded a significant average AUC improvement of 0.02 and 0.02 over random selection with an average AUC rank improvement of 0.42 and 0.66, respectively. CONCLUSION: Random forest and elastic net logistic regression yield higher discriminative performance in (chemo)radiotherapy outcome and toxicity prediction than other studied classifiers. Thus, one of these two classifiers should be the first choice for investigators when building classification models or to benchmark one's own modeling results against. Our results also show that an informed preselection of classifiers based on existing datasets can improve discrimination over random selection.
Asunto(s)
Quimioradioterapia/métodos , Aprendizaje Automático , Neoplasias/diagnóstico , Neoplasias/radioterapia , Área Bajo la Curva , Quimioradioterapia/efectos adversos , Árboles de Decisión , Humanos , Modelos Logísticos , Neoplasias/mortalidad , Redes Neurales de la Computación , Pronóstico , Programas InformáticosRESUMEN
PURPOSE: To evaluate whether inclusion of incidental radiation dose to the cardiac atria and ventricles improves the prediction of grade ≥3 radiation pneumonitis (RP) in advanced-stage non-small cell lung cancer (AS-NSCLC) patients treated with intensity modulated radiation therapy (IMRT) or volumetric modulated arc therapy (VMAT). METHODS AND MATERIALS: Using a bootstrap modeling approach, clinical parameters and dose-volume histogram (DVH) parameters of lungs and heart (assessing atria and ventricles separately and combined) were evaluated for RP prediction in 188 AS-NSCLC patients. RESULTS: After a median follow-up of 18.4 months, 26 patients (13.8%) developed RP. Only the median mean lung dose (MLD) differed between groups (15.3 Gy vs 13.7 Gy for the RP and non-RP group, respectively; P=.004). The MLD showed the highest Spearman correlation coefficient (Rs) for RP (Rs = 0.21; P<.01). Most Rs of the lung DVH parameters exceeded those of the heart DVH parameters. After predictive modeling using a bootstrap procedure, the MLD was always included in the predictive model for grade ≥3 RP, whereas the heart DVH parameters were seldom included in the model. CONCLUSION: Incidental dose to the cardiac atria and ventricles did not improve RP risk prediction in our cohort of 188 AS-NSCLC patients treated with IMRT or VMAT.