Search | VHL Search Portal

1.

Diagnostic performance of computed tomography features in detecting oropharyngeal squamous cell carcinoma extranodal extension.

Tran, Ngoc-Anh; Palotai, Miklos; Hanna, Glenn J; Schoenfeld, Jonathan D; Bay, Camden P; Rettig, Eleni M; Bunch, Paul M; Juliano, Amy F; Kelly, Hillary R; Suh, Chong Hyun; Zander, David A; Morales Pinzon, Alfredo; Kann, Benjamin H; Huang, Raymond Y; Haddad, Robert I; Guttmann, Charles R G; Guenette, Jeffrey P.

Eur Radiol ; 33(5): 3693-3703, 2023 May.

Article in English | MEDLINE | ID: mdl-36719493

ABSTRACT

OBJECTIVES: Accurate pre-treatment imaging determination of extranodal extension (ENE) could facilitate the selection of appropriate initial therapy for HPV-positive oropharyngeal squamous cell carcinoma (HPV + OPSCC). Small studies have associated 7 CT features with ENE with varied results and agreement. This article seeks to determine the replicable diagnostic performance of these CT features for ENE. METHODS: Five expert academic head/neck neuroradiologists from 5 institutions evaluate a single academic cancer center cohort of 75 consecutive HPV + OPSCC patients. In a web-based virtual laboratory for imaging research and education, the experts performed training on 7 published CT features associated with ENE and then independently identified the "single most (if any) suspicious" lymph node and presence/absence of each of the features. Inter-rater agreement was assessed using percentage agreement, Gwet's AC1, and Fleiss' kappa. Sensitivity, specificity, and positive and negative predictive values were calculated for each CT feature based on histologic ENE. RESULTS: All 5 raters identified the same node in 52 cases (69%). In 15 cases (20%), at least one rater selected a node and at least one rater did not. In 8 cases (11%), all raters selected a node, but at least one rater selected a different node. Percentage agreement and Gwet's AC1 coefficients were > 0.80 for lesion identification, matted/conglomerated nodes, and central necrosis. Fleiss' kappa was always < 0.6. CT sensitivity for histologically confirmed ENE ranged 0.18-0.94, specificity 0.41-0.88, PPV 0.26-0.36, and NPV 0.78-0.96. CONCLUSIONS: Previously described CT features appear to have poor reproducibility among expert head/neck neuroradiologists and poor predictive value for histologic ENE. KEY POINTS: â¢ Previously described CT imaging features appear to have poor reproducibility among expert head and neck subspecialized neuroradiologists as well as poor predictive value for histologic ENE. â¢ Although it may still be appropriate to comment on the presence or absence of these CT features in imaging reports, the evidence indicates that caution is warranted when incorporating these features into clinical decision-making regarding the likelihood of ENE.

Subject(s)

Head and Neck Neoplasms , Oropharyngeal Neoplasms , Papillomavirus Infections , Humans , Squamous Cell Carcinoma of Head and Neck/pathology , Oropharyngeal Neoplasms/diagnostic imaging , Oropharyngeal Neoplasms/pathology , Extranodal Extension , Papillomavirus Infections/complications , Reproducibility of Results , Tomography, X-Ray Computed/methods , Lymph Nodes/pathology , Head and Neck Neoplasms/pathology , Retrospective Studies , Neoplasm Staging

2.

PET/CT radiomics signature of human papilloma virus association in oropharyngeal squamous cell carcinoma.

Haider, Stefan P; Mahajan, Amit; Zeevi, Tal; Baumeister, Philipp; Reichel, Christoph; Sharaf, Kariem; Forghani, Reza; Kucukkaya, Ahmet S; Kann, Benjamin H; Judson, Benjamin L; Prasad, Manju L; Burtness, Barbara; Payabvash, Seyedmehdi.

Eur J Nucl Med Mol Imaging ; 47(13): 2978-2991, 2020 12.

Article in English | MEDLINE | ID: mdl-32399621

ABSTRACT

PURPOSE: To devise, validate, and externally test PET/CT radiomics signatures for human papillomavirus (HPV) association in primary tumors and metastatic cervical lymph nodes of oropharyngeal squamous cell carcinoma (OPSCC). METHODS: We analyzed 435 primary tumors (326 for training, 109 for validation) and 741 metastatic cervical lymph nodes (518 for training, 223 for validation) using FDG-PET and non-contrast CT from a multi-institutional and multi-national cohort. Utilizing 1037 radiomics features per imaging modality and per lesion, we trained, optimized, and independently validated machine-learning classifiers for prediction of HPV association in primary tumors, lymph nodes, and combined "virtual" volumes of interest (VOI). PET-based models were additionally validated in an external cohort. RESULTS: Single-modality PET and CT final models yielded similar classification performance without significant difference in independent validation; however, models combining PET and CT features outperformed single-modality PET- or CT-based models, with receiver operating characteristic area under the curve (AUC) of 0.78, and 0.77 for prediction of HPV association using primary tumor lesion features, in cross-validation and independent validation, respectively. In the external PET-only validation dataset, final models achieved an AUC of 0.83 for a virtual VOI combining primary tumor and lymph nodes, and an AUC of 0.73 for a virtual VOI combining all lymph nodes. CONCLUSION: We found that PET-based radiomics signatures yielded similar classification performance to CT-based models, with potential added value from combining PET- and CT-based radiomics for prediction of HPV status. While our results are promising, radiomics signatures may not yet substitute tissue sampling for clinical decision-making.

Subject(s)

Alphapapillomavirus , Head and Neck Neoplasms , Humans , Papillomaviridae , Positron Emission Tomography Computed Tomography , Retrospective Studies , Squamous Cell Carcinoma of Head and Neck

3.

Radiosurgery for Brain Metastases: Changing Practice Patterns and Disparities in the United States.

Kann, Benjamin H; Park, Henry S; Johnson, Skyler B; Chiang, Veronica L; Yu, James B.

J Natl Compr Canc Netw ; 15(12): 1494-1502, 2017 12.

Article in English | MEDLINE | ID: mdl-29223987

ABSTRACT

Background: Management of brain metastases typically includes radiotherapy (RT) with conventional fractionation and/or stereotactic radiosurgery (SRS). However, optimal indications and practice patterns for SRS remain unclear. We sought to evaluate national practice patterns for patients with metastatic disease receiving brain RT. Methods: We queried the National Cancer Data Base (NCDB) for patients diagnosed with metastatic non-small cell lung cancer, breast cancer, colorectal cancer, or melanoma from 2004 to 2014 who received upfront brain RT. Patients were divided into SRS and non-SRS cohorts. Patient and facility-level SRS predictors were analyzed with chi-square tests and logistic regression, and uptake trends were approximated with linear regression. Survival by diagnosis year was analyzed with the Kaplan-Meier method. Results: Of 75,953 patients, 12,250 (16.1%) received SRS and 63,703 (83.9%) received non-SRS. From 2004 to 2014, the proportion of patients receiving SRS annually increased (from 9.8% to 25.6%; P<.001), and the proportion of facilities using SRS annually increased (from 31.2% to 50.4%; P<.001). On multivariable analysis, nonwhite race, nonprivate insurance, and residence in lower-income or less-educated regions predicted lower SRS use (P<.05 for each). During the study period, SRS use increased disproportionally among patients with private insurance or who resided in higher-income or higher-educated regions. From 2004 to 2013, 1-year actuarial survival improved from 24.1% to 49.6% for patients selected for SRS and from 21.0% to 26.3% for non-SRS patients (P<.001). Conclusions: This NCDB analysis demonstrates steadily increasing-although modest overall-brain SRS use for patients with metastatic disease in the United States and identifies several progressively widening sociodemographic disparities in the adoption of SRS. Further research is needed to determine the reasons for these worsening disparities and their clinical implications on intracranial control, neurocognitive toxicities, quality of life, and survival for patients with brain metastases.

Subject(s)

Brain Neoplasms/radiotherapy , Brain Neoplasms/secondary , Adolescent , Adult , Aged , Aged, 80 and over , Brain Neoplasms/pathology , Female , Humans , Male , Middle Aged , Quality of Life , Radiosurgery/methods , Retrospective Studies , United States , Young Adult

4.

Artificial Intelligence in Oncology: Current Applications and Future Directions.

Kann, Benjamin H; Thompson, Reid; Thomas, Charles R; Dicker, Adam; Aneja, Sanjay.

Oncology (Williston Park) ; 33(2): 46-53, 2019 Feb 15.

Article in English | MEDLINE | ID: mdl-30784028

Subject(s)

Artificial Intelligence , Medical Oncology , Decision Making , Humans , Machine Learning , Neoplasms/diagnostic imaging , Neoplasms/pathology , Neural Networks, Computer

5.

Narrative review of stereotactic body radiation therapy combined with tyrosine kinase inhibitors for oligometastatic EGFR-mutated non-small cell lung cancer: present and future developments.

Zhao, Xinchen; Zhang, Shengwei; Sun, Xiaoyue; Lin, Yao; Capone, Luca; Ko, Eric C; Kann, Benjamin H; Li, Yi; Wang, Xiaoshan.

Transl Lung Cancer Res ; 13(6): 1383-1395, 2024 Jun 30.

Article in English | MEDLINE | ID: mdl-38973945

ABSTRACT

Background and Objective: A significant number of individuals diagnosed with non-small cell lung cancer (NSCLC) have distant metastases, and the concept of oligometastatic NSCLC has shown promise in achieving a cure. Stereotactic body radiation therapy (SBRT) is currently considered a viable treatment option for a limited number of tumor metastases. It has also been demonstrated that third-generation tyrosine kinase inhibitors (TKIs) are effective in extending the survival of patients with epidermal growth factor receptor (EGFR)-mutated NSCLC. Hence, the combination of SBRT with third-generation TKIs holds the potential to enhance treatment efficacy in patients with oligometastatic EGFR-mutated NSCLC. This review aimed to assess the possibility of combining SBRT with TKIs as an optimum treatment option for patients with oligometastatic EGFR-mutated NSCLC. Methods: We performed a narrative review by searching the PubMed, Web of Science, Elsevier and ClinicalTrials.gov databases for articles published in the English language from January 2009 to February 2024 and by reviewing the bibliographies of key references to identify important literature related to combining SBRT with third-generation TKIs in oligometastatic EGFR-mutated NSCLC. Key Content and Findings: This review aimed to assess the viability of combining SBRT and EGFR-TKIs in oligometastatic EGFR-mutated NSCLC. Current clinical trials suggest that the combined therapies have better progression free survival (PFS) when using SBRT as either concurrent with EGFR-TKIs or consolidated with EGFR-TKIs. Furthermore, research with third-generation EGFR-TKIs and SBRT combinations has demonstrated tolerable toxicity levels without significant additional adverse effects as compared to prior therapies. However, further clinical trials are required to establish its effectiveness. Conclusions: The combined approach of SBRT and TKIs can effectively impede the progression of oligometastatic NSCLC in patients harboring EGFR mutations and, most notably, can prolong progression-free survival rates. However, the feasibility of combining SBRT with third-generation TKIs in clinical trials remains unclear.

6.

Rates of Occult Invasive Disease in Patients With Biopsy-Proven Oral Cavity Squamous Cell Carcinoma in Situ.

Cooper, Dylan J; Ziemba, Yonah; Pereira, Lucio; Kann, Benjamin H; Parashar, Bhupesh; Miles, Brett A; Ghaly, Maged; Seetharamu, Nagashree; Frank, Douglas; Talcott, Wesley J.

JAMA Otolaryngol Head Neck Surg ; 150(2): 151-156, 2024 Feb 01.

Article in English | MEDLINE | ID: mdl-38175664

ABSTRACT

Importance: The likelihood that an oral cavity lesion harbors occult invasive disease after biopsy demonstrating carcinoma in situ (CIS) is unknown. While de-escalated treatment strategies may be appealing in the setting of CIS, knowing whether occult invasive disease may be present and its association with survival outcomes would lead to more informed management decisions. Objective: To evaluate rate of occult invasive disease and clinical outcomes in patients with oral cavity CIS. Design, Setting, and Participants: This was a retrospective population-based cohort study using the National Cancer Database and included adults with biopsy-proven oral cavity CIS as the first diagnosis of cancer between 2004 and 2020. Data were analyzed from October 10, 2022, to June 25, 2023. Exposures: Surgical resection vs no surgery. Main Outcomes and Measures: Analyses calculated the rate of occult invasive disease identified on resection of a biopsy-proven CIS lesion. Univariate and multivariate logistic regression with odds ratios and 95% CIs were used to identify significant demographic and clinical characteristics associated with risk of occult invasion (age, year of diagnosis, sex, race and ethnicity, oral cavity subsite, and comorbidity status). Kaplan-Meier curves for overall survival (OS) were calculated for both unresected and resected cohorts (stratified by presence of occult invasive disease). Results: A total of 1856 patients with oral cavity CIS were identified, with 122 who did not undergo surgery (median [range] age, 65 [26-90] years; 48 female individuals [39.3%] and 74 male individuals [60.7%]) and 1458 who underwent surgical resection and had documented pathology (median [range] age, 62 [21-90] years; 490 female individuals [33.6%] and 968 male individuals [66.4%]). Of the 1580 patients overall, 52 (3.3%) were Black; 39 (2.5%), Hispanic; 1365 (86.4%), White; and 124 (7.8%), other, not specified. Among those who proceeded with surgery with documented pathology, 408 patients (28.0%) were found to have occult invasive disease. Higher-risk features were present in 45 patients (11.0%) for final margin positivity, 16 patients (3.9%) for lymphovascular invasion, 13 patients (3.2%) for high-grade invasive disease, and 14 patients (3.4%) for nodal involvement. For those patients with occult disease, staging according to the American Joint Committee on Cancer's AJCC Cancer Staging Manual, eighth edition, was pT1 in 341 patients (83.6%), pT2 in 41 (10.0%), and pT3 or pT4 disease in 26 (6.4%). Factors associated with greater odds of occult invasive disease at resection were female sex, Black race, and alveolar ridge, vestibule, and retromolar subsite. With median 66-month follow-up, 5-year OS was 85.9% in patients who proceeded with surgical resection vs 59.7% in patients who did not undergo surgery (difference, 26.2%; 95% CI, 19.0%-33.4%). Conclusions and Relevance: This cohort study assessed the risk of concurrent occult invasion with biopsy-proven CIS of the oral cavity, demonstrating that 28.0% had invasive disease at resection. Reassuringly, even in the setting of occult invasion, high-risk disease features were rare, and 5-year OS was nearly 80% with resection. The findings support the practice of definitive resection if feasible following biopsy demonstrating oral cavity CIS.

Subject(s)

Carcinoma, Squamous Cell , Head and Neck Neoplasms , Mouth Neoplasms , Adult , Humans , Male , Female , Aged , Middle Aged , Squamous Cell Carcinoma of Head and Neck/pathology , Cohort Studies , Retrospective Studies , Neoplasm Staging , Carcinoma, Squamous Cell/pathology , Mouth Neoplasms/pathology , Biopsy , Head and Neck Neoplasms/pathology

7.

Widening the therapeutic window for central and ultra-central thoracic oligometastatic disease with stereotactic MR-guided adaptive radiation therapy (SMART).

Lee, Grace; Han, Zhaohui; Huynh, Elizabeth; Tjong, Michael C; Cagney, Daniel N; Huynh, Mai Anh; Kann, Benjamin H; Kozono, David; Leeman, Jonathan E; Singer, Lisa; Williams, Christopher L; Mak, Raymond H.

Radiother Oncol ; 190: 110034, 2024 Jan.

Article in English | MEDLINE | ID: mdl-38030080

ABSTRACT

BACKGROUND/PURPOSE: Central/ultra-central thoracic tumors are challenging to treat with stereotactic radiotherapy due potential high-grade toxicity. Stereotactic MR-guided adaptive radiation therapy (SMART) may improve the therapeutic window through motion control with breath-hold gating and real-time MR-imaging as well as the option for daily online adaptive replanning to account for changes in target and/or organ-at-risk (OAR) location. MATERIALS/METHODS: 26 central (19 ultra-central) thoracic oligoprogressive/oligometastatic tumors treated with isotoxic (OAR constraints-driven) 5-fraction SMART (median 50 Gy, range 35-60) between 10/2019-10/2022 were reviewed. Central tumor was defined as tumor within or touching 2 cm around proximal tracheobronchial tree (PBT) or adjacent to mediastinal/pericardial pleura. Ultra-central was defined as tumor abutting the PBT, esophagus, or great vessel. Hard OAR constraints observed were ≤ 0.03 cc for PBT V40, great vessel V52.5, and esophagus V35. Local failure was defined as tumor progression/recurrence within the planning target volume. RESULTS: Tumor abutted the PBT in 31 %, esophagus in 31 %, great vessel in 65 %, and heart in 42 % of cases. 96 % of fractions were treated with reoptimized plan, necessary to meet OAR constraints (80 %) and/or target coverage (20 %). Median follow-up was 19 months (27 months among surviving patients). Local control (LC) was 96 % at 1-year and 90 % at 2-years (total 2/26 local failure). 23 % had G2 acute toxicities (esophagitis, dysphagia, anorexia, nausea) and one (4 %) had G3 acute radiation dermatitis. There were no G4-5 acute toxicities. There was no symptomatic pneumonitis and no G2 + late toxicities. CONCLUSION: Isotoxic 5-fraction SMART resulted in high rates of LC and minimal toxicity. This approach may widen the therapeutic window for high-risk oligoprogressive/oligometastatic thoracic tumors.

Subject(s)

Lung Neoplasms , Radiation Injuries , Radiosurgery , Thoracic Neoplasms , Humans , Radiotherapy Planning, Computer-Assisted/methods , Neoplasm Recurrence, Local , Radiosurgery/methods , Thoracic Neoplasms/radiotherapy , Magnetic Resonance Imaging/methods , Lung Neoplasms/diagnostic imaging , Lung Neoplasms/radiotherapy , Lung Neoplasms/pathology

8.

Application of simultaneous uncertainty quantification and segmentation for oropharyngeal cancer use-case with Bayesian deep learning.

Sahlsten, Jaakko; Jaskari, Joel; Wahid, Kareem A; Ahmed, Sara; Glerean, Enrico; He, Renjie; Kann, Benjamin H; Mäkitie, Antti; Fuller, Clifton D; Naser, Mohamed A; Kaski, Kimmo.

Commun Med (Lond) ; 4(1): 110, 2024 Jun 08.

Article in English | MEDLINE | ID: mdl-38851837

ABSTRACT

BACKGROUND: Radiotherapy is a core treatment modality for oropharyngeal cancer (OPC), where the primary gross tumor volume (GTVp) is manually segmented with high interobserver variability. This calls for reliable and trustworthy automated tools in clinician workflow. Therefore, accurate uncertainty quantification and its downstream utilization is critical. METHODS: Here we propose uncertainty-aware deep learning for OPC GTVp segmentation, and illustrate the utility of uncertainty in multiple applications. We examine two Bayesian deep learning (BDL) models and eight uncertainty measures, and utilize a large multi-institute dataset of 292 PET/CT scans to systematically analyze our approach. RESULTS: We show that our uncertainty-based approach accurately predicts the quality of the deep learning segmentation in 86.6% of cases, identifies low performance cases for semi-automated correction, and visualizes regions of the scans where the segmentations likely fail. CONCLUSIONS: Our BDL-based analysis provides a first-step towards more widespread implementation of uncertainty quantification in OPC GTVp segmentation.

Radiotherapy is used as a treatment for people with oropharyngeal cancer. It is important to distinguish the areas where cancer is present so the radiotherapy treatment can be targeted at the cancer. Computational methods based on artificial intelligence can automate this task but need to be able to distinguish areas where it is unclear whether cancer is present. In this study we compare these computational methods that are able to highlight areas where it is unclear whether or not cancer is present. Our approach accurately predicts how well these areas are distinguished by the models. Our results could be applied to improve the computational methods used during radiotherapy treatment. This could enable more targeted treatment to be used in the future, which could result in better outcomes for people with oropharyngeal cancer.

9.

Lung sparing in MR-guided non-adaptive SBRT treatment of peripheral lung tumors.

Lee, Ho Young; Lee, Grace; Ferguson, Dianne; Hsu, Shu-Hui; Hu, Yue-Houng; Huynh, Elizabeth; Sudhyadhom, Atchar; Williams, Christopher L; Cagney, Daniel N; Fitzgerald, Kelly J; Kann, Benjamin H; Kozono, David; Leeman, Jonathan E; Mak, Raymond H; Han, Zhaohui.

Biomed Phys Eng Express ; 10(4)2024 Jun 20.

Article in English | MEDLINE | ID: mdl-38861951

ABSTRACT

Objective.We aim to: (1) quantify the benefits of lung sparing using non-adaptive magnetic resonance guided stereotactic body radiotherapy (MRgSBRT) with advanced motion management for peripheral lung cancers compared to conventional x-ray guided SBRT (ConvSBRT); (2) establish a practical decision-making guidance metric to assist a clinician in selecting the appropriate treatment modality.Approach.Eleven patients with peripheral lung cancer who underwent breath-hold, gated MRgSBRT on an MR-guided linear accelerator (MR linac) were studied. Four-dimensional computed tomography (4DCT)-based retrospective planning using an internal target volume (ITV) was performed to simulate ConvSBRT, which were evaluated against the original MRgSBRT plans. Metrics analyzed included planning target volume (PTV) coverage, various lung metrics and the generalized equivalent unform dose (gEUD). A dosimetric predictor for achievable lung metrics was derived to assist future patient triage across modalities.Main results.PTV coverage was high (median V100% > 98%) and comparable for both modalities. MRgSBRT had significantly lower lung doses as measured by V20 (median 3.2% versus 4.2%), mean lung dose (median 3.3 Gy versus 3.8 Gy) and gEUD. Breath-hold, gated MRgSBRT resulted in an average reduction of 47% in PTV volume and an average increase of 19% in lung volume. Strong correlation existed between lung metrics and the ratio of PTV to lung volumes (RPTV/Lungs) for both modalities, indicating that RPTV/Lungsmay serve as a good predictor for achievable lung metrics without the need for pre-planning. A threshold value of RPTV/Lungs< 0.035 is suggested to achieve V20 < 10% using ConvSBRT. MRgSBRT should otherwise be considered if the threshold cannot be met.Significance.The benefits of lung sparing using MRgSBRT were quantified for peripheral lung tumors; RPTV/Lungswas found to be an effective predictor for achievable lung metrics across modalities. RPTV/Lungscan assist a clinician in selecting the appropriate modality without the need for labor-intensive pre-planning, which has significant practical benefit for a busy clinic.

Subject(s)

Four-Dimensional Computed Tomography , Lung Neoplasms , Lung , Magnetic Resonance Imaging , Radiosurgery , Radiotherapy Dosage , Radiotherapy Planning, Computer-Assisted , Humans , Radiosurgery/methods , Lung Neoplasms/radiotherapy , Lung Neoplasms/diagnostic imaging , Radiotherapy Planning, Computer-Assisted/methods , Magnetic Resonance Imaging/methods , Lung/diagnostic imaging , Retrospective Studies , Four-Dimensional Computed Tomography/methods , Male , Female , Radiotherapy, Image-Guided/methods , Breath Holding , Aged , Middle Aged , Organ Sparing Treatments/methods , Organs at Risk

10.

Edge roughness quantifies impact of physician variation on training and performance of deep learning auto-segmentation models for the esophagus.

Yan, Yujie; Kehayias, Christopher; He, John; Aerts, Hugo J W L; Fitzgerald, Kelly J; Kann, Benjamin H; Kozono, David E; Guthier, Christian V; Mak, Raymond H.

Sci Rep ; 14(1): 2536, 2024 01 30.

Article in English | MEDLINE | ID: mdl-38291051

ABSTRACT

Manual segmentation of tumors and organs-at-risk (OAR) in 3D imaging for radiation-therapy planning is time-consuming and subject to variation between different observers. Artificial intelligence (AI) can assist with segmentation, but challenges exist in ensuring high-quality segmentation, especially for small, variable structures, such as the esophagus. We investigated the effect of variation in segmentation quality and style of physicians for training deep-learning models for esophagus segmentation and proposed a new metric, edge roughness, for evaluating/quantifying slice-to-slice inconsistency. This study includes a real-world cohort of 394 patients who each received radiation therapy (mainly for lung cancer). Segmentation of the esophagus was performed by 8 physicians as part of routine clinical care. We evaluated manual segmentation by comparing the length and edge roughness of segmentations among physicians to analyze inconsistencies. We trained eight multiple- and individual-physician segmentation models in total, based on U-Net architectures and residual backbones. We used the volumetric Dice coefficient to measure the performance for each model. We proposed a metric, edge roughness, to quantify the shift of segmentation among adjacent slices by calculating the curvature of edges of the 2D sagittal- and coronal-view projections. The auto-segmentation model trained on multiple physicians (MD1-7) achieved the highest mean Dice of 73.7 ± 14.8%. The individual-physician model (MD7) with the highest edge roughness (mean ± SD: 0.106 ± 0.016) demonstrated significantly lower volumetric Dice for test cases compared with other individual models (MD7: 58.5 ± 15.8%, MD6: 67.1 ± 16.8%, p < 0.001). A multiple-physician model trained after removing the MD7 data resulted in fewer outliers (e.g., Dice ≤ 40%: 4 cases for MD1-6, 7 cases for MD1-7, Ntotal = 394). While we initially detected this pattern in a single clinician, we validated the edge roughness metric across the entire dataset. The model trained with the lowest-quantile edge roughness (MDER-Q1, Ntrain = 62) achieved significantly higher Dice (Ntest = 270) than the model trained with the highest-quantile ones (MDER-Q4, Ntrain = 62) (MDER-Q1: 67.8 ± 14.8%, MDER-Q4: 62.8 ± 15.7%, p < 0.001). This study demonstrates that there is significant variation in style and quality in manual segmentations in clinical care, and that training AI auto-segmentation algorithms from real-world, clinical datasets may result in unexpectedly under-performing algorithms with the inclusion of outliers. Importantly, this study provides a novel evaluation metric, edge roughness, to quantify physician variation in segmentation which will allow developers to filter clinical training data to optimize model performance.

Subject(s)

Deep Learning , Humans , Artificial Intelligence , Thorax , Algorithms , Tomography, X-Ray Computed , Image Processing, Computer-Assisted/methods

11.

Impact of ¹⁸F-FDG PET Intensity Normalization on Radiomic Features of Oropharyngeal Squamous Cell Carcinomas and Machine Learning-Generated Biomarkers.

Haider, Stefan P; Zeevi, Tal; Sharaf, Kariem; Gross, Moritz; Mahajan, Amit; Kann, Benjamin H; Judson, Benjamin L; Prasad, Manju L; Burtness, Barbara; Aboian, Mariam; Canis, Martin; Reichel, Christoph A; Baumeister, Philipp; Payabvash, Seyedmehdi.

J Nucl Med ; 65(5): 803-809, 2024 May 01.

Article in English | MEDLINE | ID: mdl-38514087

ABSTRACT

We aimed to investigate the effects of 18F-FDG PET voxel intensity normalization on radiomic features of oropharyngeal squamous cell carcinoma (OPSCC) and machine learning-generated radiomic biomarkers. Methods: We extracted 1,037 18F-FDG PET radiomic features quantifying the shape, intensity, and texture of 430 OPSCC primary tumors. The reproducibility of individual features across 3 intensity-normalized images (body-weight SUV, reference tissue activity ratio to lentiform nucleus of brain and cerebellum) and the raw PET data was assessed using an intraclass correlation coefficient (ICC). We investigated the effects of intensity normalization on the features' utility in predicting the human papillomavirus (HPV) status of OPSCCs in univariate logistic regression, receiver-operating-characteristic analysis, and extreme-gradient-boosting (XGBoost) machine-learning classifiers. Results: Of 1,037 features, a high (ICC ≥ 0.90), medium (0.90 > ICC ≥ 0.75), and low (ICC < 0.75) degree of reproducibility across normalization methods was attained in 356 (34.3%), 608 (58.6%), and 73 (7%) features, respectively. In univariate analysis, features from the PET normalized to the lentiform nucleus had the strongest association with HPV status, with 865 of 1,037 (83.4%) significant features after multiple testing corrections and a median area under the receiver-operating-characteristic curve (AUC) of 0.65 (interquartile range, 0.62-0.68). Similar tendencies were observed in XGBoost models, with the lentiform nucleus-normalized model achieving the numerically highest average AUC of 0.72 (SD, 0.07) in the cross validation within the training cohort. The model generalized well to the validation cohorts, attaining an AUC of 0.73 (95% CI, 0.60-0.85) in independent validation and 0.76 (95% CI, 0.58-0.95) in external validation. The AUCs of the XGBoost models were not significantly different. Conclusion: Only one third of the features demonstrated a high degree of reproducibility across intensity-normalization techniques, making uniform normalization a prerequisite for interindividual comparability of radiomic markers. The choice of normalization technique may affect the radiomic features' predictive value with respect to HPV. Our results show trends that normalization to the lentiform nucleus may improve model performance, although more evidence is needed to draw a firm conclusion.

Subject(s)

Fluorodeoxyglucose F18 , Machine Learning , Oropharyngeal Neoplasms , Humans , Oropharyngeal Neoplasms/diagnostic imaging , Male , Female , Middle Aged , Positron-Emission Tomography/methods , Image Processing, Computer-Assisted/methods , Aged , Carcinoma, Squamous Cell/diagnostic imaging , Biomarkers, Tumor/metabolism , Reproducibility of Results , Radiomics

12.

Longitudinal risk prediction for pediatric glioma with temporal deep learning.

Tak, Divyanshu; Garomsa, Biniam A; Zapaishchykova, Anna; Ye, Zezhong; Vajapeyam, Sri; Mahootiha, Maryam; Climent Pardo, Juan Carlos; Smith, Ceilidh; Familiar, Ariana M; Chaunzwa, Tafadzwa; Liu, Kevin X; Prabhu, Sanjay; Bandopadhayay, Pratiti; Nabavizadeh, Ali; Mueller, Sabine; Aerts, Hugo Jwl; Haas-Kogan, Daphne; Poussaint, Tina Y; Kann, Benjamin H.

medRxiv ; 2024 Jun 28.

Article in English | MEDLINE | ID: mdl-38978642

ABSTRACT

Pediatric glioma recurrence can cause morbidity and mortality; however, recurrence pattern and severity are heterogeneous and challenging to predict with established clinical and genomic markers. Resultingly, almost all children undergo frequent, long-term, magnetic resonance (MR) brain surveillance regardless of individual recurrence risk. Deep learning analysis of longitudinal MR may be an effective approach for improving individualized recurrence prediction in gliomas and other cancers but has thus far been infeasible with current frameworks. Here, we propose a self-supervised, deep learning approach to longitudinal medical imaging analysis, temporal learning, that models the spatiotemporal information from a patient's current and prior brain MRs to predict future recurrence. We apply temporal learning to pediatric glioma surveillance imaging for 715 patients (3,994 scans) from four distinct clinical settings. We find that longitudinal imaging analysis with temporal learning improves recurrence prediction performance by up to 41% compared to traditional approaches, with improvements in performance in both low- and high-grade glioma. We find that recurrence prediction accuracy increases incrementally with the number of historical scans available per patient. Temporal deep learning may enable point-of-care decision-support for pediatric brain tumors and be adaptable more broadly to patients with other cancers and chronic diseases undergoing surveillance imaging.

13.

Stepwise Transfer Learning for Expert-Level Pediatric Brain Tumor MRI Segmentation in a Limited Data Scenario.

Boyd, Aidan; Ye, Zezhong; Prabhu, Sanjay; Tjong, Michael C; Zha, Yining; Zapaischykova, Anna; Vajapeyam, Sridhar; Catalano, Paul J; Hayat, Hasaan; Chopra, Rishi; Liu, Kevin X; Nabavizadeh, Ali; Resnick, Adam; Mueller, Sabine; Haas-Kogan, Daphne; Aerts, Hugo J W L; Poussaint, Tina; Kann, Benjamin H.

Radiol Artif Intell ; : e230254, 2024 Jul 10.

Article in English | MEDLINE | ID: mdl-38984985

ABSTRACT

"Just Accepted" papers have undergone full peer review and have been accepted for publication in Radiology: Artificial Intelligence. This article will undergo copyediting, layout, and proof review before it is published in its final version. Please note that during production of the final copyedited article, errors may be discovered which could affect the content. Purpose To develop, externally test, and evaluate clinical acceptability of a deep learning (DL) pediatric brain tumor segmentation model using stepwise transfer learning. Materials and Methods In this retrospective study, the authors leveraged two T2-weighted MRI datasets (May 2001-December 2015) from a national brain tumor consortium (n = 184; median age, 7 years (range: 1-23 years); 94 male) and a pediatric cancer center (n = 100; median age, 8 years (range: 1-19 years); 47 male) to develop and evaluate DL neural networks for pediatric low-grade glioma segmentation using a novel stepwise transfer learning approach to maximize performance in a limited data scenario. The best model was externally-tested on an independent test set and subjected to randomized, blinded evaluation by three clinicians, wherein they assessed clinical acceptability of expert- and artificial intelligence (AI)-generated segmentations via 10-point Likert scales and Turing tests. Results The best AI model used in-domain, stepwise transfer learning (median DSC: 0.88 [IQR 0.72-0.91] versus 0.812 [0.56-0.89] for baseline model; P = .049). On external testing, AI model yielded excellent accuracy using reference standards from three clinical experts (Expert-1: 0.83 [0.75-0.90]; Expert-2: 0.81 [0.70-0.89]; Expert-3: 0.81 [0.68-0.88]; mean accuracy: 0.82)). On clinical benchmarking (n = 100 scans), experts rated AI-based segmentations higher on average compared with other experts (median Likert score: median 9 [IQR 7-9]) versus 7 [IQR 7-9]) and rated more AI segmentations as clinically acceptable (80.2% versus 65.4%). Experts correctly predicted the origin of AI segmentations in an average of 26.0% of cases. Conclusion Stepwise transfer learning enabled expert-level, automated pediatric brain tumor auto-segmentation and volumetric measurement with a high level of clinical acceptability. ©RSNA, 2024.

14.

Large language models to identify social determinants of health in electronic health records.

Guevara, Marco; Chen, Shan; Thomas, Spencer; Chaunzwa, Tafadzwa L; Franco, Idalid; Kann, Benjamin H; Moningi, Shalini; Qian, Jack M; Goldstein, Madeleine; Harper, Susan; Aerts, Hugo J W L; Catalano, Paul J; Savova, Guergana K; Mak, Raymond H; Bitterman, Danielle S.

NPJ Digit Med ; 7(1): 6, 2024 Jan 11.

Article in English | MEDLINE | ID: mdl-38200151

ABSTRACT

Social determinants of health (SDoH) play a critical role in patient outcomes, yet their documentation is often missing or incomplete in the structured data of electronic health records (EHRs). Large language models (LLMs) could enable high-throughput extraction of SDoH from the EHR to support research and clinical care. However, class imbalance and data limitations present challenges for this sparsely documented yet critical information. Here, we investigated the optimal methods for using LLMs to extract six SDoH categories from narrative text in the EHR: employment, housing, transportation, parental status, relationship, and social support. The best-performing models were fine-tuned Flan-T5 XL for any SDoH mentions (macro-F1 0.71), and Flan-T5 XXL for adverse SDoH mentions (macro-F1 0.70). Adding LLM-generated synthetic data to training varied across models and architecture, but improved the performance of smaller Flan-T5 models (delta F1 + 0.12 to +0.23). Our best-fine-tuned models outperformed zero- and few-shot performance of ChatGPT-family models in the zero- and few-shot setting, except GPT4 with 10-shot prompting for adverse SDoH. Fine-tuned models were less likely than ChatGPT to change their prediction when race/ethnicity and gender descriptors were added to the text, suggesting less algorithmic bias (p < 0.05). Our models identified 93.8% of patients with adverse SDoH, while ICD-10 codes captured 2.0%. These results demonstrate the potential of LLMs in improving real-world evidence on SDoH and assisting in identifying patients who could benefit from resource support.

15.

Towards Consistency in Pediatric Brain Tumor Measurements: Challenges, Solutions, and the Role of AI-Based Segmentation.

Familiar, Ariana M; Fathi Kazerooni, Anahita; Vossough, Arastoo; Ware, Jeffrey B; Bagheri, Sina; Khalili, Nastaran; Anderson, Hannah; Haldar, Debanjan; Storm, Phillip B; Resnick, Adam C; Kann, Benjamin H; Aboian, Mariam; Kline, Cassie; Weller, Michael; Huang, Raymond Y; Chang, Susan M; Fangusaro, Jason R; Hoffman, Lindsey M; Mueller, Sabine; Prados, Michael; Nabavizadeh, Ali.

Neuro Oncol ; 2024 May 21.

Article in English | MEDLINE | ID: mdl-38769022

ABSTRACT

MR imaging is central to the assessment of tumor burden and changes over time in neuro-oncology. Several response assessment guidelines have been set forth by the Response Assessment in Pediatric Neuro-Oncology (RAPNO) working groups in different tumor histologies; however, the visual delineation of tumor components using MRIs is not always straightforward, and complexities not currently addressed by these criteria can introduce inter- and intra-observer variability in manual assessments. Differentiation of non-enhancing tumor from peritumoral edema, mild enhancement from absence of enhancement, and various cystic components can be challenging; particularly given a lack of sufficient and uniform imaging protocols in clinical practice. Automated tumor segmentation with artificial intelligence (AI) may be able to provide more objective delineations, but rely on accurate and consistent training data created manually (ground truth). Herein, this paper reviews existing challenges and potential solutions to identifying and defining subregions of pediatric brain tumors (PBTs) that are not explicitly addressed by current guidelines. The goal is to assert the importance of defining and adopting criteria for addressing these challenges, as it will be critical to achieving standardized tumor measurements and reproducible response assessment in PBTs, ultimately leading to more precise outcome metrics and accurate comparisons among clinical studies.

16.

Noninvasive Molecular Subtyping of Pediatric Low-Grade Glioma with Self-Supervised Transfer Learning.

Tak, Divyanshu; Ye, Zezhong; Zapaischykova, Anna; Zha, Yining; Boyd, Aidan; Vajapeyam, Sridhar; Chopra, Rishi; Hayat, Hasaan; Prabhu, Sanjay P; Liu, Kevin X; Elhalawani, Hesham; Nabavizadeh, Ali; Familiar, Ariana; Resnick, Adam C; Mueller, Sabine; Aerts, Hugo J W L; Bandopadhayay, Pratiti; Ligon, Keith L; Haas-Kogan, Daphne A; Poussaint, Tina Y; Kann, Benjamin H.

Radiol Artif Intell ; 6(3): e230333, 2024 May.

Article in English | MEDLINE | ID: mdl-38446044

ABSTRACT

Purpose To develop and externally test a scan-to-prediction deep learning pipeline for noninvasive, MRI-based BRAF mutational status classification for pediatric low-grade glioma. Materials and Methods This retrospective study included two pediatric low-grade glioma datasets with linked genomic and diagnostic T2-weighted MRI data of patients: Dana-Farber/Boston Children's Hospital (development dataset, n = 214 [113 (52.8%) male; 104 (48.6%) BRAF wild type, 60 (28.0%) BRAF fusion, and 50 (23.4%) BRAF V600E]) and the Children's Brain Tumor Network (external testing, n = 112 [55 (49.1%) male; 35 (31.2%) BRAF wild type, 60 (53.6%) BRAF fusion, and 17 (15.2%) BRAF V600E]). A deep learning pipeline was developed to classify BRAF mutational status (BRAF wild type vs BRAF fusion vs BRAF V600E) via a two-stage process: (a) three-dimensional tumor segmentation and extraction of axial tumor images and (b) section-wise, deep learning-based classification of mutational status. Knowledge-transfer and self-supervised approaches were investigated to prevent model overfitting, with a primary end point of the area under the receiver operating characteristic curve (AUC). To enhance model interpretability, a novel metric, center of mass distance, was developed to quantify the model attention around the tumor. Results A combination of transfer learning from a pretrained medical imaging-specific network and self-supervised label cross-training (TransferX) coupled with consensus logic yielded the highest classification performance with an AUC of 0.82 (95% CI: 0.72, 0.91), 0.87 (95% CI: 0.61, 0.97), and 0.85 (95% CI: 0.66, 0.95) for BRAF wild type, BRAF fusion, and BRAF V600E, respectively, on internal testing. On external testing, the pipeline yielded an AUC of 0.72 (95% CI: 0.64, 0.86), 0.78 (95% CI: 0.61, 0.89), and 0.72 (95% CI: 0.64, 0.88) for BRAF wild type, BRAF fusion, and BRAF V600E, respectively. Conclusion Transfer learning and self-supervised cross-training improved classification performance and generalizability for noninvasive pediatric low-grade glioma mutational status prediction in a limited data scenario. Keywords: Pediatrics, MRI, CNS, Brain/Brain Stem, Oncology, Feature Detection, Diagnosis, Supervised Learning, Transfer Learning, Convolutional Neural Network (CNN) Supplemental material is available for this article. © RSNA, 2024.

Subject(s)

Brain Neoplasms , Glioma , Humans , Child , Male , Female , Brain Neoplasms/diagnostic imaging , Retrospective Studies , Proto-Oncogene Proteins B-raf/genetics , Glioma/diagnosis , Machine Learning

17.

Using ChatGPT to evaluate cancer myths and misconceptions: artificial intelligence and cancer information.

Johnson, Skyler B; King, Andy J; Warner, Echo L; Aneja, Sanjay; Kann, Benjamin H; Bylund, Carma L.

JNCI Cancer Spectr ; 7(2)2023 03 01.

Article in English | MEDLINE | ID: mdl-36929393

ABSTRACT

Data about the quality of cancer information that chatbots and other artificial intelligence systems provide are limited. Here, we evaluate the accuracy of cancer information on ChatGPT compared with the National Cancer Institute's (NCI's) answers by using the questions on the "Common Cancer Myths and Misconceptions" web page. The NCI's answers and ChatGPT answers to each question were blinded, and then evaluated for accuracy (accurate: yes vs no). Ratings were evaluated independently for each question, and then compared between the blinded NCI and ChatGPT answers. Additionally, word count and Flesch-Kincaid readability grade level for each individual response were evaluated. Following expert review, the percentage of overall agreement for accuracy was 100% for NCI answers and 96.9% for ChatGPT outputs for questions 1 through 13 (Ä¸ = â0.03, standard error = 0.08). There were few noticeable differences in the number of words or the readability of the answers from NCI or ChatGPT. Overall, the results suggest that ChatGPT provides accurate information about common cancer myths and misconceptions.

Subject(s)

Artificial Intelligence , Neoplasms , United States/epidemiology , Humans , Neoplasms/diagnosis , National Cancer Institute (U.S.)

18.

Segmentation stability of human head and neck cancer medical images for radiotherapy applications under de-identification conditions: Benchmarking data sharing and artificial intelligence use-cases.

Sahlsten, Jaakko; Wahid, Kareem A; Glerean, Enrico; Jaskari, Joel; Naser, Mohamed A; He, Renjie; Kann, Benjamin H; Mäkitie, Antti; Fuller, Clifton D; Kaski, Kimmo.

Front Oncol ; 13: 1120392, 2023.

Article in English | MEDLINE | ID: mdl-36925936

ABSTRACT

Background: Demand for head and neck cancer (HNC) radiotherapy data in algorithmic development has prompted increased image dataset sharing. Medical images must comply with data protection requirements so that re-use is enabled without disclosing patient identifiers. Defacing, i.e., the removal of facial features from images, is often considered a reasonable compromise between data protection and re-usability for neuroimaging data. While defacing tools have been developed by the neuroimaging community, their acceptability for radiotherapy applications have not been explored. Therefore, this study systematically investigated the impact of available defacing algorithms on HNC organs at risk (OARs). Methods: A publicly available dataset of magnetic resonance imaging scans for 55 HNC patients with eight segmented OARs (bilateral submandibular glands, parotid glands, level II neck lymph nodes, level III neck lymph nodes) was utilized. Eight publicly available defacing algorithms were investigated: afni_refacer, DeepDefacer, defacer, fsl_deface, mask_face, mri_deface, pydeface, and quickshear. Using a subset of scans where defacing succeeded (N=29), a 5-fold cross-validation 3D U-net based OAR auto-segmentation model was utilized to perform two main experiments: 1.) comparing original and defaced data for training when evaluated on original data; 2.) using original data for training and comparing the model evaluation on original and defaced data. Models were primarily assessed using the Dice similarity coefficient (DSC). Results: Most defacing methods were unable to produce any usable images for evaluation, while mask_face, fsl_deface, and pydeface were unable to remove the face for 29%, 18%, and 24% of subjects, respectively. When using the original data for evaluation, the composite OAR DSC was statistically higher (p ≤ 0.05) for the model trained with the original data with a DSC of 0.760 compared to the mask_face, fsl_deface, and pydeface models with DSCs of 0.742, 0.736, and 0.449, respectively. Moreover, the model trained with original data had decreased performance (p ≤ 0.05) when evaluated on the defaced data with DSCs of 0.673, 0.693, and 0.406 for mask_face, fsl_deface, and pydeface, respectively. Conclusion: Defacing algorithms may have a significant impact on HNC OAR auto-segmentation model training and testing. This work highlights the need for further development of HNC-specific image anonymization methods.

19.

Application of simultaneous uncertainty quantification for image segmentation with probabilistic deep learning: Performance benchmarking of oropharyngeal cancer target delineation as a use-case.

Sahlsten, Jaakko; Jaskari, Joel; Wahid, Kareem A; Ahmed, Sara; Glerean, Enrico; He, Renjie; Kann, Benjamin H; Mäkitie, Antti; Fuller, Clifton D; Naser, Mohamed A; Kaski, Kimmo.

medRxiv ; 2023 Feb 24.

Article in English | MEDLINE | ID: mdl-36865296

ABSTRACT

Background: Oropharyngeal cancer (OPC) is a widespread disease, with radiotherapy being a core treatment modality. Manual segmentation of the primary gross tumor volume (GTVp) is currently employed for OPC radiotherapy planning, but is subject to significant interobserver variability. Deep learning (DL) approaches have shown promise in automating GTVp segmentation, but comparative (auto)confidence metrics of these models predictions has not been well-explored. Quantifying instance-specific DL model uncertainty is crucial to improving clinician trust and facilitating broad clinical implementation. Therefore, in this study, probabilistic DL models for GTVp auto-segmentation were developed using large-scale PET/CT datasets, and various uncertainty auto-estimation methods were systematically investigated and benchmarked. Methods: We utilized the publicly available 2021 HECKTOR Challenge training dataset with 224 co-registered PET/CT scans of OPC patients with corresponding GTVp segmentations as a development set. A separate set of 67 co-registered PET/CT scans of OPC patients with corresponding GTVp segmentations was used for external validation. Two approximate Bayesian deep learning methods, the MC Dropout Ensemble and Deep Ensemble, both with five submodels, were evaluated for GTVp segmentation and uncertainty performance. The segmentation performance was evaluated using the volumetric Dice similarity coefficient (DSC), mean surface distance (MSD), and Hausdorff distance at 95% (95HD). The uncertainty was evaluated using four measures from literature: coefficient of variation (CV), structure expected entropy, structure predictive entropy, and structure mutual information, and additionally with our novel Dice-risk measure. The utility of uncertainty information was evaluated with the accuracy of uncertainty-based segmentation performance prediction using the Accuracy vs Uncertainty (AvU) metric, and by examining the linear correlation between uncertainty estimates and DSC. In addition, batch-based and instance-based referral processes were examined, where the patients with high uncertainty were rejected from the set. In the batch referral process, the area under the referral curve with DSC (R-DSC AUC) was used for evaluation, whereas in the instance referral process, the DSC at various uncertainty thresholds were examined. Results: Both models behaved similarly in terms of the segmentation performance and uncertainty estimation. Specifically, the MC Dropout Ensemble had 0.776 DSC, 1.703 mm MSD, and 5.385 mm 95HD. The Deep Ensemble had 0.767 DSC, 1.717 mm MSD, and 5.477 mm 95HD. The uncertainty measure with the highest DSC correlation was structure predictive entropy with correlation coefficients of 0.699 and 0.692 for the MC Dropout Ensemble and the Deep Ensemble, respectively. The highest AvU value was 0.866 for both models. The best performing uncertainty measure for both models was the CV which had R-DSC AUC of 0.783 and 0.782 for the MC Dropout Ensemble and Deep Ensemble, respectively. With referring patients based on uncertainty thresholds from 0.85 validation DSC for all uncertainty measures, on average the DSC improved from the full dataset by 4.7% and 5.0% while referring 21.8% and 22% patients for MC Dropout Ensemble and Deep Ensemble, respectively. Conclusion: We found that many of the investigated methods provide overall similar but distinct utility in terms of predicting segmentation quality and referral performance. These findings are a critical first-step towards more widespread implementation of uncertainty quantification in OPC GTVp segmentation.

20.

Prediction of Distant Metastases After Stereotactic Body Radiation Therapy for Early Stage NSCLC: Development and External Validation of a Multi-Institutional Model.

Gao, Sarah J; Jin, Lan; Meadows, Hugh W; Shafman, Timothy D; Gross, Cary P; Yu, James B; Aerts, Hugo J W L; Miccio, Joseph A; Stahl, John M; Mak, Raymond H; Decker, Roy H; Kann, Benjamin H.

J Thorac Oncol ; 18(3): 339-349, 2023 Mar.

Article in English | MEDLINE | ID: mdl-36396062

ABSTRACT

INTRODUCTION: Distant metastases (DMs) are the primary driver of mortality for patients with early stage NSCLC receiving stereotactic body radiation therapy (SBRT), yet patient-level risk is difficult to predict. We developed and validated a model to predict individualized risk of DM in this population. METHODS: We used a multi-institutional database of 1280 patients with cT1-3N0M0 NSCLC treated with SBRT from 2006 to 2015 for model development and internal validation. A Fine and Gray (FG) regression model was built to predict 1-year DM risk and compared with a random survival forests model. The higher performing model was evaluated on an external data set of 130 patients from a separate institution. Discriminatory performance was evaluated using the time-dependent area under the curve (AUC). Calibration was assessed graphically and with Brier scores. RESULTS: The FG model yielded an AUC of 0.71 (95% confidence interval [CI]: 0.57-0.86) compared with the AUC of random survival forest at 0.69 (95% CI: 0.63-0.85) in the internal test set and was selected for further testing. On external validation, the FG model yielded an AUC of 0.70 (95% CI: 0.57-0.83) with good calibration (Brier score: 0.08). The model identified a high-risk patient subgroup with greater 1-year DM rates in the internal test (20.0% [3 of 15] versus 2.9% [7 of 241], p = 0.001) and external validation (21.4% [3 of 15] versus 7.8% [9 of 116], p = 0.095). A model nomogram and online application was made available. CONCLUSIONS: We developed and externally validated a practical model that predicts DM risk in patients with NSCLC receiving SBRT which may help select patients for systemic therapy.

Subject(s)

Carcinoma, Non-Small-Cell Lung , Lung Neoplasms , Radiosurgery , Humans , Prognosis , Lung Neoplasms/pathology , Carcinoma, Non-Small-Cell Lung/pathology , Nomograms

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL