Search | VHL Regional Portal

1.

Application of simultaneous uncertainty quantification and segmentation for oropharyngeal cancer use-case with Bayesian deep learning.

Sahlsten, Jaakko; Jaskari, Joel; Wahid, Kareem A; Ahmed, Sara; Glerean, Enrico; He, Renjie; Kann, Benjamin H; Mäkitie, Antti; Fuller, Clifton D; Naser, Mohamed A; Kaski, Kimmo.

Commun Med (Lond) ; 4(1): 110, 2024 Jun 08.

Article in English | MEDLINE | ID: mdl-38851837

ABSTRACT

BACKGROUND: Radiotherapy is a core treatment modality for oropharyngeal cancer (OPC), where the primary gross tumor volume (GTVp) is manually segmented with high interobserver variability. This calls for reliable and trustworthy automated tools in clinician workflow. Therefore, accurate uncertainty quantification and its downstream utilization is critical. METHODS: Here we propose uncertainty-aware deep learning for OPC GTVp segmentation, and illustrate the utility of uncertainty in multiple applications. We examine two Bayesian deep learning (BDL) models and eight uncertainty measures, and utilize a large multi-institute dataset of 292 PET/CT scans to systematically analyze our approach. RESULTS: We show that our uncertainty-based approach accurately predicts the quality of the deep learning segmentation in 86.6% of cases, identifies low performance cases for semi-automated correction, and visualizes regions of the scans where the segmentations likely fail. CONCLUSIONS: Our BDL-based analysis provides a first-step towards more widespread implementation of uncertainty quantification in OPC GTVp segmentation.

Radiotherapy is used as a treatment for people with oropharyngeal cancer. It is important to distinguish the areas where cancer is present so the radiotherapy treatment can be targeted at the cancer. Computational methods based on artificial intelligence can automate this task but need to be able to distinguish areas where it is unclear whether cancer is present. In this study we compare these computational methods that are able to highlight areas where it is unclear whether or not cancer is present. Our approach accurately predicts how well these areas are distinguished by the models. Our results could be applied to improve the computational methods used during radiotherapy treatment. This could enable more targeted treatment to be used in the future, which could result in better outcomes for people with oropharyngeal cancer.

2.

Multi-modal segmentation with missing image data for automatic delineation of gross tumor volumes in head and neck cancers.

Zhao, Yao; Wang, Xin; Phan, Jack; Chen, Xinru; Lee, Anna; Yu, Cenji; Huang, Kai; Court, Laurence E; Pan, Tinsu; Wang, He; Wahid, Kareem Abdul; Mohamed, Abdalah S R; Naser, Mohamed; Fuller, Clifton D; Yang, Jinzhong.

Med Phys ; 2024 Jun 19.

Article in English | MEDLINE | ID: mdl-38896829

ABSTRACT

BACKGROUND: Head and neck (HN) gross tumor volume (GTV) auto-segmentation is challenging due to the morphological complexity and low image contrast of targets. Multi-modality images, including computed tomography (CT) and positron emission tomography (PET), are used in the routine clinic to assist radiation oncologists for accurate GTV delineation. However, the availability of PET imaging may not always be guaranteed. PURPOSE: To develop a deep learning segmentation framework for automated GTV delineation of HN cancers using a combination of PET/CT images, while addressing the challenge of missing PET data. METHODS: Two datasets were included for this study: Dataset I: 524 (training) and 359 (testing) oropharyngeal cancer patients from different institutions with their PET/CT pairs provided by the HECKTOR Challenge; Dataset II: 90 HN patients(testing) from a local institution with their planning CT, PET/CT pairs. To handle potentially missing PET images, a model training strategy named the "Blank Channel" method was implemented. To simulate the absence of a PET image, a blank array with the same dimensions as the CT image was generated to meet the dual-channel input requirement of the deep learning model. During the model training process, the model was randomly presented with either a real PET/CT pair or a blank/CT pair. This allowed the model to learn the relationship between the CT image and the corresponding GTV delineation based on available modalities. As a result, our model had the ability to handle flexible inputs during prediction, making it suitable for cases where PET images are missing. To evaluate the performance of our proposed model, we trained it using training patients from Dataset I and tested it with Dataset II. We compared our model (Model 1) with two other models which were trained for specific modality segmentations: Model 2 trained with only CT images, and Model 3 trained with real PET/CT pairs. The performance of the models was evaluated using quantitative metrics, including Dice similarity coefficient (DSC), mean surface distance (MSD), and 95% Hausdorff Distance (HD95). In addition, we evaluated our Model 1 and Model 3 using the 359 test cases in Dataset I. RESULTS: Our proposed model(Model 1) achieved promising results for GTV auto-segmentation using PET/CT images, with the flexibility of missing PET images. Specifically, when assessed with only CT images in Dataset II, Model 1 achieved DSC of 0.56 ± 0.16, MSD of 3.4 ± 2.1 mm, and HD95 of 13.9 ± 7.6 mm. When the PET images were included, the performance of our model was improved to DSC of 0.62 ± 0.14, MSD of 2.8 ± 1.7 mm, and HD95 of 10.5 ± 6.5 mm. These results are comparable to those achieved by Model 2 and Model 3, illustrating Model 1's effectiveness in utilizing flexible input modalities. Further analysis using the test dataset from Dataset I showed that Model 1 achieved an average DSC of 0.77, surpassing the overall average DSC of 0.72 among all participants in the HECKTOR Challenge. CONCLUSIONS: We successfully refined a multi-modal segmentation tool for accurate GTV delineation for HN cancer. Our method addressed the issue of missing PET images by allowing flexible data input, thereby providing a practical solution for clinical settings where access to PET imaging may be limited.

3.

Associations Between Radiation Oncologist Demographic Factors and Segmentation Similarity Benchmarks: Insights From a Crowd-Sourced Challenge Using Bayesian Estimation.

Wahid, Kareem A; Sahin, Onur; Kundu, Suprateek; Lin, Diana; Alanis, Anthony; Tehami, Salik; Kamel, Serageldin; Duke, Simon; Sherer, Michael V; Rasmussen, Mathis; Korreman, Stine; Fuentes, David; Cislo, Michael; Nelms, Benjamin E; Christodouleas, John P; Murphy, James D; Mohamed, Abdallah S R; He, Renjie; Naser, Mohammed A; Gillespie, Erin F; Fuller, Clifton D.

JCO Clin Cancer Inform ; 8: e2300174, 2024 Jun.

Article in English | MEDLINE | ID: mdl-38870441

ABSTRACT

PURPOSE: The quality of radiotherapy auto-segmentation training data, primarily derived from clinician observers, is of utmost importance. However, the factors influencing the quality of clinician-derived segmentations are poorly understood; our study aims to quantify these factors. METHODS: Organ at risk (OAR) and tumor-related segmentations provided by radiation oncologists from the Contouring Collaborative for Consensus in Radiation Oncology data set were used. Segmentations were derived from five disease sites: breast, sarcoma, head and neck (H&N), gynecologic (GYN), and GI. Segmentation quality was determined on a structure-by-structure basis by comparing the observer segmentations with an expert-derived consensus, which served as a reference standard benchmark. The Dice similarity coefficient (DSC) was primarily used as a metric for the comparisons. DSC was stratified into binary groups on the basis of structure-specific expert-derived interobserver variability (IOV) cutoffs. Generalized linear mixed-effects models using Bayesian estimation were used to investigate the association between demographic variables and the binarized DSC for each disease site. Variables with a highest density interval excluding zero were considered to substantially affect the outcome measure. RESULTS: Five hundred seventy-four, 110, 452, 112, and 48 segmentations were used for the breast, sarcoma, H&N, GYN, and GI cases, respectively. The median percentage of segmentations that crossed the expert DSC IOV cutoff when stratified by structure type was 55% and 31% for OARs and tumors, respectively. Regression analysis revealed that the structure being tumor-related had a substantial negative impact on binarized DSC for the breast, sarcoma, H&N, and GI cases. There were no recurring relationships between segmentation quality and demographic variables across the cases, with most variables demonstrating large standard deviations. CONCLUSION: Our study highlights substantial uncertainty surrounding conventionally presumed factors influencing segmentation quality relative to benchmarks.

Subject(s)

Bayes Theorem , Benchmarking , Radiation Oncologists , Humans , Benchmarking/methods , Female , Radiotherapy Planning, Computer-Assisted/methods , Neoplasms/epidemiology , Neoplasms/radiotherapy , Organs at Risk , Male , Radiation Oncology/standards , Radiation Oncology/methods , Demography , Observer Variation

4.

Artificial Intelligence Uncertainty Quantification in Radiotherapy Applications - A Scoping Review.

Wahid, Kareem A; Kaffey, Zaphanlene Y; Farris, David P; Humbert-Vidan, Laia; Moreno, Amy C; Rasmussen, Mathis; Ren, Jintao; Naser, Mohamed A; Netherton, Tucker J; Korreman, Stine; Balakrishnan, Guha; Fuller, Clifton D; Fuentes, David; Dohopolski, Michael J.

medRxiv ; 2024 May 13.

Article in English | MEDLINE | ID: mdl-38798581

ABSTRACT

Background/purpose: The use of artificial intelligence (AI) in radiotherapy (RT) is expanding rapidly. However, there exists a notable lack of clinician trust in AI models, underscoring the need for effective uncertainty quantification (UQ) methods. The purpose of this study was to scope existing literature related to UQ in RT, identify areas of improvement, and determine future directions. Methods: We followed the PRISMA-ScR scoping review reporting guidelines. We utilized the population (human cancer patients), concept (utilization of AI UQ), context (radiotherapy applications) framework to structure our search and screening process. We conducted a systematic search spanning seven databases, supplemented by manual curation, up to January 2024. Our search yielded a total of 8980 articles for initial review. Manuscript screening and data extraction was performed in Covidence. Data extraction categories included general study characteristics, RT characteristics, AI characteristics, and UQ characteristics. Results: We identified 56 articles published from 2015-2024. 10 domains of RT applications were represented; most studies evaluated auto-contouring (50%), followed by image-synthesis (13%), and multiple applications simultaneously (11%). 12 disease sites were represented, with head and neck cancer being the most common disease site independent of application space (32%). Imaging data was used in 91% of studies, while only 13% incorporated RT dose information. Most studies focused on failure detection as the main application of UQ (60%), with Monte Carlo dropout being the most commonly implemented UQ method (32%) followed by ensembling (16%). 55% of studies did not share code or datasets. Conclusion: Our review revealed a lack of diversity in UQ for RT applications beyond auto-contouring. Moreover, there was a clear need to study additional UQ methods, such as conformal prediction. Our results may incentivize the development of guidelines for reporting and implementation of UQ in RT.

5.

Evolving Horizons in Radiation Therapy Auto-Contouring: Distilling Insights, Embracing Data-Centric Frameworks, and Moving Beyond Geometric Quantification.

Wahid, Kareem A; Cardenas, Carlos E; Marquez, Barbara; Netherton, Tucker J; Kann, Benjamin H; Court, Laurence E; He, Renjie; Naser, Mohamed A; Moreno, Amy C; Fuller, Clifton D; Fuentes, David.

Adv Radiat Oncol ; 9(7): 101521, 2024 Jul.

Article in English | MEDLINE | ID: mdl-38799110

6.

Dataset of weekly intra-treatment diffusion weighted imaging in head and neck cancer patients treated with MR-Linac.

El-Habashy, Dina M; Wahid, Kareem A; He, Renjie; McDonald, Brigid; Mulder, Samuel J; Ding, Yao; Salzillo, Travis; Lai, Stephen Y; Christodouleas, John; Dresner, Alex; Wang, Jihong; Naser, Mohamed A; Fuller, Clifton D; Mohamed, Abdallah Sherif Radwan.

Sci Data ; 11(1): 487, 2024 May 11.

Article in English | MEDLINE | ID: mdl-38734679

ABSTRACT

Radiation therapy (RT) is a crucial treatment for head and neck squamous cell carcinoma (HNSCC); however, it can have adverse effects on patients' long-term function and quality of life. Biomarkers that can predict tumor response to RT are being explored to personalize treatment and improve outcomes. While tissue and blood biomarkers have limitations, imaging biomarkers derived from magnetic resonance imaging (MRI) offer detailed information. The integration of MRI and a linear accelerator in the MR-Linac system allows for MR-guided radiation therapy (MRgRT), offering precise visualization and treatment delivery. This data descriptor offers a valuable repository for weekly intra-treatment diffusion-weighted imaging (DWI) data obtained from head and neck cancer patients. By analyzing the sequential DWI changes and their correlation with treatment response, as well as oncological and survival outcomes, the study provides valuable insights into the clinical implications of DWI in HNSCC.

Subject(s)

Diffusion Magnetic Resonance Imaging , Head and Neck Neoplasms , Humans , Head and Neck Neoplasms/diagnostic imaging , Head and Neck Neoplasms/radiotherapy , Radiotherapy, Image-Guided , Squamous Cell Carcinoma of Head and Neck/diagnostic imaging , Squamous Cell Carcinoma of Head and Neck/radiotherapy , Particle Accelerators

7.

Prognostic value of tumor volume doubling time in lung-metastatic adenoid cystic carcinoma.

Dal Lago, Eduardo A; Sousa, Luana G; Yang, Zixi; Hoff, Camilla O; Bonini, Flavia; Sawyer, Matthew; Wang, Kaiwen; Lewis, Whitney; Wahid, Kareem A; Hanna, Ehab Y; El-Naggar, Adel; Fuller, Clifton D; Kundu, Suprateek; Godoy, Myrna; Ferrarotto, Renata.

Oral Oncol ; 151: 106759, 2024 Apr.

Article in English | MEDLINE | ID: mdl-38507991

ABSTRACT

OBJECTIVES: Lung metastases in adenoid cystic carcinoma (ACC) usually have indolent growth and the optimal timing to start systemic therapy is not established. We assessed ACC lung metastasis tumor growth dynamics and compared the prognostic value of time to progression (TTP) and tumor volume doubling time (TVDT). METHODS: The study included ACC patients with ≥1 pulmonary metastasis (≥5 mm) and at least 2 chest computed tomography scans. Radiology assessment was performed from the first scan showing metastasis until treatment initiation or death. Up to 5 lung nodules per patient were segmented for TVDT calculation. To assess tumor growth rate (TGR), the correlation coefficient (r) and coefficient of determination (R2) were calculated for measured lung nodules. TTP was assessed per RECIST 1.1; TVDT was calculated using the Schwartz formula. Overall survival was analyzed using the Kaplan-Meier method. RESULTS: The study included 75 patients. Sixty-seven patients (89%) had lung-only metastasis on first CT scan. The TGR was overall constant (median R2 = 0.974). Median TTP and TVDT were 11.2 months and 7.5 months. Shorter TVDT (<6 months) was associated with poor overall survival (HR = 0.48; p = 0.037), but TTP was not associated with survival (HR = 1.02; p = 0.96). Cox regression showed that TVDT but not TTP significantly correlated with OS. TVDT calculated using estimated tumor volume correlated with TVDT obtained by segmentation. CONCLUSION: Most ACC lung metastases have a constant TGR. TVDT may be a better prognostic indicator than TTP in lung-metastatic ACC. TVDT can be estimated by single longitudinal measurement in clinical practice.

Subject(s)

Carcinoma, Adenoid Cystic , Lung Neoplasms , Humans , Prognosis , Carcinoma, Adenoid Cystic/pathology , Tumor Burden , Time Factors , Lung Neoplasms/diagnostic imaging , Lung/pathology , Retrospective Studies

8.

Comparison of Machine Leaning Models for Prediction of Acute Pain Severity and On-Treatment Opioid Utilization in Oral Cavity and Oropharyngeal Cancer Patients Receiving Radiation Therapy: Exploratory Analysis from a Large-Scale Retrospective Cohort.

Salama, Vivian; Humbert-Vidan, Laia; Godinich, Brandon; Wahid, Kareem A; ElHabashy, Dina M; Naser, Mohamed A; He, Renjie; Mohamed, Abdallah S R; Sahli, Ariana J; Hutcheson, Katherine A; Gunn, Gary Brandon; Rosenthal, David I; Fuller, Clifton D; Moreno, Amy C.

medRxiv ; 2024 Feb 08.

Article in English | MEDLINE | ID: mdl-38370746

ABSTRACT

Background: Acute pain is a common and debilitating symptom experienced by oral cavity and oropharyngeal cancer (OC/OPC) patients undergoing radiation therapy (RT). Uncontrolled pain can result in opioid overuse and increased risks of long-term opioid dependence. The specific aim of this exploratory analysis was the prediction of severe acute pain and opioid use in the acute on-treatment setting, to develop risk-stratification models for pragmatic clinical trials. Materials and Methods: A retrospective study was conducted on 900 OC/OPC patients treated with RT during 2017 to 2023. Clinical data including demographics, tumor data, pain scores and medication data were extracted from patient records. On-treatment pain intensity scores were assessed using a numeric rating scale (0-none, 10-worst) and total opioid doses were calculated using morphine equivalent daily dose (MEDD) conversion factors. Analgesics efficacy was assessed based on the combined pain intensity and the total required MEDD. ML models, including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Gradient Boosting Model (GBM) were developed and validated using ten-fold cross-validation. Performance of models were evaluated using discrimination and calibration metrics. Feature importance was investigated using bootstrap and permutation techniques. Results: For predicting acute pain intensity, the GBM demonstrated superior area under the receiver operating curve (AUC) (0.71), recall (0.39), and F1 score (0.48). For predicting the total MEDD, LR outperformed other models in the AUC (0.67). For predicting the analgesics efficacy, SVM achieved the highest specificity (0.97), and best calibration (ECE of 0.06), while RF and GBM achieved the same highest AUC, 0.68. RF model emerged as the best calibrated model with ECE of 0.02 for pain intensity prediction and 0.05 for MEDD prediction. Baseline pain scores and vital signs demonstrated the most contributed features for the different predictive models. Conclusion: These ML models are promising in predicting end-of-treatment acute pain and opioid requirements and analgesics efficacy in OC/OPC patients undergoing RT. Baseline pain score, vital sign changes were identified as crucial predictors. Implementation of these models in clinical practice could facilitate early risk stratification and personalized pain management. Prospective multicentric studies and external validation are essential for further refinement and generalizability.

9.

Empirically Derived Principles for Research Funding Success: A Primer for Early Career Academic Investigators.

Wahid, Kareem A; Rooney, Michael K; Gunther, Jillian R; Moreno, Amy C; Pinnix, Chelsea C; Thomas, Charles R; Fuller, Clifton D.

Int J Radiat Oncol Biol Phys ; 118(3): 590-594, 2024 Mar 01.

Article in English | MEDLINE | ID: mdl-38340768

Subject(s)

Biomedical Research , Humans , United States , Career Choice

10.

Weak Supervision, Strong Results: Achieving High Performance in Intracranial Hemorrhage Detection with Fewer Annotation Labels.

Wahid, Kareem A; Fuentes, David.

Radiol Artif Intell ; 6(1): e230598, 2024 Jan.

Article in English | MEDLINE | ID: mdl-38294326

11.

Assessment of heavy metals at mangrove ecosystem, applying multiple approaches using in-situ and remote sensing techniques, Red Sea, Egypt.

Mohammed, Asmaa H; Khalifa, Ahmed M; Mohamed, Hagar M; Abd El-Wahid, Kareem H; Hanafy, Mahmoud H.

Environ Sci Pollut Res Int ; 31(5): 8118-8133, 2024 Jan.

Article in English | MEDLINE | ID: mdl-38177641

ABSTRACT

Mangrove areas are considered the most retention zone for heavy metal pollution as it work as an edge that aggregates land and sea sediments. This study aims to examine if the heavy metals' existence in the mangrove sediment is related to contamination or natural resources. In addition, it gives an interpretation of the origin of these metals along the Egyptian Red Sea coast. Twenty-two samples of mangrove sediments were collected and then, analyzed for metals (Mn, Ni, Cu, Fe, Cd, Ag, and Pb) using inductively coupled plasma mass spectroscopy (ICP-MS). Integration between the in-situ data, contamination indices, and remote sensing and geographical information science (GIS), and multivariate statistical analysis techniques (PCA) were analyzed to assess and clarify the spatial origin of heavy metals in sediment at a regional scale. The average concentration of heavy metals from mangrove sediments were shown to be substantially lower than the referenced value, ranging from moderate to significant except the levels of Ag were very high. The heavy metals concentrations were expected to be naturally origin rather than anthropogenic and that be confirmed by mapping of Red Sea alteration zones spots. These alteration zones are parallel to mangrove sites and rich by several mineralization types including heavy metals that are carried by flooding to the coastline. Remote sensing and GIS techniques successfully contributed to interpreting the pattern of the origin of heavy metals and discharging systems that control the heavy metals concentration along the Red Sea coast.

Subject(s)

Metals, Heavy , Water Pollutants, Chemical , Ecosystem , Indian Ocean , Egypt , Remote Sensing Technology , Water Pollutants, Chemical/analysis , Environmental Monitoring , Geologic Sediments/chemistry , Metals, Heavy/analysis , Risk Assessment

12.

Harnessing uncertainty in radiotherapy auto-segmentation quality assurance.

Wahid, Kareem A; Sahlsten, Jaakko; Jaskari, Joel; Dohopolski, Michael J; Kaski, Kimmo; He, Renjie; Glerean, Enrico; Kann, Benjamin H; Mäkitie, Antti; Fuller, Clifton D; Naser, Mohamed A; Fuentes, David.

Phys Imaging Radiat Oncol ; 29: 100526, 2024 Jan.

Article in English | MEDLINE | ID: mdl-38179210

13.

Investigation of autosegmentation techniques on T2-weighted MRI for off-line dose reconstruction in MR-linac workflow for head and neck cancers.

McDonald, Brigid A; Cardenas, Carlos E; O'Connell, Nicolette; Ahmed, Sara; Naser, Mohamed A; Wahid, Kareem A; Xu, Jiaofeng; Thill, Dan; Zuhour, Raed J; Mesko, Shane; Augustyn, Alexander; Buszek, Samantha M; Grant, Stephen; Chapman, Bhavana V; Bagley, Alexander F; He, Renjie; Mohamed, Abdallah S R; Christodouleas, John; Brock, Kristy K; Fuller, Clifton D.

Med Phys ; 51(1): 278-291, 2024 Jan.

Article in English | MEDLINE | ID: mdl-37475466

ABSTRACT

BACKGROUND: In order to accurately accumulate delivered dose for head and neck cancer patients treated with the Adapt to Position workflow on the 1.5T magnetic resonance imaging (MRI)-linear accelerator (MR-linac), the low-resolution T2-weighted MRIs used for daily setup must be segmented to enable reconstruction of the delivered dose at each fraction. PURPOSE: In this pilot study, we evaluate various autosegmentation methods for head and neck organs at risk (OARs) on on-board setup MRIs from the MR-linac for off-line reconstruction of delivered dose. METHODS: Seven OARs (parotid glands, submandibular glands, mandible, spinal cord, and brainstem) were contoured on 43 images by seven observers each. Ground truth contours were generated using a simultaneous truth and performance level estimation (STAPLE) algorithm. Twenty total autosegmentation methods were evaluated in ADMIRE: 1-9) atlas-based autosegmentation using a population atlas library (PAL) of 5/10/15 patients with STAPLE, patch fusion (PF), random forest (RF) for label fusion; 10-19) autosegmentation using images from a patient's 1-4 prior fractions (individualized patient prior [IPP]) using STAPLE/PF/RF; 20) deep learning (DL) (3D ResUNet trained on 43 ground truth structure sets plus 45 contoured by one observer). Execution time was measured for each method. Autosegmented structures were compared to ground truth structures using the Dice similarity coefficient, mean surface distance (MSD), Hausdorff distance (HD), and Jaccard index (JI). For each metric and OAR, performance was compared to the inter-observer variability using Dunn's test with control. Methods were compared pairwise using the Steel-Dwass test for each metric pooled across all OARs. Further dosimetric analysis was performed on three high-performing autosegmentation methods (DL, IPP with RF and 4 fractions [IPP_RF_4], IPP with 1 fraction [IPP_1]), and one low-performing (PAL with STAPLE and 5 atlases [PAL_ST_5]). For five patients, delivered doses from clinical plans were recalculated on setup images with ground truth and autosegmented structure sets. Differences in maximum and mean dose to each structure between the ground truth and autosegmented structures were calculated and correlated with geometric metrics. RESULTS: DL and IPP methods performed best overall, all significantly outperforming inter-observer variability and with no significant difference between methods in pairwise comparison. PAL methods performed worst overall; most were not significantly different from the inter-observer variability or from each other. DL was the fastest method (33 s per case) and PAL methods the slowest (3.7-13.8 min per case). Execution time increased with a number of prior fractions/atlases for IPP and PAL. For DL, IPP_1, and IPP_RF_4, the majority (95%) of dose differences were within ± 250 cGy from ground truth, but outlier differences up to 785 cGy occurred. Dose differences were much higher for PAL_ST_5, with outlier differences up to 1920 cGy. Dose differences showed weak but significant correlations with all geometric metrics (R2 between 0.030 and 0.314). CONCLUSIONS: The autosegmentation methods offering the best combination of performance and execution time are DL and IPP_1. Dose reconstruction on on-board T2-weighted MRIs is feasible with autosegmented structures with minimal dosimetric variation from ground truth, but contours should be visually inspected prior to dose reconstruction in an end-to-end dose accumulation workflow.

Subject(s)

Head and Neck Neoplasms , Radiotherapy Planning, Computer-Assisted , Humans , Pilot Projects , Workflow , Radiotherapy Planning, Computer-Assisted/methods , Tomography, X-Ray Computed/methods , Head and Neck Neoplasms/diagnostic imaging , Head and Neck Neoplasms/radiotherapy , Magnetic Resonance Imaging/methods , Organs at Risk

14.

Artificial Intelligence and Machine Learning in Cancer Related Pain: A Systematic Review.

Salama, Vivian; Godinich, Brandon; Geng, Yimin; Humbert-Vidan, Laia; Maule, Laura; Wahid, Kareem A; Naser, Mohamed A; He, Renjie; Mohamed, Abdallah S R; Fuller, Clifton D; Moreno, Amy C.

medRxiv ; 2023 Dec 08.

Article in English | MEDLINE | ID: mdl-38105979

ABSTRACT

Background/objective: Pain is a challenging multifaceted symptom reported by most cancer patients, resulting in a substantial burden on both patients and healthcare systems. This systematic review aims to explore applications of artificial intelligence/machine learning (AI/ML) in predicting pain-related outcomes and supporting decision-making processes in pain management in cancer. Methods: A comprehensive search of Ovid MEDLINE, EMBASE and Web of Science databases was conducted using terms including "Cancer", "Pain", "Pain Management", "Analgesics", "Opioids", "Artificial Intelligence", "Machine Learning", "Deep Learning", and "Neural Networks" published up to September 7, 2023. The screening process was performed using the Covidence screening tool. Only original studies conducted in human cohorts were included. AI/ML models, their validation and performance and adherence to TRIPOD guidelines were summarized from the final included studies. Results: This systematic review included 44 studies from 2006-2023. Most studies were prospective and uni-institutional. There was an increase in the trend of AI/ML studies in cancer pain in the last 4 years. Nineteen studies used AI/ML for classifying cancer patients' pain development after cancer therapy, with median AUC 0.80 (range 0.76-0.94). Eighteen studies focused on cancer pain research with median AUC 0.86 (range 0.50-0.99), and 7 focused on applying AI/ML for cancer pain management decisions with median AUC 0.71 (range 0.47-0.89). Multiple ML models were investigated with. median AUC across all models in all studies (0.77). Random forest models demonstrated the highest performance (median AUC 0.81), lasso models had the highest median sensitivity (1), while Support Vector Machine had the highest median specificity (0.74). Overall adherence of included studies to TRIPOD guidelines was 70.7%. Lack of external validation (14%) and clinical application (23%) of most included studies was detected. Reporting of model calibration was also missing in the majority of studies (5%). Conclusion: Implementation of various novel AI/ML tools promises significant advances in the classification, risk stratification, and management decisions for cancer pain. These advanced tools will integrate big health-related data for personalized pain management in cancer patients. Further research focusing on model calibration and rigorous external clinical validation in real healthcare settings is imperative for ensuring its practical and reliable application in clinical practice.

15.

Development and implementation of optimized endogenous contrast sequences for delineation in adaptive radiotherapy on a 1.5T MR-linear-accelerator: a prospective R-IDEAL stage 0-2a quantitative/qualitative evaluation of in vivo site-specific quality-assurance using a 3D T2 fat-suppressed platform for head and neck cancer.

Salzillo, Travis C; Dresner, M Alex; Way, Ashley; Wahid, Kareem A; McDonald, Brigid A; Mulder, Sam; Naser, Mohamed A; He, Renjie; Ding, Yao; Yoder, Alison; Ahmed, Sara; Corrigan, Kelsey L; Manzar, Gohar S; Andring, Lauren; Pinnix, Chelsea; Stafford, R Jason; Mohamed, Abdallah S R; Christodouleas, John; Wang, Jihong; Fuller, Clifton David.

J Med Imaging (Bellingham) ; 10(6): 065501, 2023 Nov.

Article in English | MEDLINE | ID: mdl-37937259

ABSTRACT

Purpose: To improve segmentation accuracy in head and neck cancer (HNC) radiotherapy treatment planning for the 1.5T hybrid magnetic resonance imaging/linear accelerator (MR-Linac), three-dimensional (3D), T2-weighted, fat-suppressed magnetic resonance imaging sequences were developed and optimized. Approach: After initial testing, spectral attenuated inversion recovery (SPAIR) was chosen as the fat suppression technique. Five candidate SPAIR sequences and a nonsuppressed, T2-weighted sequence were acquired for five HNC patients using a 1.5T MR-Linac. MR physicists identified persistent artifacts in two of the SPAIR sequences, so the remaining three SPAIR sequences were further analyzed. The gross primary tumor volume, metastatic lymph nodes, parotid glands, and pterygoid muscles were delineated using five segmentors. A robust image quality analysis platform was developed to objectively score the SPAIR sequences on the basis of qualitative and quantitative metrics. Results: Sequences were analyzed for the signal-to-noise ratio and the contrast-to-noise ratio and compared with fat and muscle, conspicuity, pairwise distance metrics, and segmentor assessments. In this analysis, the nonsuppressed sequence was inferior to each of the SPAIR sequences for the primary tumor, lymph nodes, and parotid glands, but it was superior for the pterygoid muscles. The SPAIR sequence that received the highest combined score among the analysis categories was recommended to Unity MR-Linac users for HNC radiotherapy treatment planning. Conclusions: Our study led to two developments: an optimized, 3D, T2-weighted, fat-suppressed sequence that can be disseminated to Unity MR-Linac users and a robust image quality analysis pathway that can be used to objectively score SPAIR sequences and can be customized and generalized to any image quality optimization protocol. Improved segmentation accuracy with the proposed SPAIR sequence will potentially lead to improved treatment outcomes and reduced toxicity for patients by maximizing the target coverage and minimizing the radiation exposure of organs at risk.

16.

Evolving Horizons in Radiotherapy Auto-Contouring: Distilling Insights, Embracing Data-Centric Frameworks, and Moving Beyond Geometric Quantification.

Wahid, Kareem A; Cardenas, Carlos E; Marquez, Barbara; Netherton, Tucker J; Kann, Benjamin H; Court, Laurence E; He, Renjie; Naser, Mohamed A; Moreno, Amy C; Fuller, Clifton D; Fuentes, David.

ArXiv ; 2023 Oct 16.

Article in English | MEDLINE | ID: mdl-37904737

17.

Determining The Role Of Radiation Oncologist Demographic Factors On Segmentation Quality: Insights From A Crowd-Sourced Challenge Using Bayesian Estimation.

Wahid, Kareem A; Sahin, Onur; Kundu, Suprateek; Lin, Diana; Alanis, Anthony; Tehami, Salik; Kamel, Serageldin; Duke, Simon; Sherer, Michael V; Rasmussen, Mathis; Korreman, Stine; Fuentes, David; Cislo, Michael; Nelms, Benjamin E; Christodouleas, John P; Murphy, James D; Mohamed, Abdallah S R; He, Renjie; Naser, Mohammed A; Gillespie, Erin F; Fuller, Clifton D.

medRxiv ; 2023 Sep 05.

Article in English | MEDLINE | ID: mdl-37693394

ABSTRACT

BACKGROUND: Medical image auto-segmentation is poised to revolutionize radiotherapy workflows. The quality of auto-segmentation training data, primarily derived from clinician observers, is of utmost importance. However, the factors influencing the quality of these clinician-derived segmentations have yet to be fully understood or quantified. Therefore, the purpose of this study was to determine the role of common observer demographic variables on quantitative segmentation performance. METHODS: Organ at risk (OAR) and tumor volume segmentations provided by radiation oncologist observers from the Contouring Collaborative for Consensus in Radiation Oncology public dataset were utilized for this study. Segmentations were derived from five separate disease sites comprised of one patient case each: breast, sarcoma, head and neck (H&N), gynecologic (GYN), and gastrointestinal (GI). Segmentation quality was determined on a structure-by-structure basis by comparing the observer segmentations with an expert-derived consensus gold standard primarily using the Dice Similarity Coefficient (DSC); surface DSC was investigated as a secondary metric. Metrics were stratified into binary groups based on previously established structure-specific expert-derived interobserver variability (IOV) cutoffs. Generalized linear mixed-effects models using Markov chain Monte Carlo Bayesian estimation were used to investigate the association between demographic variables and the binarized segmentation quality for each disease site separately. Variables with a highest density interval excluding zero - loosely analogous to frequentist significance - were considered to substantially impact the outcome measure. RESULTS: After filtering by practicing radiation oncologists, 574, 110, 452, 112, and 48 structure observations remained for the breast, sarcoma, H&N, GYN, and GI cases, respectively. The median percentage of observations that crossed the expert DSC IOV cutoff when stratified by structure type was 55% and 31% for OARs and tumor volumes, respectively. Bayesian regression analysis revealed tumor category had a substantial negative impact on binarized DSC for the breast (coefficient mean ± standard deviation: -0.97 ± 0.20), sarcoma (-1.04 ± 0.54), H&N (-1.00 ± 0.24), and GI (-2.95 ± 0.98) cases. There were no clear recurring relationships between segmentation quality and demographic variables across the cases, with most variables demonstrating large standard deviations and wide highest density intervals. CONCLUSION: Our study highlights substantial uncertainty surrounding conventionally presumed factors influencing segmentation quality. Future studies should investigate additional demographic variables, more patients and imaging modalities, and alternative metrics of segmentation acceptability.

18.

Weekly Intra-Treatment Diffusion Weighted Imaging Dataset for Head and Neck Cancer Patients Undergoing MR-linac Treatment.

El-Habashy, Dina M; Wahid, Kareem A; Renjie, He; McDonald, Brigid; Mulder, Samuel J; Ding, Yao; Salzillo, Travis; Stephen, Lai; Christodouleas, John; Dresner, Alex; Wang, Jihong; Naser, Mohamed A; Fuller, Clifton D; Mohamed, Abdallah Sherif Radwan.

medRxiv ; 2023 Aug 20.

Article in English | MEDLINE | ID: mdl-37645931

ABSTRACT

Radiation therapy (RT) is a crucial treatment for head and neck squamous cell carcinoma (HNSCC), however it can have adverse effects on patients' long-term function and quality of life. Biomarkers that can predict tumor response to RT are being explored to personalize treatment and improve outcomes. While tissue and blood biomarkers have limitations, imaging biomarkers derived from magnetic resonance imaging (MRI) offer detailed information. The integration of MRI and a linear accelerator in the MR-Linac system allows for MR-guided radiation therapy (MRgRT), offering precise visualization and treatment delivery. This data descriptor offers a valuable repository for weekly intra-treatment diffusion-weighted imaging (DWI) data obtained from head and neck cancer patients. By analyzing the sequential DWI changes and their correlation with treatment response, as well as oncological and survival outcomes, the study provides valuable insights into the clinical implications of DWI in HNSCC. [Table: see text].

19.

Development and Validation of an Automated Image-Based Deep Learning Platform for Sarcopenia Assessment in Head and Neck Cancer.

Ye, Zezhong; Saraf, Anurag; Ravipati, Yashwanth; Hoebers, Frank; Catalano, Paul J; Zha, Yining; Zapaishchykova, Anna; Likitlersuang, Jirapat; Guthier, Christian; Tishler, Roy B; Schoenfeld, Jonathan D; Margalit, Danielle N; Haddad, Robert I; Mak, Raymond H; Naser, Mohamed; Wahid, Kareem A; Sahlsten, Jaakko; Jaskari, Joel; Kaski, Kimmo; Mäkitie, Antti A; Fuller, Clifton D; Aerts, Hugo J W L; Kann, Benjamin H.

JAMA Netw Open ; 6(8): e2328280, 2023 08 01.

Article in English | MEDLINE | ID: mdl-37561460

ABSTRACT

Importance: Sarcopenia is an established prognostic factor in patients with head and neck squamous cell carcinoma (HNSCC); the quantification of sarcopenia assessed by imaging is typically achieved through the skeletal muscle index (SMI), which can be derived from cervical skeletal muscle segmentation and cross-sectional area. However, manual muscle segmentation is labor intensive, prone to interobserver variability, and impractical for large-scale clinical use. Objective: To develop and externally validate a fully automated image-based deep learning platform for cervical vertebral muscle segmentation and SMI calculation and evaluate associations with survival and treatment toxicity outcomes. Design, Setting, and Participants: For this prognostic study, a model development data set was curated from publicly available and deidentified data from patients with HNSCC treated at MD Anderson Cancer Center between January 1, 2003, and December 31, 2013. A total of 899 patients undergoing primary radiation for HNSCC with abdominal computed tomography scans and complete clinical information were selected. An external validation data set was retrospectively collected from patients undergoing primary radiation therapy between January 1, 1996, and December 31, 2013, at Brigham and Women's Hospital. The data analysis was performed between May 1, 2022, and March 31, 2023. Exposure: C3 vertebral skeletal muscle segmentation during radiation therapy for HNSCC. Main Outcomes and Measures: Overall survival and treatment toxicity outcomes of HNSCC. Results: The total patient cohort comprised 899 patients with HNSCC (median [range] age, 58 [24-90] years; 140 female [15.6%] and 755 male [84.0%]). Dice similarity coefficients for the validation set (n = 96) and internal test set (n = 48) were 0.90 (95% CI, 0.90-0.91) and 0.90 (95% CI, 0.89-0.91), respectively, with a mean 96.2% acceptable rate between 2 reviewers on external clinical testing (n = 377). Estimated cross-sectional area and SMI values were associated with manually annotated values (Pearson r = 0.99; P < .001) across data sets. On multivariable Cox proportional hazards regression, SMI-derived sarcopenia was associated with worse overall survival (hazard ratio, 2.05; 95% CI, 1.04-4.04; P = .04) and longer feeding tube duration (median [range], 162 [6-1477] vs 134 [15-1255] days; hazard ratio, 0.66; 95% CI, 0.48-0.89; P = .006) than no sarcopenia. Conclusions and Relevance: This prognostic study's findings show external validation of a fully automated deep learning pipeline to accurately measure sarcopenia in HNSCC and an association with important disease outcomes. The pipeline could enable the integration of sarcopenia assessment into clinical decision making for individuals with HNSCC.

Subject(s)

Deep Learning , Head and Neck Neoplasms , Sarcopenia , Humans , Male , Female , Middle Aged , Squamous Cell Carcinoma of Head and Neck/diagnostic imaging , Retrospective Studies , Sarcopenia/diagnostic imaging , Sarcopenia/complications , Head and Neck Neoplasms/complications , Head and Neck Neoplasms/diagnostic imaging

20.

Longitudinal diffusion and volumetric kinetics of head and neck cancer magnetic resonance on a 1.5T MR-Linear accelerator hybrid system: A prospective R-IDEAL Stage 2a imaging biomarker characterization/ pre-qualification study.

El-Habashy, Dina M; Wahid, Kareem A; He, Renjie; McDonald, Brigid; Rigert, Jillian; Mulder, Samuel J; Lim, Tze Yee; Wang, Xin; Yang, Jinzhong; Ding, Yao; Naser, Mohamed A; Ng, Sweet Ping; Bahig, Houda; Salzillo, Travis C; Preston, Kathryn E; Abobakr, Moamen; Shehata, Mohamed A; Elkhouly, Enas A; Alagizy, Hagar A; Hegazy, Amira H; Mohammadseid, Mustefa; Terhaard, Chris; Philippens, Marielle; Rosenthal, David I; Wang, Jihong; Lai, Stephen Y; Dresner, Alex; Christodouleas, John C; Mohamed, Abdallah Sherif Radwan; Fuller, Clifton D.

medRxiv ; 2023 May 05.

Article in English | MEDLINE | ID: mdl-37205359

ABSTRACT

Objectives: We aim to characterize the serial quantitative apparent diffusion coefficient (ADC) changes of the target disease volume using diffusion-weighted imaging (DWI) acquired weekly during radiation therapy (RT) on a 1.5T MR-Linac and correlate these changes with tumor response and oncologic outcomes for head and neck squamous cell carcinoma (HNSCC) patients as part of a programmatic R-IDEAL biomarker characterization effort. Methods: Thirty patients with pathologically confirmed HNSCC who received curative-intent RT at the University of Texas MD Anderson Cancer Center, were included in this prospective study. Baseline and weekly Magnetic resonance imaging (MRI) (weeks 1-6) were obtained, and various ADC parameters (mean, 5 th , 10 th , 20 th , 30 th , 40 th , 50 th , 60 th , 70 th , 80 th , 90 th and 95 th percentile) were extracted from the target regions of interest (ROIs). Baseline and weekly ADC parameters were correlated with response during RT, loco-regional control, and the development of recurrence using the Mann-Whitney U test. The Wilcoxon signed-rank test was used to compare the weekly ADC versus baseline values. Weekly volumetric changes (Δvolume) for each ROI were correlated with ΔADC using Spearman's Rho test. Recursive partitioning analysis (RPA) was performed to identify the optimal ΔADC threshold associated with different oncologic outcomes. Results: There was an overall significant rise in all ADC parameters during different time points of RT compared to baseline values for both gross primary disease volume (GTV-P) and gross nodal disease volumes (GTV-N). The increased ADC values for GTV-P were statistically significant only for primary tumors achieving complete remission (CR) during RT. RPA identified GTV-P ΔADC 5 th percentile >13% at the 3 rd week of RT as the most significant parameter associated with CR for primary tumor during RT (p <0.001). Baseline ADC parameters for GTV-P and GTV-N didn't significantly correlate with response to RT or other oncologic outcomes. There was a significant decrease in residual volume of both GTV-P & GTV-N throughout the course of RT. Additionally, a significant negative correlation between mean ΔADC and Δvolume for GTV-P at the 3 rd and 4 th week of RT was detected (r = -0.39, p = 0.044 & r = -0.45, p = 0.019, respectively). Conclusion: Assessment of ADC kinetics at regular intervals throughout RT seems to be correlated with RT response. Further studies with larger cohorts and multi-institutional data are needed for validation of ΔADC as a model for prediction of response to RT.

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL