RESUMO
BACKGROUND: Video-based assessment by experts may structurally measure surgical performance using procedure-specific competency assessment tools (CATs). A CAT for minimally invasive esophagectomy (MIE-CAT) was developed and validated previously. However, surgeon's time is scarce and video assessment is time-consuming and labor intensive. This study investigated non-procedure-specific assessment of MIE video clips by MIE experts and crowdsourcing, collective surgical performance evaluation by anonymous and untrained laypeople, to assist procedure-specific expert review. METHODS: Two surgical performance scoring frameworks were used to assess eight MIE videos. First, global performance was assessed with the non-procedure-specific Global Operative Assessment of Laparoscopic Skills (GOALS) of 64 procedural phase-based video clips < 10 min. Each clip was assessed by two MIE experts and > 30 crowd workers. Second, the same experts assessed procedure-specific performance with the MIE-CAT of the corresponding full-length video. Reliability and convergent validity of GOALS for MIE were investigated using hypothesis testing with correlations (experience, blood loss, operative time, and MIE-CAT). RESULTS: Less than 75% of hypothesized correlations between GOALS scores and experience of the surgical team (r < 0.3), blood loss (r = - 0.82 to 0.02), operative time (r = - 0.42 to 0.07), and the MIE-CAT scores (r = - 0.04 to 0.76) were met for both crowd workers and experts. Interestingly, experts' GOALS and MIE-CAT scores correlated strongly (r = 0.40 to 0.79), while crowd workers' GOALS and experts' MIE-CAT scores correlations were weak (r = - 0.04 to 0.49). Expert and crowd worker GOALS scores correlated poorly (ICC ≤ 0.42). CONCLUSION: GOALS assessments by crowd workers lacked convergent validity and showed poor reliability. It is likely that MIE is technically too difficult to assess for laypeople. Convergent validity of GOALS assessments by experts could also not be established. GOALS might not be comprehensive enough to assess detailed MIE performance. However, expert's GOALS and MIE-CAT scores strongly correlated indicating video clip (instead of full-length video) assessments could be useful to shorten assessment time.
Assuntos
Crowdsourcing , Neoplasias Esofágicas , Laparoscopia , Humanos , Reprodutibilidade dos Testes , Esofagectomia , Competência ClínicaRESUMO
Importance: Suboptimal surgical performance is hypothesized to be associated with less favorable patient outcomes in minimally invasive esophagectomy (MIE). Establishing this association may lead to programs that promote better surgical performance of MIE and improve patient outcomes. Objective: To investigate associations between surgical performance and postoperative outcomes after MIE. Design, Setting, and Participants: In this nationwide cohort study of 15 Dutch hospitals that perform more than 20 MIEs per year, 7 masked expert MIE surgeons assessed surgical performance using videos and a previously developed and validated competency assessment tool (CAT). Each hospital submitted 2 representative videos of MIEs performed between November 4, 2021, and September 13, 2022. Patients registered in the Dutch Upper Gastrointestinal Cancer Audit between January 1, 2020, and December 31, 2021, were included to examine patient outcomes. Exposure: Hospitals were divided into quartiles based on their MIE-CAT performance score. Outcomes were compared between highest (top 25%) and lowest (bottom 25%) performing quartiles. Transthoracic MIE with gastric tube reconstruction. Main Outcome and Measure: The primary outcome was severe postoperative complications (Clavien-Dindo ≥3) within 30 days after surgery. Multilevel logistic regression, with clustering of patients within hospitals, was used to analyze associations between performance and outcomes. Results: In total, 30 videos and 970 patients (mean [SD] age, 66.6 [9.1] years; 719 men [74.1%]) were included. The mean (SD) MIE-CAT score was 113.6 (5.5) in the highest performance quartile vs 94.1 (5.9) in the lowest. Severe postoperative complications occurred in 18.7% (41 of 219) of patients in the highest performance quartile vs 39.2% (40 of 102) in the lowest (risk ratio [RR], 0.50; 95% CI, 0.24-0.99). The highest vs the lowest performance quartile showed lower rates of conversions (1.8% vs 8.9%; RR, 0.21; 95% CI, 0.21-0.21), intraoperative complications (2.7% vs 7.8%; RR, 0.21; 95% CI, 0.04-0.94), and overall postoperative complications (46.1% vs 65.7%; RR, 0.54; 95% CI, 0.24-0.96). The R0 resection rate (96.8% vs 94.2%; RR, 1.03; 95% CI, 0.97-1.05) and lymph node yield (mean [SD], 38.9 [14.7] vs 26.2 [9.0]; RR, 3.20; 95% CI, 0.27-3.21) increased with oncologic-specific performance (eg, hiatus dissection, lymph node dissection). In addition, a high anastomotic phase score was associated with a lower anastomotic leakage rate (4.6% vs 17.7%; RR, 0.14; 95% CI, 0.06-0.31). Conclusions and Relevance: These findings suggest that better surgical performance is associated with fewer perioperative complications for patients with esophageal cancer on a national level. If surgical performance of MIE can be improved with MIE-CAT implementation, substantially better patient outcomes may be achievable.
Assuntos
Neoplasias Esofágicas , Esofagectomia , Masculino , Humanos , Idoso , Estudos de Coortes , Resultado do Tratamento , Esofagectomia/efeitos adversos , Procedimentos Cirúrgicos Minimamente Invasivos , Complicações Pós-Operatórias/etiologia , Neoplasias Esofágicas/cirurgia , Neoplasias Esofágicas/complicaçõesRESUMO
Importance: Minimally invasive esophagectomy (MIE) is a complex procedure with substantial learning curves. In other complex minimally invasive procedures, suboptimal surgical performance has convincingly been associated with less favorable patient outcomes as assessed by peer review of the surgical procedure. Objective: To develop and validate a procedure-specific competency assessment tool (CAT) for MIE. Design, Setting, and Participants: In this international quality improvement study, a procedure-specific MIE-CAT was developed and validated. The MIE-CAT contains 8 procedural phases, and 4 quality components per phase are scored with a Likert scale ranging from 1 to 4. For evaluation of the MIE-CAT, intraoperative MIE videos performed by a single surgical team in the Esophageal Center East Netherlands were peer reviewed by 18 independent international MIE experts (with more than 120 MIEs performed). Each video was assessed by 2 or 3 blinded experts to evaluate feasibility, content validity, reliability, and construct validity. MIE-CAT version 2 was composed with refined content aimed at improving interrater reliability. A total of 32 full-length MIE videos from patients who underwent MIE between 2011 and 2020 were analyzed. Data were analyzed from January 2021 to January 2023. Exposure: Performance assessment of transthoracic MIE with an intrathoracic anastomosis. Main Outcomes and Measures: Feasibility, content validity, interrater and intrarater reliability, and construct validity, including correlations with both experience of the surgical team and clinical parameters, of the developed MIE-CAT. Results: Experts found the MIE-CAT easy to understand and easy to use to grade surgical performance. The MIE-CAT demonstrated good intrarater reliability (range of intraclass correlation coefficients [ICCs], 0.807 [95% CI, 0.656 to 0.892] for quality component score to 0.898 [95% CI, 0.846 to 0.932] for phase score). Interrater reliability was moderate (range of ICCs, 0.536 [95% CI, -0.220 to 0.994] for total MIE-CAT score to 0.705 [95% CI, 0.473 to 0.846] for quality component score), and most discrepancies originated in the lymphadenectomy phases. Hypothesis testing for construct validity showed more than 75% of hypotheses correct: MIE-CAT performance scores correlated with experience of the surgical team (r = 0.288 to 0.622), blood loss (r = -0.034 to -0.545), operative time (r = -0.309 to -0.611), intraoperative complications (r = -0.052 to -0.319), and severe postoperative complications (r = -0.207 to -0.395). MIE-CAT version 2 increased usability. Interrater reliability improved but remained moderate (range of ICCs, 0.666 to 0.743), and most discrepancies between raters remained in the lymphadenectomy phases. Conclusions and Relevance: The MIE-CAT was developed and its feasibility, content validity, reliability, and construct validity were demonstrated. By providing insight into surgical performance of MIE, the MIE-CAT might be used for clinical, training, and research purposes.
Assuntos
Neoplasias Esofágicas , Esofagectomia , Humanos , Esofagectomia/efeitos adversos , Neoplasias Esofágicas/cirurgia , Reprodutibilidade dos Testes , Excisão de Linfonodo/efeitos adversos , Complicações Pós-Operatórias/etiologiaRESUMO
BACKGROUND: Three-dimensional (3D)-printed saw guides are frequently used to optimize osteotomy results and are usually designed based on computed tomography (CT), despite the radiation burden, as radiation-less alternatives like magnetic resonance imaging (MRI) have inferior bone visualization capabilities. This study investigated the usability of MR-based synthetic-CT (sCT), a novel radiation-less bone visualization technique for 3D planning and design of patient-specific saw guides. METHODS: Eight human cadaveric lower arms (mean age: 78y) received MRI and CT scans as well as high-resolution micro-CT. From the MRI scans, sCT were generated using a conditional generative adversarial network. Digital 3D bone surface models based on the sCT and general CT were compared to the surface model from the micro-CT that was used as ground truth for image resolution. From both the sCT and CT digital bone models saw guides were designed and 3D-printed in nylon for one proximal and one distal bone position for each radius and ulna. Six blinded observers placed these saw guides as accurately as possible on dissected bones. The position of each guide was assessed by optical 3D-scanning of each bone with positioned saw guide and compared to the preplanning. Eight placement errors were evaluated: three translational errors (along each axis), three rotational errors (around each axis), a total translation (∆T) and a total rotation error (∆R). RESULTS: Surface models derived from micro-CT were on average smaller than sCT and CT-based models with average differences of 0.27 ± 0.30 mm for sCT and 0.24 ± 0.12 mm for CT. No statistically significant positioning differences on the bones were found between sCT- and CT-based saw guides for any axis specific translational or rotational errors nor between the ∆T (p = .284) and ∆R (p = .216). On Bland-Altman plots, the ∆T and ∆R limits of agreement (LoA) were within the inter-observer variability LoA. CONCLUSIONS: This research showed a similar error for sCT and CT digital surface models when comparing to ground truth micro-CT models. Additionally, the saw guide study showed equivalent CT- and sCT-based saw guide placement errors. Therefore, MRI-based synthetic CT is a promising radiation-less alternative to CT for the creation of patient-specific osteotomy surgical saw guides.