Automated detection of steps in videos of strabismus surgery using deep learning.
BMC Ophthalmol
; 24(1): 242, 2024 Jun 10.
Article
in En
| MEDLINE
| ID: mdl-38853240
ABSTRACT
BACKGROUND:
Learning to perform strabismus surgery is an essential aspect of ophthalmologists' surgical training. Automated classification strategy for surgical steps can improve the effectiveness of training curricula and the efficient evaluation of residents' performance. To this end, we aimed to develop and validate a deep learning (DL) model for automated detecting strabismus surgery steps in the videos.METHODS:
In this study, we gathered 479 strabismus surgery videos from Shanghai Children's Hospital, affiliated to Shanghai Jiao Tong University School of Medicine, spanning July 2017 to October 2021. The videos were manually cut into 3345 clips of the eight strabismus surgical steps based on the International Council of Ophthalmology's Ophthalmology Surgical Competency Assessment Rubrics (ICO-OSCAR strabismus). The videos dataset was randomly split by eye-level into a training (60%), validation (20%) and testing dataset (20%). We evaluated two hybrid DL algorithms a Recurrent Neural Network (RNN) based and a Transformer-based model. The evaluation metrics included accuracy, area under the receiver operating characteristic curve, precision, recall and F1-score.RESULTS:
DL models identified the steps in video clips of strabismus surgery achieved macro-average AUC of 1.00 (95% CI 1.00-1.00) with Transformer-based model and 0.98 (95% CI 0.97-1.00) with RNN-based model, respectively. The Transformer-based model yielded a higher accuracy compared with RNN-based models (0.96 vs. 0.83, p < 0.001). In detecting different steps of strabismus surgery, the predictive ability of the Transformer-based model was better than that of the RNN. Precision ranged between 0.90 and 1 for the Transformer-based model and 0.75 to 0.94 for the RNN-based model. The f1-score ranged between 0.93 and 1 for the Transformer-based model and 0.78 to 0.92 for the RNN-based model.CONCLUSION:
The DL models can automate identify video steps of strabismus surgery with high accuracy and Transformer-based algorithms show excellent performance when modeling spatiotemporal features of video frames.Key words
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Ophthalmologic Surgical Procedures
/
Video Recording
/
Strabismus
/
Deep Learning
/
Oculomotor Muscles
Limits:
Humans
Language:
En
Journal:
BMC Ophthalmol
Journal subject:
OFTALMOLOGIA
Year:
2024
Document type:
Article
Affiliation country:
China
Country of publication:
United kingdom