Prediction of remaining surgery duration in laparoscopic videos based on visual saliency and the transformer network.
Int J Med Robot
; 20(2): e2632, 2024 Apr.
Article
em En
| MEDLINE
| ID: mdl-38630888
ABSTRACT
BACKGROUND:
Real-time prediction of the remaining surgery duration (RSD) is important for optimal scheduling of resources in the operating room.METHODS:
We focus on the intraoperative prediction of RSD from laparoscopic video. An extensive evaluation of seven common deep learning models, a proposed one based on the Transformer architecture (TransLocal) and four baseline approaches, is presented. The proposed pipeline includes a CNN-LSTM for feature extraction from salient regions within short video segments and a Transformer with local attention mechanisms.RESULTS:
Using the Cholec80 dataset, TransLocal yielded the best performance (mean absolute error (MAE) = 7.1 min). For long and short surgeries, the MAE was 10.6 and 4.4 min, respectively. Thirty minutes before the end of surgery MAE = 6.2 min, 7.2 and 5.5 min for all long and short surgeries, respectively.CONCLUSIONS:
The proposed technique achieves state-of-the-art results. In the future, we aim to incorporate intraoperative indicators and pre-operative data.Palavras-chave
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Laparoscopia
Limite:
Humans
Idioma:
En
Revista:
Int J Med Robot
Ano de publicação:
2024
Tipo de documento:
Article
País de afiliação:
Grécia
País de publicação:
Reino Unido