Pesquisa | Biblioteca Virtual em Saúde

1.

EndoViT: pretraining vision transformers on a large collection of endoscopic images.

Batic, Dominik; Holm, Felix; Özsoy, Ege; Czempiel, Tobias; Navab, Nassir.

Int J Comput Assist Radiol Surg ; 2024 Apr 03.

Artigo em Inglês | MEDLINE | ID: mdl-38570373

RESUMO

PURPOSE: Automated endoscopy video analysis is essential for assisting surgeons during medical procedures, but it faces challenges due to complex surgical scenes and limited annotated data. Large-scale pretraining has shown great success in natural language processing and computer vision communities in recent years. These approaches reduce the need for annotated data, which is of great interest in the medical domain. In this work, we investigate endoscopy domain-specific self-supervised pretraining on large collections of data. METHODS: To this end, we first collect Endo700k, the largest publicly available corpus of endoscopic images, extracted from nine public Minimally Invasive Surgery (MIS) datasets. Endo700k comprises more than 700,000 images. Next, we introduce EndoViT, an endoscopy-pretrained Vision Transformer (ViT), and evaluate it on a diverse set of surgical downstream tasks. RESULTS: Our findings indicate that domain-specific pretraining with EndoViT yields notable advantages in complex downstream tasks. In the case of action triplet recognition, our approach outperforms ImageNet pretraining. In semantic segmentation, we surpass the state-of-the-art (SOTA) performance. These results demonstrate the effectiveness of our domain-specific pretraining approach in addressing the challenges of automated endoscopy video analysis. CONCLUSION: Our study contributes to the field of medical computer vision by showcasing the benefits of domain-specific large-scale self-supervised pretraining for vision transformers. We release both our code and pretrained models to facilitate further research in this direction: https://github.com/DominikBatic/EndoViT .

2.

Multimodal graph attention network for COVID-19 outcome prediction.

Keicher, Matthias; Burwinkel, Hendrik; Bani-Harouni, David; Paschali, Magdalini; Czempiel, Tobias; Burian, Egon; Makowski, Marcus R; Braren, Rickmer; Navab, Nassir; Wendler, Thomas.

Sci Rep ; 13(1): 19539, 2023 11 09.

Artigo em Inglês | MEDLINE | ID: mdl-37945590

RESUMO

When dealing with a newly emerging disease such as COVID-19, the impact of patient- and disease-specific factors (e.g., body weight or known co-morbidities) on the immediate course of the disease is largely unknown. An accurate prediction of the most likely individual disease progression can improve the planning of limited resources and finding the optimal treatment for patients. In the case of COVID-19, the need for intensive care unit (ICU) admission of pneumonia patients can often only be determined on short notice by acute indicators such as vital signs (e.g., breathing rate, blood oxygen levels), whereas statistical analysis and decision support systems that integrate all of the available data could enable an earlier prognosis. To this end, we propose a holistic, multimodal graph-based approach combining imaging and non-imaging information. Specifically, we introduce a multimodal similarity metric to build a population graph that shows a clustering of patients. For each patient in the graph, we extract radiomic features from a segmentation network that also serves as a latent image feature encoder. Together with clinical patient data like vital signs, demographics, and lab results, these modalities are combined into a multimodal representation of each patient. This feature extraction is trained end-to-end with an image-based Graph Attention Network to process the population graph and predict the COVID-19 patient outcomes: admission to ICU, need for ventilation, and mortality. To combine multiple modalities, radiomic features are extracted from chest CTs using a segmentation neural network. Results on a dataset collected in Klinikum rechts der Isar in Munich, Germany and the publicly available iCTCF dataset show that our approach outperforms single modality and non-graph baselines. Moreover, our clustering and graph attention increases understanding of the patient relationships within the population graph and provides insight into the network's decision-making process.

Assuntos

COVID-19 , Humanos , Prognóstico , Pulmão , Progressão da Doença , Hospitalização

3.

Holistic OR domain modeling: a semantic scene graph approach.

Özsoy, Ege; Czempiel, Tobias; Örnek, Evin Pinar; Eck, Ulrich; Tombari, Federico; Navab, Nassir.

Int J Comput Assist Radiol Surg ; 2023 Oct 12.

Artigo em Inglês | MEDLINE | ID: mdl-37823976

RESUMO

PURPOSE: Surgical procedures take place in highly complex operating rooms (OR), involving medical staff, patients, devices and their interactions. Until now, only medical professionals are capable of comprehending these intricate links and interactions. This work advances the field toward automated, comprehensive and semantic understanding and modeling of the OR domain by introducing semantic scene graphs (SSG) as a novel approach to describing and summarizing surgical environments in a structured and semantically rich manner. METHODS: We create the first open-source 4D SSG dataset. 4D-OR includes simulated total knee replacement surgeries captured by RGB-D sensors in a realistic OR simulation center. It includes annotations for SSGs, human and object pose, clinical roles and surgical phase labels. We introduce a neural network-based SSG generation pipeline for semantic reasoning in the OR and apply our approach to two downstream tasks: clinical role prediction and surgical phase recognition. RESULTS: We show that our pipeline can successfully reason within the OR domain. The capabilities of our scene graphs are further highlighted by their successful application to clinical role prediction and surgical phase recognition tasks. CONCLUSION: This work paves the way for multimodal holistic operating room modeling, with the potential to significantly enhance the state of the art in surgical data analysis, such as enabling more efficient and precise decision-making during surgical procedures, and ultimately improving patient safety and surgical outcomes. We release our code and dataset at github.com/egeozsoy/4D-OR.

4.

CholecTriplet2022: Show me a tool and tell me the triplet - An endoscopic vision challenge for surgical action triplet detection.

Nwoye, Chinedu Innocent; Yu, Tong; Sharma, Saurav; Murali, Aditya; Alapatt, Deepak; Vardazaryan, Armine; Yuan, Kun; Hajek, Jonas; Reiter, Wolfgang; Yamlahi, Amine; Smidt, Finn-Henri; Zou, Xiaoyang; Zheng, Guoyan; Oliveira, Bruno; Torres, Helena R; Kondo, Satoshi; Kasai, Satoshi; Holm, Felix; Özsoy, Ege; Gui, Shuangchun; Li, Han; Raviteja, Sista; Sathish, Rachana; Poudel, Pranav; Bhattarai, Binod; Wang, Ziheng; Rui, Guo; Schellenberg, Melanie; Vilaça, João L; Czempiel, Tobias; Wang, Zhenkun; Sheet, Debdoot; Thapa, Shrawan Kumar; Berniker, Max; Godau, Patrick; Morais, Pedro; Regmi, Sudarshan; Tran, Thuy Nuong; Fonseca, Jaime; Nölke, Jan-Hinrich; Lima, Estevão; Vazquez, Eduard; Maier-Hein, Lena; Navab, Nassir; Mascagni, Pietro; Seeliger, Barbara; Gonzalez, Cristians; Mutter, Didier; Padoy, Nicolas.

Med Image Anal ; 89: 102888, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-37451133

RESUMO

Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier efforts and the CholecTriplet challenge introduced in 2021 have put together techniques aimed at recognizing these triplets from surgical footage. Estimating also the spatial locations of the triplets would offer a more precise intraoperative context-aware decision support for computer-assisted intervention. This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection. It includes weakly-supervised bounding box localization of every visible surgical instrument (or tool), as the key actors, and the modeling of each tool-activity in the form of triplet. The paper describes a baseline method and 10 new deep learning algorithms presented at the challenge to solve the task. It also provides thorough methodological comparisons of the methods, an in-depth analysis of the obtained results across multiple metrics, visual and procedural challenges; their significance, and useful insights for future research directions and applications in surgery.

Assuntos

Inteligência Artificial , Cirurgia Assistida por Computador , Humanos , Endoscopia , Algoritmos , Cirurgia Assistida por Computador/métodos , Instrumentos Cirúrgicos

5.

DisguisOR: holistic face anonymization for the operating room.

Bastian, Lennart; Wang, Tony Danjun; Czempiel, Tobias; Busam, Benjamin; Navab, Nassir.

Int J Comput Assist Radiol Surg ; 18(7): 1209-1215, 2023 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-37219807

RESUMO

PURPOSE: Recent advances in Surgical Data Science (SDS) have contributed to an increase in video recordings from hospital environments. While methods such as surgical workflow recognition show potential in increasing the quality of patient care, the quantity of video data has surpassed the scale at which images can be manually anonymized. Existing automated 2D anonymization methods under-perform in Operating Rooms (OR), due to occlusions and obstructions. We propose to anonymize multi-view OR recordings using 3D data from multiple camera streams. METHODS: RGB and depth images from multiple cameras are fused into a 3D point cloud representation of the scene. We then detect each individual's face in 3D by regressing a parametric human mesh model onto detected 3D human keypoints and aligning the face mesh with the fused 3D point cloud. The mesh model is rendered into every acquired camera view, replacing each individual's face. RESULTS: Our method shows promise in locating faces at a higher rate than existing approaches. DisguisOR produces geometrically consistent anonymizations for each camera view, enabling more realistic anonymization that is less detrimental to downstream tasks. CONCLUSION: Frequent obstructions and crowding in operating rooms leaves significant room for improvement for off-the-shelf anonymization methods. DisguisOR addresses privacy on a scene level and has the potential to facilitate further research in SDS.

Assuntos

Salas Cirúrgicas , Humanos , Gravação em Vídeo

6.

CholecTriplet2021: A benchmark challenge for surgical action triplet recognition.

Nwoye, Chinedu Innocent; Alapatt, Deepak; Yu, Tong; Vardazaryan, Armine; Xia, Fangfang; Zhao, Zixuan; Xia, Tong; Jia, Fucang; Yang, Yuxuan; Wang, Hao; Yu, Derong; Zheng, Guoyan; Duan, Xiaotian; Getty, Neil; Sanchez-Matilla, Ricardo; Robu, Maria; Zhang, Li; Chen, Huabin; Wang, Jiacheng; Wang, Liansheng; Zhang, Bokai; Gerats, Beerend; Raviteja, Sista; Sathish, Rachana; Tao, Rong; Kondo, Satoshi; Pang, Winnie; Ren, Hongliang; Abbing, Julian Ronald; Sarhan, Mohammad Hasan; Bodenstedt, Sebastian; Bhasker, Nithya; Oliveira, Bruno; Torres, Helena R; Ling, Li; Gaida, Finn; Czempiel, Tobias; Vilaça, João L; Morais, Pedro; Fonseca, Jaime; Egging, Ruby Mae; Wijma, Inge Nicole; Qian, Chen; Bian, Guibin; Li, Zhen; Balasubramanian, Velmurugan; Sheet, Debdoot; Luengo, Imanol; Zhu, Yuanbo; Ding, Shuai.

Med Image Anal ; 86: 102803, 2023 05.

Artigo em Inglês | MEDLINE | ID: mdl-37004378

RESUMO

Context-aware decision support in the operating room can foster surgical safety and efficiency by leveraging real-time feedback from surgical workflow analysis. Most existing works recognize surgical activities at a coarse-grained level, such as phases, steps or events, leaving out fine-grained interaction details about the surgical activity; yet those are needed for more helpful AI assistance in the operating room. Recognizing surgical actions as triplets of combination delivers more comprehensive details about the activities taking place in surgical videos. This paper presents CholecTriplet2021: an endoscopic vision challenge organized at MICCAI 2021 for the recognition of surgical action triplets in laparoscopic videos. The challenge granted private access to the large-scale CholecT50 dataset, which is annotated with action triplet information. In this paper, we present the challenge setup and the assessment of the state-of-the-art deep learning methods proposed by the participants during the challenge. A total of 4 baseline methods from the challenge organizers and 19 new deep learning algorithms from the competing teams are presented to recognize surgical action triplets directly from surgical videos, achieving mean average precision (mAP) ranging from 4.2% to 38.1%. This study also analyzes the significance of the results obtained by the presented approaches, performs a thorough methodological comparison between them, in-depth result analysis, and proposes a novel ensemble method for enhanced recognition. Our analysis shows that surgical workflow analysis is not yet solved, and also highlights interesting directions for future research on fine-grained surgical activity recognition which is of utmost importance for the development of AI in surgery.

Assuntos

Benchmarking , Laparoscopia , Humanos , Algoritmos , Salas Cirúrgicas , Fluxo de Trabalho , Aprendizado Profundo

7.

PLAFOKON: a new concept for a patient-individual and intervention-specific flexible surgical platform.

Bernhard, Lukas; Krumpholz, Roman; Krieger, Yannick; Czempiel, Tobias; Meining, Alexander; Navab, Nassir; Lüth, Tim; Wilhelm, Dirk.

Surg Endosc ; 36(7): 5303-5312, 2022 07.

Artigo em Inglês | MEDLINE | ID: mdl-34919177

RESUMO

BACKGROUND: Research in the field of surgery is mainly driven by aiming for trauma reduction as well as for personalized treatment concepts. Beyond laparoscopy, other proposed approaches for further reduction of the therapeutic trauma have failed to achieve clinical translation, with few notable exceptions. We believe that this is mainly due to a lack of flexibility and high associated costs. We aimed at addressing these issues by developing a novel minimally invasive operating platform and a preoperative design workflow for patient-individual adaptation and cost-effective rapid manufacturing of surgical manipulators. In this article, we report on the first in-vitro cholecystectomy performed with our operating platform. METHODS: The single-port overtube (SPOT) is a snake-like surgical manipulator for minimally invasive interventions. The system layout is highly flexible and can be adapted in design and dimensions for different kinds of surgery, based on patient- and disease-specific parameters. For collecting and analyzing this data, we developed a graphical user interface, which assists clinicians during the preoperative planning phase. Other major components of our operating platform include an instrument management system and a non-sterile user interface. For the trial surgery, we used a validated phantom which was further equipped with a porcine liver including the gallbladder. RESULTS: Following our envisioned preoperative design workflow, a suitable geometry of the surgical manipulator was determined for our trial surgery and rapidly manufactured by means of 3D printing. With this setup, we successfully performed a first in-vitro cholecystectomy, which was completed in 78 min. CONCLUSIONS: By conducting the trial surgery, we demonstrated the effectiveness of our PLAFOKON operating platform. While some aspects - especially regarding usability and ergonomics - can be further optimized, the overall performance of the system is highly promising, with sufficient flexibility and strength for conducting the necessary tissue manipulations.

Assuntos

Laparoscopia , Animais , Colecistectomia , Desenho de Equipamento , Ergonomia , Humanos , Impressão Tridimensional , Instrumentos Cirúrgicos , Suínos

8.

AI for Doctors-A Course to Educate Medical Professionals in Artificial Intelligence for Medical Imaging.

Hedderich, Dennis M; Keicher, Matthias; Wiestler, Benedikt; Gruber, Martin J; Burwinkel, Hendrik; Hinterwimmer, Florian; Czempiel, Tobias; Spiro, Judith E; Pinto Dos Santos, Daniel; Heim, Dominik; Zimmer, Claus; Rückert, Daniel; Kirschke, Jan S; Navab, Nassir.

Healthcare (Basel) ; 9(10)2021 Sep 28.

Artigo em Inglês | MEDLINE | ID: mdl-34682958

RESUMO

Successful adoption of artificial intelligence (AI) in medical imaging requires medical professionals to understand underlying principles and techniques. However, educational offerings tailored to the need of medical professionals are scarce. To fill this gap, we created the course "AI for Doctors: Medical Imaging". An analysis of participants' opinions on AI and self-perceived skills rated on a five-point Likert scale was conducted before and after the course. The participants' attitude towards AI in medical imaging was very optimistic before and after the course. However, deeper knowledge of AI and the process for validating and deploying it resulted in significantly less overoptimism with respect to perceivable patient benefits through AI (p = 0.020). Self-assessed skill ratings significantly improved after the course, and the appreciation of the course content was very positive. However, we observed a substantial drop-out rate, mostly attributed to the lack of time of medical professionals. There is a high demand for educational offerings regarding AI in medical imaging among medical professionals, and better education may lead to a more realistic appreciation of clinical adoption. However, time constraints imposed by a busy clinical schedule need to be taken into account for successful education of medical professionals.

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA