Pesquisa | Prevenção e Controle de Câncer

1.

A Framework for Multimodal Medical Image Interaction.

Schutz, Laura; Matinfar, Sasan; Schafroth, Gideon; Navab, Navid; Fairhurst, Merle; Wagner, Arthur; Wiestler, Benedikt; Eck, Ulrich; Navab, Nassir.

IEEE Trans Vis Comput Graph ; PP2024 Sep 10.

Artigo em Inglês | MEDLINE | ID: mdl-39255140

RESUMO

Medical doctors rely on images of the human anatomy, such as magnetic resonance imaging (MRI), to localize regions of interest in the patient during diagnosis and treatment. Despite advances in medical imaging technology, the information conveyance remains unimodal. This visual representation fails to capture the complexity of the real, multisensory interaction with human tissue. However, perceiving multimodal information about the patient's anatomy and disease in real-time is critical for the success of medical procedures and patient outcome. We introduce a Multimodal Medical Image Interaction (MMII) framework to allow medical experts a dynamic, audiovisual interaction with human tissue in three-dimensional space. In a virtual reality environment, the user receives physically informed audiovisual feedback to improve the spatial perception of anatomical structures. MMII uses a model-based sonification approach to generate sounds derived from the geometry and physical properties of tissue, thereby eliminating the need for hand-crafted sound design. Two user studies involving 34 general and nine clinical experts were conducted to evaluate the proposed interaction framework's learnability, usability, and accuracy. Our results showed excellent learnability of audiovisual correspondence as the rate of correct associations significantly improved (p < 0.001) over the course of the study. MMII resulted in superior brain tumor localization accuracy (p < 0.05) compared to conventional medical image interaction. Our findings substantiate the potential of this novel framework to enhance interaction with medical images, for example, during surgical procedures where immediate and precise feedback is needed.

2.

The sound of surgery-development of an acoustic trocar system enabling laparoscopic sound analysis.

Ostler-Mildner, Daniel; Wegener, Luca; Fuchtmann, Jonas; Feussner, Hubertus; Wilhelm, Dirk; Navab, Nassir.

Int J Comput Assist Radiol Surg ; 2024 Jun 03.

Artigo em Inglês | MEDLINE | ID: mdl-38831175

RESUMO

PURPOSE: Acoustic information can contain viable information in medicine and specifically in surgery. While laparoscopy depends mainly on visual information, our goal is to develop the means to capture and process acoustic information during laparoscopic surgery. METHODS: To achieve this, we iteratively developed three prototypes that will overcome the abdominal wall as a sound barrier and can be used with standard trocars. We evaluated them in terms of clinical applicability and sound transmission quality. Furthermore, the applicability of each prototype for sound classification based on machine learning was evaluated. RESULTS: Our developed prototypes for recording airborne sound from the intraperitoneal cavity represent a promising solution suitable for real-world clinical usage All three prototypes fulfill our set requirements in terms of clinical applicability (i.e., air-tightness, invasiveness, sterility) and show promising results regarding their acoustic characteristics and the associated results on ML-based sound classification. CONCLUSION: In summary, our prototypes for capturing acoustic information during laparoscopic surgeries integrate seamlessly with existing procedures and have the potential to augment the surgeon's perception. This advancement could change how surgeons interact with and understand the surgical field.

3.

DosePatch: physics-inspired cropping layout for patch-based Monte Carlo simulations to provide fast and accurate internal dosimetry.

De Benetti, Francesca; Brosch-Lenz, Julia; Guerra González, Jorge Mario; Uribe, Carlos; Eiber, Matthias; Navab, Nassir; Wendler, Thomas.

EJNMMI Phys ; 11(1): 51, 2024 Jun 26.

Artigo em Inglês | MEDLINE | ID: mdl-38922372

RESUMO

BACKGROUND: Dosimetry-based personalized therapy was shown to have clinical benefits e.g. in liver selective internal radiation therapy (SIRT). Yet, there is no consensus about its introduction into clinical practice, mainly as Monte Carlo simulations (gold standard for dosimetry) involve massive computation time. We addressed the problem of computation time and tested a patch-based approach for Monte Carlo simulations for internal dosimetry to improve parallelization. We introduce a physics-inspired cropping layout for patch-based MC dosimetry, and compare it to cropping layouts of the literature as well as dosimetry using organ-S-values, and dose kernels, taking whole-body Monte Carlo simulations as ground truth. This was evaluated in five patients receiving Yttrium-90 liver SIRT. RESULTS: The patch-based Monte Carlo approach yielded the closest results to the ground truth, making it a valid alternative to the conventional approach. Our physics-inspired cropping layout and mosaicking scheme yielded a voxel-wise error of < 2% compared to whole-body Monte Carlo in soft tissue, while requiring only ≈ 10% of the time. CONCLUSIONS: This work demonstrates the feasibility and accuracy of physics-inspired cropping layouts for patch-based Monte Carlo simulations.

4.

Shape completion in the dark: completing vertebrae morphology from 3D ultrasound.

Gafencu, Miruna-Alexandra; Velikova, Yordanka; Saleh, Mahdi; Ungi, Tamas; Navab, Nassir; Wendler, Thomas; Azampour, Mohammad Farid.

Int J Comput Assist Radiol Surg ; 19(7): 1339-1347, 2024 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-38748052

RESUMO

PURPOSE: Ultrasound (US) imaging, while advantageous for its radiation-free nature, is challenging to interpret due to only partially visible organs and a lack of complete 3D information. While performing US-based diagnosis or investigation, medical professionals therefore create a mental map of the 3D anatomy. In this work, we aim to replicate this process and enhance the visual representation of anatomical structures. METHODS: We introduce a point cloud-based probabilistic deep learning (DL) method to complete occluded anatomical structures through 3D shape completion and choose US-based spine examinations as our application. To enable training, we generate synthetic 3D representations of partially occluded spinal views by mimicking US physics and accounting for inherent artifacts. RESULTS: The proposed model performs consistently on synthetic and patient data, with mean and median differences of 2.02 and 0.03 in Chamfer Distance (CD), respectively. Our ablation study demonstrates the importance of US physics-based data generation, reflected in the large mean and median difference of 11.8 CD and 9.55 CD, respectively. Additionally, we demonstrate that anatomical landmarks, such as the spinous process (with reconstruction CD of 4.73) and the facet joints (mean distance to ground truth (GT) of 4.96 mm), are preserved in the 3D completion. CONCLUSION: Our work establishes the feasibility of 3D shape completion for lumbar vertebrae, ensuring the preservation of level-wise characteristics and successful generalization from synthetic to real data. The incorporation of US physics contributes to more accurate patient data completions. Notably, our method preserves essential anatomical landmarks and reconstructs crucial injections sites at their correct locations.

Assuntos

Aprendizado Profundo , Imageamento Tridimensional , Ultrassonografia , Humanos , Imageamento Tridimensional/métodos , Ultrassonografia/métodos , Coluna Vertebral/diagnóstico por imagem , Coluna Vertebral/anatomia & histologia , Pontos de Referência Anatômicos

5.

Advancing surgical VQA with scene graph knowledge.

Yuan, Kun; Kattel, Manasi; Lavanchy, Joël L; Navab, Nassir; Srivastav, Vinkle; Padoy, Nicolas.

Int J Comput Assist Radiol Surg ; 19(7): 1409-1417, 2024 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-38780829

RESUMO

PURPOSE: The modern operating room is becoming increasingly complex, requiring innovative intra-operative support systems. While the focus of surgical data science has largely been on video analysis, integrating surgical computer vision with natural language capabilities is emerging as a necessity. Our work aims to advance visual question answering (VQA) in the surgical context with scene graph knowledge, addressing two main challenges in the current surgical VQA systems: removing question-condition bias in the surgical VQA dataset and incorporating scene-aware reasoning in the surgical VQA model design. METHODS: First, we propose a surgical scene graph-based dataset, SSG-VQA, generated by employing segmentation and detection models on publicly available datasets. We build surgical scene graphs using spatial and action information of instruments and anatomies. These graphs are fed into a question engine, generating diverse QA pairs. We then propose SSG-VQA-Net, a novel surgical VQA model incorporating a lightweight Scene-embedded Interaction Module, which integrates geometric scene knowledge in the VQA model design by employing cross-attention between the textual and the scene features. RESULTS: Our comprehensive analysis shows that our SSG-VQA dataset provides a more complex, diverse, geometrically grounded, unbiased and surgical action-oriented dataset compared to existing surgical VQA datasets and SSG-VQA-Net outperforms existing methods across different question types and complexities. We highlight that the primary limitation in the current surgical VQA systems is the lack of scene knowledge to answer complex queries. CONCLUSION: We present a novel surgical VQA dataset and model and show that results can be significantly improved by incorporating geometric scene features in the VQA model design. We point out that the bottleneck of the current surgical visual question-answer model lies in learning the encoded representation rather than decoding the sequence. Our SSG-VQA dataset provides a diagnostic benchmark to test the scene understanding and reasoning capabilities of the model. The source code and the dataset will be made publicly available at: https://github.com/CAMMA-public/SSG-VQA .

Assuntos

Salas Cirúrgicas , Humanos , Cirurgia Assistida por Computador/métodos , Processamento de Linguagem Natural , Gravação em Vídeo

6.

Ultrasound segmentation analysis via distinct and completed anatomical borders.

Duque, Vanessa Gonzalez; Marquardt, Alexandra; Velikova, Yordanka; Lacourpaille, Lilian; Nordez, Antoine; Crouzier, Marion; Lee, Hong Joo; Mateus, Diana; Navab, Nassir.

Int J Comput Assist Radiol Surg ; 19(7): 1419-1427, 2024 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-38789884

RESUMO

PURPOSE: Segmenting ultrasound images is important for precise area and/or volume calculations, ensuring reliable diagnosis and effective treatment evaluation for diseases. Recently, many segmentation methods have been proposed and shown impressive performance. However, currently, there is no deeper understanding of how networks segment target regions or how they define the boundaries. In this paper, we present a new approach that analyzes ultrasound segmentation networks in terms of learned borders because border delimitation is challenging in ultrasound. METHODS: We propose a way to split the boundaries for ultrasound images into distinct and completed. By exploiting the Grad-CAM of the split borders, we analyze the areas each network pays attention to. Further, we calculate the ratio of correct predictions for distinct and completed borders. We conducted experiments on an in-house leg ultrasound dataset (LEG-3D-US) as well as on two additional public datasets of thyroid, nerves, and one private for prostate. RESULTS: Quantitatively, the networks exhibit around 10% improvement in handling completed borders compared to distinct borders. Similar to doctors, the network struggles to define the borders in less visible areas. Additionally, the Seg-Grad-CAM analysis underscores how completion uses distinct borders and landmarks, while distinct focuses mainly on the shiny structures. We also observe variations depending on the attention mechanism of each architecture. CONCLUSION: In this work, we highlight the importance of studying ultrasound borders differently than other modalities such as MRI or CT. We split the borders into distinct and completed, similar to clinicians, and show the quality of the network-learned information for these two types of borders. Additionally, we open-source a 3D leg ultrasound dataset to the community https://github.com/Al3xand1a/segmentation-border-analysis .

Assuntos

Ultrassonografia , Humanos , Ultrassonografia/métodos , Masculino , Glândula Tireoide/diagnóstico por imagem , Próstata/diagnóstico por imagem , Perna (Membro)/diagnóstico por imagem , Imageamento Tridimensional/métodos

7.

ReMIND: The Brain Resection Multimodal Imaging Database.

Juvekar, Parikshit; Dorent, Reuben; Kögl, Fryderyk; Torio, Erickson; Barr, Colton; Rigolo, Laura; Galvin, Colin; Jowkar, Nick; Kazi, Anees; Haouchine, Nazim; Cheema, Harneet; Navab, Nassir; Pieper, Steve; Wells, William M; Bi, Wenya Linda; Golby, Alexandra; Frisken, Sarah; Kapur, Tina.

Sci Data ; 11(1): 494, 2024 May 14.

Artigo em Inglês | MEDLINE | ID: mdl-38744868

RESUMO

The standard of care for brain tumors is maximal safe surgical resection. Neuronavigation augments the surgeon's ability to achieve this but loses validity as surgery progresses due to brain shift. Moreover, gliomas are often indistinguishable from surrounding healthy brain tissue. Intraoperative magnetic resonance imaging (iMRI) and ultrasound (iUS) help visualize the tumor and brain shift. iUS is faster and easier to incorporate into surgical workflows but offers a lower contrast between tumorous and healthy tissues than iMRI. With the success of data-hungry Artificial Intelligence algorithms in medical image analysis, the benefits of sharing well-curated data cannot be overstated. To this end, we provide the largest publicly available MRI and iUS database of surgically treated brain tumors, including gliomas (n = 92), metastases (n = 11), and others (n = 11). This collection contains 369 preoperative MRI series, 320 3D iUS series, 301 iMRI series, and 356 segmentations collected from 114 consecutive patients at a single institution. This database is expected to help brain shift and image analysis research and neurosurgical training in interpreting iUS and iMRI.

Assuntos

Neoplasias Encefálicas , Bases de Dados Factuais , Imageamento por Ressonância Magnética , Imagem Multimodal , Humanos , Neoplasias Encefálicas/diagnóstico por imagem , Neoplasias Encefálicas/cirurgia , Encéfalo/diagnóstico por imagem , Encéfalo/cirurgia , Glioma/diagnóstico por imagem , Glioma/cirurgia , Ultrassonografia , Neuronavegação/métodos

8.

EndoViT: pretraining vision transformers on a large collection of endoscopic images.

Batic, Dominik; Holm, Felix; Özsoy, Ege; Czempiel, Tobias; Navab, Nassir.

Int J Comput Assist Radiol Surg ; 19(6): 1085-1091, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38570373

RESUMO

PURPOSE: Automated endoscopy video analysis is essential for assisting surgeons during medical procedures, but it faces challenges due to complex surgical scenes and limited annotated data. Large-scale pretraining has shown great success in natural language processing and computer vision communities in recent years. These approaches reduce the need for annotated data, which is of great interest in the medical domain. In this work, we investigate endoscopy domain-specific self-supervised pretraining on large collections of data. METHODS: To this end, we first collect Endo700k, the largest publicly available corpus of endoscopic images, extracted from nine public Minimally Invasive Surgery (MIS) datasets. Endo700k comprises more than 700,000 images. Next, we introduce EndoViT, an endoscopy-pretrained Vision Transformer (ViT), and evaluate it on a diverse set of surgical downstream tasks. RESULTS: Our findings indicate that domain-specific pretraining with EndoViT yields notable advantages in complex downstream tasks. In the case of action triplet recognition, our approach outperforms ImageNet pretraining. In semantic segmentation, we surpass the state-of-the-art (SOTA) performance. These results demonstrate the effectiveness of our domain-specific pretraining approach in addressing the challenges of automated endoscopy video analysis. CONCLUSION: Our study contributes to the field of medical computer vision by showcasing the benefits of domain-specific large-scale self-supervised pretraining for vision transformers. We release both our code and pretrained models to facilitate further research in this direction: https://github.com/DominikBatic/EndoViT .

Assuntos

Endoscopia , Humanos , Endoscopia/métodos , Endoscopia/educação , Processamento de Imagem Assistida por Computador/métodos , Gravação em Vídeo , Procedimentos Cirúrgicos Minimamente Invasivos/educação , Procedimentos Cirúrgicos Minimamente Invasivos/métodos

9.

Review of robotic systems for thoracoabdominal puncture interventional surgery.

Wang, Cheng; Guo, Li; Zhu, Jianjun; Zhu, Lifeng; Li, Chichi; Zhu, Haidong; Song, Aiguo; Lu, Ligong; Teng, Gao-Jun; Navab, Nassir; Jiang, Zhongliang.

APL Bioeng ; 8(2): 021501, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38572313

RESUMO

Cancer, with high morbidity and high mortality, is one of the major burdens threatening human health globally. Intervention procedures via percutaneous puncture have been widely used by physicians due to its minimally invasive surgical approach. However, traditional manual puncture intervention depends on personal experience and faces challenges in terms of precisely puncture, learning-curve, safety and efficacy. The development of puncture interventional surgery robotic (PISR) systems could alleviate the aforementioned problems to a certain extent. This paper attempts to review the current status and prospective of PISR systems for thoracic and abdominal application. In this review, the key technologies related to the robotics, including spatial registration, positioning navigation, puncture guidance feedback, respiratory motion compensation, and motion control, are discussed in detail.

10.

CACTUSS: Common Anatomical CT-US Space for US examinations.

Velikova, Yordanka; Simson, Walter; Azampour, Mohammad Farid; Paprottka, Philipp; Navab, Nassir.

Int J Comput Assist Radiol Surg ; 19(5): 861-869, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38270811

RESUMO

PURPOSE: The detection and treatment of abdominal aortic aneurysm (AAA), a vascular disorder with life-threatening consequences, is challenging due to its lack of symptoms until it reaches a critical size. Abdominal ultrasound (US) is utilized for diagnosis; however, its inherent low image quality and reliance on operator expertise make computed tomography (CT) the preferred choice for monitoring and treatment. Moreover, CT datasets have been effectively used for training deep neural networks for aorta segmentation. In this work, we demonstrate how leveraging CT labels can be used to improve segmentation in ultrasound and hence save manual annotations. METHODS: We introduce CACTUSS: a common anatomical CT-US space that inherits properties from both CT and ultrasound modalities to produce an image in intermediate representation (IR) space. CACTUSS acts as a virtual third modality between CT and US to address the scarcity of annotated ultrasound training data. The generation of IR images is facilitated by re-parametrizing a physics-based US simulator. In CACTUSS we use IR images as training data for ultrasound segmentation, eliminating the need for manual labeling. In addition, an image-to-image translation network is employed for the model's application on real B-modes. RESULTS: The model's performance is evaluated quantitatively for the task of aorta segmentation by comparison against a fully supervised method in terms of Dice Score and diagnostic metrics. CACTUSS outperforms the fully supervised network in segmentation and meets clinical requirements for AAA screening and diagnosis. CONCLUSION: CACTUSS provides a promising approach to improve US segmentation accuracy by leveraging CT labels, reducing the need for manual annotations. We generate IRs that inherit properties from both modalities while preserving the anatomical structure and are optimized for the task of aorta segmentation. Future work involves integrating CACTUSS into robotic ultrasound platforms for automated screening and conducting clinical feasibility studies.

Assuntos

Aneurisma da Aorta Abdominal , Tomografia Computadorizada por Raios X , Ultrassonografia , Humanos , Aneurisma da Aorta Abdominal/diagnóstico por imagem , Tomografia Computadorizada por Raios X/métodos , Ultrassonografia/métodos , Aorta Abdominal/diagnóstico por imagem , Imagem Multimodal/métodos

11.

ReMIND: The Brain Resection Multimodal Imaging Database.

Juvekar, Parikshit; Dorent, Reuben; Kögl, Fryderyk; Torio, Erickson; Barr, Colton; Rigolo, Laura; Galvin, Colin; Jowkar, Nick; Kazi, Anees; Haouchine, Nazim; Cheema, Harneet; Navab, Nassir; Pieper, Steve; Wells, William M; Bi, Wenya Linda; Golby, Alexandra; Frisken, Sarah; Kapur, Tina.

medRxiv ; 2024 Apr 08.

Artigo em Inglês | MEDLINE | ID: mdl-37745329

RESUMO

The standard of care for brain tumors is maximal safe surgical resection. Neuronavigation augments the surgeon's ability to achieve this but loses validity as surgery progresses due to brain shift. Moreover, gliomas are often indistinguishable from surrounding healthy brain tissue. Intraoperative magnetic resonance imaging (iMRI) and ultrasound (iUS) help visualize the tumor and brain shift. iUS is faster and easier to incorporate into surgical workflows but offers a lower contrast between tumorous and healthy tissues than iMRI. With the success of data-hungry Artificial Intelligence algorithms in medical image analysis, the benefits of sharing well-curated data cannot be overstated. To this end, we provide the largest publicly available MRI and iUS database of surgically treated brain tumors, including gliomas (n=92), metastases (n=11), and others (n=11). This collection contains 369 preoperative MRI series, 320 3D iUS series, 301 iMRI series, and 356 segmentations collected from 114 consecutive patients at a single institution. This database is expected to help brain shift and image analysis research and neurosurgical training in interpreting iUS and iMRI.

12.

Investigating pulse-echo sound speed estimation in breast ultrasound with deep learning.

Simson, Walter A; Paschali, Magdalini; Sideri-Lampretsa, Vasiliki; Navab, Nassir; Dahl, Jeremy J.

Ultrasonics ; 137: 107179, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-37939413

RESUMO

Ultrasound is an adjunct tool to mammography that can quickly and safely aid physicians in diagnosing breast abnormalities. Clinical ultrasound often assumes a constant sound speed to form diagnostic B-mode images. However, the components of breast tissue, such as glandular tissue, fat, and lesions, differ in sound speed. Given a constant sound speed assumption, these differences can degrade the quality of reconstructed images via phase aberration. Sound speed images can be a powerful tool for improving image quality and identifying diseases if properly estimated. To this end, we propose a supervised deep-learning approach for sound speed estimation from analytic ultrasound signals. We develop a large-scale simulated ultrasound dataset that generates representative breast tissue samples by modeling breast gland, skin, and lesions with varying echogenicity and sound speed. We adopt a fully convolutional neural network architecture trained on a simulated dataset to produce an estimated sound speed map. The simulated tissue is interrogated with a plane wave transmit sequence, and the complex-value reconstructed images are used as input for the convolutional network. The network is trained on the sound speed distribution map of the simulated data, and the trained model can estimate sound speed given reconstructed pulse-echo signals. We further incorporate thermal noise augmentation during training to enhance model robustness to artifacts found in real ultrasound data. To highlight the ability of our model to provide accurate sound speed estimations, we evaluate it on simulated, phantom, and in-vivo breast ultrasound data.

Assuntos

Aprendizado Profundo , Humanos , Feminino , Algoritmos , Ultrassonografia Mamária , Som , Ultrassonografia/métodos , Imagens de Fantasmas , Processamento de Imagem Assistida por Computador/métodos

13.

RCIT: A Robust Catadioptric-based Instrument 3D Tracking Method For Microsurgical Instruments In a Single-Camera System.

Alikhani, Alireza; Osner, Sebastian; Dehghani, Shervin; Busam, Benjamin; Inagaki, Satoshi; Maier, Mathias; Navab, Nassir; Nasseri, M Ali.

Annu Int Conf IEEE Eng Med Biol Soc ; 2023: 1-5, 2023 07.

Artigo em Inglês | MEDLINE | ID: mdl-38083453

RESUMO

The field of robotic microsurgery and micro-manipulation has undergone a profound evolution in recent years, particularly with regard to the accuracy, precision, versatility, and dexterity. These advancements have the potential to revolutionize high-precision biomedical procedures, such as neurosurgery, vitreoretinal surgery, and cell micro-manipulation. However, a critical challenge in developing micron-precision robotic systems is accurately verifying the end-effector motion in 3D. Such verification is complicated due to environmental vibrations, inaccuracy of mechanical assembly, and other physical uncertainties. To overcome these challenges, this paper proposes a novel single-camera framework that utilizes mirrors with known geometric parameters to estimate the 3D position of the microsurgical instrument. Euclidean distance between reconstructed points by the algorithm and the robot movement recorded by the highly accurate encoders is considered an error. Our method exhibits an accurate estimation with the mean absolute error of 0.044 mm when tested on a 23G surgical cannula with a diameter of 0.640 mm and operates at a resolution of 4024 × 3036 at 30 frames per second.

Assuntos

Robótica , Cirurgia Assistida por Computador , Microcirurgia , Movimento (Física) , Movimento

14.

Robotic Navigation Autonomy for Subretinal Injection via Intelligent Real-Time Virtual iOCT Volume Slicing.

Dehghani, Shervin; Sommersperger, Michael; Zhang, Peiyao; Martin-Gomez, Alejandro; Busam, Benjamin; Gehlbach, Peter; Navab, Nassir; Nasseri, M Ali; Iordachita, Iulian.

IEEE Int Conf Robot Autom ; 2023: 4724-4731, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-38125032

RESUMO

In the last decade, various robotic platforms have been introduced that could support delicate retinal surgeries. Concurrently, to provide semantic understanding of the surgical area, recent advances have enabled microscope-integrated intraoperative Optical Coherent Tomography (iOCT) with high-resolution 3D imaging at near video rate. The combination of robotics and semantic understanding enables task autonomy in robotic retinal surgery, such as for subretinal injection. This procedure requires precise needle insertion for best treatment outcomes. However, merging robotic systems with iOCT introduces new challenges. These include, but are not limited to high demands on data processing rates and dynamic registration of these systems during the procedure. In this work, we propose a framework for autonomous robotic navigation for subretinal injection, based on intelligent real-time processing of iOCT volumes. Our method consists of an instrument pose estimation method, an online registration between the robotic and the iOCT system, and trajectory planning tailored for navigation to an injection target. We also introduce intelligent virtual B-scans, a volume slicing approach for rapid instrument pose estimation, which is enabled by Convolutional Neural Networks (CNNs). Our experiments on ex-vivo porcine eyes demonstrate the precision and repeatability of the method. Finally, we discuss identified challenges in this work and suggest potential solutions to further the development of such systems.

15.

Theoretical error analysis of spotlight-based instrument localization for retinal surgery.

Zhou, Mingchuan; Hennerkes, Felix; Liu, Jingsong; Jiang, Zhongliang; Wendler, Thomas; Nasseri, M Ali; Iordachita, Iulian; Navab, Nassir.

Robotica ; 41(5): 1536-1549, 2023 May.

Artigo em Inglês | MEDLINE | ID: mdl-37982126

RESUMO

Retinal surgery is widely considered to be a complicated and challenging task even for specialists. Image-guided robot-assisted intervention is among the novel and promising solutions that may enhance human capabilities therein. In this paper, we demonstrate the possibility of using spotlights for 5D guidance of a microsurgical instrument. The theoretical basis of the localization for the instrument based on the projection of a single spotlight is analyzed to deduce the position and orientation of the spotlight source. The usage of multiple spotlights is also proposed to check the possibility of further improvements for the performance boundaries. The proposed method is verified within a high-fidelity simulation environment using the 3D creation suite Blender. Experimental results show that the average positioning error is 0.029 mm using a single spotlight and 0.025 mm with three spotlights, respectively, while the rotational errors are 0.124 and 0.101, which shows the application to be promising in instrument localization for retinal surgery.

16.

Holistic OR domain modeling: a semantic scene graph approach.

Özsoy, Ege; Czempiel, Tobias; Örnek, Evin Pinar; Eck, Ulrich; Tombari, Federico; Navab, Nassir.

Int J Comput Assist Radiol Surg ; 2023 Oct 12.

Artigo em Inglês | MEDLINE | ID: mdl-37823976

RESUMO

PURPOSE: Surgical procedures take place in highly complex operating rooms (OR), involving medical staff, patients, devices and their interactions. Until now, only medical professionals are capable of comprehending these intricate links and interactions. This work advances the field toward automated, comprehensive and semantic understanding and modeling of the OR domain by introducing semantic scene graphs (SSG) as a novel approach to describing and summarizing surgical environments in a structured and semantically rich manner. METHODS: We create the first open-source 4D SSG dataset. 4D-OR includes simulated total knee replacement surgeries captured by RGB-D sensors in a realistic OR simulation center. It includes annotations for SSGs, human and object pose, clinical roles and surgical phase labels. We introduce a neural network-based SSG generation pipeline for semantic reasoning in the OR and apply our approach to two downstream tasks: clinical role prediction and surgical phase recognition. RESULTS: We show that our pipeline can successfully reason within the OR domain. The capabilities of our scene graphs are further highlighted by their successful application to clinical role prediction and surgical phase recognition tasks. CONCLUSION: This work paves the way for multimodal holistic operating room modeling, with the potential to significantly enhance the state of the art in surgical data analysis, such as enabling more efficient and precise decision-making during surgical procedures, and ultimately improving patient safety and surgical outcomes. We release our code and dataset at github.com/egeozsoy/4D-OR.

17.

Contact Endoscopy - Narrow Band Imaging (CE-NBI) data set for laryngeal lesion assessment.

Esmaeili, Nazila; Davaris, Nikolaos; Boese, Axel; Illanes, Alfredo; Navab, Nassir; Friebe, Michael; Arens, Christoph.

Sci Data ; 10(1): 733, 2023 10 21.

Artigo em Inglês | MEDLINE | ID: mdl-37865668

RESUMO

The endoscopic examination of subepithelial vascular patterns within the vocal fold is crucial for clinicians seeking to distinguish between benign lesions and laryngeal cancer. Among innovative techniques, Contact Endoscopy combined with Narrow Band Imaging (CE-NBI) offers real-time visualization of these vascular structures. Despite the advent of CE-NBI, concerns have arisen regarding the subjective interpretation of its images. As a result, several computer-based solutions have been developed to address this issue. This study introduces the CE-NBI data set, the first publicly accessible data set that features enhanced and magnified visualizations of subepithelial blood vessels within the vocal fold. This data set encompasses 11144 images from 210 adult patients with pathological vocal fold conditions, where CE-NBI images are annotated using three distinct label categories. The data set has proven invaluable for numerous clinical assessments geared toward diagnosing laryngeal cancer using Optical Biopsy. Furthermore, given its versatility for various image analysis tasks, we have devised and implemented diverse image classification scenarios using Machine Learning (ML) approaches to address critical clinical challenges in assessing laryngeal lesions.

Assuntos

Neoplasias Laríngeas , Laringoscopia , Laringe , Adulto , Humanos , Neoplasias Laríngeas/diagnóstico por imagem , Neoplasias Laríngeas/patologia , Laringe/diagnóstico por imagem , Imagem de Banda Estreita , Prega Vocal/diagnóstico por imagem

18.

Robotic ultrasound imaging: State-of-the-art and future perspectives.

Jiang, Zhongliang; Salcudean, Septimiu E; Navab, Nassir.

Med Image Anal ; 89: 102878, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-37541100

RESUMO

Ultrasound (US) is one of the most widely used modalities for clinical intervention and diagnosis due to the merits of providing non-invasive, radiation-free, and real-time images. However, free-hand US examinations are highly operator-dependent. Robotic US System (RUSS) aims at overcoming this shortcoming by offering reproducibility, while also aiming at improving dexterity, and intelligent anatomy and disease-aware imaging. In addition to enhancing diagnostic outcomes, RUSS also holds the potential to provide medical interventions for populations suffering from the shortage of experienced sonographers. In this paper, we categorize RUSS as teleoperated or autonomous. Regarding teleoperated RUSS, we summarize their technical developments, and clinical evaluations, respectively. This survey then focuses on the review of recent work on autonomous robotic US imaging. We demonstrate that machine learning and artificial intelligence present the key techniques, which enable intelligent patient and process-specific, motion and deformation-aware robotic image acquisition. We also show that the research on artificial intelligence for autonomous RUSS has directed the research community toward understanding and modeling expert sonographers' semantic reasoning and action. Here, we call this process, the recovery of the "language of sonography". This side result of research on autonomous robotic US acquisitions could be considered as valuable and essential as the progress made in the robotic US examination itself. This article will provide both engineers and clinicians with a comprehensive understanding of RUSS by surveying underlying techniques. Additionally, we present the challenges that the scientific community needs to face in the coming years in order to achieve its ultimate goal of developing intelligent robotic sonographer colleagues. These colleagues are expected to be capable of collaborating with human sonographers in dynamic environments to enhance both diagnostic and intraoperative imaging.

Assuntos

Procedimentos Cirúrgicos Robóticos , Robótica , Humanos , Inteligência Artificial , Reprodutibilidade dos Testes , Ultrassonografia/métodos

19.

CholecTriplet2022: Show me a tool and tell me the triplet - An endoscopic vision challenge for surgical action triplet detection.

Nwoye, Chinedu Innocent; Yu, Tong; Sharma, Saurav; Murali, Aditya; Alapatt, Deepak; Vardazaryan, Armine; Yuan, Kun; Hajek, Jonas; Reiter, Wolfgang; Yamlahi, Amine; Smidt, Finn-Henri; Zou, Xiaoyang; Zheng, Guoyan; Oliveira, Bruno; Torres, Helena R; Kondo, Satoshi; Kasai, Satoshi; Holm, Felix; Özsoy, Ege; Gui, Shuangchun; Li, Han; Raviteja, Sista; Sathish, Rachana; Poudel, Pranav; Bhattarai, Binod; Wang, Ziheng; Rui, Guo; Schellenberg, Melanie; Vilaça, João L; Czempiel, Tobias; Wang, Zhenkun; Sheet, Debdoot; Thapa, Shrawan Kumar; Berniker, Max; Godau, Patrick; Morais, Pedro; Regmi, Sudarshan; Tran, Thuy Nuong; Fonseca, Jaime; Nölke, Jan-Hinrich; Lima, Estevão; Vazquez, Eduard; Maier-Hein, Lena; Navab, Nassir; Mascagni, Pietro; Seeliger, Barbara; Gonzalez, Cristians; Mutter, Didier; Padoy, Nicolas.

Med Image Anal ; 89: 102888, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-37451133

RESUMO

Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier efforts and the CholecTriplet challenge introduced in 2021 have put together techniques aimed at recognizing these triplets from surgical footage. Estimating also the spatial locations of the triplets would offer a more precise intraoperative context-aware decision support for computer-assisted intervention. This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection. It includes weakly-supervised bounding box localization of every visible surgical instrument (or tool), as the key actors, and the modeling of each tool-activity in the form of triplet. The paper describes a baseline method and 10 new deep learning algorithms presented at the challenge to solve the task. It also provides thorough methodological comparisons of the methods, an in-depth analysis of the obtained results across multiple metrics, visual and procedural challenges; their significance, and useful insights for future research directions and applications in surgery.

Assuntos

Inteligência Artificial , Cirurgia Assistida por Computador , Humanos , Endoscopia , Algoritmos , Cirurgia Assistida por Computador/métodos , Instrumentos Cirúrgicos

20.

AR visualizations in laparoscopy: surgeon preferences and depth assessment of vascular anatomy.

Heiliger, Christian; Heiliger, Thomas; Deodati, Alessandra; Winkler, Alexander; Grimm, Matthias; Kalim, Faisal; Esteban, Javier; Mihatsch, Lorenz; Ehrlich V Treuenstätt, Viktor H; Mohamed, Khaled Ahmed; Andrade, Dorian; Frank, Alexander; Solyanik, Olga; Mandal, Subhamoy; Werner, Jens; Eck, Ulrich; Navab, Nassir; Karcz, Konrad.

Minim Invasive Ther Allied Technol ; 32(4): 190-198, 2023 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-37293947

RESUMO

Introduction: This study compares five augmented reality (AR) vasculature visualization techniques in a mixed-reality laparoscopy simulator with 50 medical professionals and analyzes their impact on the surgeon. Material and methods: ââThe different visualization techniques' abilities to convey depth were measured using the participant's accuracy in an objective depth sorting task. Demographic data and subjective measures, such as the preference of each AR visualization technique and potential application areas, were collected with questionnaires. Results: Despite measuring differences in objective measurements across the visualization techniques, they were not statistically significant. In the subjective measures, however, 55% of the participants rated visualization technique II, 'Opaque with single-color Fresnel highlights', as their favorite. Participants felt that AR could be useful for various surgeries, especially complex surgeries (100%). Almost all participants agreed that AR could potentially improve surgical parameters, such as patient safety (88%), complication rate (84%), and identifying risk structures (96%). Conclusions: More studies are needed on the effect of different visualizations on task performance, as well as more sophisticated and effective visualization techniques for the operating room. With the findings of this study, we encourage the development of new study setups to advance surgical AR.

Assuntos

Realidade Aumentada , Laparoscopia , Cirurgiões , Cirurgia Assistida por Computador , Humanos , Laparoscopia/métodos , Cirurgia Assistida por Computador/métodos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA