Pesquisa | Biblioteca Virtual em Saúde

1.

A retrieval-augmented chatbot based on GPT-4 provides appropriate differential diagnosis in gastrointestinal radiology: a proof of concept study.

Rau, Stephan; Rau, Alexander; Nattenmüller, Johanna; Fink, Anna; Bamberg, Fabian; Reisert, Marco; Russe, Maximilian F.

Eur Radiol Exp ; 8(1): 60, 2024 May 17.

Artigo em Inglês | MEDLINE | ID: mdl-38755410

RESUMO

BACKGROUND: We investigated the potential of an imaging-aware GPT-4-based chatbot in providing diagnoses based on imaging descriptions of abdominal pathologies. METHODS: Utilizing zero-shot learning via the LlamaIndex framework, GPT-4 was enhanced using the 96 documents from the Radiographics Top 10 Reading List on gastrointestinal imaging, creating a gastrointestinal imaging-aware chatbot (GIA-CB). To assess its diagnostic capability, 50 cases on a variety of abdominal pathologies were created, comprising radiological findings in fluoroscopy, MRI, and CT. We compared the GIA-CB to the generic GPT-4 chatbot (g-CB) in providing the primary and 2 additional differential diagnoses, using interpretations from senior-level radiologists as ground truth. The trustworthiness of the GIA-CB was evaluated by investigating the source documents as provided by the knowledge-retrieval mechanism. Mann-Whitney U test was employed. RESULTS: The GIA-CB demonstrated a high capability to identify the most appropriate differential diagnosis in 39/50 cases (78%), significantly surpassing the g-CB in 27/50 cases (54%) (p = 0.006). Notably, the GIA-CB offered the primary differential in the top 3 differential diagnoses in 45/50 cases (90%) versus g-CB with 37/50 cases (74%) (p = 0.022) and always with appropriate explanations. The median response time was 29.8 s for GIA-CB and 15.7 s for g-CB, and the mean cost per case was $0.15 and $0.02, respectively. CONCLUSIONS: The GIA-CB not only provided an accurate diagnosis for gastrointestinal pathologies, but also direct access to source documents, providing insight into the decision-making process, a step towards trustworthy and explainable AI. Integrating context-specific data into AI models can support evidence-based clinical decision-making. RELEVANCE STATEMENT: A context-aware GPT-4 chatbot demonstrates high accuracy in providing differential diagnoses based on imaging descriptions, surpassing the generic GPT-4. It provided formulated rationale and source excerpts supporting the diagnoses, thus enhancing trustworthy decision-support. KEY POINTS: â¢ Knowledge retrieval enhances differential diagnoses in a gastrointestinal imaging-aware chatbot (GIA-CB). â¢ GIA-CB outperformed the generic counterpart, providing formulated rationale and source excerpts. â¢ GIA-CB has the potential to pave the way for AI-assisted decision support systems.

Assuntos

Estudo de Prova de Conceito , Humanos , Diagnóstico Diferencial , Gastroenteropatias/diagnóstico por imagem

2.

Enhancing LLM Application in Radiology: A Call for Expanded Research and Comparative Analysis.

Russe, Maximilian Frederik; Reisert, Marco; Bamberg, Fabian; Rau, Alexander.

Rofo ; 2024 May 14.

Artigo em Inglês | MEDLINE | ID: mdl-38744321

3.

Imaging Markers Derived From MRI-Based Automated Kidney Segmentation.

Kellner, Elias; Sekula, Peggy; Lipovsek, Jan; Russe, Maximilian; Horbach, Harald; Schlett, Christopher L; Nauck, Matthias; Völzke, Henry; Kroencke, Thomas; Bette, Stefanie; Kauczor, Hans-Ulrich; Keil, Thomas; Pischon, Tobias; Heid, Iris M; Peters, Annette; Niendorf, Thoralf; Lieb, Wolfgang; Bamberg, Fabian; Büchert, Martin; Reichardt, Wilfried; Reisert, Marco; Köttgen, Anna.

Dtsch Arztebl Int ; 121(9): 284-290, 2024 May 03.

Artigo em Inglês | MEDLINE | ID: mdl-38530931

RESUMO

BACKGROUND: Population-wide research on potential new imaging biomarkers of the kidney depends on accurate automated segmentation of the kidney and its compartments (cortex, medulla, and sinus). METHODS: We developed a robust deep-learning framework for kidney (sub-)segmentation based on a hierarchical, three-dimensional convolutional neural network (CNN) that was optimized for multiscale problems of combined localization and segmentation. We applied the CNN to abdominal magnetic resonance images from the population-based German National Cohort (NAKO) study. RESULTS: There was good to excellent agreement between the model predictions and manual segmentations. The median values for the body-surface normalized total kidney, cortex, medulla, and sinus volumes of 9934 persons were 158, 115, 43, and 24 mL/m2. Distributions of these markers are provided both for the overall study population and for a subgroup of persons without kidney disease or any associated conditions. Multivariable adjusted regression analyses revealed that diabetes, male sex, and a higher estimated glomerular filtration rate (eGFR) are important predictors of higher total and cortical volumes. Each increase of eGFR by one unit (i.e., 1 mL/min per 1.73 m2 body surface area) was associated with a 0.98 mL/m2 increase in total kidney volume, and this association was significant. Volumes were lower in persons with eGFR-defined chronic kidney disease. CONCLUSION: The extraction of image-based biomarkers through CNN-based renal sub-segmentation using data from a population-based study yields reliable results, forming a solid foundation for future investigations.

Assuntos

Rim , Imageamento por Ressonância Magnética , Humanos , Masculino , Feminino , Imageamento por Ressonância Magnética/métodos , Imageamento por Ressonância Magnética/estatística & dados numéricos , Rim/diagnóstico por imagem , Pessoa de Meia-Idade , Idoso , Adulto , Alemanha , Taxa de Filtração Glomerular/fisiologia , Biomarcadores/análise , Redes Neurais de Computação , Aprendizado Profundo , Estudos de Coortes

4.

Effective management of recurrent Doege-Potter syndrome with somatostatin-analogues: A case report.

Schöler, Felix; Storz, Maximilian Andreas; Khavaran, Ashkan; Hümmler, Nicolas; Russe, Maximilian Frederik; Wielenberg, Christoph-Ferdinand; Laubner, Katharina; Seufert, Jochen.

Cancer Rep (Hoboken) ; 7(3): e1992, 2024 03.

Artigo em Inglês | MEDLINE | ID: mdl-38441351

RESUMO

BACKGROUND: Doege-Potter syndrome is defined as paraneoplastic hypoinsulinemic hypoglycemia associated with a benign or malignant solitary fibrous tumor frequently located in pleural, but also extrapleural sites. Hypoglycemia can be attributed to paraneoplastic secretion of "Big-IGF-II," a precursor of Insulin-like growth factor-II. This prohormone aberrantly binds to and activates insulin receptors, with consecutive initiation of common insulin actions such as inhibition of gluconeogenesis, activation of glycolysis and stimulation of cellular glucose uptake culminating in recurrent tumor-induced hypoglycemic episodes. Complete tumor resection or debulking surgery is considered the most promising treatment for DPS. CASE: Here, we report a rare case of a recurrent Doege-Poter Syndrome with atypical gelatinous tumor lesions of the lung, pleura and pericardial fat tissue in an 87-year-old woman. Although previously described as ineffective, we propose that adjuvant treatment with Octreotide in conjunction with intravenous glucose helped to maintain tolerable blood glucose levels before tumor resection. The somatostatin-analogue Lanreotide was successfully used after tumor debulking surgery (R2-resection) to maintain adequate blood glucose control. CONCLUSION: We conclude that somatostatin-analogues bear the potential of being effective in conjunction with limited surgical approaches for the treatment of hypoglycemia in recurrent or non-totally resectable SFT entities underlying DPS.

Assuntos

Anormalidades Congênitas , Hipoglicemia , Nefropatias/congênito , Rim/anormalidades , Neoplasias , Feminino , Humanos , Idoso de 80 Anos ou mais , Somatostatina , Hipoglicemia/etiologia

5.

Multicentric development and validation of a multi-scale and multi-task deep learning model for comprehensive lower extremity alignment analysis.

Wilhelm, Nikolas J; von Schacky, Claudio E; Lindner, Felix J; Feucht, Matthias J; Ehmann, Yannick; Pogorzelski, Jonas; Haddadin, Sami; Neumann, Jan; Hinterwimmer, Florian; von Eisenhart-Rothe, Rüdiger; Jung, Matthias; Russe, Maximilian F; Izadpanah, Kaywan; Siebenlist, Sebastian; Burgkart, Rainer; Rupp, Marco-Christopher.

Artif Intell Med ; 150: 102843, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-38553152

RESUMO

Osteoarthritis of the knee, a widespread cause of knee disability, is commonly treated in orthopedics due to its rising prevalence. Lower extremity misalignment, pivotal in knee injury etiology and management, necessitates comprehensive mechanical alignment evaluation via frequently-requested weight-bearing long leg radiographs (LLR). Despite LLR's routine use, current analysis techniques are error-prone and time-consuming. To address this, we conducted a multicentric study to develop and validate a deep learning (DL) model for fully automated leg alignment assessment on anterior-posterior LLR, targeting enhanced reliability and efficiency. The DL model, developed using 594 patients' LLR and a 60%/10%/30% data split for training, validation, and testing, executed alignment analyses via a multi-step process, employing a detection network and nine specialized networks. It was designed to assess all vital anatomical and mechanical parameters for standard clinical leg deformity analysis and preoperative planning. Accuracy, reliability, and assessment duration were compared with three specialized orthopedic surgeons across two distinct institutional datasets (136 and 143 radiographs). The algorithm exhibited equivalent performance to the surgeons in terms of alignment accuracy (DL: 0.21 ± 0.18°to 1.06 ± 1.3°vs. OS: 0.21 ± 0.16°to 1.72 ± 1.96°), interrater reliability (ICC DL: 0.90 ± 0.05 to 1.0 ± 0.0 vs. ICC OS: 0.90 ± 0.03 to 1.0 ± 0.0), and clinically acceptable accuracy (DL: 53.9%-100% vs OS 30.8%-100%). Further, automated analysis significantly reduced analysis time compared to manual annotation (DL: 22 ± 0.6 s vs. OS; 101.7 ± 7 s, p ≤ 0.01). By demonstrating that our algorithm not only matches the precision of expert surgeons but also significantly outpaces them in both speed and consistency of measurements, our research underscores a pivotal advancement in harnessing AI to enhance clinical efficiency and decision-making in orthopaedics.

Assuntos

Aprendizado Profundo , Humanos , Reprodutibilidade dos Testes , Extremidade Inferior/diagnóstico por imagem , Extremidade Inferior/cirurgia , Articulação do Joelho , Radiografia , Estudos Retrospectivos

6.

Longitudinal cardiac dimensions in patients undergoing LVAD implantation.

Meissner, Florian; Szvetics, Sophie; Galbas, Michelle Costa; Russe, Maximilian; Schibilsky, David; Kaier, Klaus; Czerny, Martin; Bothe, Wolfgang.

Artif Organs ; 48(5): 550-558, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38409825

RESUMO

BACKGROUND: In conventional left ventricular assist devices (LVAD), a separate outflow graft is sutured to the ascending aorta. Novel device designs may include a transventricular outflow cannula crossing the aortic valve (AV). While transversal ventricular dimensions are well investigated in patients with severe heart failure, little is known about the longitudinal dimensions. These dimensions are, however, particularly critical for the design and development of mechanical circulatory support (MCS) devices with transaortic outflow cannula. METHODS: In an explorative retrospective cohort study at the University Medical Center Freiburg, Germany, the longitudinal cardiac dimensions of patients undergoing computed tomography angiography (CTA) before and, if available, after LVAD implantation were analyzed. Among others, the following dimensions were assessed: (a) apex to AV, (b) apex to mitral valve, (c) AV to sinotubular junction (STJ), (d) apex to STJ, (e) apex to brachiocephalic artery (BCA), and (f) AV to BCA. RESULTS: In total, 44 LVAD patients (36 male, age 55.8 years, height 1.75 m) were included. The longitudinal cardiac dimensions were (a) 114.5 ± 12.1 mm, (b) 108.0 ± 12.4 mm, (c) 20.9 ± 2.9, (d) 135.4 ± 13.4 mm, (e) 206.0 ± 18.3, and (f) 91.5 ± 9.8 mm. Postoperatively, (a) and (b) decreased by 31.5% and 39.5%, respectively (N = 14). CONCLUSIONS: Longitudinal cardiac dimensions may be reduced by up to 40% after LVAD implantation. A better knowledge of these dimensions and their postoperative alterations in LVAD patients may improve surgical planning and help to design MCS devices with transventricular outflow cannula.

Assuntos

Insuficiência Cardíaca , Coração Auxiliar , Procedimentos Cirúrgicos Torácicos , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Retrospectivos , Aorta Torácica/cirurgia , Aorta , Valva Aórtica , Coração Auxiliar/efeitos adversos , Insuficiência Cardíaca/cirurgia , Resultado do Tratamento

7.

Improving the use of LLMs in radiology through prompt engineering: from precision prompts to zero-shot learning.

Russe, Maximilian Frederik; Reisert, Marco; Bamberg, Fabian; Rau, Alexander.

Rofo ; 2024 Feb 26.

Artigo em Inglês | MEDLINE | ID: mdl-38408477

RESUMO

PURPOSE: Large language models (LLMs) such as ChatGPT have shown significant potential in radiology. Their effectiveness often depends on prompt engineering, which optimizes the interaction with the chatbot for accurate results. Here, we highlight the critical role of prompt engineering in tailoring the LLMs' responses to specific medical tasks. MATERIALS AND METHODS: Using a clinical case, we elucidate different prompting strategies to adapt the LLM ChatGPT using GPT4 to new tasks without additional training of the base model. These approaches range from precision prompts to advanced in-context methods such as few-shot and zero-shot learning. Additionally, the significance of embeddings, which serve as a data representation technique, is discussed. RESULTS: Prompt engineering substantially improved and focused the chatbot's output. Moreover, embedding of specialized knowledge allows for more transparent insight into the model's decision-making and thus enhances trust. CONCLUSION: Despite certain challenges, prompt engineering plays a pivotal role in harnessing the potential of LLMs for specialized tasks in the medical domain, particularly radiology. As LLMs continue to evolve, techniques like few-shot learning, zero-shot learning, and embedding-based retrieval mechanisms will become indispensable in delivering tailored outputs. KEY POINTS: · Large language models might impact radiological practice and decision-masking.. · However, implementation and performance are dependent on the assigned task.. · Optimization of prompting strategies can substantially improve model performance.. · Strategies for prompt engineering range from precision prompts to zero-shot learning..

8.

A deep learning approach for projection and body-side classification in musculoskeletal radiographs.

Fink, Anna; Tran, Hien; Reisert, Marco; Rau, Alexander; Bayer, Jörg; Kotter, Elmar; Bamberg, Fabian; Russe, Maximilian F.

Eur Radiol Exp ; 8(1): 23, 2024 Feb 14.

Artigo em Inglês | MEDLINE | ID: mdl-38353812

RESUMO

BACKGROUND: The growing prevalence of musculoskeletal diseases increases radiologic workload, highlighting the need for optimized workflow management and automated metadata classification systems. We developed a large-scale, well-characterized dataset of musculoskeletal radiographs and trained deep learning neural networks to classify radiographic projection and body side. METHODS: In this IRB-approved retrospective single-center study, a dataset of musculoskeletal radiographs from 2011 to 2019 was retrieved and manually labeled for one of 45 possible radiographic projections and the depicted body side. Two classification networks were trained for the respective tasks using the Xception architecture with a custom network top and pretrained weights. Performance was evaluated on a hold-out test sample, and gradient-weighted class activation mapping (Grad-CAM) heatmaps were computed to visualize the influential image regions for network predictions. RESULTS: A total of 13,098 studies comprising 23,663 radiographs were included with a patient-level dataset split, resulting in 19,183 training, 2,145 validation, and 2,335 test images. Focusing on paired body regions, training for side detection included 16,319 radiographs (13,284 training, 1,443 validation, and 1,592 test images). The models achieved an overall accuracy of 0.975 for projection and 0.976 for body-side classification on the respective hold-out test sample. Errors were primarily observed in projections with seamless anatomical transitions or non-orthograde adjustment techniques. CONCLUSIONS: The deep learning neural networks demonstrated excellent performance in classifying radiographic projection and body side across a wide range of musculoskeletal radiographs. These networks have the potential to serve as presorting algorithms, optimizing radiologic workflow and enhancing patient care. RELEVANCE STATEMENT: The developed networks excel at classifying musculoskeletal radiographs, providing valuable tools for research data extraction, standardized image sorting, and minimizing misclassifications in artificial intelligence systems, ultimately enhancing radiology workflow efficiency and patient care. KEY POINTS: â¢ A large-scale, well-characterized dataset was developed, covering a broad spectrum of musculoskeletal radiographs. â¢ Deep learning neural networks achieved high accuracy in classifying radiographic projection and body side. â¢ Grad-CAM heatmaps provided insight into network decisions, contributing to their interpretability and trustworthiness. â¢ The trained models can help optimize radiologic workflow and manage large amounts of data.

Assuntos

Aprendizado Profundo , Radiologia , Humanos , Inteligência Artificial , Estudos Retrospectivos , Radiografia

9.

AI-based X-ray fracture analysis of the distal radius: accuracy between representative classification, detection and segmentation deep learning models for clinical practice.

Russe, Maximilian Frederik; Rebmann, Philipp; Tran, Phuong Hien; Kellner, Elias; Reisert, Marco; Bamberg, Fabian; Kotter, Elmar; Kim, Suam.

BMJ Open ; 14(1): e076954, 2024 01 23.

Artigo em Inglês | MEDLINE | ID: mdl-38262641

RESUMO

OBJECTIVES: To aid in selecting the optimal artificial intelligence (AI) solution for clinical application, we directly compared performances of selected representative custom-trained or commercial classification, detection and segmentation models for fracture detection on musculoskeletal radiographs of the distal radius by aligning their outputs. DESIGN AND SETTING: This single-centre retrospective study was conducted on a random subset of emergency department radiographs from 2008 to 2018 of the distal radius in Germany. MATERIALS AND METHODS: An image set was created to be compatible with training and testing classification and segmentation models by annotating examinations for fractures and overlaying fracture masks, if applicable. Representative classification and segmentation models were trained on 80% of the data. After output binarisation, their derived fracture detection performances as well as that of a standard commercially available solution were compared on the remaining X-rays (20%) using mainly accuracy and area under the receiver operating characteristic (AUROC). RESULTS: A total of 2856 examinations with 712 (24.9%) fractures were included in the analysis. Accuracies reached up to 0.97 for the classification model, 0.94 for the segmentation model and 0.95 for BoneView. Cohen's kappa was at least 0.80 in pairwise comparisons, while Fleiss' kappa was 0.83 for all models. Fracture predictions were visualised with all three methods at different levels of detail, ranking from downsampled image region for classification over bounding box for detection to single pixel-level delineation for segmentation. CONCLUSIONS: All three investigated approaches reached high performances for detection of distal radius fractures with simple preprocessing and postprocessing protocols on the custom-trained models. Despite their underlying structural differences, selection of one's fracture analysis AI tool in the frame of this study reduces to the desired flavour of automation: automated classification, AI-assisted manual fracture reading or minimised false negatives.

Assuntos

Aprendizado Profundo , Fraturas Ósseas , Humanos , Raios X , Inteligência Artificial , Rádio (Anatomia) , Estudos Retrospectivos

10.

A content-aware chatbot based on GPT 4 provides trustworthy recommendations for Cone-Beam CT guidelines in dental imaging.

Russe, Maximilian Frederik; Rau, Alexander; Ermer, Michael Andreas; Rothweiler, René; Wenger, Sina; Klöble, Klara; Schulze, Ralf K W; Bamberg, Fabian; Schmelzeisen, Rainer; Reisert, Marco; Semper-Hogg, Wiebke.

Dentomaxillofac Radiol ; 53(2): 109-114, 2024 Feb 08.

Artigo em Inglês | MEDLINE | ID: mdl-38180877

RESUMO

OBJECTIVES: To develop a content-aware chatbot based on GPT-3.5-Turbo and GPT-4 with specialized knowledge on the German S2 Cone-Beam CT (CBCT) dental imaging guideline and to compare the performance against humans. METHODS: The LlamaIndex software library was used to integrate the guideline context into the chatbots. Based on the CBCT S2 guideline, 40 questions were posed to content-aware chatbots and early career and senior practitioners with different levels of experience served as reference. The chatbots' performance was compared in terms of recommendation accuracy and explanation quality. Chi-square test and one-tailed Wilcoxon signed rank test evaluated accuracy and explanation quality, respectively. RESULTS: The GPT-4 based chatbot provided 100% correct recommendations and superior explanation quality compared to the one based on GPT3.5-Turbo (87.5% vs. 57.5% for GPT-3.5-Turbo; P = .003). Moreover, it outperformed early career practitioners in correct answers (P = .002 and P = .032) and earned higher trust than the chatbot using GPT-3.5-Turbo (P = 0.006). CONCLUSIONS: A content-aware chatbot using GPT-4 reliably provided recommendations according to current consensus guidelines. The responses were deemed trustworthy and transparent, and therefore facilitate the integration of artificial intelligence into clinical decision-making.

Assuntos

Inteligência Artificial , Software , Humanos , Tomada de Decisão Clínica , Tomografia Computadorizada de Feixe Cônico , Consenso

11.

Photon-counting computed tomography - clinical application in oncological, cardiovascular, and pediatric radiology. / Photon-Counting Computertomographie klinische Anwendungen in der onkologischen, kardiovaskulären und pädiatrischen Radiologie.

Hagen, Florian; Soschynski, Martin; Weis, Meike; Hagar, Muhammad Taha; Krumm, Patrick; Ayx, Isabelle; Taron, Jana; Krauss, Tobias; Hein, Manuel; Ruile, Philipp; von Zur Muehlen, Constantin; Schlett, Christopher L; Neubauer, Jakob; Tsiflikas, Ilias; Russe, Maximilian Frederik; Arnold, Philipp; Faby, Sebastian; Froelich, Matthias F; Weiß, Jakob; Stein, Thomas; Overhoff, Daniel; Bongers, Malte; Nikolaou, Konstantin; Schönberg, Stefan O; Bamberg, Fabian; Horger, Marius.

Rofo ; 196(1): 25-35, 2024 Jan.

Artigo em Inglês, Alemão | MEDLINE | ID: mdl-37793417

RESUMO

BACKGROUND: Photon-counting detector computed tomography (PCD-CT) is a promising new technology with the potential to fundamentally change workflows in the daily routine and provide new quantitative imaging information to improve clinical decision-making and patient management. METHOD: The contents of this review are based on an unrestricted literature search of PubMed and Google Scholar using the search terms "photon-counting CT", "photon-counting detector", "spectral CT", "computed tomography" as well as on the authors' own experience. RESULTS: The fundamental difference with respect to the currently established energy-integrating CT detectors is that PCD-CT allows for the counting of every single photon at the detector level. Based on the identified literature, PCD-CT phantom measurements and initial clinical studies have demonstrated that the new technology allows for improved spatial resolution, reduced image noise, and new possibilities for advanced quantitative image postprocessing. CONCLUSION: For clinical practice, the potential benefits include fewer beam hardening artifacts, a radiation dose reduction, and the use of new or combinations of contrast agents. In particular, critical patient groups such as oncological, cardiovascular, lung, and head & neck as well as pediatric patient collectives benefit from the clinical advantages. KEY POINTS: · Photon-counting computed tomography (PCD-CT) is being used for the first time in routine clinical practice, enabling a significant dose reduction in critical patient populations such as oncology, cardiology, and pediatrics.. · Compared to conventional CT, PCD-CT enables a reduction in electronic image noise.. · Due to the spectral data sets, PCD-CT enables fully comprehensive post-processing applications.. CITATION FORMAT: · Hagen F, Soschynski M, Weis M etâal. Photon-counting computed tomography - clinical application in oncological, cardiovascular, and pediatric radiology. Fortschr Röntgenstr 2024; 196: 25â-â34.

Assuntos

Radiologia , Tomografia Computadorizada por Raios X , Humanos , Criança , Tomografia Computadorizada por Raios X/métodos , Meios de Contraste , Tórax , Imagens de Fantasmas , Pulmão

12.

Automated image quality assessment for selecting among multiple magnetic resonance image acquisitions in the German National Cohort study.

Schuppert, Christopher; Rospleszcz, Susanne; Hirsch, Jochen G; Hoinkiss, Daniel C; Köhn, Alexander; von Krüchten, Ricarda; Russe, Maximilian F; Keil, Thomas; Krist, Lilian; Schmidt, Börge; Michels, Karin B; Schipf, Sabine; Brenner, Hermann; Kröncke, Thomas J; Pischon, Tobias; Niendorf, Thoralf; Schulz-Menger, Jeanette; Forsting, Michael; Völzke, Henry; Hosten, Norbert; Bülow, Robin; Zaitsev, Maxim; Kauczor, Hans-Ulrich; Bamberg, Fabian; Günther, Matthias; Schlett, Christopher L.

Sci Rep ; 13(1): 22745, 2023 12 20.

Artigo em Inglês | MEDLINE | ID: mdl-38123791

RESUMO

In magnetic resonance imaging (MRI), the perception of substandard image quality may prompt repetition of the respective image acquisition protocol. Subsequently selecting the preferred high-quality image data from a series of acquisitions can be challenging. An automated workflow may facilitate and improve this selection. We therefore aimed to investigate the applicability of an automated image quality assessment for the prediction of the subjectively preferred image acquisition. Our analysis included data from 11,347 participants with whole-body MRI examinations performed as part of the ongoing prospective multi-center German National Cohort (NAKO) study. Trained radiologic technologists repeated any of the twelve examination protocols due to induced setup errors and/or subjectively unsatisfactory image quality and chose a preferred acquisition from the resultant series. Up to 11 quantitative image quality parameters were automatically derived from all acquisitions. Regularized regression and standard estimates of diagnostic accuracy were calculated. Controlling for setup variations in 2342 series of two or more acquisitions, technologists preferred the repetition over the initial acquisition in 1116 of 1396 series in which the initial setup was retained (79.9%, range across protocols: 73-100%). Image quality parameters then commonly showed statistically significant differences between chosen and discarded acquisitions. In regularized regression across all protocols, 'structured noise maximum' was the strongest predictor for the technologists' choice, followed by 'N/2 ghosting average'. Combinations of the automatically derived parameters provided an area under the ROC curve between 0.51 and 0.74 for the prediction of the technologists' choice. It is concluded that automated image quality assessment can, despite considerable performance differences between protocols and anatomical regions, contribute substantially to identifying the subjective preference in a series of MRI acquisitions and thus provide effective decision support to readers.

Assuntos

Imageamento por Ressonância Magnética , Humanos , Estudos de Coortes , Estudos Prospectivos , Imageamento por Ressonância Magnética/métodos , Curva ROC , Estudos Longitudinais

13.

Pilot study on high-resolution radiological methods for the analysis of cerebrospinal fluid (CSF) shunt valves.

Pichotka, Martin P; Weigt, Moritz; Shah, Mukesch J; Russe, Maximilian F; Stein, Thomas; Billoud, T; Beck, Jürgen; Straehle, Jakob; Schlett, Christopher L; Elverfeldt, Dominik V; Reisert, Marco.

Z Med Phys ; 2023 Dec 15.

Artigo em Inglês | MEDLINE | ID: mdl-38104007

RESUMO

OBJECTIVES: Despite their life-saving capabilities, cerebrospinal fluid (CSF) shunts exhibit high failure rates, with a large fraction of failures attributed to the regulating valve. Due to a lack of methods for the detailed analysis of valve malfunctions, failure mechanisms are not well understood, and valves often have to be surgically explanted on the mere suspicion of malfunction. The presented pilot study aims to demonstrate radiological methods for comprehensive analysis of CSF shunt valves, considering both the potential for failure analysis in design optimization, and for future clinical in-vivo application to reduce the number of required shunt revision surgeries. The proposed method could also be utilized to develop and support in situ repair methods (e.g. by lysis or ultrasound) of malfunctioning CSF shunt valves. MATERIALS AND METHODS: The primary methods described are contrast-enhanced radiographic time series of CSF shunt valves, taken in a favorable projection geometry at low radiation dose, and the machine-learning-based diagnosis of CSF shunt valve obstructions. Complimentarily, we investigate CT-based methods capable of providing accurate ground truth for the training of such diagnostic tools. Using simulated test and training data, the performance of the machine-learning diagnostics in identifying and localizing obstructions within a shunt valve is evaluated regarding per-pixel sensitivity and specificity, the Dice similarity coefficient, and the false positive rate in the case of obstruction free test samples. RESULTS: Contrast enhanced subtraction radiography allows high-resolution, time-resolved, low-dose analysis of fluid transport in CSF shunt valves. Complementarily, photon-counting micro-CT allows to investigate valve obstruction mechanisms in detail, and to generate valid ground truth for machine learning-based diagnostics. Machine-learning-based detection of valve obstructions in simulated radiographies shows promising results, with a per-pixel sensitivity >70%, per-pixel specificity >90%, a median Dice coefficient >0.8 and <10% false positives at a detection threshold of 0.5. CONCLUSIONS: This ex-vivo study demonstrates obstruction detection in cerebro-spinal fluid shunt valves, combining radiological methods with machine learning under conditions compatible to future in-vivo application. Results indicate that high-resolution contrast-enhanced subtraction radiography, possibly including time-series data, combined with machine-learning image analysis, has the potential to strongly improve the diagnostics of CSF shunt valve failures. The presented method is in principle suitable for in-vivo application, considering both measurement geometry and radiological dose. Further research is needed to validate these results on real-world data and to refine the employed methods. In combination, the presented methods enable comprehensive analysis of valve failure mechanisms, paving the way for improved product development and clinical diagnostics of CSF shunt valves.

14.

Performance of ChatGPT, human radiologists, and context-aware ChatGPT in identifying AO codes from radiology reports.

Russe, Maximilian F; Fink, Anna; Ngo, Helen; Tran, Hien; Bamberg, Fabian; Reisert, Marco; Rau, Alexander.

Sci Rep ; 13(1): 14215, 2023 08 30.

Artigo em Inglês | MEDLINE | ID: mdl-37648742

RESUMO

While radiologists can describe a fracture's morphology and complexity with ease, the translation into classification systems such as the Arbeitsgemeinschaft Osteosynthesefragen (AO) Fracture and Dislocation Classification Compendium is more challenging. We tested the performance of generic chatbots and chatbots aware of specific knowledge of the AO classification provided by a vector-index and compared it to human readers. In the 100 radiological reports we created based on random AO codes, chatbots provided AO codes significantly faster than humans (mean 3.2 s per case vs. 50 s per case, p < .001) though not reaching human performance (max. chatbot performance of 86% correct full AO codes vs. 95% in human readers). In general, chatbots based on GPT 4 outperformed the ones based on GPT 3.5-Turbo. Further, we found that providing specific knowledge substantially enhances the chatbot's performance and consistency as the context-aware chatbot based on GPT 4 provided 71% consistent correct full AO codes for the compared to the 2% consistent correct full AO codes for the generic ChatGPT 4. This provides evidence, that refining and providing specific context to ChatGPT will be the next essential step in harnessing its power.

Assuntos

Fraturas Ósseas , Radiologia , Humanos , Conscientização , Medicamentos Genéricos , Radiologistas

15.

Automated detection of cephalometric landmarks using deep neural patchworks.

Weingart, Julia Vera; Schlager, Stefan; Metzger, Marc Christian; Brandenburg, Leonard Simon; Hein, Anna; Schmelzeisen, Rainer; Bamberg, Fabian; Kim, Suam; Kellner, Elias; Reisert, Marco; Russe, Maximilian Frederik.

Dentomaxillofac Radiol ; 52(6): 20230059, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37427585

RESUMO

OBJECTIVES: This study evaluated the accuracy of deep neural patchworks (DNPs), a deep learning-based segmentation framework, for automated identification of 60 cephalometric landmarks (bone-, soft tissue- and tooth-landmarks) on CT scans. The aim was to determine whether DNP could be used for routine three-dimensional cephalometric analysis in diagnostics and treatment planning in orthognathic surgery and orthodontics. METHODS: Full skull CT scans of 30 adult patients (18 female, 12 male, mean age 35.6 years) were randomly divided into a training and test data set (each n = 15). Clinician A annotated 60 landmarks in all 30 CT scans. Clinician B annotated 60 landmarks in the test data set only. The DNP was trained using spherical segmentations of the adjacent tissue for each landmark. Automated landmark predictions in the separate test data set were created by calculating the center of mass of the predictions. The accuracy of the method was evaluated by comparing these annotations to the manual annotations. RESULTS: The DNP was successfully trained to identify all 60 landmarks. The mean error of our method was 1.94 mm (SD 1.45 mm) compared to a mean error of 1.32 mm (SD 1.08 mm) for manual annotations. The minimum error was found for landmarks ANS 1.11 mm, SN 1.2 mm, and CP_R 1.25 mm. CONCLUSION: The DNP-algorithm was able to accurately identify cephalometric landmarks with mean errors <2 mm. This method could improve the workflow of cephalometric analysis in orthodontics and orthognathic surgery. Low training requirements while still accomplishing high precision make this method particularly promising for clinical use.

Assuntos

Pontos de Referência Anatômicos , Crânio , Adulto , Humanos , Masculino , Feminino , Reprodutibilidade dos Testes , Cefalometria/métodos , Crânio/diagnóstico por imagem , Algoritmos

16.

A Context-based Chatbot Surpasses Trained Radiologists and Generic ChatGPT in Following the ACR Appropriateness Guidelines.

Rau, Alexander; Rau, Stephan; Zoeller, Daniela; Fink, Anna; Tran, Hien; Wilpert, Caroline; Nattenmueller, Johanna; Neubauer, Jakob; Bamberg, Fabian; Reisert, Marco; Russe, Maximilian F.

Radiology ; 308(1): e230970, 2023 07.

Artigo em Inglês | MEDLINE | ID: mdl-37489981

RESUMO

Background Radiological imaging guidelines are crucial for accurate diagnosis and optimal patient care as they result in standardized decisions and thus reduce inappropriate imaging studies. Purpose In the present study, we investigated the potential to support clinical decision-making using an interactive chatbot designed to provide personalized imaging recommendations from American College of Radiology (ACR) appropriateness criteria documents using semantic similarity processing. Methods We utilized 209 ACR appropriateness criteria documents as specialized knowledge base and employed LlamaIndex, a framework that allows to connect large language models with external data, and the ChatGPT 3.5-Turbo to create an appropriateness criteria contexted chatbot (accGPT). Fifty clinical case files were used to compare the accGPT's performance against general radiologists at varying experience levels and to generic ChatGPT 3.5 and 4.0. Results All chatbots reached at least human performance level. For the 50 case files, the accGPT performed best in providing correct recommendations that were "usually appropriate" according to the ACR criteria and also did provide the highest proportion of consistently correct answers in comparison with generic chatbots and radiologists. Further, the chatbots provided substantial time and cost savings, with an average decision time of 5 minutes and a cost of 0.19 for all cases, compared to 50 minutes and 29.99 for radiologists (both p < 0.01). Conclusion ChatGPT-based algorithms have the potential to substantially improve the decision-making for clinical imaging studies in accordance with ACR guidelines. Specifically, a context-based algorithm performed superior to its generic counterpart, demonstrating the value of tailoring AI solutions to specific healthcare applications.

Assuntos

Algoritmos , Software , Humanos , Tomada de Decisão Clínica , Redução de Custos , Radiologistas

17.

RF-induced heating of interventional devices at 23.66 MHz.

Özen, Ali Caglar; Russe, Maximilian Frederik; Lottner, Thomas; Reiss, Simon; Littin, Sebastian; Zaitsev, Maxim; Bock, Michael.

MAGMA ; 36(3): 439-449, 2023 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-37195365

RESUMO

OBJECTIVE: Low-field MRI systems are expected to cause less RF heating in conventional interventional devices due to lower Larmor frequency. We systematically evaluate RF-induced heating of commonly used intravascular devices at the Larmor frequency of a 0.55 T system (23.66 MHz) with a focus on the effect of patient size, target organ, and device position on maximum temperature rise. MATERIALS AND METHODS: To assess RF-induced heating, high-resolution measurements of the electric field, temperature, and transfer function were combined. Realistic device trajectories were derived from vascular models to evaluate the variation of the temperature increase as a function of the device trajectory. At a low-field RF test bench, the effects of patient size and positioning, target organ (liver and heart) and body coil type were measured for six commonly used interventional devices (two guidewires, two catheters, an applicator and a biopsy needle). RESULTS: Electric field mapping shows that the hotspots are not necessarily localized at the device tip. Of all procedures, the liver catheterizations showed the lowest heating, and a modification of the transmit body coil could further reduce the temperature increase. For common commercial needles no significant heating was measured at the needle tip. Comparable local SAR values were found in the temperature measurements and the TF-based calculations. CONCLUSION: At low fields, interventions with shorter insertion lengths such as hepatic catheterizations result in less RF-induced heating than coronary interventions. The maximum temperature increase depends on body coil design.

Assuntos

Calefação , Ondas de Rádio , Humanos , Imageamento por Ressonância Magnética/métodos , Temperatura , Imagens de Fantasmas , Temperatura Alta

18.

Deep learning segmentation results in precise delineation of the putamen in multiple system atrophy.

Rau, Alexander; Schröter, Nils; Rijntjes, Michel; Bamberg, Fabian; Jost, Wolfgang H; Zaitsev, Maxim; Weiller, Cornelius; Rau, Stephan; Urbach, Horst; Reisert, Marco; Russe, Maximilian F.

Eur Radiol ; 33(10): 7160-7167, 2023 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-37121929

RESUMO

OBJECTIVES: The precise segmentation of atrophic structures remains challenging in neurodegenerative diseases. We determined the performance of a Deep Neural Patchwork (DNP) in comparison to established segmentation algorithms regarding the ability to delineate the putamen in multiple system atrophy (MSA), Parkinson's disease (PD), and healthy controls. METHODS: We retrospectively included patients with MSA and PD as well as healthy controls. A DNP was trained on manual segmentations of the putamen as ground truth. For this, the cohort was randomly split into a training (N = 131) and test set (N = 120). The DNP's performance was compared with putaminal segmentations as derived by Automatic Anatomic Labelling, Freesurfer and Fastsurfer. For validation, we assessed the diagnostic accuracy of the resulting segmentations in the delineation of MSA vs. PD and healthy controls. RESULTS: A total of 251 subjects (61 patients with MSA, 158 patients with PD, and 32 healthy controls; mean age of 61.5 ± 8.8 years) were included. Compared to the dice-coefficient of the DNP (0.96), we noted significantly weaker performance for AAL3 (0.72; p < .001), Freesurfer (0.82; p < .001), and Fastsurfer (0.84, p < .001). This was corroborated by the superior diagnostic performance of MSA vs. PD and HC of the DNP (AUC 0.93) versus the AUC of 0.88 for AAL3 (p = 0.02), 0.86 for Freesurfer (p = 0.048), and 0.85 for Fastsurfer (p = 0.04). CONCLUSION: By utilization of a DNP, accurate segmentations of the putamen can be obtained even if substantial atrophy is present. This allows for more precise extraction of imaging parameters or shape features from the putamen in relevant patient cohorts. CLINICAL RELEVANCE STATEMENT: Deep learning-based segmentation of the putamen was superior to currently available algorithms and is beneficial for the diagnosis of multiple system atrophy. KEY POINTS: â¢ A Deep Neural Patchwork precisely delineates the putamen and performs equal to human labeling in multiple system atrophy, even when pronounced putaminal volume loss is present. â¢ The Deep Neural Patchwork-based segmentation was more capable to differentiate between multiple system atrophy and Parkinson's disease than the AAL3 atlas, Freesurfer, or Fastsurfer.

Assuntos

Aprendizado Profundo , Atrofia de Múltiplos Sistemas , Doença de Parkinson , Humanos , Pessoa de Meia-Idade , Idoso , Atrofia de Múltiplos Sistemas/diagnóstico por imagem , Doença de Parkinson/diagnóstico por imagem , Putamen/diagnóstico por imagem , Estudos Retrospectivos , Imageamento por Ressonância Magnética/métodos

19.

Photon-Counting Computed Tomography - Basic Principles, Potenzial Benefits, and Initial Clinical Experience.

Stein, Thomas; Rau, Alexander; Russe, Maximilian Frederik; Arnold, Philipp; Faby, Sebastian; Ulzheimer, Stefan; Weis, Meike; Froelich, Matthias F; Overhoff, Daniel; Horger, Marius; Hagen, Florian; Bongers, Malte; Nikolaou, Konstantin; Schönberg, Stefan O; Bamberg, Fabian; Weiß, Jakob.

Rofo ; 195(8): 691-698, 2023 08.

Artigo em Inglês | MEDLINE | ID: mdl-36863367

RESUMO

BACKGROUND: Photon-counting computed tomography (PCCT) is a promising new technology with the potential to fundamentally change today's workflows in the daily routine and to provide new quantitative imaging information to improve clinical decision-making and patient management. METHOD: The content of this review is based on an unrestricted literature search on PubMed and Google Scholar using the search terms "Photon-Counting CT", "Photon-Counting detector", "spectral CT", "Computed Tomography" as well as on the authors' experience. RESULTS: The fundamental difference with respect to the currently established energy-integrating CT detectors is that PCCT allows counting of every single photon at the detector level. Based on the identified literature, PCCT phantom measurements and initial clinical studies have demonstrated that the new technology allows improved spatial resolution, reduced image noise, and new possibilities for advanced quantitative image postprocessing. CONCLUSION: For clinical practice, the potential benefits include fewer beam hardening artifacts, radiation dose reduction, and the use of new contrast agents. In this review, we will discuss basic technical principles and potential clinical benefits and demonstrate first clinical use cases. KEY POINTS: · Photon-counting computed tomography (PCCT) has been implemented in the clinical routine. · Compared to energy-integrating detector CT, PCCT allows the reduction of electronic image noise. · PCCT provides increased spatial resolution and a higher contrast-to-noise ratio. · The novel detector technology allows the quantification of spectral information. CITATION FORMAT: · Stein T, Rau A, Russe MF etâal. Photon-Counting Computed Tomography - Basic Principles, Potenzial Benefits, and Initial Clinical Experience. Fortschr Röntgenstr 2023; 195: 691â-â698.

Assuntos

Fótons , Tomografia Computadorizada por Raios X , Humanos , Tomografia Computadorizada por Raios X/métodos , Imagens de Fantasmas

20.

Multiclass datasets expand neural network utility: an example on ankle radiographs.

Kim, Suam; Rebmann, Philipp; Tran, Phuong Hien; Kellner, Elias; Reisert, Marco; Steybe, David; Bayer, Jörg; Bamberg, Fabian; Kotter, Elmar; Russe, Maximilian.

Int J Comput Assist Radiol Surg ; 18(5): 819-826, 2023 May.

Artigo em Inglês | MEDLINE | ID: mdl-36729290

RESUMO

PURPOSE: Artificial intelligence in computer vision has been increasingly adapted in clinical application since the implementation of neural networks, potentially providing incremental information beyond the mere detection of pathology. As its algorithmic approach propagates input variation, neural networks could be used to identify and evaluate relevant image features. In this study, we introduce a basic dataset structure and demonstrate a pertaining use case. METHODS: A multidimensional classification of ankle x-rays (n = 1493) rating a variety of features including fracture certainty was used to confirm its usability for separating input variations. We trained a customized neural network on the task of fracture detection using a state-of-the-art preprocessing and training protocol. By grouping the radiographs into subsets according to their image features, the influence of selected features on model performance was evaluated via selective training. RESULTS: The models trained on our dataset outperformed most comparable models of current literature with an ROC AUC of 0.943. Excluding ankle x-rays with signs of surgery improved fracture classification performance (AUC 0.955), while limiting the training set to only healthy ankles with and without fracture had no consistent effect. CONCLUSION: Using multiclass datasets and comparing model performance, we were able to demonstrate signs of surgery as a confounding factor, which, following elimination, improved our model. Also eliminating pathologies other than fracture in contrast had no effect on model performance, suggesting a beneficial influence of feature variability for robust model training. Thus, multiclass datasets allow for evaluation of distinct image features, deepening our understanding of pathology imaging.

Assuntos

Inteligência Artificial , Fraturas Ósseas , Humanos , Tornozelo , Redes Neurais de Computação , Radiografia , Diagnóstico por Imagem , Fraturas Ósseas/diagnóstico por imagem

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA