Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 51
Filter
1.
Eur J Radiol ; 178: 111633, 2024 Jul 15.
Article in English | MEDLINE | ID: mdl-39067266

ABSTRACT

PURPOSE: To assess the image quality and impact on acquisition time of a novel deep learning based T2 Dixon sequence (T2DL) of the spine. METHODS: This prospective, single center study included n = 44 consecutive patients with a clinical indication for lumbar MRI at our university radiology department between September 2022 and March 2023. MRI examinations were performed on 1.5-T and 3-T scanners (MAGNETOM Aera and Vida; Siemens Healthineers, Erlangen, Germany) using dedicated spine coils. The MR study protocol consisted of our standard clinical protocol, including a T2 weighted standard Dixon sequence (T2std) and an additional T2DL acquisition. The latter used a conventional sampling pattern with a higher parallel acceleration factor. The individual contrasts acquired for Dixon water-fat separation were then reconstructed using a dedicated research application. After reconstruction of the contrast images from k-space data, a conventional water-fat separation was performed to provide derived water images. Two readers with 6 and 4 years of experience in interpreting MSK imaging, respectively, analyzed the images in a randomized fashion. Regarding overall image quality, banding artifacts, artifacts, sharpness, noise, and diagnostic confidence were analyzed using a 5-point Likert scale (from 1 = non-diagnostic to 5 = excellent image quality). Statistical analyses included the Wilcoxon signed-rank test and weighted Cohen's kappa statistics. RESULTS: Forty-four patients (mean age 53 years (±18), male sex: 39 %) were prospectively included. Thirty-one examinations were performed on 1.5 T and 13 examinations on 3 T scanners. A sequence was successfully acquired in all patients. The total acquisition time of T2DL was 93 s at 1.5-T and 86 s at 3-T, compared to 235 s, and 257 s, respectively for T2std (reduction of acquisition time: 60.4 % at 1.5-T, and 66.5 % at 3-T; p < 0.01). Overall image quality was rated equal for both sequences (median T2DL: 5[3 -5], and median T2std: 5 [2 -5]; p = 0.57). T2DL showed significantly reduced noise levels compared to T2std (5 [4 -5] versus 4 [3 -4]; p < 0.001). In addition, sharpness was rated to be significantly higher in T2DL (5 [4 -5] versus 4 [3 -5]; p < 0.001). Although T2DL displayed significantly more banding artifacts (5 [2 -5] versus 5 [4 -5]; p < 0.001), no significant impact on readers diagnostic confidence between sequences was noted (T2std: 5 [2 -5], and T2DL: 5 [3 -5]; p = 0.61). Substantial inter-reader and intrareader agreement was observed for T2DL overall image quality (κ: 0.77, and κ: 0.8, respectively). CONCLUSION: T2DL is feasible, yields an image quality comparable to the reference standard while substantially reducing the acquisition time.

3.
Eur Radiol Exp ; 8(1): 60, 2024 May 17.
Article in English | MEDLINE | ID: mdl-38755410

ABSTRACT

BACKGROUND: We investigated the potential of an imaging-aware GPT-4-based chatbot in providing diagnoses based on imaging descriptions of abdominal pathologies. METHODS: Utilizing zero-shot learning via the LlamaIndex framework, GPT-4 was enhanced using the 96 documents from the Radiographics Top 10 Reading List on gastrointestinal imaging, creating a gastrointestinal imaging-aware chatbot (GIA-CB). To assess its diagnostic capability, 50 cases on a variety of abdominal pathologies were created, comprising radiological findings in fluoroscopy, MRI, and CT. We compared the GIA-CB to the generic GPT-4 chatbot (g-CB) in providing the primary and 2 additional differential diagnoses, using interpretations from senior-level radiologists as ground truth. The trustworthiness of the GIA-CB was evaluated by investigating the source documents as provided by the knowledge-retrieval mechanism. Mann-Whitney U test was employed. RESULTS: The GIA-CB demonstrated a high capability to identify the most appropriate differential diagnosis in 39/50 cases (78%), significantly surpassing the g-CB in 27/50 cases (54%) (p = 0.006). Notably, the GIA-CB offered the primary differential in the top 3 differential diagnoses in 45/50 cases (90%) versus g-CB with 37/50 cases (74%) (p = 0.022) and always with appropriate explanations. The median response time was 29.8 s for GIA-CB and 15.7 s for g-CB, and the mean cost per case was $0.15 and $0.02, respectively. CONCLUSIONS: The GIA-CB not only provided an accurate diagnosis for gastrointestinal pathologies, but also direct access to source documents, providing insight into the decision-making process, a step towards trustworthy and explainable AI. Integrating context-specific data into AI models can support evidence-based clinical decision-making. RELEVANCE STATEMENT: A context-aware GPT-4 chatbot demonstrates high accuracy in providing differential diagnoses based on imaging descriptions, surpassing the generic GPT-4. It provided formulated rationale and source excerpts supporting the diagnoses, thus enhancing trustworthy decision-support. KEY POINTS: • Knowledge retrieval enhances differential diagnoses in a gastrointestinal imaging-aware chatbot (GIA-CB). • GIA-CB outperformed the generic counterpart, providing formulated rationale and source excerpts. • GIA-CB has the potential to pave the way for AI-assisted decision support systems.


Subject(s)
Proof of Concept Study , Humans , Diagnosis, Differential , Gastrointestinal Diseases/diagnostic imaging
4.
Dtsch Arztebl Int ; 121(9): 284-290, 2024 May 03.
Article in English | MEDLINE | ID: mdl-38530931

ABSTRACT

BACKGROUND: Population-wide research on potential new imaging biomarkers of the kidney depends on accurate automated segmentation of the kidney and its compartments (cortex, medulla, and sinus). METHODS: We developed a robust deep-learning framework for kidney (sub-)segmentation based on a hierarchical, three-dimensional convolutional neural network (CNN) that was optimized for multiscale problems of combined localization and segmentation. We applied the CNN to abdominal magnetic resonance images from the population-based German National Cohort (NAKO) study. RESULTS: There was good to excellent agreement between the model predictions and manual segmentations. The median values for the body-surface normalized total kidney, cortex, medulla, and sinus volumes of 9934 persons were 158, 115, 43, and 24 mL/m2. Distributions of these markers are provided both for the overall study population and for a subgroup of persons without kidney disease or any associated conditions. Multivariable adjusted regression analyses revealed that diabetes, male sex, and a higher estimated glomerular filtration rate (eGFR) are important predictors of higher total and cortical volumes. Each increase of eGFR by one unit (i.e., 1 mL/min per 1.73 m2 body surface area) was associated with a 0.98 mL/m2 increase in total kidney volume, and this association was significant. Volumes were lower in persons with eGFR-defined chronic kidney disease. CONCLUSION: The extraction of image-based biomarkers through CNN-based renal sub-segmentation using data from a population-based study yields reliable results, forming a solid foundation for future investigations.


Subject(s)
Kidney , Magnetic Resonance Imaging , Humans , Male , Female , Magnetic Resonance Imaging/methods , Magnetic Resonance Imaging/statistics & numerical data , Kidney/diagnostic imaging , Middle Aged , Aged , Adult , Germany , Glomerular Filtration Rate/physiology , Biomarkers/analysis , Neural Networks, Computer , Deep Learning , Cohort Studies
5.
Artif Intell Med ; 150: 102843, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38553152

ABSTRACT

Osteoarthritis of the knee, a widespread cause of knee disability, is commonly treated in orthopedics due to its rising prevalence. Lower extremity misalignment, pivotal in knee injury etiology and management, necessitates comprehensive mechanical alignment evaluation via frequently-requested weight-bearing long leg radiographs (LLR). Despite LLR's routine use, current analysis techniques are error-prone and time-consuming. To address this, we conducted a multicentric study to develop and validate a deep learning (DL) model for fully automated leg alignment assessment on anterior-posterior LLR, targeting enhanced reliability and efficiency. The DL model, developed using 594 patients' LLR and a 60%/10%/30% data split for training, validation, and testing, executed alignment analyses via a multi-step process, employing a detection network and nine specialized networks. It was designed to assess all vital anatomical and mechanical parameters for standard clinical leg deformity analysis and preoperative planning. Accuracy, reliability, and assessment duration were compared with three specialized orthopedic surgeons across two distinct institutional datasets (136 and 143 radiographs). The algorithm exhibited equivalent performance to the surgeons in terms of alignment accuracy (DL: 0.21 ± 0.18°to 1.06 ± 1.3°vs. OS: 0.21 ± 0.16°to 1.72 ± 1.96°), interrater reliability (ICC DL: 0.90 ± 0.05 to 1.0 ± 0.0 vs. ICC OS: 0.90 ± 0.03 to 1.0 ± 0.0), and clinically acceptable accuracy (DL: 53.9%-100% vs OS 30.8%-100%). Further, automated analysis significantly reduced analysis time compared to manual annotation (DL: 22 ± 0.6 s vs. OS; 101.7 ± 7 s, p ≤ 0.01). By demonstrating that our algorithm not only matches the precision of expert surgeons but also significantly outpaces them in both speed and consistency of measurements, our research underscores a pivotal advancement in harnessing AI to enhance clinical efficiency and decision-making in orthopaedics.


Subject(s)
Deep Learning , Humans , Reproducibility of Results , Lower Extremity/diagnostic imaging , Lower Extremity/surgery , Knee Joint , Radiography , Retrospective Studies
6.
Cancer Rep (Hoboken) ; 7(3): e1992, 2024 03.
Article in English | MEDLINE | ID: mdl-38441351

ABSTRACT

BACKGROUND: Doege-Potter syndrome is defined as paraneoplastic hypoinsulinemic hypoglycemia associated with a benign or malignant solitary fibrous tumor frequently located in pleural, but also extrapleural sites. Hypoglycemia can be attributed to paraneoplastic secretion of "Big-IGF-II," a precursor of Insulin-like growth factor-II. This prohormone aberrantly binds to and activates insulin receptors, with consecutive initiation of common insulin actions such as inhibition of gluconeogenesis, activation of glycolysis and stimulation of cellular glucose uptake culminating in recurrent tumor-induced hypoglycemic episodes. Complete tumor resection or debulking surgery is considered the most promising treatment for DPS. CASE: Here, we report a rare case of a recurrent Doege-Poter Syndrome with atypical gelatinous tumor lesions of the lung, pleura and pericardial fat tissue in an 87-year-old woman. Although previously described as ineffective, we propose that adjuvant treatment with Octreotide in conjunction with intravenous glucose helped to maintain tolerable blood glucose levels before tumor resection. The somatostatin-analogue Lanreotide was successfully used after tumor debulking surgery (R2-resection) to maintain adequate blood glucose control. CONCLUSION: We conclude that somatostatin-analogues bear the potential of being effective in conjunction with limited surgical approaches for the treatment of hypoglycemia in recurrent or non-totally resectable SFT entities underlying DPS.


Subject(s)
Congenital Abnormalities , Hypoglycemia , Kidney Diseases/congenital , Kidney/abnormalities , Neoplasms , Female , Humans , Aged, 80 and over , Somatostatin , Hypoglycemia/etiology
7.
Artif Organs ; 48(5): 550-558, 2024 May.
Article in English | MEDLINE | ID: mdl-38409825

ABSTRACT

BACKGROUND: In conventional left ventricular assist devices (LVAD), a separate outflow graft is sutured to the ascending aorta. Novel device designs may include a transventricular outflow cannula crossing the aortic valve (AV). While transversal ventricular dimensions are well investigated in patients with severe heart failure, little is known about the longitudinal dimensions. These dimensions are, however, particularly critical for the design and development of mechanical circulatory support (MCS) devices with transaortic outflow cannula. METHODS: In an explorative retrospective cohort study at the University Medical Center Freiburg, Germany, the longitudinal cardiac dimensions of patients undergoing computed tomography angiography (CTA) before and, if available, after LVAD implantation were analyzed. Among others, the following dimensions were assessed: (a) apex to AV, (b) apex to mitral valve, (c) AV to sinotubular junction (STJ), (d) apex to STJ, (e) apex to brachiocephalic artery (BCA), and (f) AV to BCA. RESULTS: In total, 44 LVAD patients (36 male, age 55.8 years, height 1.75 m) were included. The longitudinal cardiac dimensions were (a) 114.5 ± 12.1 mm, (b) 108.0 ± 12.4 mm, (c) 20.9 ± 2.9, (d) 135.4 ± 13.4 mm, (e) 206.0 ± 18.3, and (f) 91.5 ± 9.8 mm. Postoperatively, (a) and (b) decreased by 31.5% and 39.5%, respectively (N = 14). CONCLUSIONS: Longitudinal cardiac dimensions may be reduced by up to 40% after LVAD implantation. A better knowledge of these dimensions and their postoperative alterations in LVAD patients may improve surgical planning and help to design MCS devices with transventricular outflow cannula.


Subject(s)
Heart Failure , Heart-Assist Devices , Thoracic Surgical Procedures , Humans , Male , Middle Aged , Retrospective Studies , Aorta, Thoracic/surgery , Aorta , Aortic Valve , Heart-Assist Devices/adverse effects , Heart Failure/surgery , Treatment Outcome
8.
Eur Radiol Exp ; 8(1): 23, 2024 Feb 14.
Article in English | MEDLINE | ID: mdl-38353812

ABSTRACT

BACKGROUND: The growing prevalence of musculoskeletal diseases increases radiologic workload, highlighting the need for optimized workflow management and automated metadata classification systems. We developed a large-scale, well-characterized dataset of musculoskeletal radiographs and trained deep learning neural networks to classify radiographic projection and body side. METHODS: In this IRB-approved retrospective single-center study, a dataset of musculoskeletal radiographs from 2011 to 2019 was retrieved and manually labeled for one of 45 possible radiographic projections and the depicted body side. Two classification networks were trained for the respective tasks using the Xception architecture with a custom network top and pretrained weights. Performance was evaluated on a hold-out test sample, and gradient-weighted class activation mapping (Grad-CAM) heatmaps were computed to visualize the influential image regions for network predictions. RESULTS: A total of 13,098 studies comprising 23,663 radiographs were included with a patient-level dataset split, resulting in 19,183 training, 2,145 validation, and 2,335 test images. Focusing on paired body regions, training for side detection included 16,319 radiographs (13,284 training, 1,443 validation, and 1,592 test images). The models achieved an overall accuracy of 0.975 for projection and 0.976 for body-side classification on the respective hold-out test sample. Errors were primarily observed in projections with seamless anatomical transitions or non-orthograde adjustment techniques. CONCLUSIONS: The deep learning neural networks demonstrated excellent performance in classifying radiographic projection and body side across a wide range of musculoskeletal radiographs. These networks have the potential to serve as presorting algorithms, optimizing radiologic workflow and enhancing patient care. RELEVANCE STATEMENT: The developed networks excel at classifying musculoskeletal radiographs, providing valuable tools for research data extraction, standardized image sorting, and minimizing misclassifications in artificial intelligence systems, ultimately enhancing radiology workflow efficiency and patient care. KEY POINTS: • A large-scale, well-characterized dataset was developed, covering a broad spectrum of musculoskeletal radiographs. • Deep learning neural networks achieved high accuracy in classifying radiographic projection and body side. • Grad-CAM heatmaps provided insight into network decisions, contributing to their interpretability and trustworthiness. • The trained models can help optimize radiologic workflow and manage large amounts of data.


Subject(s)
Deep Learning , Radiology , Humans , Artificial Intelligence , Retrospective Studies , Radiography
9.
Rofo ; 2024 Feb 26.
Article in English | MEDLINE | ID: mdl-38408477

ABSTRACT

PURPOSE: Large language models (LLMs) such as ChatGPT have shown significant potential in radiology. Their effectiveness often depends on prompt engineering, which optimizes the interaction with the chatbot for accurate results. Here, we highlight the critical role of prompt engineering in tailoring the LLMs' responses to specific medical tasks. MATERIALS AND METHODS: Using a clinical case, we elucidate different prompting strategies to adapt the LLM ChatGPT using GPT4 to new tasks without additional training of the base model. These approaches range from precision prompts to advanced in-context methods such as few-shot and zero-shot learning. Additionally, the significance of embeddings, which serve as a data representation technique, is discussed. RESULTS: Prompt engineering substantially improved and focused the chatbot's output. Moreover, embedding of specialized knowledge allows for more transparent insight into the model's decision-making and thus enhances trust. CONCLUSION: Despite certain challenges, prompt engineering plays a pivotal role in harnessing the potential of LLMs for specialized tasks in the medical domain, particularly radiology. As LLMs continue to evolve, techniques like few-shot learning, zero-shot learning, and embedding-based retrieval mechanisms will become indispensable in delivering tailored outputs. KEY POINTS: · Large language models might impact radiological practice and decision-masking.. · However, implementation and performance are dependent on the assigned task.. · Optimization of prompting strategies can substantially improve model performance.. · Strategies for prompt engineering range from precision prompts to zero-shot learning..

10.
Dentomaxillofac Radiol ; 53(2): 109-114, 2024 Feb 08.
Article in English | MEDLINE | ID: mdl-38180877

ABSTRACT

OBJECTIVES: To develop a content-aware chatbot based on GPT-3.5-Turbo and GPT-4 with specialized knowledge on the German S2 Cone-Beam CT (CBCT) dental imaging guideline and to compare the performance against humans. METHODS: The LlamaIndex software library was used to integrate the guideline context into the chatbots. Based on the CBCT S2 guideline, 40 questions were posed to content-aware chatbots and early career and senior practitioners with different levels of experience served as reference. The chatbots' performance was compared in terms of recommendation accuracy and explanation quality. Chi-square test and one-tailed Wilcoxon signed rank test evaluated accuracy and explanation quality, respectively. RESULTS: The GPT-4 based chatbot provided 100% correct recommendations and superior explanation quality compared to the one based on GPT3.5-Turbo (87.5% vs. 57.5% for GPT-3.5-Turbo; P = .003). Moreover, it outperformed early career practitioners in correct answers (P = .002 and P = .032) and earned higher trust than the chatbot using GPT-3.5-Turbo (P = 0.006). CONCLUSIONS: A content-aware chatbot using GPT-4 reliably provided recommendations according to current consensus guidelines. The responses were deemed trustworthy and transparent, and therefore facilitate the integration of artificial intelligence into clinical decision-making.


Subject(s)
Artificial Intelligence , Software , Humans , Clinical Decision-Making , Cone-Beam Computed Tomography , Consensus
11.
BMJ Open ; 14(1): e076954, 2024 01 23.
Article in English | MEDLINE | ID: mdl-38262641

ABSTRACT

OBJECTIVES: To aid in selecting the optimal artificial intelligence (AI) solution for clinical application, we directly compared performances of selected representative custom-trained or commercial classification, detection and segmentation models for fracture detection on musculoskeletal radiographs of the distal radius by aligning their outputs. DESIGN AND SETTING: This single-centre retrospective study was conducted on a random subset of emergency department radiographs from 2008 to 2018 of the distal radius in Germany. MATERIALS AND METHODS: An image set was created to be compatible with training and testing classification and segmentation models by annotating examinations for fractures and overlaying fracture masks, if applicable. Representative classification and segmentation models were trained on 80% of the data. After output binarisation, their derived fracture detection performances as well as that of a standard commercially available solution were compared on the remaining X-rays (20%) using mainly accuracy and area under the receiver operating characteristic (AUROC). RESULTS: A total of 2856 examinations with 712 (24.9%) fractures were included in the analysis. Accuracies reached up to 0.97 for the classification model, 0.94 for the segmentation model and 0.95 for BoneView. Cohen's kappa was at least 0.80 in pairwise comparisons, while Fleiss' kappa was 0.83 for all models. Fracture predictions were visualised with all three methods at different levels of detail, ranking from downsampled image region for classification over bounding box for detection to single pixel-level delineation for segmentation. CONCLUSIONS: All three investigated approaches reached high performances for detection of distal radius fractures with simple preprocessing and postprocessing protocols on the custom-trained models. Despite their underlying structural differences, selection of one's fracture analysis AI tool in the frame of this study reduces to the desired flavour of automation: automated classification, AI-assisted manual fracture reading or minimised false negatives.


Subject(s)
Deep Learning , Fractures, Bone , Humans , X-Rays , Artificial Intelligence , Radius , Retrospective Studies
12.
Rofo ; 196(1): 25-35, 2024 Jan.
Article in English, German | MEDLINE | ID: mdl-37793417

ABSTRACT

BACKGROUND: Photon-counting detector computed tomography (PCD-CT) is a promising new technology with the potential to fundamentally change workflows in the daily routine and provide new quantitative imaging information to improve clinical decision-making and patient management. METHOD: The contents of this review are based on an unrestricted literature search of PubMed and Google Scholar using the search terms "photon-counting CT", "photon-counting detector", "spectral CT", "computed tomography" as well as on the authors' own experience. RESULTS: The fundamental difference with respect to the currently established energy-integrating CT detectors is that PCD-CT allows for the counting of every single photon at the detector level. Based on the identified literature, PCD-CT phantom measurements and initial clinical studies have demonstrated that the new technology allows for improved spatial resolution, reduced image noise, and new possibilities for advanced quantitative image postprocessing. CONCLUSION: For clinical practice, the potential benefits include fewer beam hardening artifacts, a radiation dose reduction, and the use of new or combinations of contrast agents. In particular, critical patient groups such as oncological, cardiovascular, lung, and head & neck as well as pediatric patient collectives benefit from the clinical advantages. KEY POINTS: · Photon-counting computed tomography (PCD-CT) is being used for the first time in routine clinical practice, enabling a significant dose reduction in critical patient populations such as oncology, cardiology, and pediatrics.. · Compared to conventional CT, PCD-CT enables a reduction in electronic image noise.. · Due to the spectral data sets, PCD-CT enables fully comprehensive post-processing applications.. CITATION FORMAT: · Hagen F, Soschynski M, Weis M et al. Photon-counting computed tomography - clinical application in oncological, cardiovascular, and pediatric radiology. Fortschr Röntgenstr 2024; 196: 25 - 34.


Subject(s)
Radiology , Tomography, X-Ray Computed , Humans , Child , Tomography, X-Ray Computed/methods , Contrast Media , Thorax , Phantoms, Imaging , Lung
13.
Sci Rep ; 13(1): 22745, 2023 12 20.
Article in English | MEDLINE | ID: mdl-38123791

ABSTRACT

In magnetic resonance imaging (MRI), the perception of substandard image quality may prompt repetition of the respective image acquisition protocol. Subsequently selecting the preferred high-quality image data from a series of acquisitions can be challenging. An automated workflow may facilitate and improve this selection. We therefore aimed to investigate the applicability of an automated image quality assessment for the prediction of the subjectively preferred image acquisition. Our analysis included data from 11,347 participants with whole-body MRI examinations performed as part of the ongoing prospective multi-center German National Cohort (NAKO) study. Trained radiologic technologists repeated any of the twelve examination protocols due to induced setup errors and/or subjectively unsatisfactory image quality and chose a preferred acquisition from the resultant series. Up to 11 quantitative image quality parameters were automatically derived from all acquisitions. Regularized regression and standard estimates of diagnostic accuracy were calculated. Controlling for setup variations in 2342 series of two or more acquisitions, technologists preferred the repetition over the initial acquisition in 1116 of 1396 series in which the initial setup was retained (79.9%, range across protocols: 73-100%). Image quality parameters then commonly showed statistically significant differences between chosen and discarded acquisitions. In regularized regression across all protocols, 'structured noise maximum' was the strongest predictor for the technologists' choice, followed by 'N/2 ghosting average'. Combinations of the automatically derived parameters provided an area under the ROC curve between 0.51 and 0.74 for the prediction of the technologists' choice. It is concluded that automated image quality assessment can, despite considerable performance differences between protocols and anatomical regions, contribute substantially to identifying the subjective preference in a series of MRI acquisitions and thus provide effective decision support to readers.


Subject(s)
Magnetic Resonance Imaging , Humans , Cohort Studies , Prospective Studies , Magnetic Resonance Imaging/methods , ROC Curve , Longitudinal Studies
14.
Z Med Phys ; 2023 Dec 15.
Article in English | MEDLINE | ID: mdl-38104007

ABSTRACT

OBJECTIVES: Despite their life-saving capabilities, cerebrospinal fluid (CSF) shunts exhibit high failure rates, with a large fraction of failures attributed to the regulating valve. Due to a lack of methods for the detailed analysis of valve malfunctions, failure mechanisms are not well understood, and valves often have to be surgically explanted on the mere suspicion of malfunction. The presented pilot study aims to demonstrate radiological methods for comprehensive analysis of CSF shunt valves, considering both the potential for failure analysis in design optimization, and for future clinical in-vivo application to reduce the number of required shunt revision surgeries. The proposed method could also be utilized to develop and support in situ repair methods (e.g. by lysis or ultrasound) of malfunctioning CSF shunt valves. MATERIALS AND METHODS: The primary methods described are contrast-enhanced radiographic time series of CSF shunt valves, taken in a favorable projection geometry at low radiation dose, and the machine-learning-based diagnosis of CSF shunt valve obstructions. Complimentarily, we investigate CT-based methods capable of providing accurate ground truth for the training of such diagnostic tools. Using simulated test and training data, the performance of the machine-learning diagnostics in identifying and localizing obstructions within a shunt valve is evaluated regarding per-pixel sensitivity and specificity, the Dice similarity coefficient, and the false positive rate in the case of obstruction free test samples. RESULTS: Contrast enhanced subtraction radiography allows high-resolution, time-resolved, low-dose analysis of fluid transport in CSF shunt valves. Complementarily, photon-counting micro-CT allows to investigate valve obstruction mechanisms in detail, and to generate valid ground truth for machine learning-based diagnostics. Machine-learning-based detection of valve obstructions in simulated radiographies shows promising results, with a per-pixel sensitivity >70%, per-pixel specificity >90%, a median Dice coefficient >0.8 and <10% false positives at a detection threshold of 0.5. CONCLUSIONS: This ex-vivo study demonstrates obstruction detection in cerebro-spinal fluid shunt valves, combining radiological methods with machine learning under conditions compatible to future in-vivo application. Results indicate that high-resolution contrast-enhanced subtraction radiography, possibly including time-series data, combined with machine-learning image analysis, has the potential to strongly improve the diagnostics of CSF shunt valve failures. The presented method is in principle suitable for in-vivo application, considering both measurement geometry and radiological dose. Further research is needed to validate these results on real-world data and to refine the employed methods. In combination, the presented methods enable comprehensive analysis of valve failure mechanisms, paving the way for improved product development and clinical diagnostics of CSF shunt valves.

15.
Sci Rep ; 13(1): 14215, 2023 08 30.
Article in English | MEDLINE | ID: mdl-37648742

ABSTRACT

While radiologists can describe a fracture's morphology and complexity with ease, the translation into classification systems such as the Arbeitsgemeinschaft Osteosynthesefragen (AO) Fracture and Dislocation Classification Compendium is more challenging. We tested the performance of generic chatbots and chatbots aware of specific knowledge of the AO classification provided by a vector-index and compared it to human readers. In the 100 radiological reports we created based on random AO codes, chatbots provided AO codes significantly faster than humans (mean 3.2 s per case vs. 50 s per case, p < .001) though not reaching human performance (max. chatbot performance of 86% correct full AO codes vs. 95% in human readers). In general, chatbots based on GPT 4 outperformed the ones based on GPT 3.5-Turbo. Further, we found that providing specific knowledge substantially enhances the chatbot's performance and consistency as the context-aware chatbot based on GPT 4 provided 71% consistent correct full AO codes for the compared to the 2% consistent correct full AO codes for the generic ChatGPT 4. This provides evidence, that refining and providing specific context to ChatGPT will be the next essential step in harnessing its power.


Subject(s)
Fractures, Bone , Radiology , Humans , Awareness , Drugs, Generic , Radiologists
16.
Dentomaxillofac Radiol ; 52(6): 20230059, 2023 Sep.
Article in English | MEDLINE | ID: mdl-37427585

ABSTRACT

OBJECTIVES: This study evaluated the accuracy of deep neural patchworks (DNPs), a deep learning-based segmentation framework, for automated identification of 60 cephalometric landmarks (bone-, soft tissue- and tooth-landmarks) on CT scans. The aim was to determine whether DNP could be used for routine three-dimensional cephalometric analysis in diagnostics and treatment planning in orthognathic surgery and orthodontics. METHODS: Full skull CT scans of 30 adult patients (18 female, 12 male, mean age 35.6 years) were randomly divided into a training and test data set (each n = 15). Clinician A annotated 60 landmarks in all 30 CT scans. Clinician B annotated 60 landmarks in the test data set only. The DNP was trained using spherical segmentations of the adjacent tissue for each landmark. Automated landmark predictions in the separate test data set were created by calculating the center of mass of the predictions. The accuracy of the method was evaluated by comparing these annotations to the manual annotations. RESULTS: The DNP was successfully trained to identify all 60 landmarks. The mean error of our method was 1.94 mm (SD 1.45 mm) compared to a mean error of 1.32 mm (SD 1.08 mm) for manual annotations. The minimum error was found for landmarks ANS 1.11 mm, SN 1.2 mm, and CP_R 1.25 mm. CONCLUSION: The DNP-algorithm was able to accurately identify cephalometric landmarks with mean errors <2 mm. This method could improve the workflow of cephalometric analysis in orthodontics and orthognathic surgery. Low training requirements while still accomplishing high precision make this method particularly promising for clinical use.


Subject(s)
Anatomic Landmarks , Skull , Adult , Humans , Male , Female , Reproducibility of Results , Cephalometry/methods , Skull/diagnostic imaging , Algorithms
17.
Radiology ; 308(1): e230970, 2023 07.
Article in English | MEDLINE | ID: mdl-37489981

ABSTRACT

Background Radiological imaging guidelines are crucial for accurate diagnosis and optimal patient care as they result in standardized decisions and thus reduce inappropriate imaging studies. Purpose In the present study, we investigated the potential to support clinical decision-making using an interactive chatbot designed to provide personalized imaging recommendations from American College of Radiology (ACR) appropriateness criteria documents using semantic similarity processing. Methods We utilized 209 ACR appropriateness criteria documents as specialized knowledge base and employed LlamaIndex, a framework that allows to connect large language models with external data, and the ChatGPT 3.5-Turbo to create an appropriateness criteria contexted chatbot (accGPT). Fifty clinical case files were used to compare the accGPT's performance against general radiologists at varying experience levels and to generic ChatGPT 3.5 and 4.0. Results All chatbots reached at least human performance level. For the 50 case files, the accGPT performed best in providing correct recommendations that were "usually appropriate" according to the ACR criteria and also did provide the highest proportion of consistently correct answers in comparison with generic chatbots and radiologists. Further, the chatbots provided substantial time and cost savings, with an average decision time of 5 minutes and a cost of 0.19 € for all cases, compared to 50 minutes and 29.99 € for radiologists (both p < 0.01). Conclusion ChatGPT-based algorithms have the potential to substantially improve the decision-making for clinical imaging studies in accordance with ACR guidelines. Specifically, a context-based algorithm performed superior to its generic counterpart, demonstrating the value of tailoring AI solutions to specific healthcare applications.


Subject(s)
Algorithms , Software , Humans , Clinical Decision-Making , Cost Savings , Radiologists
18.
MAGMA ; 36(3): 439-449, 2023 Jul.
Article in English | MEDLINE | ID: mdl-37195365

ABSTRACT

OBJECTIVE: Low-field MRI systems are expected to cause less RF heating in conventional interventional devices due to lower Larmor frequency. We systematically evaluate RF-induced heating of commonly used intravascular devices at the Larmor frequency of a 0.55 T system (23.66 MHz) with a focus on the effect of patient size, target organ, and device position on maximum temperature rise. MATERIALS AND METHODS: To assess RF-induced heating, high-resolution measurements of the electric field, temperature, and transfer function were combined. Realistic device trajectories were derived from vascular models to evaluate the variation of the temperature increase as a function of the device trajectory. At a low-field RF test bench, the effects of patient size and positioning, target organ (liver and heart) and body coil type were measured for six commonly used interventional devices (two guidewires, two catheters, an applicator and a biopsy needle). RESULTS: Electric field mapping shows that the hotspots are not necessarily localized at the device tip. Of all procedures, the liver catheterizations showed the lowest heating, and a modification of the transmit body coil could further reduce the temperature increase. For common commercial needles no significant heating was measured at the needle tip. Comparable local SAR values were found in the temperature measurements and the TF-based calculations. CONCLUSION: At low fields, interventions with shorter insertion lengths such as hepatic catheterizations result in less RF-induced heating than coronary interventions. The maximum temperature increase depends on body coil design.


Subject(s)
Heating , Radio Waves , Humans , Magnetic Resonance Imaging/methods , Temperature , Phantoms, Imaging , Hot Temperature
19.
Eur Radiol ; 33(10): 7160-7167, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37121929

ABSTRACT

OBJECTIVES: The precise segmentation of atrophic structures remains challenging in neurodegenerative diseases. We determined the performance of a Deep Neural Patchwork (DNP) in comparison to established segmentation algorithms regarding the ability to delineate the putamen in multiple system atrophy (MSA), Parkinson's disease (PD), and healthy controls. METHODS: We retrospectively included patients with MSA and PD as well as healthy controls. A DNP was trained on manual segmentations of the putamen as ground truth. For this, the cohort was randomly split into a training (N = 131) and test set (N = 120). The DNP's performance was compared with putaminal segmentations as derived by Automatic Anatomic Labelling, Freesurfer and Fastsurfer. For validation, we assessed the diagnostic accuracy of the resulting segmentations in the delineation of MSA vs. PD and healthy controls. RESULTS: A total of 251 subjects (61 patients with MSA, 158 patients with PD, and 32 healthy controls; mean age of 61.5 ± 8.8 years) were included. Compared to the dice-coefficient of the DNP (0.96), we noted significantly weaker performance for AAL3 (0.72; p < .001), Freesurfer (0.82; p < .001), and Fastsurfer (0.84, p < .001). This was corroborated by the superior diagnostic performance of MSA vs. PD and HC of the DNP (AUC 0.93) versus the AUC of 0.88 for AAL3 (p = 0.02), 0.86 for Freesurfer (p = 0.048), and 0.85 for Fastsurfer (p = 0.04). CONCLUSION: By utilization of a DNP, accurate segmentations of the putamen can be obtained even if substantial atrophy is present. This allows for more precise extraction of imaging parameters or shape features from the putamen in relevant patient cohorts. CLINICAL RELEVANCE STATEMENT: Deep learning-based segmentation of the putamen was superior to currently available algorithms and is beneficial for the diagnosis of multiple system atrophy. KEY POINTS: • A Deep Neural Patchwork precisely delineates the putamen and performs equal to human labeling in multiple system atrophy, even when pronounced putaminal volume loss is present. • The Deep Neural Patchwork-based segmentation was more capable to differentiate between multiple system atrophy and Parkinson's disease than the AAL3 atlas, Freesurfer, or Fastsurfer.


Subject(s)
Deep Learning , Multiple System Atrophy , Parkinson Disease , Humans , Middle Aged , Aged , Multiple System Atrophy/diagnostic imaging , Parkinson Disease/diagnostic imaging , Putamen/diagnostic imaging , Retrospective Studies , Magnetic Resonance Imaging/methods
20.
Rofo ; 195(8): 691-698, 2023 08.
Article in English | MEDLINE | ID: mdl-36863367

ABSTRACT

BACKGROUND: Photon-counting computed tomography (PCCT) is a promising new technology with the potential to fundamentally change today's workflows in the daily routine and to provide new quantitative imaging information to improve clinical decision-making and patient management. METHOD: The content of this review is based on an unrestricted literature search on PubMed and Google Scholar using the search terms "Photon-Counting CT", "Photon-Counting detector", "spectral CT", "Computed Tomography" as well as on the authors' experience. RESULTS: The fundamental difference with respect to the currently established energy-integrating CT detectors is that PCCT allows counting of every single photon at the detector level. Based on the identified literature, PCCT phantom measurements and initial clinical studies have demonstrated that the new technology allows improved spatial resolution, reduced image noise, and new possibilities for advanced quantitative image postprocessing. CONCLUSION: For clinical practice, the potential benefits include fewer beam hardening artifacts, radiation dose reduction, and the use of new contrast agents. In this review, we will discuss basic technical principles and potential clinical benefits and demonstrate first clinical use cases. KEY POINTS: · Photon-counting computed tomography (PCCT) has been implemented in the clinical routine. · Compared to energy-integrating detector CT, PCCT allows the reduction of electronic image noise. · PCCT provides increased spatial resolution and a higher contrast-to-noise ratio. · The novel detector technology allows the quantification of spectral information. CITATION FORMAT: · Stein T, Rau A, Russe MF et al. Photon-Counting Computed Tomography - Basic Principles, Potenzial Benefits, and Initial Clinical Experience. Fortschr Röntgenstr 2023; 195: 691 - 698.


Subject(s)
Photons , Tomography, X-Ray Computed , Humans , Tomography, X-Ray Computed/methods , Phantoms, Imaging
SELECTION OF CITATIONS
SEARCH DETAIL