Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 68
Filter
Add more filters

Publication year range
1.
Eur Radiol ; 34(4): 2791-2804, 2024 Apr.
Article in English | MEDLINE | ID: mdl-37733025

ABSTRACT

OBJECTIVES: To investigate the intra- and inter-rater reliability of the total radiomics quality score (RQS) and the reproducibility of individual RQS items' score in a large multireader study. METHODS: Nine raters with different backgrounds were randomly assigned to three groups based on their proficiency with RQS utilization: Groups 1 and 2 represented the inter-rater reliability groups with or without prior training in RQS, respectively; group 3 represented the intra-rater reliability group. Thirty-three original research papers on radiomics were evaluated by raters of groups 1 and 2. Of the 33 papers, 17 were evaluated twice with an interval of 1 month by raters of group 3. Intraclass coefficient (ICC) for continuous variables, and Fleiss' and Cohen's kappa (k) statistics for categorical variables were used. RESULTS: The inter-rater reliability was poor to moderate for total RQS (ICC 0.30-055, p < 0.001) and very low to good for item's reproducibility (k - 0.12 to 0.75) within groups 1 and 2 for both inexperienced and experienced raters. The intra-rater reliability for total RQS was moderate for the less experienced rater (ICC 0.522, p = 0.009), whereas experienced raters showed excellent intra-rater reliability (ICC 0.91-0.99, p < 0.001) between the first and second read. Intra-rater reliability on RQS items' score reproducibility was higher and most of the items had moderate to good intra-rater reliability (k - 0.40 to 1). CONCLUSIONS: Reproducibility of the total RQS and the score of individual RQS items is low. There is a need for a robust and reproducible assessment method to assess the quality of radiomics research. CLINICAL RELEVANCE STATEMENT: There is a need for reproducible scoring systems to improve quality of radiomics research and consecutively close the translational gap between research and clinical implementation. KEY POINTS: • Radiomics quality score has been widely used for the evaluation of radiomics studies. • Although the intra-rater reliability was moderate to excellent, intra- and inter-rater reliability of total score and point-by-point scores were low with radiomics quality score. • A robust, easy-to-use scoring system is needed for the evaluation of radiomics research.


Subject(s)
Radiomics , Reading , Humans , Observer Variation , Reproducibility of Results
2.
Can Assoc Radiol J ; 75(2): 226-244, 2024 May.
Article in English | MEDLINE | ID: mdl-38251882

ABSTRACT

Artificial Intelligence (AI) carries the potential for unprecedented disruption in radiology, with possible positive and negative consequences. The integration of AI in radiology holds the potential to revolutionize healthcare practices by advancing diagnosis, quantification, and management of multiple medical conditions. Nevertheless, the ever­growing availability of AI tools in radiology highlights an increasing need to critically evaluate claims for its utility and to differentiate safe product offerings from potentially harmful, or fundamentally unhelpful ones. This multi­society paper, presenting the views of Radiology Societies in the USA, Canada, Europe, Australia, and New Zealand, defines the potential practical problems and ethical issues surrounding the incorporation of AI into radiological practice. In addition to delineating the main points of concern that developers, regulators, and purchasers of AI tools should consider prior to their introduction into clinical practice, this statement also suggests methods to monitor their stability and safety in clinical use, and their suitability for possible autonomous function. This statement is intended to serve as a useful summary of the practical issues which should be considered by all parties involved in the development of radiology AI resources, and their implementation as clinical tools.


Subject(s)
Artificial Intelligence , Radiology , Societies, Medical , Humans , Canada , Europe , New Zealand , United States , Australia
3.
Eur Radiol ; 33(3): 1884-1894, 2023 Mar.
Article in English | MEDLINE | ID: mdl-36282312

ABSTRACT

OBJECTIVE: The main aim of the present systematic review was a comprehensive overview of the Radiomics Quality Score (RQS)-based systematic reviews to highlight common issues and challenges of radiomics research application and evaluate the relationship between RQS and review features. METHODS: The literature search was performed on multiple medical literature archives according to PRISMA guidelines for systematic reviews that reported radiomic quality assessment through the RQS. Reported scores were converted to a 0-100% scale. The Mann-Whitney and Kruskal-Wallis tests were used to compare RQS scores and review features. RESULTS: The literature research yielded 345 articles, from which 44 systematic reviews were finally included in the analysis. Overall, the median of RQS was 21.00% (IQR = 11.50). No significant differences of RQS were observed in subgroup analyses according to targets (oncological/not oncological target, neuroradiology/body imaging focus and one imaging technique/more than one imaging technique, characterization/prognosis/detection/other). CONCLUSIONS: Our review did not reveal a significant difference of quality of radiomic articles reported in systematic reviews, divided in different subgroups. Furthermore, low overall methodological quality of radiomics research was found independent of specific application domains. While the RQS can serve as a reference tool to improve future study designs, future research should also be aimed at improving its reliability and developing new tools to meet an ever-evolving research space. KEY POINTS: • Radiomics is a promising high-throughput method that may generate novel imaging biomarkers to improve clinical decision-making process, but it is an inherently complex analysis and often lacks reproducibility and generalizability. • The Radiomics Quality Score serves a necessary role as the de facto reference tool for assessing radiomics studies. • External auditing of radiomics studies, in addition to the standard peer-review process, is valuable to highlight common limitations and provide insights to improve future study designs and practical applicability of the radiomics models.


Subject(s)
Diagnostic Imaging , Humans , Reproducibility of Results , Prognosis , Biomarkers
4.
Eur Radiol ; 31(8): 6001-6012, 2021 Aug.
Article in English | MEDLINE | ID: mdl-33492473

ABSTRACT

Existing quantitative imaging biomarkers (QIBs) are associated with known biological tissue characteristics and follow a well-understood path of technical, biological and clinical validation before incorporation into clinical trials. In radiomics, novel data-driven processes extract numerous visually imperceptible statistical features from the imaging data with no a priori assumptions on their correlation with biological processes. The selection of relevant features (radiomic signature) and incorporation into clinical trials therefore requires additional considerations to ensure meaningful imaging endpoints. Also, the number of radiomic features tested means that power calculations would result in sample sizes impossible to achieve within clinical trials. This article examines how the process of standardising and validating data-driven imaging biomarkers differs from those based on biological associations. Radiomic signatures are best developed initially on datasets that represent diversity of acquisition protocols as well as diversity of disease and of normal findings, rather than within clinical trials with standardised and optimised protocols as this would risk the selection of radiomic features being linked to the imaging process rather than the pathology. Normalisation through discretisation and feature harmonisation are essential pre-processing steps. Biological correlation may be performed after the technical and clinical validity of a radiomic signature is established, but is not mandatory. Feature selection may be part of discovery within a radiomics-specific trial or represent exploratory endpoints within an established trial; a previously validated radiomic signature may even be used as a primary/secondary endpoint, particularly if associations are demonstrated with specific biological processes and pathways being targeted within clinical trials. KEY POINTS: • Data-driven processes like radiomics risk false discoveries due to high-dimensionality of the dataset compared to sample size, making adequate diversity of the data, cross-validation and external validation essential to mitigate the risks of spurious associations and overfitting. • Use of radiomic signatures within clinical trials requires multistep standardisation of image acquisition, image analysis and data mining processes. • Biological correlation may be established after clinical validation but is not mandatory.


Subject(s)
Radiology , Tomography, X-Ray Computed , Biomarkers , Consensus , Humans , Image Processing, Computer-Assisted
5.
Radiologe ; 61(11): 979-985, 2021 Nov.
Article in German | MEDLINE | ID: mdl-34661685

ABSTRACT

Numerous publications have shown the value of structured reporting in communication with referring physicians and for further usage of clinical report data in other contexts. Despite the topic being present and known in radiology for many years now, widespread adoption of structured reporting in clinical routine is still lacking. All major radiological societies have published position statements in favor of structured reporting and are pursuing various related initiatives. Among those are the development and the maintenance of openly available structured report template repositories as well as quality control of report templates and development of standardized terminologies. This article highlights the efforts of the German Radiological Society and the European Society of Radiology on the topic of structured reporting and provides an overview of pros and cons as well as publicly available resources.


Subject(s)
Radiology Information Systems , Radiology , Communication , Humans , Radiography
6.
Radiology ; 294(1): 199-209, 2020 01.
Article in English | MEDLINE | ID: mdl-31714194

ABSTRACT

Background Multicenter studies are required to validate the added benefit of using deep convolutional neural network (DCNN) software for detecting malignant pulmonary nodules on chest radiographs. Purpose To compare the performance of radiologists in detecting malignant pulmonary nodules on chest radiographs when assisted by deep learning-based DCNN software with that of radiologists or DCNN software alone in a multicenter setting. Materials and Methods Investigators at four medical centers retrospectively identified 600 lung cancer-containing chest radiographs and 200 normal chest radiographs. Each radiograph with a lung cancer had at least one malignant nodule confirmed by CT and pathologic examination. Twelve radiologists from the four centers independently analyzed the chest radiographs and marked regions of interest. Commercially available deep learning-based computer-aided detection software separately trained, tested, and validated with 19 330 radiographs was used to find suspicious nodules. The radiologists then reviewed the images with the assistance of DCNN software. The sensitivity and number of false-positive findings per image of DCNN software, radiologists alone, and radiologists with the use of DCNN software were analyzed by using logistic regression and Poisson regression. Results The average sensitivity of radiologists improved (from 65.1% [1375 of 2112; 95% confidence interval {CI}: 62.0%, 68.1%] to 70.3% [1484 of 2112; 95% CI: 67.2%, 73.1%], P < .001) and the number of false-positive findings per radiograph declined (from 0.2 [488 of 2400; 95% CI: 0.18, 0.22] to 0.18 [422 of 2400; 95% CI: 0.16, 0.2], P < .001) when the radiologists re-reviewed radiographs with the DCNN software. For the 12 radiologists in this study, 104 of 2400 radiographs were positively changed (from false-negative to true-positive or from false-positive to true-negative) using the DCNN, while 56 of 2400 radiographs were changed negatively. Conclusion Radiologists had better performance with deep convolutional network software for the detection of malignant pulmonary nodules on chest radiographs than without. © RSNA, 2019 Online supplemental material is available for this article. See also the editorial by Jacobson in this issue.


Subject(s)
Lung Neoplasms/diagnostic imaging , Multiple Pulmonary Nodules/diagnostic imaging , Neural Networks, Computer , Radiographic Image Interpretation, Computer-Assisted/methods , Radiography, Thoracic/methods , Solitary Pulmonary Nodule/diagnostic imaging , Adult , Aged , Female , Humans , Lung/diagnostic imaging , Male , Middle Aged , Reproducibility of Results , Retrospective Studies , Sensitivity and Specificity , Young Adult
7.
Radiology ; 293(2): 436-440, 2019 11.
Article in English | MEDLINE | ID: mdl-31573399

ABSTRACT

This is a condensed summary of an international multisociety statement on ethics of artificial intelligence (AI) in radiology produced by the ACR, European Society of Radiology, RSNA, Society for Imaging Informatics in Medicine, European Society of Medical Imaging Informatics, Canadian Association of Radiologists, and American Association of Physicists in Medicine. AI has great potential to increase efficiency and accuracy throughout radiology, but it also carries inherent pitfalls and biases. Widespread use of AI-based intelligent and autonomous systems in radiology can increase the risk of systemic errors with high consequence and highlights complex ethical and societal issues. Currently, there is little experience using AI for patient care in diverse clinical settings. Extensive research is needed to understand how to best deploy AI in clinical practice. This statement highlights our consensus that ethical use of AI in radiology should promote well-being, minimize harm, and ensure that the benefits and harms are distributed among stakeholders in a just manner. We believe AI should respect human rights and freedoms, including dignity and privacy. It should be designed for maximum transparency and dependability. Ultimate responsibility and accountability for AI remains with its human designers and operators for the foreseeable future. The radiology community should start now to develop codes of ethics and practice for AI that promote any use that helps patients and the common good and should block use of radiology data and algorithms for financial gain without those two attributes. This article is a simultaneous joint publication in Radiology, Journal of the American College of Radiology, Canadian Association of Radiologists Journal, and Insights into Imaging. Published under a CC BY-NC-ND 4.0 license. Online supplemental material is available for this article.


Subject(s)
Artificial Intelligence/ethics , Radiology/ethics , Canada , Consensus , Europe , Humans , Radiologists/ethics , Societies, Medical , United States
8.
Can Assoc Radiol J ; 70(4): 329-334, 2019 Nov.
Article in English | MEDLINE | ID: mdl-31585825

ABSTRACT

This is a condensed summary of an international multisociety statement on ethics of artificial intelligence (AI) in radiology produced by the ACR, European Society of Radiology, RSNA, Society for Imaging Informatics in Medicine, European Society of Medical Imaging Informatics, Canadian Association of Radiologists, and American Association of Physicists in Medicine. AI has great potential to increase efficiency and accuracy throughout radiology, but it also carries inherent pitfalls and biases. Widespread use of AI-based intelligent and autonomous systems in radiology can increase the risk of systemic errors with high consequence and highlights complex ethical and societal issues. Currently, there is little experience using AI for patient care in diverse clinical settings. Extensive research is needed to understand how to best deploy AI in clinical practice. This statement highlights our consensus that ethical use of AI in radiology should promote well-being, minimize harm, and ensure that the benefits and harms are distributed among stakeholders in a just manner. We believe AI should respect human rights and freedoms, including dignity and privacy. It should be designed for maximum transparency and dependability. Ultimate responsibility and accountability for AI remains with its human designers and operators for the foreseeable future. The radiology community should start now to develop codes of ethics and practice for AI that promote any use that helps patients and the common good and should block use of radiology data and algorithms for financial gain without those two attributes.


Subject(s)
Artificial Intelligence/ethics , Radiology/ethics , Canada , Consensus , Europe , Humans , Radiologists/ethics , Societies, Medical , United States
9.
Transpl Infect Dis ; 20(6): e12993, 2018 Dec.
Article in English | MEDLINE | ID: mdl-30187615

ABSTRACT

Long-term success of lung transplantation is limited by allograft dysfunction and frequent infections. Varicella zoster virus infection (VZV) is one of the most common opportunistic infections among solid organ transplantation recipients. However the occurrence of visceral involvement or disseminated disease, as seen after bone marrow transplantation, is rare. We report a case of a 59-year-old woman who underwent double-lung transplantation with a fatal visceral and disseminated varicella zoster virus infection.


Subject(s)
Herpesvirus 3, Human/isolation & purification , Immunosuppression Therapy/adverse effects , Lung Transplantation/adverse effects , Pulmonary Fibrosis/surgery , Shock, Septic/immunology , Varicella Zoster Virus Infection/immunology , Abdominal Pain/immunology , Abdominal Pain/virology , Exanthema/immunology , Exanthema/microbiology , Fatal Outcome , Female , Humans , Immunosuppression Therapy/methods , Middle Aged , Shock, Septic/virology , Unconsciousness/immunology , Unconsciousness/virology , Varicella Zoster Virus Infection/complications , Varicella Zoster Virus Infection/virology
10.
J Arthroplasty ; 33(8): 2652-2659.e3, 2018 08.
Article in English | MEDLINE | ID: mdl-29615377

ABSTRACT

BACKGROUND: Correct positioning of the cup is an important factor in total hip arthroplasty. Assessing its position from a plain anteroposterior pelvic radiograph is known to be hampered by systemic errors. This study focuses on developing a correction method to adjust for these potential sources of error and to eliminate them based on a 3D geometric analysis. METHODS: Computed tomography scans of 113 (66 male, 47 female) pelvices were reconstructed and virtually projected onto a plain radiograph with varying rotational and translational positions. Thus cup inclination and anteversion as measured on a 2D-radiograph and in the 3D environment were correlated. Projected offset of the symphysis from the mid-sacrum served as a mean to measure pelvic right/left-rotation. Pelvic tilt was determined from the projected height of the contour of the small pelvis. Correction formulas were verified by projecting a gimbal-mounted artificial pelvis with a cup implanted in a known position. RESULTS: We found gender-specific formulas that correct for malrotated and off-centered radiographs. Applying these formulas cup inclination was assessed as close as 1.3° (±1.90°) to the true 3D value and cup anteversion as close as 1° (±1.91°) although deviations between directly measured plain values and corrected values rose up to 18°. CONCLUSION: Inherent effects of central projection and malrotations due to pelvic tilt, pelvic rotation, and noncentered radiographs are corrected. Evaluation of radiographic inclination and anteversion of acetabular cups from plain 2D-radiographs show improved precision. Real values are approached better than 1.3° when applying our correction formulas.


Subject(s)
Arthroplasty, Replacement, Hip/methods , Hip Prosthesis , Tomography, X-Ray Computed/methods , Acetabulum/diagnostic imaging , Acetabulum/surgery , Adult , Aged , Aged, 80 and over , Female , Humans , Image Processing, Computer-Assisted , Imaging, Three-Dimensional , Male , Middle Aged , Patient Positioning , Pelvis/diagnostic imaging , Pelvis/surgery , Radiography , Rotation , Sex Factors
13.
Eur Radiol ; 25(6): 1768-75, 2015 Jun.
Article in English | MEDLINE | ID: mdl-25576230

ABSTRACT

PURPOSE: To develop and validate a decision support tool for mammographic mass lesions based on a standardized descriptor terminology (BI-RADS lexicon) to reduce variability of practice. MATERIALS AND METHODS: We used separate training data (1,276 lesions, 138 malignant) and validation data (1,177 lesions, 175 malignant). We created naïve Bayes (NB) classifiers from the training data with tenfold cross-validation. Our "inclusive model" comprised BI-RADS categories, BI-RADS descriptors, and age as predictive variables; our "descriptor model" comprised BI-RADS descriptors and age. The resulting NB classifiers were applied to the validation data. We evaluated and compared classifier performance with ROC-analysis. RESULTS: In the training data, the inclusive model yields an AUC of 0.959; the descriptor model yields an AUC of 0.910 (P < 0.001). The inclusive model is superior to the clinical performance (BI-RADS categories alone, P < 0.001); the descriptor model performs similarly. When applied to the validation data, the inclusive model yields an AUC of 0.935; the descriptor model yields an AUC of 0.876 (P < 0.001). Again, the inclusive model is superior to the clinical performance (P < 0.001); the descriptor model performs similarly. CONCLUSION: We consider our classifier a step towards a more uniform interpretation of combinations of BI-RADS descriptors. We provide our classifier at www.ebm-radiology.com/nbmm/index.html . KEY POINTS: • We provide a decision support tool for mammographic masses at www.ebm-radiology.com/nbmm/index.html . • Our tool may reduce variability of practice in BI-RADS category assignment. • A formal analysis of BI-RADS descriptors may enhance radiologists' diagnostic performance.


Subject(s)
Breast Neoplasms/diagnostic imaging , Decision Support Systems, Clinical/standards , Mammography/methods , Adult , Aged , Bayes Theorem , Female , Humans , Internet , Mammography/standards , Middle Aged , ROC Curve , Radiology/education , Terminology as Topic , United States
14.
J Neuroradiol ; 41(4): 259-68, 2014 Oct.
Article in English | MEDLINE | ID: mdl-24411522

ABSTRACT

BACKGROUND AND PURPOSE: CT angiography (CTA) is an increasingly used method for evaluation of stented vessel segments. Our aim was to compare the appearance of different carotid artery stents in vitro on CTA using different CT scanners. Of particular interest was the measurement of artificial lumen narrowing (ALN) caused by the stent material within the stented vessel segment to determine whether CTA can be used to detect in-stent restenosis. MATERIAL AND METHODS: CTA appearances of 16 carotid artery stents of different designs and sizes (4.0 to 11.0 mm) were investigated in vitro. CTA was performed using 16-, 64- and 320-row CT scanners. For each stent, artificial lumen narrowing (ALN) was calculated. RESULTS: ALN ranged from 18.77% to 59.86%. ALN in different stents differed significantly. In most stents, ALN decreased with increasing stent diameter. In all but one stents, ALN using sharp image kernels was significantly lower than ALN using medium image kernels. Considering all stents, ALN did not significantly differ using different CT scanners or imaging protocols. CONCLUSION: CTA evaluation of vessel patency after stent placement is possible, but is considerably impaired by ALN. Investigators should be informed about the method of choice for every stent and stent manufacturers should be aware of potential artifacts caused by their stents during noninvasive diagnostic methods such as CTA.


Subject(s)
Angiography/instrumentation , Blood Vessel Prosthesis/adverse effects , Carotid Arteries/diagnostic imaging , Carotid Artery Diseases/diagnostic imaging , Graft Occlusion, Vascular/diagnostic imaging , Stents/adverse effects , Tomography, X-Ray Computed/instrumentation , Angiography/methods , Carotid Arteries/surgery , Carotid Artery Diseases/etiology , Equipment Design , Equipment Failure Analysis , Graft Occlusion, Vascular/etiology , In Vitro Techniques , Reproducibility of Results , Sensitivity and Specificity , Tomography, X-Ray Computed/methods
15.
Radiologie (Heidelb) ; 64(6): 498-502, 2024 Jun.
Article in German | MEDLINE | ID: mdl-38499692

ABSTRACT

The introduction of artificial intelligence (AI) into radiology promises to enhance efficiency and improve diagnostic accuracy, yet it also raises manifold ethical questions. These include data protection issues, the future role of radiologists, liability when using AI systems, and the avoidance of bias. To prevent data bias, the datasets need to be compiled carefully and to be representative of the target population. Accordingly, the upcoming European Union AI act sets particularly high requirements for the datasets used in training medical AI systems. Cognitive bias occurs when radiologists place too much trust in the results provided by AI systems (overreliance). So far, diagnostic AI systems are used almost exclusively as "second look" systems. If diagnostic AI systems are to be used in the future as "first look" systems or even as autonomous AI systems in order to enhance efficiency in radiology, the question of liability needs to be addressed, comparable to liability for autonomous driving. Such use of AI would also significantly change the role of radiologists.


Subject(s)
Artificial Intelligence , Radiology , Humans , Artificial Intelligence/ethics , Computer Security/ethics , Radiology/ethics
16.
J Cardiothorac Surg ; 19(1): 184, 2024 Apr 06.
Article in English | MEDLINE | ID: mdl-38582893

ABSTRACT

The occurrence of ectopic pancreas in the mediastinum is rare. Herein, we report a 22-year-old female who presented with right shoulder pain, dysphagia, fever and headaches. Chest computer tomography revealed a mass in the posterior mediastinum with accompanying signs of acute mediastinitis. Needle biopsy and fine-needle aspiration revealed ectopic gastral tissue and ectopic pancreas tissue, respectively. Surgical resection was attempted due to recurring acute pancreatitis episodes. However, due to chronic-inflammatory adhesions of the mass to the tracheal wall, en-bloc resection was not possible without major tracheal resection. Since then, recurring pancreatitis episodes have been treated conservatively with antibiotics. We report this case due to its differing clinical and radiological findings in comparison to previous case reports, none of which pertained a case of ectopic pancreas tissue in the posterior mediastinum with recurring acute pancreatitis and mediastinitis.


Subject(s)
Choristoma , Mediastinitis , Pancreatitis , Female , Humans , Young Adult , Acute Disease , Choristoma/surgery , Choristoma/diagnosis , Mediastinitis/diagnosis , Mediastinitis/surgery , Mediastinitis/complications , Mediastinum/diagnostic imaging , Mediastinum/pathology , Pancreas/pathology , Pancreatitis/complications , Pancreatitis/diagnosis
17.
Eur Radiol Exp ; 8(1): 23, 2024 Feb 14.
Article in English | MEDLINE | ID: mdl-38353812

ABSTRACT

BACKGROUND: The growing prevalence of musculoskeletal diseases increases radiologic workload, highlighting the need for optimized workflow management and automated metadata classification systems. We developed a large-scale, well-characterized dataset of musculoskeletal radiographs and trained deep learning neural networks to classify radiographic projection and body side. METHODS: In this IRB-approved retrospective single-center study, a dataset of musculoskeletal radiographs from 2011 to 2019 was retrieved and manually labeled for one of 45 possible radiographic projections and the depicted body side. Two classification networks were trained for the respective tasks using the Xception architecture with a custom network top and pretrained weights. Performance was evaluated on a hold-out test sample, and gradient-weighted class activation mapping (Grad-CAM) heatmaps were computed to visualize the influential image regions for network predictions. RESULTS: A total of 13,098 studies comprising 23,663 radiographs were included with a patient-level dataset split, resulting in 19,183 training, 2,145 validation, and 2,335 test images. Focusing on paired body regions, training for side detection included 16,319 radiographs (13,284 training, 1,443 validation, and 1,592 test images). The models achieved an overall accuracy of 0.975 for projection and 0.976 for body-side classification on the respective hold-out test sample. Errors were primarily observed in projections with seamless anatomical transitions or non-orthograde adjustment techniques. CONCLUSIONS: The deep learning neural networks demonstrated excellent performance in classifying radiographic projection and body side across a wide range of musculoskeletal radiographs. These networks have the potential to serve as presorting algorithms, optimizing radiologic workflow and enhancing patient care. RELEVANCE STATEMENT: The developed networks excel at classifying musculoskeletal radiographs, providing valuable tools for research data extraction, standardized image sorting, and minimizing misclassifications in artificial intelligence systems, ultimately enhancing radiology workflow efficiency and patient care. KEY POINTS: • A large-scale, well-characterized dataset was developed, covering a broad spectrum of musculoskeletal radiographs. • Deep learning neural networks achieved high accuracy in classifying radiographic projection and body side. • Grad-CAM heatmaps provided insight into network decisions, contributing to their interpretability and trustworthiness. • The trained models can help optimize radiologic workflow and manage large amounts of data.


Subject(s)
Deep Learning , Radiology , Humans , Artificial Intelligence , Retrospective Studies , Radiography
18.
BMJ Open ; 14(1): e076954, 2024 01 23.
Article in English | MEDLINE | ID: mdl-38262641

ABSTRACT

OBJECTIVES: To aid in selecting the optimal artificial intelligence (AI) solution for clinical application, we directly compared performances of selected representative custom-trained or commercial classification, detection and segmentation models for fracture detection on musculoskeletal radiographs of the distal radius by aligning their outputs. DESIGN AND SETTING: This single-centre retrospective study was conducted on a random subset of emergency department radiographs from 2008 to 2018 of the distal radius in Germany. MATERIALS AND METHODS: An image set was created to be compatible with training and testing classification and segmentation models by annotating examinations for fractures and overlaying fracture masks, if applicable. Representative classification and segmentation models were trained on 80% of the data. After output binarisation, their derived fracture detection performances as well as that of a standard commercially available solution were compared on the remaining X-rays (20%) using mainly accuracy and area under the receiver operating characteristic (AUROC). RESULTS: A total of 2856 examinations with 712 (24.9%) fractures were included in the analysis. Accuracies reached up to 0.97 for the classification model, 0.94 for the segmentation model and 0.95 for BoneView. Cohen's kappa was at least 0.80 in pairwise comparisons, while Fleiss' kappa was 0.83 for all models. Fracture predictions were visualised with all three methods at different levels of detail, ranking from downsampled image region for classification over bounding box for detection to single pixel-level delineation for segmentation. CONCLUSIONS: All three investigated approaches reached high performances for detection of distal radius fractures with simple preprocessing and postprocessing protocols on the custom-trained models. Despite their underlying structural differences, selection of one's fracture analysis AI tool in the frame of this study reduces to the desired flavour of automation: automated classification, AI-assisted manual fracture reading or minimised false negatives.


Subject(s)
Deep Learning , Fractures, Bone , Humans , X-Rays , Artificial Intelligence , Radius , Retrospective Studies
19.
Insights Imaging ; 15(1): 54, 2024 Feb 27.
Article in English | MEDLINE | ID: mdl-38411750

ABSTRACT

OBJECTIVE: To systematically review radiomic feature reproducibility and model validation strategies in recent studies dealing with CT and MRI radiomics of bone and soft-tissue sarcomas, thus updating a previous version of this review which included studies published up to 2020. METHODS: A literature search was conducted on EMBASE and PubMed databases for papers published between January 2021 and March 2023. Data regarding radiomic feature reproducibility and model validation strategies were extracted and analyzed. RESULTS: Out of 201 identified papers, 55 were included. They dealt with radiomics of bone (n = 23) or soft-tissue (n = 32) tumors. Thirty-two (out of 54 employing manual or semiautomatic segmentation, 59%) studies included a feature reproducibility analysis. Reproducibility was assessed based on intra/interobserver segmentation variability in 30 (55%) and geometrical transformations of the region of interest in 2 (4%) studies. At least one machine learning validation technique was used for model development in 34 (62%) papers, and K-fold cross-validation was employed most frequently. A clinical validation of the model was reported in 38 (69%) papers. It was performed using a separate dataset from the primary institution (internal test) in 22 (40%), an independent dataset from another institution (external test) in 14 (25%) and both in 2 (4%) studies. CONCLUSIONS: Compared to papers published up to 2020, a clear improvement was noted with almost double publications reporting methodological aspects related to reproducibility and validation. Larger multicenter investigations including external clinical validation and the publication of databases in open-access repositories could further improve methodology and bring radiomics from a research area to the clinical stage. CRITICAL RELEVANCE STATEMENT: An improvement in feature reproducibility and model validation strategies has been shown in this updated systematic review on radiomics of bone and soft-tissue sarcomas, highlighting efforts to enhance methodology and bring radiomics from a research area to the clinical stage. KEY POINTS: • 2021-2023 radiomic studies on CT and MRI of musculoskeletal sarcomas were reviewed. • Feature reproducibility was assessed in more than half (59%) of the studies. • Model clinical validation was performed in 69% of the studies. • Internal (44%) and/or external (29%) test datasets were employed for clinical validation.

20.
Insights Imaging ; 15(1): 16, 2024 Jan 22.
Article in English | MEDLINE | ID: mdl-38246898

ABSTRACT

Artificial Intelligence (AI) carries the potential for unprecedented disruption in radiology, with possible positive and negative consequences. The integration of AI in radiology holds the potential to revolutionize healthcare practices by advancing diagnosis, quantification, and management of multiple medical conditions. Nevertheless, the ever-growing availability of AI tools in radiology highlights an increasing need to critically evaluate claims for its utility and to differentiate safe product offerings from potentially harmful, or fundamentally unhelpful ones.This multi-society paper, presenting the views of Radiology Societies in the USA, Canada, Europe, Australia, and New Zealand, defines the potential practical problems and ethical issues surrounding the incorporation of AI into radiological practice. In addition to delineating the main points of concern that developers, regulators, and purchasers of AI tools should consider prior to their introduction into clinical practice, this statement also suggests methods to monitor their stability and safety in clinical use, and their suitability for possible autonomous function. This statement is intended to serve as a useful summary of the practical issues which should be considered by all parties involved in the development of radiology AI resources, and their implementation as clinical tools.Key points • The incorporation of artificial intelligence (AI) in radiological practice demands increased monitoring of its utility and safety.• Cooperation between developers, clinicians, and regulators will allow all involved to address ethical issues and monitor AI performance.• AI can fulfil its promise to advance patient well-being if all steps from development to integration in healthcare are rigorously evaluated.

SELECTION OF CITATIONS
SEARCH DETAIL