Your browser doesn't support javascript.
loading
Montrer: 20 | 50 | 100
Résultats 1 - 20 de 302
Filtrer
1.
J Am Coll Radiol ; 2024 Jul 01.
Article de Anglais | MEDLINE | ID: mdl-38960083

RÉSUMÉ

PURPOSE: We compared the performance of generative AI (G-AI, ATARI) and natural language processing (NLP) tools for identifying laterality errors in radiology reports and images. METHODS: We used an NLP-based (mPower) tool to identify radiology reports flagged for laterality errors in its QA Dashboard. The NLP model detects and highlights laterality mismatches in radiology reports. From an initial pool of 1124 radiology reports flagged by the NLP for laterality errors, we selected and evaluated 898 reports that encompassed radiography, CT, MRI, and ultrasound modalities to ensure comprehensive coverage. A radiologist reviewed each radiology report to assess if the flagged laterality errors were present (reporting error - true positive) or absent (NLP error - false positive). Next, we applied ATARI to 237 radiology reports and images with consecutive NLP true positive (118 reports) and false positive (119 reports) laterality errors. We estimated accuracy of NLP and G-AI tools to identify overall and modality-wise laterality errors. RESULTS: Among the 898 NLP-flagged laterality errors, 64% (574/898) had NLP errors and 36% (324/898) were reporting errors. The text query ATARI feature correctly identified the absence of laterality mismatch (NLP false positives) with a 97.4% accuracy (115/118 reports; 95% CI = 96.5% - 98.3%). Combined Vision and text query resulted in 98.3% accuracy (116/118 reports/images; 95% CI = 97.6% - 99.0%) query alone had a 98.3% accuracy (116/118 images; 95% CI = 97.6% - 99.0%). CONCLUSION: The generative AI-empowered ATARI prototype outperformed the assessed NLP tool for determining true and false laterality errors in radiology reports while enabling an image-based laterality determination. Underlying errors in ATARI text query in complex radiology reports emphasize the need for further improvement in the technology.

2.
Diagnostics (Basel) ; 14(14)2024 Jul 16.
Article de Anglais | MEDLINE | ID: mdl-39061671

RÉSUMÉ

Background: Diagnosing lung diseases accurately is crucial for proper treatment. Convolutional neural networks (CNNs) have advanced medical image processing, but challenges remain in their accurate explainability and reliability. This study combines U-Net with attention and Vision Transformers (ViTs) to enhance lung disease segmentation and classification. We hypothesize that Attention U-Net will enhance segmentation accuracy and that ViTs will improve classification performance. The explainability methodologies will shed light on model decision-making processes, aiding in clinical acceptance. Methodology: A comparative approach was used to evaluate deep learning models for segmenting and classifying lung illnesses using chest X-rays. The Attention U-Net model is used for segmentation, and architectures consisting of four CNNs and four ViTs were investigated for classification. Methods like Gradient-weighted Class Activation Mapping plus plus (Grad-CAM++) and Layer-wise Relevance Propagation (LRP) provide explainability by identifying crucial areas influencing model decisions. Results: The results support the conclusion that ViTs are outstanding in identifying lung disorders. Attention U-Net obtained a Dice Coefficient of 98.54% and a Jaccard Index of 97.12%. ViTs outperformed CNNs in classification tasks by 9.26%, reaching an accuracy of 98.52% with MobileViT. An 8.3% increase in accuracy was seen while moving from raw data classification to segmented image classification. Techniques like Grad-CAM++ and LRP provided insights into the decision-making processes of the models. Conclusions: This study highlights the benefits of integrating Attention U-Net and ViTs for analyzing lung diseases, demonstrating their importance in clinical settings. Emphasizing explainability clarifies deep learning processes, enhancing confidence in AI solutions and perhaps enhancing clinical acceptance for improved healthcare results.

5.
Phys Med ; 124: 103431, 2024 Jul 25.
Article de Anglais | MEDLINE | ID: mdl-39059250

RÉSUMÉ

PURPOSE: The objective of our IAEA-coordinated international study was to assess CT practices and radiation doses from multiple hospitals across several African countries. METHODS: The study included 13 hospitals from Africa which contributed information on minimum of 20 consecutive patients who underwent head, chest, and/or abdomen-pelvis CT. Prior to the data recording step, all hospitals had a mandatory one-hour training on the best practices in recording the relevant data elements. The recorded data elements included patient age, weight, protocol name, scanner information, acquisition parameters, and radiation dose descriptors including phase-specific CT dose index volume (CTDIvol in mGy) and dose length product (DLP in mGy.cm). We estimated the median and interquartile range of body-region specific CTDIvol and DLP and compared data across sites and countries using the Kruskal-Wallis H Test for non-normal distribution, analysis of variance. RESULTS: A total of 1061 patients (mean age 50 ± 19 years) were included in the study. 16 % of CT exams had no stated clinical indications for CT examinations of the head (32/343, 9 %), chest (50/281, 18 %), abdomen-pelvis (67/243, 28 %), and/or chest-abdomen-pelvis CT (24/194, 12 %). Most hospitals used multiphase CT protocols for abdomen-pelvis (9/11 hospitals) and chest CT (10/12 hospitals), regardless of clinical indications. Total median DLP values for head (953 mGy.cm), chest (405 mGy.cm), and abdomen-pelvis (1195 mGy.cm) CT were above the UK, German, and American College of Radiology Diagnostic Reference Levels (DRLs). CONCLUSIONS: Concerning variations in CT practices and protocols across several hospitals in Africa were demonstrated, emphasizing the need for better protocol optimization to improve patient safety.

6.
Clin Imaging ; 112: 110207, 2024 Aug.
Article de Anglais | MEDLINE | ID: mdl-38838448

RÉSUMÉ

PURPOSE: We created an infrastructure for no code machine learning (NML) platform for non-programming physicians to create NML model. We tested the platform by creating an NML model for classifying radiographs for the presence and absence of clavicle fractures. METHODS: Our IRB-approved retrospective study included 4135 clavicle radiographs from 2039 patients (mean age 52 ± 20 years, F:M 1022:1017) from 13 hospitals. Each patient had two-view clavicle radiographs with axial and anterior-posterior projections. The positive radiographs had either displaced or non-displaced clavicle fractures. We configured the NML platform to automatically retrieve the eligible exams using the series' unique identification from the hospital virtual network archive via web access to DICOM Objects. The platform trained a model until the validation loss plateaus. Once the testing was complete, the platform provided the receiver operating characteristics curve and confusion matrix for estimating sensitivity, specificity, and accuracy. RESULTS: The NML platform successfully retrieved 3917 radiographs (3917/4135, 94.7 %) and parsed them for creating a ML classifier with 2151 radiographs in the training, 100 radiographs for validation, and 1666 radiographs in testing datasets (772 radiographs with clavicle fracture, 894 without clavicle fracture). The network identified clavicle fracture with 90 % sensitivity, 87 % specificity, and 88 % accuracy with AUC of 0.95 (confidence interval 0.94-0.96). CONCLUSION: A NML platform can help physicians create and test machine learning models from multicenter imaging datasets such as the one in our study for classifying radiographs based on the presence of clavicle fracture.


Sujet(s)
Clavicule , Fractures osseuses , Apprentissage machine , Humains , Clavicule/traumatismes , Clavicule/imagerie diagnostique , Fractures osseuses/imagerie diagnostique , Fractures osseuses/classification , Femelle , Adulte d'âge moyen , Mâle , Études rétrospectives , Sensibilité et spécificité , Adulte , Radiographie/méthodes
8.
Eur J Radiol ; 175: 111448, 2024 Jun.
Article de Anglais | MEDLINE | ID: mdl-38574510

RÉSUMÉ

PURPOSE: Aim of the recent study is to point out a method to optimize quality of CT scans in oncological patients with port systems. This study investigates the potential of photon counting computed tomography (PCCT) for reduction of beam hardening artifacts caused by port-implants in chest imaging by means of spectral reconstructions. METHOD: In this retrospective single-center study, 8 ROIs for 19 spectral reconstructions (polyenergetic imaging, monoenergetic reconstructions from 40 to 190 keV as well as iodine maps and virtual non contrast (VNC)) of 49 patients with pectoral port systems undergoing PCCT of the chest for staging of oncologic disease were measured. Mean values and standard deviation (SD) Hounsfield unit measurements of port-chamber associated hypo- and hyperdense artifacts, bilateral muscles and vessels has been carried out. Also, a structured assessment of artifacts and imaging findings was performed by two radiologists. RESULTS: A significant association of keV with iodine contrast as well as artifact intensity was noted (all p < 0.001). In qualitative assessment, utilization of 120 keV monoenergetic reconstructions could reduce severe and pronounced artifacts completely, as compared to lower keV reconstructions (p < 0.001). Regarding imaging findings, no significant difference between monoenergetic reconstructions was noted (all p > 0.05). In cases with very high iodine concentrations in the subclavian vein, image distortions were noted at 40 keV images (p < 0.01). CONCLUSIONS: The present study demonstrates that PCCT derived spectral reconstructions can be used in oncological imaging of the thorax to reduce port-derived beam-hardening artefacts. When evaluating image data sets within a staging, it can be particularly helpful to consider the 120 keV VMIs, in which the artefacts are comparatively low.


Sujet(s)
Artéfacts , Radiographie thoracique , Tomodensitométrie , Humains , Mâle , Femelle , Adulte d'âge moyen , Sujet âgé , Tomodensitométrie/méthodes , Radiographie thoracique/méthodes , Études rétrospectives , Adulte , Sujet âgé de 80 ans ou plus , Interprétation d'images radiographiques assistée par ordinateur/méthodes , Photons , Reproductibilité des résultats
9.
Crit Care Explor ; 6(2): e1040, 2024 Feb.
Article de Anglais | MEDLINE | ID: mdl-38511125

RÉSUMÉ

OBJECTIVES: To investigate the contribution of mechanical obstruction and pulmonary vasoconstriction to pulmonary vascular resistance (PVR) in acute pulmonary embolism (PE) in pigs. DESIGN: Controlled, animal study. SETTING: Tertiary university hospital, animal research laboratory. SUBJECTS: Female Danish slaughter pigs (n = 12, ~60 kg). INTERVENTIONS: None. MEASUREMENTS AND MAIN RESULTS: PE was induced by infusion of autologous blood clots in pigs. CT pulmonary angiograms were performed at baseline, after PE (first experimental day [PEd0]) and the following 2 days (second experimental day [PEd1] and third experimental day [PEd2]), and clot burden quantified by a modified Qanadli Obstruction Score. Hemodynamics were evaluated with left and right heart catheterization and systemic invasive pressures each day before, under, and after treatment with the pulmonary vasodilators sildenafil (0.1 mg/kg) and oxygen (Fio2 40%). PE increased PVR (baseline vs. PEd0: 178 ± 54 vs. 526 ± 160 dynes; p < 0.0001) and obstruction score (baseline vs. PEd0: 0% vs. 45% ± 13%; p < 0.0001). PVR decreased toward baseline at day 1 (baseline vs. PEd1: 178 ± 54 vs. 219 ± 48; p = 0.16) and day 2 (baseline vs. PEd2: 178 ± 54 vs. 201 ± 50; p = 0.51). Obstruction score decreased only slightly at day 1 (PEd0 vs. PEd1: 45% ± 12% vs. 43% ± 14%; p = 0.04) and remained elevated throughout the study (PEd1 vs. PEd2: 43% ± 14% vs. 42% ± 17%; p = 0.74). Sildenafil and oxygen in combination decreased PVR at day 0 (-284 ± 154 dynes; p = 0.0064) but had no effects at day 1 (-8 ± 27 dynes; p = 0.4827) or day 2 (-18 ± 32 dynes; p = 0.0923). CONCLUSIONS: Pulmonary vasoconstriction, and not mechanical obstruction, was the predominant cause of increased PVR in acute PE in pigs. PVR rapidly declined over the first 2 days after onset despite a persistent mechanical obstruction of the pulmonary circulation from emboli. The findings suggest that treatment with pulmonary vasodilators might only be effective in the acute phase of PE thereby limiting the window for such therapy.

10.
Sci Rep ; 14(1): 7154, 2024 03 26.
Article de Anglais | MEDLINE | ID: mdl-38531923

RÉSUMÉ

Due to the intricate relationship between the small non-coding ribonucleic acid (miRNA) sequences, the classification of miRNA species, namely Human, Gorilla, Rat, and Mouse is challenging. Previous methods are not robust and accurate. In this study, we present AtheroPoint's GeneAI 3.0, a powerful, novel, and generalized method for extracting features from the fixed patterns of purines and pyrimidines in each miRNA sequence in ensemble paradigms in machine learning (EML) and convolutional neural network (CNN)-based deep learning (EDL) frameworks. GeneAI 3.0 utilized five conventional (Entropy, Dissimilarity, Energy, Homogeneity, and Contrast), and three contemporary (Shannon entropy, Hurst exponent, Fractal dimension) features, to generate a composite feature set from given miRNA sequences which were then passed into our ML and DL classification framework. A set of 11 new classifiers was designed consisting of 5 EML and 6 EDL for binary/multiclass classification. It was benchmarked against 9 solo ML (SML), 6 solo DL (SDL), 12 hybrid DL (HDL) models, resulting in a total of 11 + 27 = 38 models were designed. Four hypotheses were formulated and validated using explainable AI (XAI) as well as reliability/statistical tests. The order of the mean performance using accuracy (ACC)/area-under-the-curve (AUC) of the 24 DL classifiers was: EDL > HDL > SDL. The mean performance of EDL models with CNN layers was superior to that without CNN layers by 0.73%/0.92%. Mean performance of EML models was superior to SML models with improvements of ACC/AUC by 6.24%/6.46%. EDL models performed significantly better than EML models, with a mean increase in ACC/AUC of 7.09%/6.96%. The GeneAI 3.0 tool produced expected XAI feature plots, and the statistical tests showed significant p-values. Ensemble models with composite features are highly effective and generalized models for effectively classifying miRNA sequences.


Sujet(s)
Apprentissage profond , microARN , Humains , Animaux , Souris , Rats , Nucléotides , Reproductibilité des résultats , Aire sous la courbe
11.
Radiol Artif Intell ; 6(1): e220221, 2024 Jan.
Article de Anglais | MEDLINE | ID: mdl-38166328

RÉSUMÉ

Purpose To determine whether saliency maps in radiology artificial intelligence (AI) are vulnerable to subtle perturbations of the input, which could lead to misleading interpretations, using prediction-saliency correlation (PSC) for evaluating the sensitivity and robustness of saliency methods. Materials and Methods In this retrospective study, locally trained deep learning models and a research prototype provided by a commercial vendor were systematically evaluated on 191 229 chest radiographs from the CheXpert dataset and 7022 MR images from a human brain tumor classification dataset. Two radiologists performed a reader study on 270 chest radiograph pairs. A model-agnostic approach for computing the PSC coefficient was used to evaluate the sensitivity and robustness of seven commonly used saliency methods. Results The saliency methods had low sensitivity (maximum PSC, 0.25; 95% CI: 0.12, 0.38) and weak robustness (maximum PSC, 0.12; 95% CI: 0.0, 0.25) on the CheXpert dataset, as demonstrated by leveraging locally trained model parameters. Further evaluation showed that the saliency maps generated from a commercial prototype could be irrelevant to the model output, without knowledge of the model specifics (area under the receiver operating characteristic curve decreased by 8.6% without affecting the saliency map). The human observer studies confirmed that it is difficult for experts to identify the perturbed images; the experts had less than 44.8% correctness. Conclusion Popular saliency methods scored low PSC values on the two datasets of perturbed chest radiographs, indicating weak sensitivity and robustness. The proposed PSC metric provides a valuable quantification tool for validating the trustworthiness of medical AI explainability. Keywords: Saliency Maps, AI Trustworthiness, Dynamic Consistency, Sensitivity, Robustness Supplemental material is available for this article. © RSNA, 2023 See also the commentary by Yanagawa and Sato in this issue.


Sujet(s)
Intelligence artificielle , Radiologie , Humains , Études rétrospectives , Radiographie , Radiologues
12.
Pediatr Radiol ; 54(3): 457-467, 2024 03.
Article de Anglais | MEDLINE | ID: mdl-37227466

RÉSUMÉ

We established a framework for collecting radiation doses for head, chest and abdomen-pelvis computed tomography (CT) in children scanned at multiple imaging sites across Latin America with an aim towards establishing diagnostic reference levels (DRLs) and achievable doses (ADs) in pediatric CT in Latin America. Our study included 12 Latin American sites (in Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Honduras and Panama) contributing data on the four most common pediatric CT examinations (non-contrast head, non-contrast chest, post-contrast chest and post-contrast abdomen-pelvis). Sites contributed data on patients' age, sex and weight, scan factors (tube current and potential), volume CT dose index (CTDIvol) and dose length product (DLP). Data were verified, leading to the exclusion of two sites with missing or incorrect data entries. We estimated overall and site-specific 50th (AD) and 75th (diagnostic reference level [DRL]) percentile CTDIvol and DLP for each CT protocol. Non-normal data were compared using the Kruskal-Wallis test. Sites contributed data from 3,934 children (1,834 females) for different CT exams (head CT 1,568/3,934, 40%; non-contrast chest CT 945/3,934, 24%; post-contrast chest CT 581/3,934, 15%; abdomen-pelvis CT 840/3,934, 21%). There were significant statistical differences in 50th and 75th percentile CTDIvol and DLP values across the participating sites (P<0.001). The 50th and 75th percentile doses for most CT protocols were substantially higher than the corresponding doses reported from the United States of America. Our study demonstrates substantial disparities and variations in pediatric CT examinations performed in multiple sites in Latin America. We will use the collected data to improve scan protocols and perform a follow-up CT study to establish DRLs and ADs based on clinical indications.


Sujet(s)
Niveaux de référence diagnostiques , Tomodensitométrie , Femelle , Humains , Enfant , Amérique latine , Dose de rayonnement , Valeurs de référence , Tomodensitométrie/méthodes
15.
Acad Radiol ; 2023 Dec 11.
Article de Anglais | MEDLINE | ID: mdl-38087718

RÉSUMÉ

RATIONALE AND OBJECTIVES: To assess differences in radiomics derived from semi-automatic segmentation of liver metastases for stable disease (SD), partial response (PR), and progressive disease (PD) based on RECIST1.1 and to assess if radiomics alone at baseline can predict response. MATERIALS AND METHODS: Our IRB-approved study included 203 women (mean age 54 ± 11 years) with metastatic liver disease from breast cancer. All patients underwent contrast abdomen-pelvis CT in the portal venous phase at two points: baseline (pre-treatment) and follow-up (between 3 and 12 months following treatment). Patients were subcategorized into three subgroups based on RECIST 1.1 criteria (Response Evaluation Criteria in Solid Tumors version 1.1): 66 with SD, 69 with PR, and 68 with PD on follow-up CT. The deidentified baseline and follow-up CT images were exported to the radiomics prototype. The prototype enabled semi-automatic segmentation of the target liver lesions for the extraction of first and high order radiomics. Statistical analyses with logistic regression and random forest classifiers were performed to differentiate SD from PD and PR. RESULTS: There was no significant difference between the radiomics on the baseline and follow-up CT images of patients with SD (area under the curve (AUC): 0.3). Random forest classifier differentiated patients with PR with an AUC of 0.845. The most relevant feature was the large dependence emphasis's high and low pass wavelet filter (derived gray level dependence matrix features). Random forest classifier differentiated PD with an AUC of 0.731, with the most relevant feature being the surface-to-volume ratio. There was no difference in radiomics among the three groups at baseline; therefore, a response could not be predicted. CONCLUSION: Radiomics of liver metastases with semi-automatic segmentation demonstrate differences between SD from PR and PD. SUMMARY STATEMENT: Semiautomatic segmentation and radiomics of metastatic liver disease demonstrate differences in SD from the PR and progressive metastatic on the baseline and follow-up CT. Despite substantial variations in the scanners, acquisition, and reconstruction parameters, radiomics had an AUC of 0.84-0.89 for differentiating stable hepatic metastases from decreasing and increasing metastatic disease.

16.
J Korean Med Sci ; 38(46): e395, 2023 Nov 27.
Article de Anglais | MEDLINE | ID: mdl-38013648

RÉSUMÉ

Cardiovascular disease (CVD) related mortality and morbidity heavily strain society. The relationship between external risk factors and our genetics have not been well established. It is widely acknowledged that environmental influence and individual behaviours play a significant role in CVD vulnerability, leading to the development of polygenic risk scores (PRS). We employed the PRISMA search method to locate pertinent research and literature to extensively review artificial intelligence (AI)-based PRS models for CVD risk prediction. Furthermore, we analyzed and compared conventional vs. AI-based solutions for PRS. We summarized the recent advances in our understanding of the use of AI-based PRS for risk prediction of CVD. Our study proposes three hypotheses: i) Multiple genetic variations and risk factors can be incorporated into AI-based PRS to improve the accuracy of CVD risk predicting. ii) AI-based PRS for CVD circumvents the drawbacks of conventional PRS calculators by incorporating a larger variety of genetic and non-genetic components, allowing for more precise and individualised risk estimations. iii) Using AI approaches, it is possible to significantly reduce the dimensionality of huge genomic datasets, resulting in more accurate and effective disease risk prediction models. Our study highlighted that the AI-PRS model outperformed traditional PRS calculators in predicting CVD risk. Furthermore, using AI-based methods to calculate PRS may increase the precision of risk predictions for CVD and have significant ramifications for individualized prevention and treatment plans.


Sujet(s)
Maladies cardiovasculaires , Humains , Maladies cardiovasculaires/diagnostic , Maladies cardiovasculaires/génétique , Intelligence artificielle , Facteurs de risque
17.
Diagnostics (Basel) ; 13(19)2023 Oct 09.
Article de Anglais | MEDLINE | ID: mdl-37835902

RÉSUMÉ

Skin lesion classification plays a crucial role in dermatology, aiding in the early detection, diagnosis, and management of life-threatening malignant lesions. However, standalone transfer learning (TL) models failed to deliver optimal performance. In this study, we present an attention-enabled ensemble-based deep learning technique, a powerful, novel, and generalized method for extracting features for the classification of skin lesions. This technique holds significant promise in enhancing diagnostic accuracy by using seven pre-trained TL models for classification. Six ensemble-based DL (EBDL) models were created using stacking, softmax voting, and weighted average techniques. Furthermore, we investigated the attention mechanism as an effective paradigm and created seven attention-enabled transfer learning (aeTL) models before branching out to construct three attention-enabled ensemble-based DL (aeEBDL) models to create a reliable, adaptive, and generalized paradigm. The mean accuracy of the TL models is 95.30%, and the use of an ensemble-based paradigm increased it by 4.22%, to 99.52%. The aeTL models' performance was superior to the TL models in accuracy by 3.01%, and aeEBDL models outperformed aeTL models by 1.29%. Statistical tests show significant p-value and Kappa coefficient along with a 99.6% reliability index for the aeEBDL models. The approach is highly effective and generalized for the classification of skin lesions.

19.
Diagnostics (Basel) ; 13(11)2023 Jun 02.
Article de Anglais | MEDLINE | ID: mdl-37296806

RÉSUMÉ

BACKGROUND AND MOTIVATION: Lung computed tomography (CT) techniques are high-resolution and are well adopted in the intensive care unit (ICU) for COVID-19 disease control classification. Most artificial intelligence (AI) systems do not undergo generalization and are typically overfitted. Such trained AI systems are not practical for clinical settings and therefore do not give accurate results when executed on unseen data sets. We hypothesize that ensemble deep learning (EDL) is superior to deep transfer learning (TL) in both non-augmented and augmented frameworks. METHODOLOGY: The system consists of a cascade of quality control, ResNet-UNet-based hybrid deep learning for lung segmentation, and seven models using TL-based classification followed by five types of EDL's. To prove our hypothesis, five different kinds of data combinations (DC) were designed using a combination of two multicenter cohorts-Croatia (80 COVID) and Italy (72 COVID and 30 controls)-leading to 12,000 CT slices. As part of generalization, the system was tested on unseen data and statistically tested for reliability/stability. RESULTS: Using the K5 (80:20) cross-validation protocol on the balanced and augmented dataset, the five DC datasets improved TL mean accuracy by 3.32%, 6.56%, 12.96%, 47.1%, and 2.78%, respectively. The five EDL systems showed improvements in accuracy of 2.12%, 5.78%, 6.72%, 32.05%, and 2.40%, thus validating our hypothesis. All statistical tests proved positive for reliability and stability. CONCLUSION: EDL showed superior performance to TL systems for both (a) unbalanced and unaugmented and (b) balanced and augmented datasets for both (i) seen and (ii) unseen paradigms, validating both our hypotheses.

SÉLECTION CITATIONS
DÉTAIL DE RECHERCHE