Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
1.
Eur Radiol ; 2024 Jul 23.
Artículo en Inglés | MEDLINE | ID: mdl-39042303

RESUMEN

OBJECTIVES: This study aims to externally validate a commercially available Computer-Aided Detection (CAD)-system for the automatic detection and characterization of solid, part-solid, and ground-glass lung nodules (LN) on CT scans. METHODS: This retrospective study encompasses 263 chest CT scans performed between January 2020 and December 2021 at a Dutch university hospital. All scans were read by a radiologist (R1) and compared with the initial radiology report. Conflicting scans were assessed by an adjudicating radiologist (R2). All scans were also processed by CAD. The standalone performance of CAD in terms of sensitivity and false-positive (FP)-rate for detection was calculated together with the sensitivity for characterization, including texture, calcification, speculation, and location. The R1's detection sensitivity was also assessed. RESULTS: A total of 183 true nodules were identified in 121 nodule-containing scans (142 non-nodule-containing scans), of which R1 identified 165/183 (90.2%). CAD detected 149 nodules, of which 12 were not identified by R1, achieving a sensitivity of 149/183 (81.4%) with an FP-rate of 49/121 (0.405). CAD's detection sensitivity for solid, part-solid, and ground-glass LNs was 82/94 (87.2%), 42/47 (89.4%), and 25/42 (59.5%), respectively. The classification accuracy for solid, part-solid, and ground-glass LNs was 81/82 (98.8%), 16/42 (38.1%), and 18/25 (72.0%), respectively. Additionally, CAD demonstrated overall classification accuracies of 137/149 (91.9%), 123/149 (82.6%), and 141/149 (94.6%) for calcification, spiculation, and location, respectively. CONCLUSIONS: Although the overall detection rate of this system slightly lags behind that of a radiologist, CAD is capable of detecting different LNs and thereby has the potential to enhance a reader's detection rate. While promising characterization performances are obtained, the tool's performance in terms of texture classification remains a subject of concern. CLINICAL RELEVANCE STATEMENT: Numerous lung nodule computer-aided detection-systems are commercially available, with some of them solely being externally validated based on their detection performance on solid nodules. We encourage researchers to assess performances by incorporating all relevant characteristics, including part-solid and ground-glass nodules. KEY POINTS: Few computer-aided detection (CAD) systems are externally validated for automatic detection and characterization of lung nodules. A detection sensitivity of 81.4% and an overall texture classification sensitivity of 77.2% were measured utilizing CAD. CAD has the potential to increase single reader detection rate, however, improvement in texture classification is required.

2.
Radiology ; 312(1): e233341, 2024 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-38980184

RESUMEN

Background Due to conflicting findings in the literature, there are concerns about a lack of objectivity in grading knee osteoarthritis (KOA) on radiographs. Purpose To examine how artificial intelligence (AI) assistance affects the performance and interobserver agreement of radiologists and orthopedists of various experience levels when evaluating KOA on radiographs according to the established Kellgren-Lawrence (KL) grading system. Materials and Methods In this retrospective observer performance study, consecutive standing knee radiographs from patients with suspected KOA were collected from three participating European centers between April 2019 and May 2022. Each center recruited four readers across radiology and orthopedic surgery at in-training and board-certified experience levels. KL grading (KL-0 = no KOA, KL-4 = severe KOA) on the frontal view was assessed by readers with and without assistance from a commercial AI tool. The majority vote of three musculoskeletal radiology consultants established the reference standard. The ordinal receiver operating characteristic method was used to estimate grading performance. Light kappa was used to estimate interrater agreement, and bootstrapped t statistics were used to compare groups. Results Seventy-five studies were included from each center, totaling 225 studies (mean patient age, 55 years ± 15 [SD]; 113 female patients). The KL grades were KL-0, 24.0% (n = 54); KL-1, 28.0% (n = 63); KL-2, 21.8% (n = 49); KL-3, 18.7% (n = 42); and KL-4, 7.6% (n = 17). Eleven readers completed their readings. Three of the six junior readers showed higher KL grading performance with versus without AI assistance (area under the receiver operating characteristic curve, 0.81 ± 0.017 [SEM] vs 0.88 ± 0.011 [P < .001]; 0.76 ± 0.018 vs 0.86 ± 0.013 [P < .001]; and 0.89 ± 0.011 vs 0.91 ± 0.009 [P = .008]). Interobserver agreement for KL grading among all readers was higher with versus without AI assistance (κ = 0.77 ± 0.018 [SEM] vs 0.85 ± 0.013; P < .001). Board-certified radiologists achieved almost perfect agreement for KL grading when assisted by AI (κ = 0.90 ± 0.01), which was higher than that achieved by the reference readers independently (κ = 0.84 ± 0.017; P = .01). Conclusion AI assistance increased junior readers' radiographic KOA grading performance and increased interobserver agreement for osteoarthritis grading across all readers and experience levels. Published under a CC BY 4.0 license. Supplemental material is available for this article.


Asunto(s)
Inteligencia Artificial , Variaciones Dependientes del Observador , Osteoartritis de la Rodilla , Humanos , Femenino , Masculino , Osteoartritis de la Rodilla/diagnóstico por imagen , Persona de Mediana Edad , Estudios Retrospectivos , Radiografía/métodos , Anciano
3.
Quant Imaging Med Surg ; 14(6): 3778-3788, 2024 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-38846290

RESUMEN

Background: While current preoperative and postoperative assessment of the fractured and surgically reconstructed calcaneus relies on computed tomography (CT)-imaging, there are no established methods to quantify calcaneus morphology on CT-images. This study aims to develop a semi-automated method for morphological measurements of the calcaneus on three-dimensional (3D) models derived from CT-imaging. Methods: Using CT data, 3D models were created from healthy, fractured, and surgically reconstructed calcanei. Böhler's angle (BA) and Critical angle of Gissane (CAG) were measured on conventional lateral radiographs and corresponding 3D CT reconstructions using a novel point-based method with semi-automatic landmark placement by three observers. Intraobserver and interobserver reliability scores were calculated using intra-class correlation coefficient (ICC). In addition, consensus among observers was calculated for a maximal allowable discrepancy of 5 and 10 degrees for both methods. Results: Imaging data from 119 feet were obtained (40 healthy, 39 fractured, 40 reconstructed). Semi-automated measurements on 3D models of BA and CAG showed excellent reliability (ICC: 0.87-1.00). The manual measurements on conventional radiographs had a poor-to-excellent reliability (ICC: 0.22-0.96). In addition, the percentage of consensus among observers was much higher for the 3D method when compared to conventional two-dimensional (2D) measurements. Conclusions: The proposed method enables reliable and reproducible quantification of calcaneus morphology in 3D models of healthy, fractured and reconstructed calcanei.

4.
Cancers (Basel) ; 16(11)2024 May 28.
Artículo en Inglés | MEDLINE | ID: mdl-38893158

RESUMEN

Malignant peripheral nerve sheath tumors (MPNSTs) are aggressive soft-tissue tumors prevalent in neurofibromatosis type 1 (NF1) patients, posing a significant risk of metastasis and recurrence. Current magnetic resonance imaging (MRI) imaging lacks decisiveness in distinguishing benign peripheral nerve sheath tumors (BPNSTs) and MPNSTs, necessitating invasive biopsies. This study aims to develop a radiomics model using quantitative imaging features and machine learning to distinguish MPNSTs from BPNSTs. Clinical data and MRIs from MPNST and BPNST patients (2000-2019) were collected at a tertiary sarcoma referral center. Lesions were manually and semi-automatically segmented on MRI scans, and radiomics features were extracted using the Workflow for Optimal Radiomics Classification (WORC) algorithm, employing automated machine learning. The evaluation was conducted using a 100× random-split cross-validation. A total of 35 MPNSTs and 74 BPNSTs were included. The T1-weighted (T1w) MRI radiomics model outperformed others with an area under the curve (AUC) of 0.71. The incorporation of additional MRI scans did not enhance performance. Combining T1w MRI with clinical features achieved an AUC of 0.74. Experienced radiologists achieved AUCs of 0.75 and 0.66, respectively. Radiomics based on T1w MRI scans and clinical features show some ability to distinguish MPNSTs from BPNSTs, potentially aiding in the management of these tumors.

5.
Skeletal Radiol ; 2024 Jun 20.
Artículo en Inglés | MEDLINE | ID: mdl-38902420

RESUMEN

This article will provide a perspective review of the most extensively investigated deep learning (DL) applications for musculoskeletal disease detection that have the best potential to translate into routine clinical practice over the next decade. Deep learning methods for detecting fractures, estimating pediatric bone age, calculating bone measurements such as lower extremity alignment and Cobb angle, and grading osteoarthritis on radiographs have been shown to have high diagnostic performance with many of these applications now commercially available for use in clinical practice. Many studies have also documented the feasibility of using DL methods for detecting joint pathology and characterizing bone tumors on magnetic resonance imaging (MRI). However, musculoskeletal disease detection on MRI is difficult as it requires multi-task, multi-class detection of complex abnormalities on multiple image slices with different tissue contrasts. The generalizability of DL methods for musculoskeletal disease detection on MRI is also challenging due to fluctuations in image quality caused by the wide variety of scanners and pulse sequences used in routine MRI protocols. The diagnostic performance of current DL methods for musculoskeletal disease detection must be further evaluated in well-designed prospective studies using large image datasets acquired at different institutions with different imaging parameters and imaging hardware before they can be fully implemented in clinical practice. Future studies must also investigate the true clinical benefits of current DL methods and determine whether they could enhance quality, reduce error rates, improve workflow, and decrease radiologist fatigue and burnout with all of this weighed against the costs.

6.
J Orthop Res ; 2024 May 06.
Artículo en Inglés | MEDLINE | ID: mdl-38711242

RESUMEN

In 3D-analysis of the calcaneus, a consistent coordinate system aligned with the original anatomical directions is crucial for pre- and postoperative analysis. This importance stems from the calcaneus's key role in weight-bearing and biomechanical alignment. However, defining a reliable coordinate system based solely on fractured or surgically reconstructed calcanei presents significant challenges. Given its anatomical prominence and consistent orientation, the talus offers a potential solution to this challenge. Our work explores the feasibility of talus-derived coordinate systems for 3D-modeling of the calcaneus across its various conditions. Four methods were tested on nonfractured, fractured and surgically reconstructed calcanei, utilizing Principal Component Analysis, anatomical landmarks, bounding box, and an atlas-based approach. The methods were compared with a self-defined calcaneus reference coordinate system. Additionally, the impact of deviation of the coordinate system on morphological measurements was investigated. Among methods for constructing nonfractured calcanei coordinate systems, the atlas-based method displayed the lowest Root Mean Square value in comparison with the reference coordinate system. For morphological measures like Böhler's Angle and the Critical angle of Gissane, the atlas talus-based system closely aligned with ground truth, yielding differences of 0.6° and 1.2°, respectively, compared to larger deviations seen in other talus-based coordinate systems. In conclusion, all tested methods were feasible for creating a talus derived coordinate system. A talus derived coordinate system showed potential, offering benefits for morphological measurements and clinical scenarios involving fractured and surgically reconstructed calcanei. Further research is recommended to assess the impact of these coordinate systems on surgical planning and outcomes.

7.
Eur Radiol ; 2024 Mar 11.
Artículo en Inglés | MEDLINE | ID: mdl-38466390

RESUMEN

OBJECTIVES: To evaluate an artificial intelligence (AI)-assisted double reading system for detecting clinically relevant missed findings on routinely reported chest radiographs. METHODS: A retrospective study was performed in two institutions, a secondary care hospital and tertiary referral oncology centre. Commercially available AI software performed a comparative analysis of chest radiographs and radiologists' authorised reports using a deep learning and natural language processing algorithm, respectively. The AI-detected discrepant findings between images and reports were assessed for clinical relevance by an external radiologist, as part of the commercial service provided by the AI vendor. The selected missed findings were subsequently returned to the institution's radiologist for final review. RESULTS: In total, 25,104 chest radiographs of 21,039 patients (mean age 61.1 years ± 16.2 [SD]; 10,436 men) were included. The AI software detected discrepancies between imaging and reports in 21.1% (5289 of 25,104). After review by the external radiologist, 0.9% (47 of 5289) of cases were deemed to contain clinically relevant missed findings. The institution's radiologists confirmed 35 of 47 missed findings (74.5%) as clinically relevant (0.1% of all cases). Missed findings consisted of lung nodules (71.4%, 25 of 35), pneumothoraces (17.1%, 6 of 35) and consolidations (11.4%, 4 of 35). CONCLUSION: The AI-assisted double reading system was able to identify missed findings on chest radiographs after report authorisation. The approach required an external radiologist to review the AI-detected discrepancies. The number of clinically relevant missed findings by radiologists was very low. CLINICAL RELEVANCE STATEMENT: The AI-assisted double reader workflow was shown to detect diagnostic errors and could be applied as a quality assurance tool. Although clinically relevant missed findings were rare, there is potential impact given the common use of chest radiography. KEY POINTS: • A commercially available double reading system supported by artificial intelligence was evaluated to detect reporting errors in chest radiographs (n=25,104) from two institutions. • Clinically relevant missed findings were found in 0.1% of chest radiographs and consisted of unreported lung nodules, pneumothoraces and consolidations. • Applying AI software as a secondary reader after report authorisation can assist in reducing diagnostic errors without interrupting the radiologist's reading workflow. However, the number of AI-detected discrepancies was considerable and required review by a radiologist to assess their relevance.

9.
Insights Imaging ; 15(1): 34, 2024 Feb 05.
Artículo en Inglés | MEDLINE | ID: mdl-38315288

RESUMEN

OBJECTIVE: To provide a comprehensive framework for value assessment of artificial intelligence (AI) in radiology. METHODS: This paper presents the RADAR framework, which has been adapted from Fryback and Thornbury's imaging efficacy framework to facilitate the valuation of radiology AI from conception to local implementation. Local efficacy has been newly introduced to underscore the importance of appraising an AI technology within its local environment. Furthermore, the RADAR framework is illustrated through a myriad of study designs that help assess value. RESULTS: RADAR presents a seven-level hierarchy, providing radiologists, researchers, and policymakers with a structured approach to the comprehensive assessment of value in radiology AI. RADAR is designed to be dynamic and meet the different valuation needs throughout the AI's lifecycle. Initial phases like technical and diagnostic efficacy (RADAR-1 and RADAR-2) are assessed pre-clinical deployment via in silico clinical trials and cross-sectional studies. Subsequent stages, spanning from diagnostic thinking to patient outcome efficacy (RADAR-3 to RADAR-5), require clinical integration and are explored via randomized controlled trials and cohort studies. Cost-effectiveness efficacy (RADAR-6) takes a societal perspective on financial feasibility, addressed via health-economic evaluations. The final level, RADAR-7, determines how prior valuations translate locally, evaluated through budget impact analysis, multi-criteria decision analyses, and prospective monitoring. CONCLUSION: The RADAR framework offers a comprehensive framework for valuing radiology AI. Its layered, hierarchical structure, combined with a focus on local relevance, aligns RADAR seamlessly with the principles of value-based radiology. CRITICAL RELEVANCE STATEMENT: The RADAR framework advances artificial intelligence in radiology by delineating a much-needed framework for comprehensive valuation. KEYPOINTS: • Radiology artificial intelligence lacks a comprehensive approach to value assessment. • The RADAR framework provides a dynamic, hierarchical method for thorough valuation of radiology AI. • RADAR advances clinical radiology by bridging the artificial intelligence implementation gap.

10.
Eur J Radiol ; 173: 111375, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38377894

RESUMEN

BACKGROUND: Artificial intelligence (AI) applications can facilitate detection of cervical spine fractures on CT and reduce time to diagnosis by prioritizing suspected cases. PURPOSE: To assess the effect on time to diagnose cervical spine fractures on CT and diagnostic accuracy of a commercially available AI application. MATERIALS AND METHODS: In this study (June 2020 - March 2022) with historic controls and prospective evaluation, we evaluated regulatory-cleared AI-software to prioritize cervical spine fractures on CT. All patients underwent non-contrast CT of the cervical spine. The time between CT acquisition and the moment the scan was first opened (DNT) was compared between the retrospective and prospective cohorts. The reference standard for determining diagnostic accuracy was the radiology report created in routine clinical workflow and adjusted by a senior radiologist. Discrepant cases were reviewed and clinical relevance of missed fractures was determined. RESULTS: 2973 (mean age, 55.4 ± 19.7 [standard deviation]; 1857 men) patients were analyzed by AI, including 2036 retrospective and 938 prospective cases. Overall prevalence of cervical spine fractures was 7.6 %. The DNT was 18 % (5 min) shorter in the prospective cohort. In scans positive for cervical spine fracture according to the reference standard, DNT was 46 % (16 min) shorter in the prospective cohort. Overall sensitivity of the AI application was 89.8 % (95 % CI: 84.2-94.0 %), specificity was 95.3 % (95 % CI: 94.2-96.2 %), and diagnostic accuracy was 94.8 % (95 % CI: 93.8-95.8 %). Negative predictive value was 99.1 % (95 % CI: 98.5-99.4 %) and positive predictive value was 63.0 % (95 % CI: 58.0-67.8 %). 22 fractures were missed by AI of which 5 required stabilizing therapy. CONCLUSION: A time gain of 16 min to diagnosis for fractured cases was observed after introducing AI. Although AI-assisted workflow prioritization of cervical spine fractures on CT shows high diagnostic accuracy, clinically relevant cases were missed.


Asunto(s)
Fracturas Óseas , Fracturas de la Columna Vertebral , Masculino , Humanos , Adulto , Persona de Mediana Edad , Anciano , Inteligencia Artificial , Estudios Retrospectivos , Tomografía Computarizada por Rayos X , Fracturas de la Columna Vertebral/diagnóstico por imagen , Vértebras Cervicales/diagnóstico por imagen , Vértebras Cervicales/lesiones , Algoritmos
11.
Nat Commun ; 14(1): 7994, 2023 Dec 02.
Artículo en Inglés | MEDLINE | ID: mdl-38042913

RESUMEN

Aortic aneurysms, which may dissect or rupture acutely and be lethal, can be a part of multisystem disorders that have a heritable basis. We report four patients with deficiency of selenocysteine-containing proteins due to selenocysteine Insertion Sequence Binding Protein 2 (SECISBP2) mutations who show early-onset, progressive, aneurysmal dilatation of the ascending aorta due to cystic medial necrosis. Zebrafish and male mice with global or vascular smooth muscle cell (VSMC)-targeted disruption of Secisbp2 respectively show similar aortopathy. Aortas from patients and animal models exhibit raised cellular reactive oxygen species, oxidative DNA damage and VSMC apoptosis. Antioxidant exposure or chelation of iron prevents oxidative damage in patient's cells and aortopathy in the zebrafish model. Our observations suggest a key role for oxidative stress and cell death, including via ferroptosis, in mediating aortic degeneration.


Asunto(s)
Aneurisma de la Aorta , Pez Cebra , Humanos , Masculino , Ratones , Animales , Selenocisteína , Músculo Liso Vascular/metabolismo , Aneurisma de la Aorta/genética , Aneurisma de la Aorta/metabolismo , Selenoproteínas/genética , Miocitos del Músculo Liso/metabolismo
12.
Radiol Cardiothorac Imaging ; 5(2): e220163, 2023 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-37124638

RESUMEN

Purpose: To evaluate the diagnostic efficacy of artificial intelligence (AI) software in detecting incidental pulmonary embolism (IPE) at CT and shorten the time to diagnosis with use of radiologist reading worklist prioritization. Materials and Methods: In this study with historical controls and prospective evaluation, regulatory-cleared AI software was evaluated to prioritize IPE on routine chest CT scans with intravenous contrast agent in adult oncology patients. Diagnostic accuracy metrics were calculated, and temporal end points, including detection and notification times (DNTs), were assessed during three time periods (April 2019 to September 2020): routine workflow without AI, human triage without AI, and worklist prioritization with AI. Results: In total, 11 736 CT scans in 6447 oncology patients (mean age, 63 years ± 12 [SD]; 3367 men) were included. Prevalence of IPE was 1.3% (51 of 3837 scans), 1.4% (54 of 3920 scans), and 1.0% (38 of 3979 scans) for the respective time periods. The AI software detected 131 true-positive, 12 false-negative, 31 false-positive, and 11 559 true-negative results, achieving 91.6% sensitivity, 99.7% specificity, 99.9% negative predictive value, and 80.9% positive predictive value. During prospective evaluation, AI-based worklist prioritization reduced the median DNT for IPE-positive examinations to 87 minutes (vs routine workflow of 7714 minutes and human triage of 4973 minutes). Radiologists' missed rate of IPE was significantly reduced from 44.8% (47 of 105 scans) without AI to 2.6% (one of 38 scans) when assisted by the AI tool (P < .001). Conclusion: AI-assisted workflow prioritization of IPE on routine CT scans in oncology patients showed high diagnostic accuracy and significantly shortened the time to diagnosis in a setting with a backlog of examinations.Keywords: CT, Computer Applications, Detection, Diagnosis, Embolism, Thorax, ThrombosisSupplemental material is available for this article.© RSNA, 2023See also the commentary by Elicker in this issue.

13.
PLoS One ; 18(5): e0285121, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37130128

RESUMEN

BACKGROUND: Recently, artificial intelligence (AI)-based applications for chest imaging have emerged as potential tools to assist clinicians in the diagnosis and management of patients with coronavirus disease 2019 (COVID-19). OBJECTIVES: To develop a deep learning-based clinical decision support system for automatic diagnosis of COVID-19 on chest CT scans. Secondarily, to develop a complementary segmentation tool to assess the extent of lung involvement and measure disease severity. METHODS: The Imaging COVID-19 AI initiative was formed to conduct a retrospective multicentre cohort study including 20 institutions from seven different European countries. Patients with suspected or known COVID-19 who underwent a chest CT were included. The dataset was split on the institution-level to allow external evaluation. Data annotation was performed by 34 radiologists/radiology residents and included quality control measures. A multi-class classification model was created using a custom 3D convolutional neural network. For the segmentation task, a UNET-like architecture with a backbone Residual Network (ResNet-34) was selected. RESULTS: A total of 2,802 CT scans were included (2,667 unique patients, mean [standard deviation] age = 64.6 [16.2] years, male/female ratio 1.3:1). The distribution of classes (COVID-19/Other type of pulmonary infection/No imaging signs of infection) was 1,490 (53.2%), 402 (14.3%), and 910 (32.5%), respectively. On the external test dataset, the diagnostic multiclassification model yielded high micro-average and macro-average AUC values (0.93 and 0.91, respectively). The model provided the likelihood of COVID-19 vs other cases with a sensitivity of 87% and a specificity of 94%. The segmentation performance was moderate with Dice similarity coefficient (DSC) of 0.59. An imaging analysis pipeline was developed that returned a quantitative report to the user. CONCLUSION: We developed a deep learning-based clinical decision support system that could become an efficient concurrent reading tool to assist clinicians, utilising a newly created European dataset including more than 2,800 CT scans.


Asunto(s)
COVID-19 , Aprendizaje Profundo , Humanos , Femenino , Masculino , Persona de Mediana Edad , COVID-19/diagnóstico por imagen , Inteligencia Artificial , Pulmón/diagnóstico por imagen , Prueba de COVID-19 , Estudios de Cohortes , SARS-CoV-2 , Tomografía Computarizada por Rayos X/métodos
14.
Eur Radiol ; 33(6): 4249-4258, 2023 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-36651954

RESUMEN

OBJECTIVES: Only few published artificial intelligence (AI) studies for COVID-19 imaging have been externally validated. Assessing the generalizability of developed models is essential, especially when considering clinical implementation. We report the development of the International Consortium for COVID-19 Imaging AI (ICOVAI) model and perform independent external validation. METHODS: The ICOVAI model was developed using multicenter data (n = 1286 CT scans) to quantify disease extent and assess COVID-19 likelihood using the COVID-19 Reporting and Data System (CO-RADS). A ResUNet model was modified to automatically delineate lung contours and infectious lung opacities on CT scans, after which a random forest predicted the CO-RADS score. After internal testing, the model was externally validated on a multicenter dataset (n = 400) by independent researchers. CO-RADS classification performance was calculated using linearly weighted Cohen's kappa and segmentation performance using Dice Similarity Coefficient (DSC). RESULTS: Regarding internal versus external testing, segmentation performance of lung contours was equally excellent (DSC = 0.97 vs. DSC = 0.97, p = 0.97). Lung opacities segmentation performance was adequate internally (DSC = 0.76), but significantly worse on external validation (DSC = 0.59, p < 0.0001). For CO-RADS classification, agreement with radiologists on the internal set was substantial (kappa = 0.78), but significantly lower on the external set (kappa = 0.62, p < 0.0001). CONCLUSION: In this multicenter study, a model developed for CO-RADS score prediction and quantification of COVID-19 disease extent was found to have a significant reduction in performance on independent external validation versus internal testing. The limited reproducibility of the model restricted its potential for clinical use. The study demonstrates the importance of independent external validation of AI models. KEY POINTS: • The ICOVAI model for prediction of CO-RADS and quantification of disease extent on chest CT of COVID-19 patients was developed using a large sample of multicenter data. • There was substantial performance on internal testing; however, performance was significantly reduced on external validation, performed by independent researchers. The limited generalizability of the model restricts its potential for clinical use. • Results of AI models for COVID-19 imaging on internal tests may not generalize well to external data, demonstrating the importance of independent external validation.


Asunto(s)
Inteligencia Artificial , COVID-19 , Humanos , Reproducibilidad de los Resultados , Tomografía Computarizada por Rayos X , Algoritmos , Estudios Retrospectivos
15.
J Pers Med ; 12(5)2022 Apr 30.
Artículo en Inglés | MEDLINE | ID: mdl-35629148

RESUMEN

Approximately 25% of the patients with muscle-invasive bladder cancer (MIBC) who are clinically node negative have occult lymph node metastases at radical cystectomy (RC) and pelvic lymph node dissection. The aim of this study was to evaluate preoperative CT-based radiomics to differentiate between pN+ and pN0 disease in patients with clinical stage cT2-T4aN0-N1M0 MIBC. Patients with cT2-T4aN0-N1M0 MIBC, of whom preoperative CT scans and pathology reports were available, were included from the prospective, multicenter CirGuidance trial. After manual segmentation of the lymph nodes, 564 radiomics features were extracted. A combination of different machine-learning methods was used to develop various decision models to differentiate between patients with pN+ and pN0 disease. A total of 209 patients (159 pN0; 50 pN+) were included, with a total of 3153 segmented lymph nodes. None of the individual radiomics features showed significant differences between pN+ and pN0 disease, and none of the radiomics models performed substantially better than random guessing. Hence, CT-based radiomics does not contribute to differentiation between pN+ and pN0 disease in patients with cT2-T4aN0-N1M0 MIBC.

16.
Eur J Surg Oncol ; 48(7): 1543-1549, 2022 07.
Artículo en Inglés | MEDLINE | ID: mdl-35393169

RESUMEN

INTRODUCTION: There is no clear evidence regarding the benefit of restaging for distant metastases after neoadjuvant radiotherapy (RTX) in patients with soft tissue sarcoma (STS) of the extremities and trunk wall. This study aimed to determine how often restaging of the chest identified metastatic disease that altered management in these patients. METHODS: We performed a single-centre retrospective study from 2010 to 2020. All patients with non-metastatic STS of the extremities and trunk wall who were treated with neoadjuvant RTX and received a staging and restaging chest CT scan or X-ray for distant metastasis were included. The outcome of interest was change in treatment strategy due to restaging after neoadjuvant RTX. RESULTS: Within the 144 patients who were staged and treated with neoadjuvant RTX, a restaging chest CT or X-ray was performed in 134 patients (93%). A change in treatment strategy due to new findings at restaging after RTX was observed in 26 out of 134 patients (19%). In 24 patients the scheduled resection of the primary STS was cancelled at restaging (24/134, 18%), given the findings at restaging. The other two patients did receive the intended local resection, but either with palliative intent, or as a part of a previously unplanned multimodality treatment. CONCLUSION: In approximately one in five patients restaging results in a change in treatment strategy. This underlines the added value of routine restaging for distant metastases with chest CT or X-ray after neoadjuvant RTX in patients with STS.


Asunto(s)
Sarcoma , Neoplasias de los Tejidos Blandos , Extremidades/patología , Humanos , Terapia Neoadyuvante/métodos , Estadificación de Neoplasias , Estudios Retrospectivos , Sarcoma/diagnóstico por imagen , Sarcoma/radioterapia , Sarcoma/cirugía , Neoplasias de los Tejidos Blandos/patología
17.
BMJ Health Care Inform ; 29(1)2022 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-35185012

RESUMEN

OBJECTIVE: Although the role of artificial intelligence (AI) in medicine is increasingly studied, most patients do not benefit because the majority of AI models remain in the testing and prototyping environment. The development and implementation trajectory of clinical AI models are complex and a structured overview is missing. We therefore propose a step-by-step overview to enhance clinicians' understanding and to promote quality of medical AI research. METHODS: We summarised key elements (such as current guidelines, challenges, regulatory documents and good practices) that are needed to develop and safely implement AI in medicine. CONCLUSION: This overview complements other frameworks in a way that it is accessible to stakeholders without prior AI knowledge and as such provides a step-by-step approach incorporating all the key elements and current guidelines that are essential for implementation, and can thereby help to move AI from bytes to bedside.


Asunto(s)
Inteligencia Artificial , Investigación Biomédica , Humanos
18.
Eur Radiol ; 32(6): 3996-4002, 2022 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-34989840

RESUMEN

OBJECTIVES: To develop and validate classifiers for automatic detection of actionable findings and documentation of nonroutine communication in routinely delivered radiology reports. METHODS: Two radiologists annotated all actionable findings and communication mentions in a training set of 1,306 radiology reports and a test set of 1,000 reports randomly selected from the electronic health record system of a large tertiary hospital. Various feature sets were constructed based on the impression section of the reports using different preprocessing steps (stemming, removal of stop words, negations, and previously known or stable findings) and n-grams. Random forest classifiers were trained to detect actionable findings, and a decision-rule classifier was trained to find communication mentions. Classifier performance was evaluated by the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity. RESULTS: On the training set, the actionable finding classifier with the highest cross-validated performance was obtained for a feature set of unigrams, after stemming and removal of negated, known, and stable findings. On the test set, this classifier achieved an AUC of 0.876 (95% CI 0.854-0.898). The classifier for communication detection was trained after negation removal, using unigrams as features. The resultant decision rule had a sensitivity of 0.841 (95% CI 0.706-0.921) and specificity of 0.990 (95% CI 0.981-0.994) on the test set. CONCLUSIONS: Automatic detection of actionable findings and subsequent communication in routinely delivered radiology reports is possible. This can serve quality control purposes and may alert radiologists to the presence of actionable findings during reporting. KEY POINTS: • Classifiers were developed for automatic detection of the broad spectrum of actionable findings and subsequent communication mentions in routinely delivered radiology reports. • Straightforward report preprocessing and simple feature sets can produce well-performing classifiers. • The resultant classifiers show good performance for detection of actionable findings and excellent performance for detection of communication mentions.


Asunto(s)
Procesamiento de Lenguaje Natural , Radiología , Comunicación , Humanos , Aprendizaje Automático
19.
Neuroradiology ; 64(7): 1359-1366, 2022 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-35032183

RESUMEN

PURPOSE: To compare two artificial intelligence software packages performing normative brain volumetry and explore whether they could differently impact dementia diagnostics in a clinical context. METHODS: Sixty patients (20 Alzheimer's disease, 20 frontotemporal dementia, 20 mild cognitive impairment) and 20 controls were included retrospectively. One MRI per subject was processed by software packages from two proprietary manufacturers, producing two quantitative reports per subject. Two neuroradiologists assigned forced-choice diagnoses using only the normative volumetry data in these reports. They classified the volumetric profile as "normal," or "abnormal", and if "abnormal," they specified the most likely dementia subtype. Differences between the packages' clinical impact were assessed by comparing (1) agreement between diagnoses based on software output; (2) diagnostic accuracy, sensitivity, and specificity; and (3) diagnostic confidence. Quantitative outputs were also compared to provide context to any diagnostic differences. RESULTS: Diagnostic agreement between packages was moderate, for distinguishing normal and abnormal volumetry (K = .41-.43) and for specific diagnoses (K = .36-.38). However, each package yielded high inter-observer agreement when distinguishing normal and abnormal profiles (K = .73-.82). Accuracy, sensitivity, and specificity were not different between packages. Diagnostic confidence was different between packages for one rater. Whole brain intracranial volume output differed between software packages (10.73%, p < .001), and normative regional data interpreted for diagnosis correlated weakly to moderately (rs = .12-.80). CONCLUSION: Different artificial intelligence software packages for quantitative normative assessment of brain MRI can produce distinct effects at the level of clinical interpretation. Clinics should not assume that different packages are interchangeable, thus recommending internal evaluation of packages before adoption.


Asunto(s)
Enfermedad de Alzheimer , Inteligencia Artificial , Enfermedad de Alzheimer/diagnóstico por imagen , Encéfalo/diagnóstico por imagen , Humanos , Imagen por Resonancia Magnética/métodos , Estudios Retrospectivos , Programas Informáticos
20.
J Digit Imaging ; 35(2): 127-136, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-35088185

RESUMEN

Treatment planning of gastrointestinal stromal tumors (GISTs) includes distinguishing GISTs from other intra-abdominal tumors and GISTs' molecular analysis. The aim of this study was to evaluate radiomics for distinguishing GISTs from other intra-abdominal tumors, and in GISTs, predict the c-KIT, PDGFRA, BRAF mutational status, and mitotic index (MI). Patients diagnosed at the Erasmus MC between 2004 and 2017, with GIST or non-GIST intra-abdominal tumors and a contrast-enhanced venous-phase CT, were retrospectively included. Tumors were segmented, from which 564 image features were extracted. Prediction models were constructed using a combination of machine learning approaches. The evaluation was performed in a 100 × random-split cross-validation. Model performance was compared to that of three radiologists. One hundred twenty-five GISTs and 122 non-GISTs were included. The GIST vs. non-GIST radiomics model had a mean area under the curve (AUC) of 0.77. Three radiologists had an AUC of 0.69, 0.76, and 0.84, respectively. The radiomics model had an AUC of 0.52 for c-KIT, 0.56 for c-KIT exon 11, and 0.52 for the MI. The numbers of PDGFRA, BRAF, and other c-KIT mutations were too low for analysis. Our radiomics model was able to distinguish GISTs from non-GISTs with a performance similar to three radiologists, but less observer dependent. Therefore, it may aid in the early diagnosis of GIST, facilitating rapid referral to specialized treatment centers. As the model was not able to predict any genetic or molecular features, it cannot aid in treatment planning yet.


Asunto(s)
Neoplasias Abdominales , Tumores del Estroma Gastrointestinal , Diagnóstico Diferencial , Tumores del Estroma Gastrointestinal/diagnóstico por imagen , Tumores del Estroma Gastrointestinal/genética , Tumores del Estroma Gastrointestinal/patología , Humanos , Proteínas Proto-Oncogénicas B-raf/genética , Proteínas Proto-Oncogénicas c-kit/genética , Estudios Retrospectivos , Tomografía Computarizada por Rayos X
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA