Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
1.
Radiology ; 312(1): e233341, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38980184

RESUMO

Background Due to conflicting findings in the literature, there are concerns about a lack of objectivity in grading knee osteoarthritis (KOA) on radiographs. Purpose To examine how artificial intelligence (AI) assistance affects the performance and interobserver agreement of radiologists and orthopedists of various experience levels when evaluating KOA on radiographs according to the established Kellgren-Lawrence (KL) grading system. Materials and Methods In this retrospective observer performance study, consecutive standing knee radiographs from patients with suspected KOA were collected from three participating European centers between April 2019 and May 2022. Each center recruited four readers across radiology and orthopedic surgery at in-training and board-certified experience levels. KL grading (KL-0 = no KOA, KL-4 = severe KOA) on the frontal view was assessed by readers with and without assistance from a commercial AI tool. The majority vote of three musculoskeletal radiology consultants established the reference standard. The ordinal receiver operating characteristic method was used to estimate grading performance. Light kappa was used to estimate interrater agreement, and bootstrapped t statistics were used to compare groups. Results Seventy-five studies were included from each center, totaling 225 studies (mean patient age, 55 years ± 15 [SD]; 113 female patients). The KL grades were KL-0, 24.0% (n = 54); KL-1, 28.0% (n = 63); KL-2, 21.8% (n = 49); KL-3, 18.7% (n = 42); and KL-4, 7.6% (n = 17). Eleven readers completed their readings. Three of the six junior readers showed higher KL grading performance with versus without AI assistance (area under the receiver operating characteristic curve, 0.81 ± 0.017 [SEM] vs 0.88 ± 0.011 [P < .001]; 0.76 ± 0.018 vs 0.86 ± 0.013 [P < .001]; and 0.89 ± 0.011 vs 0.91 ± 0.009 [P = .008]). Interobserver agreement for KL grading among all readers was higher with versus without AI assistance (κ = 0.77 ± 0.018 [SEM] vs 0.85 ± 0.013; P < .001). Board-certified radiologists achieved almost perfect agreement for KL grading when assisted by AI (κ = 0.90 ± 0.01), which was higher than that achieved by the reference readers independently (κ = 0.84 ± 0.017; P = .01). Conclusion AI assistance increased junior readers' radiographic KOA grading performance and increased interobserver agreement for osteoarthritis grading across all readers and experience levels. Published under a CC BY 4.0 license. Supplemental material is available for this article.


Assuntos
Inteligência Artificial , Variações Dependentes do Observador , Osteoartrite do Joelho , Humanos , Feminino , Masculino , Osteoartrite do Joelho/diagnóstico por imagem , Pessoa de Meia-Idade , Estudos Retrospectivos , Radiografia/métodos , Idoso
2.
Eur Radiol ; 2024 Mar 11.
Artigo em Inglês | MEDLINE | ID: mdl-38466390

RESUMO

OBJECTIVES: To evaluate an artificial intelligence (AI)-assisted double reading system for detecting clinically relevant missed findings on routinely reported chest radiographs. METHODS: A retrospective study was performed in two institutions, a secondary care hospital and tertiary referral oncology centre. Commercially available AI software performed a comparative analysis of chest radiographs and radiologists' authorised reports using a deep learning and natural language processing algorithm, respectively. The AI-detected discrepant findings between images and reports were assessed for clinical relevance by an external radiologist, as part of the commercial service provided by the AI vendor. The selected missed findings were subsequently returned to the institution's radiologist for final review. RESULTS: In total, 25,104 chest radiographs of 21,039 patients (mean age 61.1 years ± 16.2 [SD]; 10,436 men) were included. The AI software detected discrepancies between imaging and reports in 21.1% (5289 of 25,104). After review by the external radiologist, 0.9% (47 of 5289) of cases were deemed to contain clinically relevant missed findings. The institution's radiologists confirmed 35 of 47 missed findings (74.5%) as clinically relevant (0.1% of all cases). Missed findings consisted of lung nodules (71.4%, 25 of 35), pneumothoraces (17.1%, 6 of 35) and consolidations (11.4%, 4 of 35). CONCLUSION: The AI-assisted double reading system was able to identify missed findings on chest radiographs after report authorisation. The approach required an external radiologist to review the AI-detected discrepancies. The number of clinically relevant missed findings by radiologists was very low. CLINICAL RELEVANCE STATEMENT: The AI-assisted double reader workflow was shown to detect diagnostic errors and could be applied as a quality assurance tool. Although clinically relevant missed findings were rare, there is potential impact given the common use of chest radiography. KEY POINTS: • A commercially available double reading system supported by artificial intelligence was evaluated to detect reporting errors in chest radiographs (n=25,104) from two institutions. • Clinically relevant missed findings were found in 0.1% of chest radiographs and consisted of unreported lung nodules, pneumothoraces and consolidations. • Applying AI software as a secondary reader after report authorisation can assist in reducing diagnostic errors without interrupting the radiologist's reading workflow. However, the number of AI-detected discrepancies was considerable and required review by a radiologist to assess their relevance.

3.
Eur Radiol ; 2024 Jul 23.
Artigo em Inglês | MEDLINE | ID: mdl-39042303

RESUMO

OBJECTIVES: This study aims to externally validate a commercially available Computer-Aided Detection (CAD)-system for the automatic detection and characterization of solid, part-solid, and ground-glass lung nodules (LN) on CT scans. METHODS: This retrospective study encompasses 263 chest CT scans performed between January 2020 and December 2021 at a Dutch university hospital. All scans were read by a radiologist (R1) and compared with the initial radiology report. Conflicting scans were assessed by an adjudicating radiologist (R2). All scans were also processed by CAD. The standalone performance of CAD in terms of sensitivity and false-positive (FP)-rate for detection was calculated together with the sensitivity for characterization, including texture, calcification, speculation, and location. The R1's detection sensitivity was also assessed. RESULTS: A total of 183 true nodules were identified in 121 nodule-containing scans (142 non-nodule-containing scans), of which R1 identified 165/183 (90.2%). CAD detected 149 nodules, of which 12 were not identified by R1, achieving a sensitivity of 149/183 (81.4%) with an FP-rate of 49/121 (0.405). CAD's detection sensitivity for solid, part-solid, and ground-glass LNs was 82/94 (87.2%), 42/47 (89.4%), and 25/42 (59.5%), respectively. The classification accuracy for solid, part-solid, and ground-glass LNs was 81/82 (98.8%), 16/42 (38.1%), and 18/25 (72.0%), respectively. Additionally, CAD demonstrated overall classification accuracies of 137/149 (91.9%), 123/149 (82.6%), and 141/149 (94.6%) for calcification, spiculation, and location, respectively. CONCLUSIONS: Although the overall detection rate of this system slightly lags behind that of a radiologist, CAD is capable of detecting different LNs and thereby has the potential to enhance a reader's detection rate. While promising characterization performances are obtained, the tool's performance in terms of texture classification remains a subject of concern. CLINICAL RELEVANCE STATEMENT: Numerous lung nodule computer-aided detection-systems are commercially available, with some of them solely being externally validated based on their detection performance on solid nodules. We encourage researchers to assess performances by incorporating all relevant characteristics, including part-solid and ground-glass nodules. KEY POINTS: Few computer-aided detection (CAD) systems are externally validated for automatic detection and characterization of lung nodules. A detection sensitivity of 81.4% and an overall texture classification sensitivity of 77.2% were measured utilizing CAD. CAD has the potential to increase single reader detection rate, however, improvement in texture classification is required.

4.
Skeletal Radiol ; 2024 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-38902420

RESUMO

This article will provide a perspective review of the most extensively investigated deep learning (DL) applications for musculoskeletal disease detection that have the best potential to translate into routine clinical practice over the next decade. Deep learning methods for detecting fractures, estimating pediatric bone age, calculating bone measurements such as lower extremity alignment and Cobb angle, and grading osteoarthritis on radiographs have been shown to have high diagnostic performance with many of these applications now commercially available for use in clinical practice. Many studies have also documented the feasibility of using DL methods for detecting joint pathology and characterizing bone tumors on magnetic resonance imaging (MRI). However, musculoskeletal disease detection on MRI is difficult as it requires multi-task, multi-class detection of complex abnormalities on multiple image slices with different tissue contrasts. The generalizability of DL methods for musculoskeletal disease detection on MRI is also challenging due to fluctuations in image quality caused by the wide variety of scanners and pulse sequences used in routine MRI protocols. The diagnostic performance of current DL methods for musculoskeletal disease detection must be further evaluated in well-designed prospective studies using large image datasets acquired at different institutions with different imaging parameters and imaging hardware before they can be fully implemented in clinical practice. Future studies must also investigate the true clinical benefits of current DL methods and determine whether they could enhance quality, reduce error rates, improve workflow, and decrease radiologist fatigue and burnout with all of this weighed against the costs.

5.
Eur Radiol ; 33(6): 4249-4258, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-36651954

RESUMO

OBJECTIVES: Only few published artificial intelligence (AI) studies for COVID-19 imaging have been externally validated. Assessing the generalizability of developed models is essential, especially when considering clinical implementation. We report the development of the International Consortium for COVID-19 Imaging AI (ICOVAI) model and perform independent external validation. METHODS: The ICOVAI model was developed using multicenter data (n = 1286 CT scans) to quantify disease extent and assess COVID-19 likelihood using the COVID-19 Reporting and Data System (CO-RADS). A ResUNet model was modified to automatically delineate lung contours and infectious lung opacities on CT scans, after which a random forest predicted the CO-RADS score. After internal testing, the model was externally validated on a multicenter dataset (n = 400) by independent researchers. CO-RADS classification performance was calculated using linearly weighted Cohen's kappa and segmentation performance using Dice Similarity Coefficient (DSC). RESULTS: Regarding internal versus external testing, segmentation performance of lung contours was equally excellent (DSC = 0.97 vs. DSC = 0.97, p = 0.97). Lung opacities segmentation performance was adequate internally (DSC = 0.76), but significantly worse on external validation (DSC = 0.59, p < 0.0001). For CO-RADS classification, agreement with radiologists on the internal set was substantial (kappa = 0.78), but significantly lower on the external set (kappa = 0.62, p < 0.0001). CONCLUSION: In this multicenter study, a model developed for CO-RADS score prediction and quantification of COVID-19 disease extent was found to have a significant reduction in performance on independent external validation versus internal testing. The limited reproducibility of the model restricted its potential for clinical use. The study demonstrates the importance of independent external validation of AI models. KEY POINTS: • The ICOVAI model for prediction of CO-RADS and quantification of disease extent on chest CT of COVID-19 patients was developed using a large sample of multicenter data. • There was substantial performance on internal testing; however, performance was significantly reduced on external validation, performed by independent researchers. The limited generalizability of the model restricts its potential for clinical use. • Results of AI models for COVID-19 imaging on internal tests may not generalize well to external data, demonstrating the importance of independent external validation.


Assuntos
Inteligência Artificial , COVID-19 , Humanos , Reprodutibilidade dos Testes , Tomografia Computadorizada por Raios X , Algoritmos , Estudos Retrospectivos
6.
Eur Radiol ; 32(6): 3996-4002, 2022 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-34989840

RESUMO

OBJECTIVES: To develop and validate classifiers for automatic detection of actionable findings and documentation of nonroutine communication in routinely delivered radiology reports. METHODS: Two radiologists annotated all actionable findings and communication mentions in a training set of 1,306 radiology reports and a test set of 1,000 reports randomly selected from the electronic health record system of a large tertiary hospital. Various feature sets were constructed based on the impression section of the reports using different preprocessing steps (stemming, removal of stop words, negations, and previously known or stable findings) and n-grams. Random forest classifiers were trained to detect actionable findings, and a decision-rule classifier was trained to find communication mentions. Classifier performance was evaluated by the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity. RESULTS: On the training set, the actionable finding classifier with the highest cross-validated performance was obtained for a feature set of unigrams, after stemming and removal of negated, known, and stable findings. On the test set, this classifier achieved an AUC of 0.876 (95% CI 0.854-0.898). The classifier for communication detection was trained after negation removal, using unigrams as features. The resultant decision rule had a sensitivity of 0.841 (95% CI 0.706-0.921) and specificity of 0.990 (95% CI 0.981-0.994) on the test set. CONCLUSIONS: Automatic detection of actionable findings and subsequent communication in routinely delivered radiology reports is possible. This can serve quality control purposes and may alert radiologists to the presence of actionable findings during reporting. KEY POINTS: • Classifiers were developed for automatic detection of the broad spectrum of actionable findings and subsequent communication mentions in routinely delivered radiology reports. • Straightforward report preprocessing and simple feature sets can produce well-performing classifiers. • The resultant classifiers show good performance for detection of actionable findings and excellent performance for detection of communication mentions.


Assuntos
Processamento de Linguagem Natural , Radiologia , Comunicação , Humanos , Aprendizado de Máquina
7.
Neuroradiology ; 64(7): 1359-1366, 2022 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-35032183

RESUMO

PURPOSE: To compare two artificial intelligence software packages performing normative brain volumetry and explore whether they could differently impact dementia diagnostics in a clinical context. METHODS: Sixty patients (20 Alzheimer's disease, 20 frontotemporal dementia, 20 mild cognitive impairment) and 20 controls were included retrospectively. One MRI per subject was processed by software packages from two proprietary manufacturers, producing two quantitative reports per subject. Two neuroradiologists assigned forced-choice diagnoses using only the normative volumetry data in these reports. They classified the volumetric profile as "normal," or "abnormal", and if "abnormal," they specified the most likely dementia subtype. Differences between the packages' clinical impact were assessed by comparing (1) agreement between diagnoses based on software output; (2) diagnostic accuracy, sensitivity, and specificity; and (3) diagnostic confidence. Quantitative outputs were also compared to provide context to any diagnostic differences. RESULTS: Diagnostic agreement between packages was moderate, for distinguishing normal and abnormal volumetry (K = .41-.43) and for specific diagnoses (K = .36-.38). However, each package yielded high inter-observer agreement when distinguishing normal and abnormal profiles (K = .73-.82). Accuracy, sensitivity, and specificity were not different between packages. Diagnostic confidence was different between packages for one rater. Whole brain intracranial volume output differed between software packages (10.73%, p < .001), and normative regional data interpreted for diagnosis correlated weakly to moderately (rs = .12-.80). CONCLUSION: Different artificial intelligence software packages for quantitative normative assessment of brain MRI can produce distinct effects at the level of clinical interpretation. Clinics should not assume that different packages are interchangeable, thus recommending internal evaluation of packages before adoption.


Assuntos
Doença de Alzheimer , Inteligência Artificial , Doença de Alzheimer/diagnóstico por imagem , Encéfalo/diagnóstico por imagem , Humanos , Imageamento por Ressonância Magnética/métodos , Estudos Retrospectivos , Software
8.
J Digit Imaging ; 35(2): 127-136, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35088185

RESUMO

Treatment planning of gastrointestinal stromal tumors (GISTs) includes distinguishing GISTs from other intra-abdominal tumors and GISTs' molecular analysis. The aim of this study was to evaluate radiomics for distinguishing GISTs from other intra-abdominal tumors, and in GISTs, predict the c-KIT, PDGFRA, BRAF mutational status, and mitotic index (MI). Patients diagnosed at the Erasmus MC between 2004 and 2017, with GIST or non-GIST intra-abdominal tumors and a contrast-enhanced venous-phase CT, were retrospectively included. Tumors were segmented, from which 564 image features were extracted. Prediction models were constructed using a combination of machine learning approaches. The evaluation was performed in a 100 × random-split cross-validation. Model performance was compared to that of three radiologists. One hundred twenty-five GISTs and 122 non-GISTs were included. The GIST vs. non-GIST radiomics model had a mean area under the curve (AUC) of 0.77. Three radiologists had an AUC of 0.69, 0.76, and 0.84, respectively. The radiomics model had an AUC of 0.52 for c-KIT, 0.56 for c-KIT exon 11, and 0.52 for the MI. The numbers of PDGFRA, BRAF, and other c-KIT mutations were too low for analysis. Our radiomics model was able to distinguish GISTs from non-GISTs with a performance similar to three radiologists, but less observer dependent. Therefore, it may aid in the early diagnosis of GIST, facilitating rapid referral to specialized treatment centers. As the model was not able to predict any genetic or molecular features, it cannot aid in treatment planning yet.


Assuntos
Neoplasias Abdominais , Tumores do Estroma Gastrointestinal , Diagnóstico Diferencial , Tumores do Estroma Gastrointestinal/diagnóstico por imagem , Tumores do Estroma Gastrointestinal/genética , Tumores do Estroma Gastrointestinal/patologia , Humanos , Proteínas Proto-Oncogênicas B-raf/genética , Proteínas Proto-Oncogênicas c-kit/genética , Estudos Retrospectivos , Tomografia Computadorizada por Raios X
9.
Radiology ; 298(3): 486-491, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33346696

RESUMO

Background The Value-Based Healthcare (VBH) concept is designed to improve individual healthcare outcomes without increasing expenditure, and is increasingly being used to determine resourcing of and reimbursement for medical services. Radiology is a major contributor to patient and societal healthcare at many levels. Despite this, some VBH models do not acknowledge radiology's central role; this may have future negative consequences for resource allocation. Methods, findings and interpretation This multi-society paper, representing the views of Radiology Societies in Europe, the USA, Canada, Australia, and New Zealand, describes the place of radiology in VBH models and the health-care value contributions of radiology. Potential steps to objectify and quantify the value contributed by radiology to healthcare are outlined. Published under a CC BY 4.0 license.


Assuntos
Atenção à Saúde/normas , Radiologia/normas , Aquisição Baseada em Valor , Consenso , Controle de Custos , Atenção à Saúde/economia , Humanos , Internacionalidade , Radiologia/economia , Sociedades Médicas
10.
Can Assoc Radiol J ; 72(2): 208-214, 2021 May.
Artigo em Inglês | MEDLINE | ID: mdl-33345576

RESUMO

BACKGROUND: The Value-Based Healthcare (VBH) concept is designed to improve individual healthcare outcomes without increasing expenditure, and is increasingly being used to determine resourcing of and reimbursement for medical services. Radiology is a major contributor to patient and societal healthcare at many levels. Despite this, some VBH models do not acknowledge radiology's central role; this may have future negative consequences for resource allocation. METHODS, FINDINGS AND INTERPRETATION: This multi-society paper, representing the views of Radiology Societies in Europe, the USA, Canada, Australia, and New Zealand, describes the place of radiology in VBH models and the health-care value contributions of radiology. Potential steps to objectify and quantify the value contributed by radiology to healthcare are outlined.


Assuntos
Atenção à Saúde/economia , Custos de Cuidados de Saúde , Radiologia/economia , Radiologia/métodos , Austrália , Canadá , Europa (Continente) , Humanos , Nova Zelândia , Sociedades Médicas , Estados Unidos
11.
Semin Musculoskelet Radiol ; 24(4): 460-474, 2020 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-32992373

RESUMO

Musculoskeletal imaging is mainly based on the subjective and qualitative analysis of imaging examinations. However, integration of quantitative assessment of imaging data could increase the value of imaging in both research and clinical practice. Some imaging modalities, such as perfusion magnetic resonance imaging (MRI), diffusion MRI, or T2 mapping, are intrinsically quantitative. But conventional morphological imaging can also be analyzed through the quantification of various parameters. The quantitative data retrieved from imaging examinations can serve as biomarkers and be used to support diagnosis, determine patient prognosis, or monitor therapy.We focus on the value, or clinical utility, of quantitative imaging in the musculoskeletal field. There is currently a trend to move from volume- to value-based payments. This review contains definitions and examines the role that quantitative imaging may play in the implementation of value-based health care. The influence of artificial intelligence on the value of quantitative musculoskeletal imaging is also discussed.


Assuntos
Doenças Musculoesqueléticas/diagnóstico por imagem , Aquisição Baseada em Valor , Humanos , Aumento da Imagem/métodos , Interpretação de Imagem Assistida por Computador/métodos
12.
J Med Internet Res ; 22(10): e21211, 2020 10 05.
Artigo em Inglês | MEDLINE | ID: mdl-32997642

RESUMO

The physical and social distancing measures that have been adopted worldwide because of COVID-19 will probably remain in place for a long time, especially for senior adults, people with chronic conditions, and other at-risk populations. Teleconsultations can be useful in ensuring that patients continue to receive clinical care while reducing physical crowding and avoiding unnecessary exposure of health care staff. Implementation processes that typically take months of planning, budgeting, pilot testing, and education were compressed into days. However, in the urgency to deal with the present crisis, we may be forgetting that the introduction of digital health is not exclusively a technological issue, but part of a complex organizational change problem. This viewpoint offers insight regarding issues that rapidly adopted teleconsultation systems may face in a post-COVID-19 world.


Assuntos
Infecções por Coronavirus/epidemiologia , Pneumonia Viral/epidemiologia , Consulta Remota/tendências , Telemedicina/tendências , Centros Médicos Acadêmicos , Betacoronavirus , COVID-19 , Humanos , Países Baixos/epidemiologia , Pandemias , Desenvolvimento de Programas , Avaliação de Programas e Projetos de Saúde , Consulta Remota/organização & administração , SARS-CoV-2 , Software , Telemedicina/organização & administração , Interface Usuário-Computador
13.
Semin Musculoskelet Radiol ; 21(1): 37-42, 2017 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-28253532

RESUMO

In the era of value-based health care, adding value is a key element in providing care. The choice of appropriate imaging modality and protocol should be based on consideration of patients' values, health care outcomes, and cost-effectiveness, taking into account the perspective of the decision maker, the health care system, and society at large. This article provides an overview of the available tools to measure value, outcomes, and cost-effectiveness in musculoskeletal radiology, illustrated with relevant examples.


Assuntos
Análise Custo-Benefício/economia , Análise Custo-Benefício/métodos , Diagnóstico por Imagem/economia , Doenças Musculoesqueléticas/diagnóstico por imagem , Aquisição Baseada em Valor/economia , Humanos , Doenças Musculoesqueléticas/economia , Sistema Musculoesquelético/diagnóstico por imagem
15.
J Orthop Res ; 2024 May 06.
Artigo em Inglês | MEDLINE | ID: mdl-38711242

RESUMO

In 3D-analysis of the calcaneus, a consistent coordinate system aligned with the original anatomical directions is crucial for pre- and postoperative analysis. This importance stems from the calcaneus's key role in weight-bearing and biomechanical alignment. However, defining a reliable coordinate system based solely on fractured or surgically reconstructed calcanei presents significant challenges. Given its anatomical prominence and consistent orientation, the talus offers a potential solution to this challenge. Our work explores the feasibility of talus-derived coordinate systems for 3D-modeling of the calcaneus across its various conditions. Four methods were tested on nonfractured, fractured and surgically reconstructed calcanei, utilizing Principal Component Analysis, anatomical landmarks, bounding box, and an atlas-based approach. The methods were compared with a self-defined calcaneus reference coordinate system. Additionally, the impact of deviation of the coordinate system on morphological measurements was investigated. Among methods for constructing nonfractured calcanei coordinate systems, the atlas-based method displayed the lowest Root Mean Square value in comparison with the reference coordinate system. For morphological measures like Böhler's Angle and the Critical angle of Gissane, the atlas talus-based system closely aligned with ground truth, yielding differences of 0.6° and 1.2°, respectively, compared to larger deviations seen in other talus-based coordinate systems. In conclusion, all tested methods were feasible for creating a talus derived coordinate system. A talus derived coordinate system showed potential, offering benefits for morphological measurements and clinical scenarios involving fractured and surgically reconstructed calcanei. Further research is recommended to assess the impact of these coordinate systems on surgical planning and outcomes.

16.
Insights Imaging ; 15(1): 34, 2024 Feb 05.
Artigo em Inglês | MEDLINE | ID: mdl-38315288

RESUMO

OBJECTIVE: To provide a comprehensive framework for value assessment of artificial intelligence (AI) in radiology. METHODS: This paper presents the RADAR framework, which has been adapted from Fryback and Thornbury's imaging efficacy framework to facilitate the valuation of radiology AI from conception to local implementation. Local efficacy has been newly introduced to underscore the importance of appraising an AI technology within its local environment. Furthermore, the RADAR framework is illustrated through a myriad of study designs that help assess value. RESULTS: RADAR presents a seven-level hierarchy, providing radiologists, researchers, and policymakers with a structured approach to the comprehensive assessment of value in radiology AI. RADAR is designed to be dynamic and meet the different valuation needs throughout the AI's lifecycle. Initial phases like technical and diagnostic efficacy (RADAR-1 and RADAR-2) are assessed pre-clinical deployment via in silico clinical trials and cross-sectional studies. Subsequent stages, spanning from diagnostic thinking to patient outcome efficacy (RADAR-3 to RADAR-5), require clinical integration and are explored via randomized controlled trials and cohort studies. Cost-effectiveness efficacy (RADAR-6) takes a societal perspective on financial feasibility, addressed via health-economic evaluations. The final level, RADAR-7, determines how prior valuations translate locally, evaluated through budget impact analysis, multi-criteria decision analyses, and prospective monitoring. CONCLUSION: The RADAR framework offers a comprehensive framework for valuing radiology AI. Its layered, hierarchical structure, combined with a focus on local relevance, aligns RADAR seamlessly with the principles of value-based radiology. CRITICAL RELEVANCE STATEMENT: The RADAR framework advances artificial intelligence in radiology by delineating a much-needed framework for comprehensive valuation. KEYPOINTS: • Radiology artificial intelligence lacks a comprehensive approach to value assessment. • The RADAR framework provides a dynamic, hierarchical method for thorough valuation of radiology AI. • RADAR advances clinical radiology by bridging the artificial intelligence implementation gap.

17.
Eur J Radiol ; 173: 111375, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38377894

RESUMO

BACKGROUND: Artificial intelligence (AI) applications can facilitate detection of cervical spine fractures on CT and reduce time to diagnosis by prioritizing suspected cases. PURPOSE: To assess the effect on time to diagnose cervical spine fractures on CT and diagnostic accuracy of a commercially available AI application. MATERIALS AND METHODS: In this study (June 2020 - March 2022) with historic controls and prospective evaluation, we evaluated regulatory-cleared AI-software to prioritize cervical spine fractures on CT. All patients underwent non-contrast CT of the cervical spine. The time between CT acquisition and the moment the scan was first opened (DNT) was compared between the retrospective and prospective cohorts. The reference standard for determining diagnostic accuracy was the radiology report created in routine clinical workflow and adjusted by a senior radiologist. Discrepant cases were reviewed and clinical relevance of missed fractures was determined. RESULTS: 2973 (mean age, 55.4 ± 19.7 [standard deviation]; 1857 men) patients were analyzed by AI, including 2036 retrospective and 938 prospective cases. Overall prevalence of cervical spine fractures was 7.6 %. The DNT was 18 % (5 min) shorter in the prospective cohort. In scans positive for cervical spine fracture according to the reference standard, DNT was 46 % (16 min) shorter in the prospective cohort. Overall sensitivity of the AI application was 89.8 % (95 % CI: 84.2-94.0 %), specificity was 95.3 % (95 % CI: 94.2-96.2 %), and diagnostic accuracy was 94.8 % (95 % CI: 93.8-95.8 %). Negative predictive value was 99.1 % (95 % CI: 98.5-99.4 %) and positive predictive value was 63.0 % (95 % CI: 58.0-67.8 %). 22 fractures were missed by AI of which 5 required stabilizing therapy. CONCLUSION: A time gain of 16 min to diagnosis for fractured cases was observed after introducing AI. Although AI-assisted workflow prioritization of cervical spine fractures on CT shows high diagnostic accuracy, clinically relevant cases were missed.


Assuntos
Fraturas Ósseas , Fraturas da Coluna Vertebral , Masculino , Humanos , Adulto , Pessoa de Meia-Idade , Idoso , Inteligência Artificial , Estudos Retrospectivos , Tomografia Computadorizada por Raios X , Fraturas da Coluna Vertebral/diagnóstico por imagem , Vértebras Cervicais/diagnóstico por imagem , Vértebras Cervicais/lesões , Algoritmos
18.
Cancers (Basel) ; 16(11)2024 May 28.
Artigo em Inglês | MEDLINE | ID: mdl-38893158

RESUMO

Malignant peripheral nerve sheath tumors (MPNSTs) are aggressive soft-tissue tumors prevalent in neurofibromatosis type 1 (NF1) patients, posing a significant risk of metastasis and recurrence. Current magnetic resonance imaging (MRI) imaging lacks decisiveness in distinguishing benign peripheral nerve sheath tumors (BPNSTs) and MPNSTs, necessitating invasive biopsies. This study aims to develop a radiomics model using quantitative imaging features and machine learning to distinguish MPNSTs from BPNSTs. Clinical data and MRIs from MPNST and BPNST patients (2000-2019) were collected at a tertiary sarcoma referral center. Lesions were manually and semi-automatically segmented on MRI scans, and radiomics features were extracted using the Workflow for Optimal Radiomics Classification (WORC) algorithm, employing automated machine learning. The evaluation was conducted using a 100× random-split cross-validation. A total of 35 MPNSTs and 74 BPNSTs were included. The T1-weighted (T1w) MRI radiomics model outperformed others with an area under the curve (AUC) of 0.71. The incorporation of additional MRI scans did not enhance performance. Combining T1w MRI with clinical features achieved an AUC of 0.74. Experienced radiologists achieved AUCs of 0.75 and 0.66, respectively. Radiomics based on T1w MRI scans and clinical features show some ability to distinguish MPNSTs from BPNSTs, potentially aiding in the management of these tumors.

19.
Quant Imaging Med Surg ; 14(6): 3778-3788, 2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-38846290

RESUMO

Background: While current preoperative and postoperative assessment of the fractured and surgically reconstructed calcaneus relies on computed tomography (CT)-imaging, there are no established methods to quantify calcaneus morphology on CT-images. This study aims to develop a semi-automated method for morphological measurements of the calcaneus on three-dimensional (3D) models derived from CT-imaging. Methods: Using CT data, 3D models were created from healthy, fractured, and surgically reconstructed calcanei. Böhler's angle (BA) and Critical angle of Gissane (CAG) were measured on conventional lateral radiographs and corresponding 3D CT reconstructions using a novel point-based method with semi-automatic landmark placement by three observers. Intraobserver and interobserver reliability scores were calculated using intra-class correlation coefficient (ICC). In addition, consensus among observers was calculated for a maximal allowable discrepancy of 5 and 10 degrees for both methods. Results: Imaging data from 119 feet were obtained (40 healthy, 39 fractured, 40 reconstructed). Semi-automated measurements on 3D models of BA and CAG showed excellent reliability (ICC: 0.87-1.00). The manual measurements on conventional radiographs had a poor-to-excellent reliability (ICC: 0.22-0.96). In addition, the percentage of consensus among observers was much higher for the 3D method when compared to conventional two-dimensional (2D) measurements. Conclusions: The proposed method enables reliable and reproducible quantification of calcaneus morphology in 3D models of healthy, fractured and reconstructed calcanei.

20.
Radiol Cardiothorac Imaging ; 5(2): e220163, 2023 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-37124638

RESUMO

Purpose: To evaluate the diagnostic efficacy of artificial intelligence (AI) software in detecting incidental pulmonary embolism (IPE) at CT and shorten the time to diagnosis with use of radiologist reading worklist prioritization. Materials and Methods: In this study with historical controls and prospective evaluation, regulatory-cleared AI software was evaluated to prioritize IPE on routine chest CT scans with intravenous contrast agent in adult oncology patients. Diagnostic accuracy metrics were calculated, and temporal end points, including detection and notification times (DNTs), were assessed during three time periods (April 2019 to September 2020): routine workflow without AI, human triage without AI, and worklist prioritization with AI. Results: In total, 11 736 CT scans in 6447 oncology patients (mean age, 63 years ± 12 [SD]; 3367 men) were included. Prevalence of IPE was 1.3% (51 of 3837 scans), 1.4% (54 of 3920 scans), and 1.0% (38 of 3979 scans) for the respective time periods. The AI software detected 131 true-positive, 12 false-negative, 31 false-positive, and 11 559 true-negative results, achieving 91.6% sensitivity, 99.7% specificity, 99.9% negative predictive value, and 80.9% positive predictive value. During prospective evaluation, AI-based worklist prioritization reduced the median DNT for IPE-positive examinations to 87 minutes (vs routine workflow of 7714 minutes and human triage of 4973 minutes). Radiologists' missed rate of IPE was significantly reduced from 44.8% (47 of 105 scans) without AI to 2.6% (one of 38 scans) when assisted by the AI tool (P < .001). Conclusion: AI-assisted workflow prioritization of IPE on routine CT scans in oncology patients showed high diagnostic accuracy and significantly shortened the time to diagnosis in a setting with a backlog of examinations.Keywords: CT, Computer Applications, Detection, Diagnosis, Embolism, Thorax, ThrombosisSupplemental material is available for this article.© RSNA, 2023See also the commentary by Elicker in this issue.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA