Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 89
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
J Med Syst ; 48(1): 41, 2024 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-38632172

RESUMO

Polypharmacy remains an important challenge for patients with extensive medical complexity. Given the primary care shortage and the increasing aging population, effective polypharmacy management is crucial to manage the increasing burden of care. The capacity of large language model (LLM)-based artificial intelligence to aid in polypharmacy management has yet to be evaluated. Here, we evaluate ChatGPT's performance in polypharmacy management via its deprescribing decisions in standardized clinical vignettes. We inputted several clinical vignettes originally from a study of general practicioners' deprescribing decisions into ChatGPT 3.5, a publicly available LLM, and evaluated its capacity for yes/no binary deprescribing decisions as well as list-based prompts in which the model was prompted to choose which of several medications to deprescribe. We recorded ChatGPT responses to yes/no binary deprescribing prompts and the number and types of medications deprescribed. In yes/no binary deprescribing decisions, ChatGPT universally recommended deprescribing medications regardless of ADL status in patients with no overlying CVD history; in patients with CVD history, ChatGPT's answers varied by technical replicate. Total number of medications deprescribed ranged from 2.67 to 3.67 (out of 7) and did not vary with CVD status, but increased linearly with severity of ADL impairment. Among medication types, ChatGPT preferentially deprescribed pain medications. ChatGPT's deprescribing decisions vary along the axes of ADL status, CVD history, and medication type, indicating some concordance of internal logic between general practitioners and the model. These results indicate that specifically trained LLMs may provide useful clinical support in polypharmacy management for primary care physicians.


Assuntos
Doenças Cardiovasculares , Desprescrições , Clínicos Gerais , Humanos , Idoso , Polimedicação , Inteligência Artificial
2.
Radiology ; 307(5): e222044, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37219444

RESUMO

Radiologic tests often contain rich imaging data not relevant to the clinical indication. Opportunistic screening refers to the practice of systematically leveraging these incidental imaging findings. Although opportunistic screening can apply to imaging modalities such as conventional radiography, US, and MRI, most attention to date has focused on body CT by using artificial intelligence (AI)-assisted methods. Body CT represents an ideal high-volume modality whereby a quantitative assessment of tissue composition (eg, bone, muscle, fat, and vascular calcium) can provide valuable risk stratification and help detect unsuspected presymptomatic disease. The emergence of "explainable" AI algorithms that fully automate these measurements could eventually lead to their routine clinical use. Potential barriers to widespread implementation of opportunistic CT screening include the need for buy-in from radiologists, referring providers, and patients. Standardization of acquiring and reporting measures is needed, in addition to expanded normative data according to age, sex, and race and ethnicity. Regulatory and reimbursement hurdles are not insurmountable but pose substantial challenges to commercialization and clinical use. Through demonstration of improved population health outcomes and cost-effectiveness, these opportunistic CT-based measures should be attractive to both payers and health care systems as value-based reimbursement models mature. If highly successful, opportunistic screening could eventually justify a practice of standalone "intended" CT screening.


Assuntos
Inteligência Artificial , Radiologia , Humanos , Algoritmos , Radiologistas , Programas de Rastreamento/métodos , Radiologia/métodos
3.
Radiology ; 306(2): e220101, 2023 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-36125375

RESUMO

Background Adrenal masses are common, but radiology reporting and recommendations for management can be variable. Purpose To create a machine learning algorithm to segment adrenal glands on contrast-enhanced CT images and classify glands as normal or mass-containing and to assess algorithm performance. Materials and Methods This retrospective study included two groups of contrast-enhanced abdominal CT examinations (development data set and secondary test set). Adrenal glands in the development data set were manually segmented by radiologists. Images in both the development data set and the secondary test set were manually classified as normal or mass-containing. Deep learning segmentation and classification models were trained on the development data set and evaluated on both data sets. Segmentation performance was evaluated with use of the Dice similarity coefficient (DSC), and classification performance with use of sensitivity and specificity. Results The development data set contained 274 CT examinations (251 patients; median age, 61 years; 133 women), and the secondary test set contained 991 CT examinations (991 patients; median age, 62 years; 578 women). The median model DSC on the development test set was 0.80 (IQR, 0.78-0.89) for normal glands and 0.84 (IQR, 0.79-0.90) for adrenal masses. On the development reader set, the median interreader DSC was 0.89 (IQR, 0.78-0.93) for normal glands and 0.89 (IQR, 0.85-0.97) for adrenal masses. Interreader DSC for radiologist manual segmentation did not differ from automated machine segmentation (P = .35). On the development test set, the model had a classification sensitivity of 83% (95% CI: 55, 95) and specificity of 89% (95% CI: 75, 96). On the secondary test set, the model had a classification sensitivity of 69% (95% CI: 58, 79) and specificity of 91% (95% CI: 90, 92). Conclusion A two-stage machine learning pipeline was able to segment the adrenal glands and differentiate normal adrenal glands from those containing masses. © RSNA, 2022 Online supplemental material is available for this article.


Assuntos
Aprendizado de Máquina , Tomografia Computadorizada por Raios X , Humanos , Feminino , Pessoa de Meia-Idade , Tomografia Computadorizada por Raios X/métodos , Estudos Retrospectivos , Algoritmos , Glândulas Suprarrenais
4.
J Med Internet Res ; 25: e48659, 2023 08 22.
Artigo em Inglês | MEDLINE | ID: mdl-37606976

RESUMO

BACKGROUND: Large language model (LLM)-based artificial intelligence chatbots direct the power of large training data sets toward successive, related tasks as opposed to single-ask tasks, for which artificial intelligence already achieves impressive performance. The capacity of LLMs to assist in the full scope of iterative clinical reasoning via successive prompting, in effect acting as artificial physicians, has not yet been evaluated. OBJECTIVE: This study aimed to evaluate ChatGPT's capacity for ongoing clinical decision support via its performance on standardized clinical vignettes. METHODS: We inputted all 36 published clinical vignettes from the Merck Sharpe & Dohme (MSD) Clinical Manual into ChatGPT and compared its accuracy on differential diagnoses, diagnostic testing, final diagnosis, and management based on patient age, gender, and case acuity. Accuracy was measured by the proportion of correct responses to the questions posed within the clinical vignettes tested, as calculated by human scorers. We further conducted linear regression to assess the contributing factors toward ChatGPT's performance on clinical tasks. RESULTS: ChatGPT achieved an overall accuracy of 71.7% (95% CI 69.3%-74.1%) across all 36 clinical vignettes. The LLM demonstrated the highest performance in making a final diagnosis with an accuracy of 76.9% (95% CI 67.8%-86.1%) and the lowest performance in generating an initial differential diagnosis with an accuracy of 60.3% (95% CI 54.2%-66.6%). Compared to answering questions about general medical knowledge, ChatGPT demonstrated inferior performance on differential diagnosis (ß=-15.8%; P<.001) and clinical management (ß=-7.4%; P=.02) question types. CONCLUSIONS: ChatGPT achieves impressive accuracy in clinical decision-making, with increasing strength as it gains more clinical information at its disposal. In particular, ChatGPT demonstrates the greatest accuracy in tasks of final diagnosis as compared to initial diagnosis. Limitations include possible model hallucinations and the unclear composition of ChatGPT's training data set.


Assuntos
Inteligência Artificial , Humanos , Tomada de Decisão Clínica , Organizações , Fluxo de Trabalho , Design Centrado no Usuário
5.
Radiology ; 305(3): 555-563, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-35916673

RESUMO

As the role of artificial intelligence (AI) in clinical practice evolves, governance structures oversee the implementation, maintenance, and monitoring of clinical AI algorithms to enhance quality, manage resources, and ensure patient safety. In this article, a framework is established for the infrastructure required for clinical AI implementation and presents a road map for governance. The road map answers four key questions: Who decides which tools to implement? What factors should be considered when assessing an application for implementation? How should applications be implemented in clinical practice? Finally, how should tools be monitored and maintained after clinical implementation? Among the many challenges for the implementation of AI in clinical practice, devising flexible governance structures that can quickly adapt to a changing environment will be essential to ensure quality patient care and practice improvement objectives.


Assuntos
Inteligência Artificial , Radiologia , Humanos , Radiografia , Algoritmos , Qualidade da Assistência à Saúde
6.
Eur Radiol ; 30(6): 3576-3584, 2020 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-32064565

RESUMO

Artificial intelligence (AI) has the potential to significantly disrupt the way radiology will be practiced in the near future, but several issues need to be resolved before AI can be widely implemented in daily practice. These include the role of the different stakeholders in the development of AI for imaging, the ethical development and use of AI in healthcare, the appropriate validation of each developed AI algorithm, the development of effective data sharing mechanisms, regulatory hurdles for the clearance of AI algorithms, and the development of AI educational resources for both practicing radiologists and radiology trainees. This paper details these issues and presents possible solutions based on discussions held at the 2019 meeting of the International Society for Strategic Studies in Radiology. KEY POINTS: • Radiologists should be aware of the different types of bias commonly encountered in AI studies, and understand their possible effects. • Methods for effective data sharing to train, validate, and test AI algorithms need to be developed. • It is essential for all radiologists to gain an understanding of the basic principles, potentials, and limits of AI.


Assuntos
Inteligência Artificial , Radiologia , Algoritmos , Aprendizado Profundo , Previsões , Humanos , Disseminação de Informação , Aprendizado de Máquina , Radiologistas , Reprodutibilidade dos Testes , Estudos de Validação como Assunto
7.
J Digit Imaging ; 33(2): 334-340, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-31515753

RESUMO

The purpose of this study was to assess if clinical indications, patient location, and imaging sites predict the viewing pattern of referring physicians for CT and MR of the head, chest, and abdomen. Our study included 166,953 CT/MR images of head/chest/abdomen in 2016-2017 in the outpatient (OP, n = 83,981 CT/MR), inpatient (IP, n = 51,052), and emergency (ED, n = 31,920) settings. There were 125,329 CT/MR performed in the hospital setting and 41,624 in one of the nine off-campus locations. We extracted information regarding body region (head/chest/abdomen), patient location, and imaging site from the electronic medical records (EPIC). We recorded clinical indications and the number of times referring physicians viewed CT/MR (defined as the number of separate views of imaging in the EPIC). Data were analyzed with the Microsoft SQL and SPSS statistical software. About 33% of IP CT and MR studies are viewed > 6 times compared to 7% for OP and 19% of ED studies (p < 0.001). Conversely, most OP studies (55%) were viewed 1-2 times only, compared to 21% for IP and 38% for ED studies (p < 0.001). In-hospital exams are viewed (≥ 6 views; 39% studies) more frequently than off-campus imaging (≥ 6 views; 17% studies) (p < 0.001). For head CT/MR, certain clinical indications (i.e., stroke) had higher viewing rates compared to other clinical indications such as malignancy, headache, and dizziness. Conversely, for chest CT, dyspnea-hypoxia had much higher viewing rates (> 6 times) in IP (55%) and ED (46%) than in OP settings (22%). Patient location and imaging site regardless of clinical indications have a profound effect on viewing patterns of referring physicians. Understanding viewing patterns of the referring physicians can help guide interpretation priorities and finding communication for imaging exams based on patient location, imaging site, and clinical indications. The information can help in the efficient delivery of patient care.


Assuntos
Médicos , Tomografia Computadorizada por Raios X , Abdome , Comunicação , Registros Eletrônicos de Saúde , Humanos
8.
Radiology ; 288(2): 318-328, 2018 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-29944078

RESUMO

Recent advances and future perspectives of machine learning techniques offer promising applications in medical imaging. Machine learning has the potential to improve different steps of the radiology workflow including order scheduling and triage, clinical decision support systems, detection and interpretation of findings, postprocessing and dose estimation, examination quality control, and radiology reporting. In this article, the authors review examples of current applications of machine learning and artificial intelligence techniques in diagnostic radiology. In addition, the future impact and natural extension of these techniques in radiology practice are discussed.


Assuntos
Aprendizado de Máquina , Sistemas de Informação em Radiologia , Radiologia/métodos , Radiologia/tendências , Humanos
9.
Radiology ; 285(3): 713-718, 2017 12.
Artigo em Inglês | MEDLINE | ID: mdl-29155639

RESUMO

Artificial intelligence (AI), machine learning, and deep learning are terms now seen frequently, all of which refer to computer algorithms that change as they are exposed to more data. Many of these algorithms are surprisingly good at recognizing objects in images. The combination of large amounts of machine-consumable digital data, increased and cheaper computing power, and increasingly sophisticated statistical models combine to enable machines to find patterns in data in ways that are not only cost-effective but also potentially beyond humans' abilities. Building an AI algorithm can be surprisingly easy. Understanding the associated data structures and statistics, on the other hand, is often difficult and obscure. Converting the algorithm into a sophisticated product that works consistently in broad, general clinical use is complex and incompletely understood. To show how these AI products reduce costs and improve outcomes will require clinical translation and industrial-grade integration into routine workflow. Radiology has the chance to leverage AI to become a center of intelligently aggregated, quantitative, diagnostic information. Centaur radiologists, formed as a synergy of human plus computer, will provide interpretations using data extracted from images by humans and image-analysis computer algorithms, as well as the electronic health record, genomics, and other disparate sources. These interpretations will form the foundation of precision health care, or care customized to an individual patient. © RSNA, 2017.


Assuntos
Sistemas de Apoio a Decisões Clínicas/tendências , Diagnóstico por Imagem/tendências , Previsões , Interpretação de Imagem Assistida por Computador/métodos , Aprendizado de Máquina/tendências , Radiologia/tendências , Algoritmos , Humanos , Reconhecimento Automatizado de Padrão/tendências , Software
10.
Radiology ; 284(3): 766-776, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28430557

RESUMO

Purpose To quantify the effect of a comprehensive, long-term, provider-led utilization management (UM) program on high-cost imaging (computed tomography, magnetic resonance imaging, nuclear imaging, and positron emission tomography) performed on an outpatient basis. Materials and Methods This retrospective, 7-year cohort study included all patients regularly seen by primary care physicians (PCPs) at an urban academic medical center. The main outcome was the number of outpatient high-cost imaging examinations per patient per year ordered by the patient's PCP or by any specialist. The authors determined the probability of a patient undergoing any high-cost imaging procedure during a study year and the number of examinations per patient per year (intensity) in patients who underwent high-cost imaging. Risk-adjusted hierarchical models were used to directly quantify the physician component of variation in probability and intensity of high-cost imaging use, and clinicians were provided with regular comparative feedback on the basis of the results. Observed trends in high-cost imaging use and provider variation were compared with the same measures for outpatient laboratory studies because laboratory use was not subject to UM during this period. Finally, per-member per-year high-cost imaging use data were compared with statewide high-cost imaging use data from a major private payer on the basis of the same claim set. Results The patient cohort steadily increased in size from 88 959 in 2007 to 109 823 in 2013. Overall high-cost imaging utilization went from 0.43 examinations per year in 2007 to 0.34 examinations per year in 2013, a decrease of 21.33% (P < .0001). At the same time, similarly adjusted routine laboratory study utilization decreased by less than half that rate (9.4%, P < .0001). On the basis of unadjusted data, outpatient high-cost imaging utilization in this cohort decreased 28%, compared with a 20% decrease in statewide utilization (P = .0023). Conclusion Analysis of high-cost imaging utilization in a stable cohort of patients cared for by PCPs during a 7-year period showed that comprehensive UM can produce a significant and sustained reduction in risk-adjusted per-patient year outpatient high-cost imaging volume. © RSNA, 2017.


Assuntos
Diagnóstico por Imagem , Pacientes Ambulatoriais/estatística & dados numéricos , Atenção Primária à Saúde , Diagnóstico por Imagem/economia , Diagnóstico por Imagem/estatística & dados numéricos , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Médicos de Atenção Primária/estatística & dados numéricos , Atenção Primária à Saúde/economia , Atenção Primária à Saúde/estatística & dados numéricos , Estudos Retrospectivos
12.
Radiology ; 275(2): 469-79, 2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25423147

RESUMO

PURPOSE: To determine the relevant physician- and practice-related factors that jointly affect the rate of low-utility imaging examinations (score of 1-3 out of 9) ordered by means of an order entry system that provides normative appropriateness feedback. MATERIALS AND METHODS: This HIPAA-compliant study was approved by the institutional review board under an expedited protocol for analyzing anonymous aggregated administrative data. This is a retrospective study of approximately 250 000 consecutive scheduled outpatient advanced imaging examinations (computed tomography, magnetic resonance imaging, nuclear medicine) ordered by 164 primary care and 379 medical specialty physicians from 2008 to 2012. A hierarchical logistic regression model was used to identify multiple predictors of the probability that an examination received a low utility score. Physician- and practice-specific random effects were estimated to articulate (odds ratio) and quantify (intraclass correlation) interphysician variation. RESULTS: Fixed effects found to be statistically significant predictors of low-utility imaging included examination type, whether the examination was cancelled, status of the person entering the order, and the total number of examinations ordered by the clinician. Neither patient age nor sex had any effect, and there were no secular trends (year of study). The remaining amount of interphysician variation was moderate (intraclass correlation, 22%), whereas the variation between medical specialties and primary care practices was low (intraclass correlation, 5%). The estimated physician-specific effects had reliability of 70%, which makes them just suitable for identifying outliers. CONCLUSION: The authors found that 22% of the variation in the rate of low-utility examinations is attributable to ordering providers and 5% to their specialty or clinic.


Assuntos
Diagnóstico por Imagem/estatística & dados numéricos , Retroalimentação , Sistemas de Registro de Ordens Médicas/estatística & dados numéricos , Padrões de Prática Médica , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Retrospectivos , Adulto Jovem
13.
AJR Am J Roentgenol ; 204(4): 716-20, 2015 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-25794061

RESUMO

OBJECTIVE: Informatics innovations of the past 30 years have improved radiology quality and efficiency immensely. Radiologists are groundbreaking leaders in clinical information technology (IT), and often radiologists and imaging informaticists created, specified, and implemented these technologies, while also carrying the ongoing burdens of training, maintenance, support, and operation of these IT solutions. Being pioneers of clinical IT had advantages of local radiology control and radiology-centric products and services. As health care businesses become more clinically IT savvy, however, they are standardizing IT products and procedures across the enterprise, resulting in the loss of radiologists' local control and flexibility. Although this inevitable consequence may provide new opportunities in the long run, several questions arise. CONCLUSION: What will happen to the informatics expertise within the radiology domain? Will radiology's current and future concerns be heard and their needs addressed? What should radiologists do to understand, obtain, and use informatics products to maximize efficiency and provide the most value and quality for patients and the greater health care community? This article will propose some insights and considerations as we rethink radiology informatics.


Assuntos
Diagnóstico por Imagem/tendências , Aplicações da Informática Médica , Difusão de Inovações , Eficiência Organizacional , Previsões , Humanos , Serviço Hospitalar de Radiologia/tendências , Sistemas de Informação em Radiologia/tendências
14.
AJR Am J Roentgenol ; 204(4): W405-20, 2015 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-25794090

RESUMO

OBJECTIVE: We propose a method of processing and displaying imaging utilization data for large populations. CONCLUSION: The comprehensive and finely grained picture of imaging utilization yielded by our methods is a first step toward population-based imaging utilization management. We believe that our methods for the categorization and display of imaging utilization will prove to be widely useful.


Assuntos
Apresentação de Dados/tendências , Diagnóstico por Imagem/estatística & dados numéricos , Aplicações da Informática Médica , Current Procedural Terminology , Diagnóstico por Imagem/economia , Pesquisa sobre Serviços de Saúde , Humanos , Medicare Part B/economia , Software , Estados Unidos
15.
J Digit Imaging ; 27(3): 292-6, 2014 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-24682743

RESUMO

The goal of this work is to provide radiologists an update regarding changes to stage 1 of meaningful use in 2014. These changes were promulgated in the final rulemaking released by the Centers for Medicare and Medicaid Services and the Office of the National Coordinator for Health Information Technology in September 2012. Under the new rules, radiologists are exempt from meaningful use penalties provided that they are listed as radiologists under the Provider Enrollment, Chain and Ownership System (PECOS). A major caveat is that this exemption can be removed at any time. Additional concerns are discussed in the main text. Additional changes discussed include software editions independent of meaningful use stage (i.e., 2011 edition versus 2014 edition), changes to the definition of certified electronic health record technology (CEHRT), and changes to specific measures and exemptions to those measures. The new changes regarding stage 1 add complexity to an already complex program, but overall make achieving meaningful use a win-win situation for radiologists. There are no penalties for failure and incentive payments for success. The cost of upgrading to CEHRT may be much less than the incentive payments, adding a potential new source of revenue. Additional benefits may be realized if the radiology department can build upon a modern electronic health record to improve their practice and billing patterns. Meaningful use and electronic health records represent an important evolutionary step in US healthcare, and it is imperative that radiologists are active participants in the process.


Assuntos
Registros Eletrônicos de Saúde/economia , Uso Significativo/economia , Informática Médica/economia , Radiologia/economia , Difusão de Inovações , Feminino , Humanos , Masculino , Medicaid/economia , Medicare/economia , Estados Unidos
16.
J Am Coll Radiol ; 21(4): 617-623, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-37843483

RESUMO

PURPOSE: Medical imaging accounts for 85% of digital health's venture capital funding. As funding grows, it is expected that artificial intelligence (AI) products will increase commensurately. The study's objective is to project the number of new AI products given the statistical association between historical funding and FDA-approved AI products. METHODS: The study used data from the ACR Data Science Institute and for the number of FDA-approved AI products (2008-2022) and data from Rock Health for AI funding (2013-2022). Employing a 6-year lag between funding and product approved, we used linear regression to estimate the association between new products approved in a certain year, based on the lagged funding (ie, product-year funding). Using this statistical relationship, we forecasted the number of new FDA-approved products. RESULTS: The results show that there are 11.33 (95% confidence interval: 7.03-15.64) new AI products for every $1 billion in funding assuming a 6-year lag between funding and product approval. In 2022 there were 69 new FDA-approved products associated with $4.8 billion in funding. In 2035, product-year funding is projected to reach $30.8 billion, resulting in 350 new products that year. CONCLUSIONS: FDA-approved AI products are expected to grow from 69 in 2022 to 350 in 2035 given the expected funding growth in the coming years. AI is likely to change the practice of diagnostic radiology as new products are developed and integrated into practice. As more AI products are integrated, it may incentivize increased investment for future AI products.


Assuntos
Inteligência Artificial , Financiamento de Capital , Academias e Institutos , Ciência de Dados , Investimentos em Saúde
17.
Clin Imaging ; 112: 110207, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-38838448

RESUMO

PURPOSE: We created an infrastructure for no code machine learning (NML) platform for non-programming physicians to create NML model. We tested the platform by creating an NML model for classifying radiographs for the presence and absence of clavicle fractures. METHODS: Our IRB-approved retrospective study included 4135 clavicle radiographs from 2039 patients (mean age 52 ± 20 years, F:M 1022:1017) from 13 hospitals. Each patient had two-view clavicle radiographs with axial and anterior-posterior projections. The positive radiographs had either displaced or non-displaced clavicle fractures. We configured the NML platform to automatically retrieve the eligible exams using the series' unique identification from the hospital virtual network archive via web access to DICOM Objects. The platform trained a model until the validation loss plateaus. Once the testing was complete, the platform provided the receiver operating characteristics curve and confusion matrix for estimating sensitivity, specificity, and accuracy. RESULTS: The NML platform successfully retrieved 3917 radiographs (3917/4135, 94.7 %) and parsed them for creating a ML classifier with 2151 radiographs in the training, 100 radiographs for validation, and 1666 radiographs in testing datasets (772 radiographs with clavicle fracture, 894 without clavicle fracture). The network identified clavicle fracture with 90 % sensitivity, 87 % specificity, and 88 % accuracy with AUC of 0.95 (confidence interval 0.94-0.96). CONCLUSION: A NML platform can help physicians create and test machine learning models from multicenter imaging datasets such as the one in our study for classifying radiographs based on the presence of clavicle fracture.


Assuntos
Clavícula , Fraturas Ósseas , Aprendizado de Máquina , Humanos , Clavícula/lesões , Clavícula/diagnóstico por imagem , Fraturas Ósseas/diagnóstico por imagem , Fraturas Ósseas/classificação , Feminino , Pessoa de Meia-Idade , Masculino , Estudos Retrospectivos , Sensibilidade e Especificidade , Adulto , Radiografia/métodos
18.
J Am Coll Radiol ; 2024 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-38960083

RESUMO

PURPOSE: We compared the performance of generative AI (G-AI, ATARI) and natural language processing (NLP) tools for identifying laterality errors in radiology reports and images. METHODS: We used an NLP-based (mPower) tool to identify radiology reports flagged for laterality errors in its QA Dashboard. The NLP model detects and highlights laterality mismatches in radiology reports. From an initial pool of 1124 radiology reports flagged by the NLP for laterality errors, we selected and evaluated 898 reports that encompassed radiography, CT, MRI, and ultrasound modalities to ensure comprehensive coverage. A radiologist reviewed each radiology report to assess if the flagged laterality errors were present (reporting error - true positive) or absent (NLP error - false positive). Next, we applied ATARI to 237 radiology reports and images with consecutive NLP true positive (118 reports) and false positive (119 reports) laterality errors. We estimated accuracy of NLP and G-AI tools to identify overall and modality-wise laterality errors. RESULTS: Among the 898 NLP-flagged laterality errors, 64% (574/898) had NLP errors and 36% (324/898) were reporting errors. The text query ATARI feature correctly identified the absence of laterality mismatch (NLP false positives) with a 97.4% accuracy (115/118 reports; 95% CI = 96.5% - 98.3%). Combined Vision and text query resulted in 98.3% accuracy (116/118 reports/images; 95% CI = 97.6% - 99.0%) query alone had a 98.3% accuracy (116/118 images; 95% CI = 97.6% - 99.0%). CONCLUSION: The generative AI-empowered ATARI prototype outperformed the assessed NLP tool for determining true and false laterality errors in radiology reports while enabling an image-based laterality determination. Underlying errors in ATARI text query in complex radiology reports emphasize the need for further improvement in the technology.

19.
J Am Coll Radiol ; 21(2): 329-340, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37196818

RESUMO

PURPOSE: To evaluate the real-world performance of two FDA-approved artificial intelligence (AI)-based computer-aided triage and notification (CADt) detection devices and compare them with the manufacturer-reported performance testing in the instructions for use. MATERIALS AND METHODS: Clinical performance of two FDA-cleared CADt large-vessel occlusion (LVO) devices was retrospectively evaluated at two separate stroke centers. Consecutive "code stroke" CT angiography examinations were included and assessed for patient demographics, scanner manufacturer, presence or absence of CADt result, CADt result, and LVO in the internal carotid artery (ICA), horizontal middle cerebral artery (MCA) segment (M1), Sylvian MCA segments after the bifurcation (M2), precommunicating part of cerebral artery, postcommunicating part of the cerebral artery, vertebral artery, basilar artery vessel segments. The original radiology report served as the reference standard, and a study radiologist extracted the above data elements from the imaging examination and radiology report. RESULTS: At hospital A, the CADt algorithm manufacturer reports assessment of intracranial ICA and MCA with sensitivity of 97% and specificity of 95.6%. Real-world performance of 704 cases included 79 in which no CADt result was available. Sensitivity and specificity in ICA and M1 segments were 85.3% and 91.9%. Sensitivity decreased to 68.5% when M2 segments were included and to 59.9% when all proximal vessel segments were included. At hospital B the CADt algorithm manufacturer reports sensitivity of 87.8% and specificity of 89.6%, without specifying the vessel segments. Real-world performance of 642 cases included 20 cases in which no CADt result was available. Sensitivity and specificity in ICA and M1 segments were 90.7% and 97.9%. Sensitivity decreased to 76.4% when M2 segments were included and to 59.4% when all proximal vessel segments are included. DISCUSSION: Real-world testing of two CADt LVO detection algorithms identified gaps in the detection and communication of potentially treatable LVOs when considering vessels beyond the intracranial ICA and M1 segments and in cases with absent and uninterpretable data.


Assuntos
Inteligência Artificial , Acidente Vascular Cerebral , Humanos , Triagem , Estudos Retrospectivos , Acidente Vascular Cerebral/diagnóstico por imagem , Algoritmos , Computadores
20.
Artigo em Inglês | MEDLINE | ID: mdl-38806239

RESUMO

BACKGROUND AND PURPOSE: Mass effect and vasogenic edema are critical findings on CT of the head. This study compared the accuracy of an artificial intelligence model (Annalise Enterprise CTB) to consensus neuroradiologist interpretations in detecting mass effect and vasogenic edema. MATERIALS AND METHODS: A retrospective standalone performance assessment was conducted on datasets of non-contrast CT head cases acquired between 2016 and 2022 for each finding. The cases were obtained from patients aged 18 years or older from five hospitals in the United States. The positive cases were selected consecutively based on the original clinical reports using natural language processing and manual confirmation. The negative cases were selected by taking the next negative case acquired from the same CT scanner after positive cases. Each case was interpreted independently by up to three neuroradiologists to establish consensus interpretations. Each case was then interpreted by the AI model for the presence of the relevant finding. The neuroradiologists were provided with the entire CT study. The AI model separately received thin (≤1.5mm) and/or thick (>1.5 and ≤5mm) axial series. RESULTS: The two cohorts included 818 cases for mass effect and 310 cases for vasogenic edema. The AI model identified mass effect with sensitivity 96.6% (95% CI, 94.9-98.2) and specificity 89.8% (95% CI, 84.7-94.2) for the thin series, and 95.3% (95% CI, 93.5-96.8) and 93.1% (95% CI, 89.1-96.6) for the thick series. It identified vasogenic edema with sensitivity 90.2% (95% CI, 82.0-96.7) and specificity 93.5% (95% CI, 88.9-97.2) for the thin series, and 90.0% (95% CI, 84.0-96.0) and 95.5% (95% CI, 92.5-98.0) for the thick series. The corresponding areas under the curve were at least 0.980. CONCLUSIONS: The assessed AI model accurately identified mass effect and vasogenic edema in this CT dataset. It could assist the clinical workflow by prioritizing interpretation of abnormal cases, which could benefit patients through earlier identification and subsequent treatment. ABBREVIATIONS: AI = artificial intelligence; AUC = area under the curve; CADt = computer assisted triage devices; FDA = Food and Drug Administration; NPV = negative predictive value; PPV = positive predictive value; SD = standard deviation.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA