RESUMO
BACKGROUND: Respiratory syncytial virus (RSV) is a common cause of lower respiratory tract disease (LRTD) among adults and can lead to serious morbidity and mortality; however, evidence on the magnitude of the public health and economic burden of adult RSV-LRTD is limited. This study was undertaken to project annual clinical outcomes and economic costs of medically attended RSV-LRTD among US adults, and to identify subgroups responsible for a disproportionate share of disease burden. METHODS: Clinical outcomes of RSV-LRTD were projected for subgroups of US adults defined by age and comorbidity profile (with vs. without chronic/immunocompromising medical conditions) based on corresponding population sizes, episode (disease) rates, and case-fatality rates. Economic costs comprised medical (i.e., direct) costs and non-medical (i.e., indirect) costs of RSV-LRTD, and were generated based on numbers of episodes and unit costs in relation to setting of care, age, and comorbidity profile. RESULTS: Among 265 million US adults aged ≥18 years in 2023, 6.5 million medically attended episodes of RSV-LRTD were projected to occur including 349,260 requiring hospitalization, 357,892 requiring an emergency department visit (not leading to hospitalization), and 5.8 million requiring other ambulatory care. Direct costs ($15.2 billion) and indirect costs ($9.7 billion) were projected to total $25.0 billion. Persons aged 60-99 years accounted for 31 % of the adult population and over 50 % of the economic burden of RSV-LRTD, while adults aged <60 years with chronic/immunocompromising medical conditions accounted for 10 % of the population and 27 % of the economic burden. CONCLUSIONS: Annual burden of RSV-LRTD among US adults-especially older adults and those of all ages with underlying medical conditions-is substantial. Preventive measures, such as recently approved RSV vaccines, have the potential to yield important improvements in public and patient health, and to reduce the economic burden of RSV-LRTD from the US healthcare system and societal perspectives.
RESUMO
OBJECTIVES: To systematically review the accuracy of artificial intelligence (AI)-based systems for grading of fundus images in diabetic retinopathy (DR) screening. METHODS: We searched MEDLINE, EMBASE, the Cochrane Library and the ClinicalTrials.gov from 1st January 2000 to 27th August 2021. Accuracy studies published in English were included if they met the pre-specified inclusion criteria. Selection of studies for inclusion, data extraction and quality assessment were conducted by one author with a second reviewer independently screening and checking 20% of titles. Results were analysed narratively. RESULTS: Forty-three studies evaluating 15 deep learning (DL) and 4 machine learning (ML) systems were included. Nine systems were evaluated in a single study each. Most studies were judged to be at high or unclear risk of bias in at least one QUADAS-2 domain. Sensitivity for referable DR and higher grades was ≥85% while specificity varied and was <80% for all ML systems and in 6/31 studies evaluating DL systems. Studies reported high accuracy for detection of ungradable images, but the latter were analysed and reported inconsistently. Seven studies reported that AI was more sensitive but less specific than human graders. CONCLUSIONS: AI-based systems are more sensitive than human graders and could be safe to use in clinical practice but have variable specificity. However, for many systems evidence is limited, at high risk of bias and may not generalise across settings. Therefore, pre-implementation assessment in the target clinical pathway is essential to obtain reliable and applicable accuracy estimates.
Assuntos
Diabetes Mellitus , Retinopatia Diabética , Humanos , Inteligência Artificial , Retinopatia Diabética/diagnóstico por imagem , Detecção Precoce de Câncer , Programas de Rastreamento/métodosRESUMO
Rigorous evaluation of artificial intelligence (AI) systems for image classification is essential before deployment into health-care settings, such as screening programmes, so that adoption is effective and safe. A key step in the evaluation process is the external validation of diagnostic performance using a test set of images. We conducted a rapid literature review on methods to develop test sets, published from 2012 to 2020, in English. Using thematic analysis, we mapped themes and coded the principles using the Population, Intervention, and Comparator or Reference standard, Outcome, and Study design framework. A group of screening and AI experts assessed the evidence-based principles for completeness and provided further considerations. From the final 15 principles recommended here, five affect population, one intervention, two comparator, one reference standard, and one both reference standard and comparator. Finally, four are appliable to outcome and one to study design. Principles from the literature were useful to address biases from AI; however, they did not account for screening specific biases, which we now incorporate. The principles set out here should be used to support the development and use of test sets for studies that assess the accuracy of AI within screening programmes, to ensure they are fit for purpose and minimise bias.
Assuntos
Inteligência Artificial , Diagnóstico por Imagem , Programas de RastreamentoRESUMO
Artificial intelligence (AI) could have the potential to accurately classify mammograms according to the presence or absence of radiological signs of breast cancer, replacing or supplementing human readers (radiologists). The UK National Screening Committee's assessments of the use of AI systems to examine screening mammograms continues to focus on maximising benefits and minimising harms to women screened, when deciding whether to recommend the implementation of AI into the Breast Screening Programme in the UK. Maintaining or improving programme specificity is important to minimise anxiety from false positive results. When considering cancer detection, AI test sensitivity alone is not sufficiently informative, and additional information on the spectrum of disease detected and interval cancers is crucial to better understand the benefits and harms of screening. Although large retrospective studies might provide useful evidence by directly comparing test accuracy and spectrum of disease detected between different AI systems and by population subgroup, most retrospective studies are biased due to differential verification (ie, the use of different reference standards to verify the target condition among study participants). Enriched, multiple-reader, multiple-case, test set laboratory studies are also biased due to the laboratory effect (ie, radiologists' performance in retrospective, laboratory, observer studies is substantially different to their performance in a clinical environment). Therefore, assessment of the effect of incorporating any AI system into the breast screening pathway in prospective studies is required as it will provide key evidence for the effect of the interaction of medical staff with AI, and the impact on women's outcomes.