RESUMEN
BACKGROUND: Systematic reviews (SRs) are time-consuming and labor-intensive to perform. With the growing number of scientific publications, the SR development process becomes even more laborious. This is problematic because timely SR evidence is essential for decision-making in evidence-based healthcare and policymaking. Numerous methods and tools that accelerate SR development have recently emerged. To date, no scoping review has been conducted to provide a comprehensive summary of methods and ready-to-use tools to improve efficiency in SR production. OBJECTIVE: To present an overview of primary studies that evaluated the use of ready-to-use applications of tools or review methods to improve efficiency in the review process. METHODS: We conducted a scoping review. An information specialist performed a systematic literature search in four databases, supplemented with citation-based and grey literature searching. We included studies reporting the performance of methods and ready-to-use tools for improving efficiency when producing or updating a SR in the health field. We performed dual, independent title and abstract screening, full-text selection, and data extraction. The results were analyzed descriptively and presented narratively. RESULTS: We included 103 studies: 51 studies reported on methods, 54 studies on tools, and 2 studies reported on both methods and tools to make SR production more efficient. A total of 72 studies evaluated the validity (n = 69) or usability (n = 3) of one method (n = 33) or tool (n = 39), and 31 studies performed comparative analyses of different methods (n = 15) or tools (n = 16). 20 studies conducted prospective evaluations in real-time workflows. Most studies evaluated methods or tools that aimed at screening titles and abstracts (n = 42) and literature searching (n = 24), while for other steps of the SR process, only a few studies were found. Regarding the outcomes included, most studies reported on validity outcomes (n = 84), while outcomes such as impact on results (n = 23), time-saving (n = 24), usability (n = 13), and cost-saving (n = 3) were less often evaluated. CONCLUSION: For title and abstract screening and literature searching, various evaluated methods and tools are available that aim at improving the efficiency of SR production. However, only few studies have addressed the influence of these methods and tools in real-world workflows. Few studies exist that evaluate methods or tools supporting the remaining tasks. Additionally, while validity outcomes are frequently reported, there is a lack of evaluation regarding other outcomes.
Asunto(s)
Revisiones Sistemáticas como Asunto , Humanos , Proyectos de Investigación , Revisiones Sistemáticas como Asunto/métodosRESUMEN
BACKGROUND: Diagnosing people with a SARS-CoV-2 infection played a critical role in managing the COVID-19 pandemic and remains a priority for the transition to long-term management of COVID-19. Initial shortages of extraction and reverse transcription polymerase chain reaction (RT-PCR) reagents impaired the desired upscaling of testing in many countries, which led to the search for alternatives to RNA extraction/purification and RT-PCR testing. Reference standard methods for diagnosing the presence of SARS-CoV-2 infection rely primarily on real-time reverse transcription-polymerase chain reaction (RT-PCR). Alternatives to RT-PCR could, if sufficiently accurate, have a positive impact by expanding the range of diagnostic tools available for the timely identification of people infected by SARS-CoV-2, access to testing and the use of resources. OBJECTIVES: To assess the diagnostic accuracy of alternative (to RT-PCR assays) laboratory-based molecular tests for diagnosing SARS-CoV-2 infection. SEARCH METHODS: We searched the COVID-19 Open Access Project living evidence database from the University of Bern until 30 September 2020 and the WHO COVID-19 Research Database until 31 October 2022. We did not apply language restrictions. SELECTION CRITERIA: We included studies of people with suspected or known SARS-CoV-2 infection, or where tests were used to screen for infection, and studies evaluating commercially developed laboratory-based molecular tests for the diagnosis of SARS-CoV-2 infection considered as alternatives to RT-PCR testing. We also included all reference standards to define the presence or absence of SARS-CoV-2, including RT-PCR tests and established clinical diagnostic criteria. DATA COLLECTION AND ANALYSIS: Two authors independently screened studies and resolved disagreements by discussing them with a third author. Two authors independently extracted data and assessed the risk of bias and applicability of the studies using the QUADAS-2 tool. We presented sensitivity and specificity, with 95% confidence intervals (CIs), for each test using paired forest plots and summarised results using average sensitivity and specificity using a bivariate random-effects meta-analysis. We illustrated the findings per index test category and assay brand compared to the WHO's acceptable sensitivity and specificity threshold for diagnosing SARS-CoV-2 infection using nucleic acid tests. MAIN RESULTS: We included data from 64 studies reporting 94 cohorts of participants and 105 index test evaluations, with 74,753 samples and 7517 confirmed SARS-CoV-2 cases. We did not identify any published or preprint reports of accuracy for a considerable number of commercially produced NAAT assays. Most cohorts were judged at unclear or high risk of bias in more than three QUADAS-2 domains. Around half of the cohorts were considered at high risk of selection bias because of recruitment based on COVID status. Three quarters of 94 cohorts were at high risk of bias in the reference standard domain because of reliance on a single RT-PCR result to determine the absence of SARS-CoV-2 infection or were at unclear risk of bias due to a lack of clarity about the time interval between the index test assessment and the reference standard, the number of missing results, or the absence of a participant flow diagram. For index tests categories with four or more evaluations and when summary estimations were possible, we found that: a) For RT-PCR assays designed to omit/adapt RNA extraction/purification, the average sensitivity was 95.1% (95% CI 91.1% to 97.3%), and the average specificity was 99.7% (95% CI 98.5% to 99.9%; based on 27 evaluations, 2834 samples and 1178 SARS-CoV-2 cases); b) For RT-LAMP assays, the average sensitivity was 88.4% (95% CI 83.1% to 92.2%), and the average specificity was 99.7% (95% CI 98.7% to 99.9%; 24 evaluations, 29,496 samples and 2255 SARS-CoV-2 cases); c) for TMA assays, the average sensitivity was 97.6% (95% CI 95.2% to 98.8%), and the average specificity was 99.4% (95% CI 94.9% to 99.9%; 14 evaluations, 2196 samples and 942 SARS-CoV-2 cases); d) for digital PCR assays, the average sensitivity was 98.5% (95% CI 95.2% to 99.5%), and the average specificity was 91.4% (95% CI 60.4% to 98.7%; five evaluations, 703 samples and 354 SARS-CoV-2 cases); e) for RT-LAMP assays omitting/adapting RNA extraction, the average sensitivity was 73.1% (95% CI 58.4% to 84%), and the average specificity was 100% (95% CI 98% to 100%; 24 evaluations, 14,342 samples and 1502 SARS-CoV-2 cases). Only two index test categories fulfil the WHO-acceptable sensitivity and specificity requirements for SARS-CoV-2 nucleic acid tests: RT-PCR assays designed to omit/adapt RNA extraction/purification and TMA assays. In addition, WHO-acceptable performance criteria were met for two assays out of 35 when tests were used according to manufacturer instructions. At 5% prevalence using a cohort of 1000 people suspected of SARS-CoV-2 infection, the positive predictive value of RT-PCR assays omitting/adapting RNA extraction/purification will be 94%, with three in 51 positive results being false positives, and around two missed cases. For TMA assays, the positive predictive value of RT-PCR assays will be 89%, with 6 in 55 positive results being false positives, and around one missed case. AUTHORS' CONCLUSIONS: Alternative laboratory-based molecular tests aim to enhance testing capacity in different ways, such as reducing the time, steps and resources needed to obtain valid results. Several index test technologies with these potential advantages have not been evaluated or have been assessed by only a few studies of limited methodological quality, so the performance of these kits was undetermined. Only two index test categories with enough evaluations for meta-analysis fulfil the WHO set of acceptable accuracy standards for SARS-CoV-2 nucleic acid tests: RT-PCR assays designed to omit/adapt RNA extraction/purification and TMA assays. These assays might prove to be suitable alternatives to RT-PCR for identifying people infected by SARS-CoV-2, especially when the alternative would be not having access to testing. However, these findings need to be interpreted and used with caution because of several limitations in the evidence, including reliance on retrospective samples without information about the symptom status of participants and the timing of assessment. No extrapolation of found accuracy data for these two alternatives to any test brands using the same techniques can be made as, for both groups, one test brand with high accuracy was overrepresented with 21/26 and 12/14 included studies, respectively. Although we used a comprehensive search and had broad eligibility criteria to include a wide range of tests that could be alternatives to RT-PCR methods, further research is needed to assess the performance of alternative COVID-19 tests and their role in pandemic management.
Asunto(s)
Prueba de Ácido Nucleico para COVID-19 , COVID-19 , ARN Viral , SARS-CoV-2 , Sensibilidad y Especificidad , Humanos , Sesgo , COVID-19/diagnóstico , Prueba de Ácido Nucleico para COVID-19/métodos , Reacciones Falso Negativas , Reacciones Falso Positivas , Pandemias , Reacción en Cadena en Tiempo Real de la Polimerasa/métodos , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa/métodos , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa/normas , ARN Viral/análisis , SARS-CoV-2/genética , SARS-CoV-2/aislamiento & purificaciónRESUMEN
BACKGROUND: Identifying patients with COVID-19 disease who will deteriorate can be useful to assess whether they should receive intensive care, or whether they can be treated in a less intensive way or through outpatient care. In clinical care, routine laboratory markers, such as C-reactive protein, are used to assess a person's health status. OBJECTIVES: To assess the accuracy of routine blood-based laboratory tests to predict mortality and deterioration to severe or critical (from mild or moderate) COVID-19 in people with SARS-CoV-2. SEARCH METHODS: On 25 August 2022, we searched the Cochrane COVID-19 Study Register, encompassing searches of various databases such as MEDLINE via PubMed, CENTRAL, Embase, medRxiv, and ClinicalTrials.gov. We did not apply any language restrictions. SELECTION CRITERIA: We included studies of all designs that produced estimates of prognostic accuracy in participants who presented to outpatient services, or were admitted to general hospital wards with confirmed SARS-CoV-2 infection, and studies that were based on serum banks of samples from people. All routine blood-based laboratory tests performed during the first encounter were included. We included any reference standard used to define deterioration to severe or critical disease that was provided by the authors. DATA COLLECTION AND ANALYSIS: Two review authors independently extracted data from each included study, and independently assessed the methodological quality using the Quality Assessment of Prognostic Accuracy Studies tool. As studies reported different thresholds for the same test, we used the Hierarchical Summary Receiver Operator Curve model for meta-analyses to estimate summary curves in SAS 9.4. We estimated the sensitivity at points on the SROC curves that corresponded to the median and interquartile range boundaries of specificities in the included studies. Direct and indirect comparisons were exclusively conducted for biomarkers with an estimated sensitivity and 95% CI of ≥ 50% at a specificity of ≥ 50%. The relative diagnostic odds ratio was calculated as a summary of the relative accuracy of these biomarkers. MAIN RESULTS: We identified a total of 64 studies, including 71,170 participants, of which 8169 participants died, and 4031 participants deteriorated to severe/critical condition. The studies assessed 53 different laboratory tests. For some tests, both increases and decreases relative to the normal range were included. There was important heterogeneity between tests and their cut-off values. None of the included studies had a low risk of bias or low concern for applicability for all domains. None of the tests included in this review demonstrated high sensitivity or specificity, or both. The five tests with summary sensitivity and specificity above 50% were: C-reactive protein increase, neutrophil-to-lymphocyte ratio increase, lymphocyte count decrease, d-dimer increase, and lactate dehydrogenase increase. Inflammation For mortality, summary sensitivity of a C-reactive protein increase was 76% (95% CI 73% to 79%) at median specificity, 59% (low-certainty evidence). For deterioration, summary sensitivity was 78% (95% CI 67% to 86%) at median specificity, 72% (very low-certainty evidence). For the combined outcome of mortality or deterioration, or both, summary sensitivity was 70% (95% CI 49% to 85%) at median specificity, 60% (very low-certainty evidence). For mortality, summary sensitivity of an increase in neutrophil-to-lymphocyte ratio was 69% (95% CI 66% to 72%) at median specificity, 63% (very low-certainty evidence). For deterioration, summary sensitivity was 75% (95% CI 59% to 87%) at median specificity, 71% (very low-certainty evidence). For mortality, summary sensitivity of a decrease in lymphocyte count was 67% (95% CI 56% to 77%) at median specificity, 61% (very low-certainty evidence). For deterioration, summary sensitivity of a decrease in lymphocyte count was 69% (95% CI 60% to 76%) at median specificity, 67% (very low-certainty evidence). For the combined outcome, summary sensitivity was 83% (95% CI 67% to 92%) at median specificity, 29% (very low-certainty evidence). For mortality, summary sensitivity of a lactate dehydrogenase increase was 82% (95% CI 66% to 91%) at median specificity, 60% (very low-certainty evidence). For deterioration, summary sensitivity of a lactate dehydrogenase increase was 79% (95% CI 76% to 82%) at median specificity, 66% (low-certainty evidence). For the combined outcome, summary sensitivity was 69% (95% CI 51% to 82%) at median specificity, 62% (very low-certainty evidence). Hypercoagulability For mortality, summary sensitivity of a d-dimer increase was 70% (95% CI 64% to 76%) at median specificity of 56% (very low-certainty evidence). For deterioration, summary sensitivity was 65% (95% CI 56% to 74%) at median specificity of 63% (very low-certainty evidence). For the combined outcome, summary sensitivity was 65% (95% CI 52% to 76%) at median specificity of 54% (very low-certainty evidence). To predict mortality, neutrophil-to-lymphocyte ratio increase had higher accuracy compared to d-dimer increase (RDOR (diagnostic Odds Ratio) 2.05, 95% CI 1.30 to 3.24), C-reactive protein increase (RDOR 2.64, 95% CI 2.09 to 3.33), and lymphocyte count decrease (RDOR 2.63, 95% CI 1.55 to 4.46). D-dimer increase had higher accuracy compared to lymphocyte count decrease (RDOR 1.49, 95% CI 1.23 to 1.80), C-reactive protein increase (RDOR 1.31, 95% CI 1.03 to 1.65), and lactate dehydrogenase increase (RDOR 1.42, 95% CI 1.05 to 1.90). Additionally, lactate dehydrogenase increase had higher accuracy compared to lymphocyte count decrease (RDOR 1.30, 95% CI 1.13 to 1.49). To predict deterioration to severe disease, C-reactive protein increase had higher accuracy compared to d-dimer increase (RDOR 1.76, 95% CI 1.25 to 2.50). The neutrophil-to-lymphocyte ratio increase had higher accuracy compared to d-dimer increase (RDOR 2.77, 95% CI 1.58 to 4.84). Lastly, lymphocyte count decrease had higher accuracy compared to d-dimer increase (RDOR 2.10, 95% CI 1.44 to 3.07) and lactate dehydrogenase increase (RDOR 2.22, 95% CI 1.52 to 3.26). AUTHORS' CONCLUSIONS: Laboratory tests, associated with hypercoagulability and hyperinflammatory response, were better at predicting severe disease and mortality in patients with SARS-CoV-2 compared to other laboratory tests. However, to safely rule out severe disease, tests should have high sensitivity (> 90%), and none of the identified laboratory tests met this criterion. In clinical practice, a more comprehensive assessment of a patient's health status is usually required by, for example, incorporating these laboratory tests into clinical prediction rules together with clinical symptoms, radiological findings, and patient's characteristics.
Asunto(s)
Proteína C-Reactiva , COVID-19 , SARS-CoV-2 , Humanos , COVID-19/mortalidad , COVID-19/sangre , COVID-19/diagnóstico , Proteína C-Reactiva/análisis , Biomarcadores/sangre , Pronóstico , Deterioro Clínico , Sesgo , Pandemias , Sensibilidad y Especificidad , Índice de Severidad de la Enfermedad , Prueba de COVID-19/métodosRESUMEN
BACKGROUND: Inappropriate antimicrobial usage is a key driver of antimicrobial resistance (AMR). Low- and middle-income countries (LMICs) are disproportionately burdened by AMR and young children are especially vulnerable to infections with AMR-bearing pathogens. The impact of antibiotics on the microbiome, selection, persistence, and horizontal spread of AMR genes is insufficiently characterized and understood in children in LMICs. This systematic review aims to collate and evaluate the available literature describing the impact of antibiotics on the infant gut microbiome and resistome in LMICs. METHODS AND FINDINGS: In this systematic review, we searched the online databases MEDLINE (1946 to 28 January 2023), EMBASE (1947 to 28 January 2023), SCOPUS (1945 to 29 January 2023), WHO Global Index Medicus (searched up to 29 January 2023), and SciELO (searched up to 29 January 2023). A total of 4,369 articles were retrieved across the databases. Duplicates were removed resulting in 2,748 unique articles. Screening by title and abstract excluded 2,666 articles, 92 articles were assessed based on the full text, and 10 studies met the eligibility criteria that included human studies conducted in LMICs among children below the age of 2 that reported gut microbiome composition and/or resistome composition (AMR genes) following antibiotic usage. The included studies were all randomized control trials (RCTs) and were assessed for risk of bias using the Cochrane risk-of-bias for randomized studies tool. Overall, antibiotics reduced gut microbiome diversity and increased antibiotic-specific resistance gene abundance in antibiotic treatment groups as compared to the placebo. The most widely tested antibiotic was azithromycin that decreased the diversity of the gut microbiome and significantly increased macrolide resistance as early as 5 days posttreatment. A major limitation of this study was paucity of available studies that cover this subject area. Specifically, the range of antibiotics assessed did not include the most commonly used antibiotics in LMIC populations. CONCLUSION: In this study, we observed that antibiotics significantly reduce the diversity and alter the composition of the infant gut microbiome in LMICs, while concomitantly selecting for resistance genes whose persistence can last for months following treatment. Considerable heterogeneity in study methodology, timing and duration of sampling, and sequencing methodology in currently available research limit insights into antibiotic impacts on the microbiome and resistome in children in LMICs. More research is urgently needed to fill this gap in order to better understand whether antibiotic-driven reductions in microbiome diversity and selection of AMR genes place LMIC children at risk for adverse health outcomes, including infections with AMR-bearing pathogens.
Asunto(s)
Antibacterianos , Microbioma Gastrointestinal , Lactante , Niño , Humanos , Preescolar , Antibacterianos/efectos adversos , Países en Desarrollo , Microbioma Gastrointestinal/genética , Azitromicina , Farmacorresistencia Microbiana/genéticaRESUMEN
OBJECTIVE: Asthma control is generally monitored by assessing symptoms and lung function. However, optimal treatment is also dependent on the type and extent of airway inflammation. Fraction of exhaled Nitric Oxide (FeNO) is a noninvasive biomarker of type 2 airway inflammation, but its effectiveness in guiding asthma treatment remains disputed. We performed a systematic review and meta-analysis to obtain summary estimates of the effectiveness of FeNO-guided asthma treatment. DESIGN: We updated a Cochrane systematic review from 2016. Cochrane Risk of Bias tool was used to assess risk of bias. Inverse-variance random-effects meta-analysis was performed. Certainty of evidence was assessed using GRADE. Subgroup analyses were performed based on asthma severity, asthma control, allergy/atopy, pregnancy and obesity. DATA SOURCES: The Cochrane Airways Group Trials Register was searched on 9 May 2023. ELIGIBILITY CRITERIA: We included randomized controlled trials (RCTs) comparing the effectiveness of a FeNO-guided treatment versus usual (symptom-guided) treatment in adult asthma patients. RESULTS: We included 12 RCTs (2,116 patients), all showing high or unclear risk of bias in at least one domain. Five RCTs reported support from a FeNO manufacturer. FeNO-guided treatment probably reduces the number of patients having ≥1 exacerbation (OR = 0.61; 95%CI 0.44 to 0.83; six RCTs; GRADE moderate certainty) and exacerbation rate (RR = 0.67; 95%CI 0.54 to 0.82; six RCTs; moderate certainty), and may slightly improve Asthma Control Questionnaire score (MD = -0.10; 95%CI -0.18 to -0.02, six RCTs; low certainty), however, this change is unlikely to be clinically important. An effect on severe exacerbations, quality of life, FEV1, treatment dosage and FeNO values could not be demonstrated. There were no indications that effectiveness is different in subgroups of patients, although evidence for subgroup analysis was limited. CONCLUSIONS: FeNO-guided asthma treatment probably results in fewer exacerbations but may not have clinically important effects on other asthma outcomes.
Asunto(s)
Asma , Femenino , Embarazo , Adulto , Humanos , Asma/diagnóstico , Asma/tratamiento farmacológico , Óxido Nítrico , InflamaciónRESUMEN
BACKGROUND: In patients with interstitial lung diseases (ILD), histopathological input is often required to obtain a diagnosis. Surgical lung biopsy (SLB) is considered the reference standard, but many patients are clinically unfit to undergo this invasive procedure, and adverse events, length of hospitalisation and costs are considerable. This European Respiratory Society (ERS) guideline provides evidence-based clinical practice recommendations for the role of transbronchial lung cryobiopsy (TBLC) in obtaining tissue-based diagnosis in patients with undiagnosed ILD. METHODS: The ERS Task Force consisted of clinical experts in the field of ILD and/or TBLC and methodological experts. Four PICO (Patient, Intervention, Comparator, Outcomes) questions and two narrative questions were formulated. Systematic literature searches were performed in MEDLINE and Embase (up to June 2021). GRADE (Grading, Recommendation, Assessment, Development and Evaluation) methodology was applied. RESULTS: In patients with undiagnosed ILD and an indication to obtain histopathological data: 1) TBLC is suggested as a replacement test in patients considered eligible to undergo SLB, 2) TBLC is suggested in patients not considered eligible to undergo SLB, 3) SLB is suggested as an add-on test in patients with a non-informative TBLC, 4) no recommendation is made for or against a second TBLC in patients with a non-informative TBLC and 5) TBLC operators should undergo training, but no recommendation is made for the type of training required. CONCLUSIONS: TBLC provides important diagnostic information in patients with undiagnosed ILD. Diagnostic yield is lower compared to SLB, at reduced serious adverse events and length of hospitalisation. Certainty of the evidence is mostly "very low".
Asunto(s)
Criocirugía , Enfermedades Pulmonares Intersticiales , Humanos , Biopsia/métodos , Broncoscopía/métodos , Criocirugía/efectos adversos , Criocirugía/métodos , Pulmón/patología , Enfermedades Pulmonares Intersticiales/patologíaRESUMEN
BACKGROUND: Accurate rapid diagnostic tests for SARS-CoV-2 infection would be a useful tool to help manage the COVID-19 pandemic. Testing strategies that use rapid antigen tests to detect current infection have the potential to increase access to testing, speed detection of infection, and inform clinical and public health management decisions to reduce transmission. This is the second update of this review, which was first published in 2020. OBJECTIVES: To assess the diagnostic accuracy of rapid, point-of-care antigen tests for diagnosis of SARS-CoV-2 infection. We consider accuracy separately in symptomatic and asymptomatic population groups. Sources of heterogeneity investigated included setting and indication for testing, assay format, sample site, viral load, age, timing of test, and study design. SEARCH METHODS: We searched the COVID-19 Open Access Project living evidence database from the University of Bern (which includes daily updates from PubMed and Embase and preprints from medRxiv and bioRxiv) on 08 March 2021. We included independent evaluations from national reference laboratories, FIND and the Diagnostics Global Health website. We did not apply language restrictions. SELECTION CRITERIA: We included studies of people with either suspected SARS-CoV-2 infection, known SARS-CoV-2 infection or known absence of infection, or those who were being screened for infection. We included test accuracy studies of any design that evaluated commercially produced, rapid antigen tests. We included evaluations of single applications of a test (one test result reported per person) and evaluations of serial testing (repeated antigen testing over time). Reference standards for presence or absence of infection were any laboratory-based molecular test (primarily reverse transcription polymerase chain reaction (RT-PCR)) or pre-pandemic respiratory sample. DATA COLLECTION AND ANALYSIS: We used standard screening procedures with three people. Two people independently carried out quality assessment (using the QUADAS-2 tool) and extracted study results. Other study characteristics were extracted by one review author and checked by a second. We present sensitivity and specificity with 95% confidence intervals (CIs) for each test, and pooled data using the bivariate model. We investigated heterogeneity by including indicator variables in the random-effects logistic regression models. We tabulated results by test manufacturer and compliance with manufacturer instructions for use and according to symptom status. MAIN RESULTS: We included 155 study cohorts (described in 166 study reports, with 24 as preprints). The main results relate to 152 evaluations of single test applications including 100,462 unique samples (16,822 with confirmed SARS-CoV-2). Studies were mainly conducted in Europe (101/152, 66%), and evaluated 49 different commercial antigen assays. Only 23 studies compared two or more brands of test. Risk of bias was high because of participant selection (40, 26%); interpretation of the index test (6, 4%); weaknesses in the reference standard for absence of infection (119, 78%); and participant flow and timing 41 (27%). Characteristics of participants (45, 30%) and index test delivery (47, 31%) differed from the way in which and in whom the test was intended to be used. Nearly all studies (91%) used a single RT-PCR result to define presence or absence of infection. The 152 studies of single test applications reported 228 evaluations of antigen tests. Estimates of sensitivity varied considerably between studies, with consistently high specificities. Average sensitivity was higher in symptomatic (73.0%, 95% CI 69.3% to 76.4%; 109 evaluations; 50,574 samples, 11,662 cases) compared to asymptomatic participants (54.7%, 95% CI 47.7% to 61.6%; 50 evaluations; 40,956 samples, 2641 cases). Average sensitivity was higher in the first week after symptom onset (80.9%, 95% CI 76.9% to 84.4%; 30 evaluations, 2408 cases) than in the second week of symptoms (53.8%, 95% CI 48.0% to 59.6%; 40 evaluations, 1119 cases). For those who were asymptomatic at the time of testing, sensitivity was higher when an epidemiological exposure to SARS-CoV-2 was suspected (64.3%, 95% CI 54.6% to 73.0%; 16 evaluations; 7677 samples, 703 cases) compared to where COVID-19 testing was reported to be widely available to anyone on presentation for testing (49.6%, 95% CI 42.1% to 57.1%; 26 evaluations; 31,904 samples, 1758 cases). Average specificity was similarly high for symptomatic (99.1%) or asymptomatic (99.7%) participants. We observed a steady decline in summary sensitivities as measures of sample viral load decreased. Sensitivity varied between brands. When tests were used according to manufacturer instructions, average sensitivities by brand ranged from 34.3% to 91.3% in symptomatic participants (20 assays with eligible data) and from 28.6% to 77.8% for asymptomatic participants (12 assays). For symptomatic participants, summary sensitivities for seven assays were 80% or more (meeting acceptable criteria set by the World Health Organization (WHO)). The WHO acceptable performance criterion of 97% specificity was met by 17 of 20 assays when tests were used according to manufacturer instructions, 12 of which demonstrated specificities above 99%. For asymptomatic participants the sensitivities of only two assays approached but did not meet WHO acceptable performance standards in one study each; specificities for asymptomatic participants were in a similar range to those observed for symptomatic people. At 5% prevalence using summary data in symptomatic people during the first week after symptom onset, the positive predictive value (PPV) of 89% means that 1 in 10 positive results will be a false positive, and around 1 in 5 cases will be missed. At 0.5% prevalence using summary data for asymptomatic people, where testing was widely available and where epidemiological exposure to COVID-19 was suspected, resulting PPVs would be 38% to 52%, meaning that between 2 in 5 and 1 in 2 positive results will be false positives, and between 1 in 2 and 1 in 3 cases will be missed. AUTHORS' CONCLUSIONS: Antigen tests vary in sensitivity. In people with signs and symptoms of COVID-19, sensitivities are highest in the first week of illness when viral loads are higher. Assays that meet appropriate performance standards, such as those set by WHO, could replace laboratory-based RT-PCR when immediate decisions about patient care must be made, or where RT-PCR cannot be delivered in a timely manner. However, they are more suitable for use as triage to RT-PCR testing. The variable sensitivity of antigen tests means that people who test negative may still be infected. Many commercially available rapid antigen tests have not been evaluated in independent validation studies. Evidence for testing in asymptomatic cohorts has increased, however sensitivity is lower and there is a paucity of evidence for testing in different settings. Questions remain about the use of antigen test-based repeat testing strategies. Further research is needed to evaluate the effectiveness of screening programmes at reducing transmission of infection, whether mass screening or targeted approaches including schools, healthcare setting and traveller screening.
Asunto(s)
COVID-19 , COVID-19/diagnóstico , Prueba de COVID-19 , Humanos , Pandemias , Sistemas de Atención de Punto , SARS-CoV-2 , Sensibilidad y EspecificidadRESUMEN
BACKGROUND: COVID-19 illness is highly variable, ranging from infection with no symptoms through to pneumonia and life-threatening consequences. Symptoms such as fever, cough, or loss of sense of smell (anosmia) or taste (ageusia), can help flag early on if the disease is present. Such information could be used either to rule out COVID-19 disease, or to identify people who need to go for COVID-19 diagnostic tests. This is the second update of this review, which was first published in 2020. OBJECTIVES: To assess the diagnostic accuracy of signs and symptoms to determine if a person presenting in primary care or to hospital outpatient settings, such as the emergency department or dedicated COVID-19 clinics, has COVID-19. SEARCH METHODS: We undertook electronic searches up to 10 June 2021 in the University of Bern living search database. In addition, we checked repositories of COVID-19 publications. We used artificial intelligence text analysis to conduct an initial classification of documents. We did not apply any language restrictions. SELECTION CRITERIA: Studies were eligible if they included people with clinically suspected COVID-19, or recruited known cases with COVID-19 and also controls without COVID-19 from a single-gate cohort. Studies were eligible when they recruited people presenting to primary care or hospital outpatient settings. Studies that included people who contracted SARS-CoV-2 infection while admitted to hospital were not eligible. The minimum eligible sample size of studies was 10 participants. All signs and symptoms were eligible for this review, including individual signs and symptoms or combinations. We accepted a range of reference standards. DATA COLLECTION AND ANALYSIS: Pairs of review authors independently selected all studies, at both title and abstract, and full-text stage. They resolved any disagreements by discussion with a third review author. Two review authors independently extracted data and assessed risk of bias using the QUADAS-2 checklist, and resolved disagreements by discussion with a third review author. Analyses were restricted to prospective studies only. We presented sensitivity and specificity in paired forest plots, in receiver operating characteristic (ROC) space and in dumbbell plots. We estimated summary parameters using a bivariate random-effects meta-analysis whenever five or more primary prospective studies were available, and whenever heterogeneity across studies was deemed acceptable. MAIN RESULTS: We identified 90 studies; for this update we focused on the results of 42 prospective studies with 52,608 participants. Prevalence of COVID-19 disease varied from 3.7% to 60.6% with a median of 27.4%. Thirty-five studies were set in emergency departments or outpatient test centres (46,878 participants), three in primary care settings (1230 participants), two in a mixed population of in- and outpatients in a paediatric hospital setting (493 participants), and two overlapping studies in nursing homes (4007 participants). The studies did not clearly distinguish mild COVID-19 disease from COVID-19 pneumonia, so we present the results for both conditions together. Twelve studies had a high risk of bias for selection of participants because they used a high level of preselection to decide whether reverse transcription polymerase chain reaction (RT-PCR) testing was needed, or because they enrolled a non-consecutive sample, or because they excluded individuals while they were part of the study base. We rated 36 of the 42 studies as high risk of bias for the index tests because there was little or no detail on how, by whom and when, the symptoms were measured. For most studies, eligibility for testing was dependent on the local case definition and testing criteria that were in effect at the time of the study, meaning most people who were included in studies had already been referred to health services based on the symptoms that we are evaluating in this review. The applicability of the results of this review iteration improved in comparison with the previous reviews. This version has more studies of people presenting to ambulatory settings, which is where the majority of assessments for COVID-19 take place. Only three studies presented any data on children separately, and only one focused specifically on older adults. We found data on 96 symptoms or combinations of signs and symptoms. Evidence on individual signs as diagnostic tests was rarely reported, so this review reports mainly on the diagnostic value of symptoms. Results were highly variable across studies. Most had very low sensitivity and high specificity. RT-PCR was the most often used reference standard (40/42 studies). Only cough (11 studies) had a summary sensitivity above 50% (62.4%, 95% CI 50.6% to 72.9%)); its specificity was low (45.4%, 95% CI 33.5% to 57.9%)). Presence of fever had a sensitivity of 37.6% (95% CI 23.4% to 54.3%) and a specificity of 75.2% (95% CI 56.3% to 87.8%). The summary positive likelihood ratio of cough was 1.14 (95% CI 1.04 to 1.25) and that of fever 1.52 (95% CI 1.10 to 2.10). Sore throat had a summary positive likelihood ratio of 0.814 (95% CI 0.714 to 0.929), which means that its presence increases the probability of having an infectious disease other than COVID-19. Dyspnoea (12 studies) and fatigue (8 studies) had a sensitivity of 23.3% (95% CI 16.4% to 31.9%) and 40.2% (95% CI 19.4% to 65.1%) respectively. Their specificity was 75.7% (95% CI 65.2% to 83.9%) and 73.6% (95% CI 48.4% to 89.3%). The summary positive likelihood ratio of dyspnoea was 0.96 (95% CI 0.83 to 1.11) and that of fatigue 1.52 (95% CI 1.21 to 1.91), which means that the presence of fatigue slightly increases the probability of having COVID-19. Anosmia alone (7 studies), ageusia alone (5 studies), and anosmia or ageusia (6 studies) had summary sensitivities below 50% but summary specificities over 90%. Anosmia had a summary sensitivity of 26.4% (95% CI 13.8% to 44.6%) and a specificity of 94.2% (95% CI 90.6% to 96.5%). Ageusia had a summary sensitivity of 23.2% (95% CI 10.6% to 43.3%) and a specificity of 92.6% (95% CI 83.1% to 97.0%). Anosmia or ageusia had a summary sensitivity of 39.2% (95% CI 26.5% to 53.6%) and a specificity of 92.1% (95% CI 84.5% to 96.2%). The summary positive likelihood ratios of anosmia alone and anosmia or ageusia were 4.55 (95% CI 3.46 to 5.97) and 4.99 (95% CI 3.22 to 7.75) respectively, which is just below our arbitrary definition of a 'red flag', that is, a positive likelihood ratio of at least 5. The summary positive likelihood ratio of ageusia alone was 3.14 (95% CI 1.79 to 5.51). Twenty-four studies assessed combinations of different signs and symptoms, mostly combining olfactory symptoms. By combining symptoms with other information such as contact or travel history, age, gender, and a local recent case detection rate, some multivariable prediction scores reached a sensitivity as high as 90%. AUTHORS' CONCLUSIONS: Most individual symptoms included in this review have poor diagnostic accuracy. Neither absence nor presence of symptoms are accurate enough to rule in or rule out the disease. The presence of anosmia or ageusia may be useful as a red flag for the presence of COVID-19. The presence of cough also supports further testing. There is currently no evidence to support further testing with PCR in any individuals presenting only with upper respiratory symptoms such as sore throat, coryza or rhinorrhoea. Combinations of symptoms with other readily available information such as contact or travel history, or the local recent case detection rate may prove more useful and should be further investigated in an unselected population presenting to primary care or hospital outpatient settings. The diagnostic accuracy of symptoms for COVID-19 is moderate to low and any testing strategy using symptoms as selection mechanism will result in both large numbers of missed cases and large numbers of people requiring testing. Which one of these is minimised, is determined by the goal of COVID-19 testing strategies, that is, controlling the epidemic by isolating every possible case versus identifying those with clinically important disease so that they can be monitored or treated to optimise their prognosis. The former will require a testing strategy that uses very few symptoms as entry criterion for testing, the latter could focus on more specific symptoms such as fever and anosmia.
Asunto(s)
Ageusia , COVID-19 , Faringitis , Anciano , Ageusia/complicaciones , Anosmia/diagnóstico , Anosmia/etiología , Inteligencia Artificial , COVID-19/diagnóstico , COVID-19/epidemiología , Prueba de COVID-19 , Niño , Tos/etiología , Disnea , Fatiga/etiología , Fiebre/diagnóstico , Fiebre/etiología , Hospitales , Humanos , Pacientes Ambulatorios , Atención Primaria de Salud , Estudios Prospectivos , SARS-CoV-2 , Sensibilidad y EspecificidadRESUMEN
BACKGROUND: The diagnostic challenges associated with the COVID-19 pandemic resulted in rapid development of diagnostic test methods for detecting SARS-CoV-2 infection. Serology tests to detect the presence of antibodies to SARS-CoV-2 enable detection of past infection and may detect cases of SARS-CoV-2 infection that were missed by earlier diagnostic tests. Understanding the diagnostic accuracy of serology tests for SARS-CoV-2 infection may enable development of effective diagnostic and management pathways, inform public health management decisions and understanding of SARS-CoV-2 epidemiology. OBJECTIVES: To assess the accuracy of antibody tests, firstly, to determine if a person presenting in the community, or in primary or secondary care has current SARS-CoV-2 infection according to time after onset of infection and, secondly, to determine if a person has previously been infected with SARS-CoV-2. Sources of heterogeneity investigated included: timing of test, test method, SARS-CoV-2 antigen used, test brand, and reference standard for non-SARS-CoV-2 cases. SEARCH METHODS: The COVID-19 Open Access Project living evidence database from the University of Bern (which includes daily updates from PubMed and Embase and preprints from medRxiv and bioRxiv) was searched on 30 September 2020. We included additional publications from the Evidence for Policy and Practice Information and Co-ordinating Centre (EPPI-Centre) 'COVID-19: Living map of the evidence' and the Norwegian Institute of Public Health 'NIPH systematic and living map on COVID-19 evidence'. We did not apply language restrictions. SELECTION CRITERIA: We included test accuracy studies of any design that evaluated commercially produced serology tests, targeting IgG, IgM, IgA alone, or in combination. Studies must have provided data for sensitivity, that could be allocated to a predefined time period after onset of symptoms, or after a positive RT-PCR test. Small studies with fewer than 25 SARS-CoV-2 infection cases were excluded. We included any reference standard to define the presence or absence of SARS-CoV-2 (including reverse transcription polymerase chain reaction tests (RT-PCR), clinical diagnostic criteria, and pre-pandemic samples). DATA COLLECTION AND ANALYSIS: We use standard screening procedures with three reviewers. Quality assessment (using the QUADAS-2 tool) and numeric study results were extracted independently by two people. Other study characteristics were extracted by one reviewer and checked by a second. We present sensitivity and specificity with 95% confidence intervals (CIs) for each test and, for meta-analysis, we fitted univariate random-effects logistic regression models for sensitivity by eligible time period and for specificity by reference standard group. Heterogeneity was investigated by including indicator variables in the random-effects logistic regression models. We tabulated results by test manufacturer and summarised results for tests that were evaluated in 200 or more samples and that met a modification of UK Medicines and Healthcare products Regulatory Agency (MHRA) target performance criteria. MAIN RESULTS: We included 178 separate studies (described in 177 study reports, with 45 as pre-prints) providing 527 test evaluations. The studies included 64,688 samples including 25,724 from people with confirmed SARS-CoV-2; most compared the accuracy of two or more assays (102/178, 57%). Participants with confirmed SARS-CoV-2 infection were most commonly hospital inpatients (78/178, 44%), and pre-pandemic samples were used by 45% (81/178) to estimate specificity. Over two-thirds of studies recruited participants based on known SARS-CoV-2 infection status (123/178, 69%). All studies were conducted prior to the introduction of SARS-CoV-2 vaccines and present data for naturally acquired antibody responses. Seventy-nine percent (141/178) of studies reported sensitivity by week after symptom onset and 66% (117/178) for convalescent phase infection. Studies evaluated enzyme-linked immunosorbent assays (ELISA) (165/527; 31%), chemiluminescent assays (CLIA) (167/527; 32%) or lateral flow assays (LFA) (188/527; 36%). Risk of bias was high because of participant selection (172, 97%); application and interpretation of the index test (35, 20%); weaknesses in the reference standard (38, 21%); and issues related to participant flow and timing (148, 82%). We judged that there were high concerns about the applicability of the evidence related to participants in 170 (96%) studies, and about the applicability of the reference standard in 162 (91%) studies. Average sensitivities for current SARS-CoV-2 infection increased by week after onset for all target antibodies. Average sensitivity for the combination of either IgG or IgM was 41.1% in week one (95% CI 38.1 to 44.2; 103 evaluations; 3881 samples, 1593 cases), 74.9% in week two (95% CI 72.4 to 77.3; 96 evaluations, 3948 samples, 2904 cases) and 88.0% by week three after onset of symptoms (95% CI 86.3 to 89.5; 103 evaluations, 2929 samples, 2571 cases). Average sensitivity during the convalescent phase of infection (up to a maximum of 100 days since onset of symptoms, where reported) was 89.8% for IgG (95% CI 88.5 to 90.9; 253 evaluations, 16,846 samples, 14,183 cases), 92.9% for IgG or IgM combined (95% CI 91.0 to 94.4; 108 evaluations, 3571 samples, 3206 cases) and 94.3% for total antibodies (95% CI 92.8 to 95.5; 58 evaluations, 7063 samples, 6652 cases). Average sensitivities for IgM alone followed a similar pattern but were of a lower test accuracy in every time slot. Average specificities were consistently high and precise, particularly for pre-pandemic samples which provide the least biased estimates of specificity (ranging from 98.6% for IgM to 99.8% for total antibodies). Subgroup analyses suggested small differences in sensitivity and specificity by test technology however heterogeneity in study results, timing of sample collection, and smaller sample numbers in some groups made comparisons difficult. For IgG, CLIAs were the most sensitive (convalescent-phase infection) and specific (pre-pandemic samples) compared to both ELISAs and LFAs (P < 0.001 for differences across test methods). The antigen(s) used (whether from the Spike-protein or nucleocapsid) appeared to have some effect on average sensitivity in the first weeks after onset but there was no clear evidence of an effect during convalescent-phase infection. Investigations of test performance by brand showed considerable variation in sensitivity between tests, and in results between studies evaluating the same test. For tests that were evaluated in 200 or more samples, the lower bound of the 95% CI for sensitivity was 90% or more for only a small number of tests (IgG, n = 5; IgG or IgM, n = 1; total antibodies, n = 4). More test brands met the MHRA minimum criteria for specificity of 98% or above (IgG, n = 16; IgG or IgM, n = 5; total antibodies, n = 7). Seven assays met the specified criteria for both sensitivity and specificity. In a low-prevalence (2%) setting, where antibody testing is used to diagnose COVID-19 in people with symptoms but who have had a negative PCR test, we would anticipate that 1 (1 to 2) case would be missed and 8 (5 to 15) would be falsely positive in 1000 people undergoing IgG or IgM testing in week three after onset of SARS-CoV-2 infection. In a seroprevalence survey, where prevalence of prior infection is 50%, we would anticipate that 51 (46 to 58) cases would be missed and 6 (5 to 7) would be falsely positive in 1000 people having IgG tests during the convalescent phase (21 to 100 days post-symptom onset or post-positive PCR) of SARS-CoV-2 infection. AUTHORS' CONCLUSIONS: Some antibody tests could be a useful diagnostic tool for those in whom molecular- or antigen-based tests have failed to detect the SARS-CoV-2 virus, including in those with ongoing symptoms of acute infection (from week three onwards) or those presenting with post-acute sequelae of COVID-19. However, antibody tests have an increasing likelihood of detecting an immune response to infection as time since onset of infection progresses and have demonstrated adequate performance for detection of prior infection for sero-epidemiological purposes. The applicability of results for detection of vaccination-induced antibodies is uncertain.
Asunto(s)
COVID-19 , SARS-CoV-2 , Humanos , COVID-19/diagnóstico , COVID-19/epidemiología , Anticuerpos Antivirales , Inmunoglobulina G , Vacunas contra la COVID-19 , Pandemias , Estudios Seroepidemiológicos , Inmunoglobulina MRESUMEN
BACKGROUND: Our March 2021 edition of this review showed thoracic imaging computed tomography (CT) to be sensitive and moderately specific in diagnosing COVID-19 pneumonia. This new edition is an update of the review. OBJECTIVES: Our objectives were to evaluate the diagnostic accuracy of thoracic imaging in people with suspected COVID-19; assess the rate of positive imaging in people who had an initial reverse transcriptase polymerase chain reaction (RT-PCR) negative result and a positive RT-PCR result on follow-up; and evaluate the accuracy of thoracic imaging for screening COVID-19 in asymptomatic individuals. The secondary objective was to assess threshold effects of index test positivity on accuracy. SEARCH METHODS: We searched the COVID-19 Living Evidence Database from the University of Bern, the Cochrane COVID-19 Study Register, The Stephen B. Thacker CDC Library, and repositories of COVID-19 publications through to 17 February 2021. We did not apply any language restrictions. SELECTION CRITERIA: We included diagnostic accuracy studies of all designs, except for case-control, that recruited participants of any age group suspected to have COVID-19. Studies had to assess chest CT, chest X-ray, or ultrasound of the lungs for the diagnosis of COVID-19, use a reference standard that included RT-PCR, and report estimates of test accuracy or provide data from which we could compute estimates. We excluded studies that used imaging as part of the reference standard and studies that excluded participants with normal index test results. DATA COLLECTION AND ANALYSIS: The review authors independently and in duplicate screened articles, extracted data and assessed risk of bias and applicability concerns using QUADAS-2. We presented sensitivity and specificity per study on paired forest plots, and summarized pooled estimates in tables. We used a bivariate meta-analysis model where appropriate. MAIN RESULTS: We included 98 studies in this review. Of these, 94 were included for evaluating the diagnostic accuracy of thoracic imaging in the evaluation of people with suspected COVID-19. Eight studies were included for assessing the rate of positive imaging in individuals with initial RT-PCR negative results and positive RT-PCR results on follow-up, and 10 studies were included for evaluating the accuracy of thoracic imaging for imagining asymptomatic individuals. For all 98 included studies, risk of bias was high or unclear in 52 (53%) studies with respect to participant selection, in 64 (65%) studies with respect to reference standard, in 46 (47%) studies with respect to index test, and in 48 (49%) studies with respect to flow and timing. Concerns about the applicability of the evidence to: participants were high or unclear in eight (8%) studies; index test were high or unclear in seven (7%) studies; and reference standard were high or unclear in seven (7%) studies. Imaging in people with suspected COVID-19 We included 94 studies. Eighty-seven studies evaluated one imaging modality, and seven studies evaluated two imaging modalities. All studies used RT-PCR alone or in combination with other criteria (for example, clinical signs and symptoms, positive contacts) as the reference standard for the diagnosis of COVID-19. For chest CT (69 studies, 28285 participants, 14,342 (51%) cases), sensitivities ranged from 45% to 100%, and specificities from 10% to 99%. The pooled sensitivity of chest CT was 86.9% (95% confidence interval (CI) 83.6 to 89.6), and pooled specificity was 78.3% (95% CI 73.7 to 82.3). Definition for index test positivity was a source of heterogeneity for sensitivity, but not specificity. Reference standard was not a source of heterogeneity. For chest X-ray (17 studies, 8529 participants, 5303 (62%) cases), the sensitivity ranged from 44% to 94% and specificity from 24 to 93%. The pooled sensitivity of chest X-ray was 73.1% (95% CI 64. to -80.5), and pooled specificity was 73.3% (95% CI 61.9 to 82.2). Definition for index test positivity was not found to be a source of heterogeneity. Definition for index test positivity and reference standard were not found to be sources of heterogeneity. For ultrasound of the lungs (15 studies, 2410 participants, 1158 (48%) cases), the sensitivity ranged from 73% to 94% and the specificity ranged from 21% to 98%. The pooled sensitivity of ultrasound was 88.9% (95% CI 84.9 to 92.0), and the pooled specificity was 72.2% (95% CI 58.8 to 82.5). Definition for index test positivity and reference standard were not found to be sources of heterogeneity. Indirect comparisons of modalities evaluated across all 94 studies indicated that chest CT and ultrasound gave higher sensitivity estimates than X-ray (P = 0.0003 and P = 0.001, respectively). Chest CT and ultrasound gave similar sensitivities (P=0.42). All modalities had similar specificities (CT versus X-ray P = 0.36; CT versus ultrasound P = 0.32; X-ray versus ultrasound P = 0.89). Imaging in PCR-negative people who subsequently became positive For rate of positive imaging in individuals with initial RT-PCR negative results, we included 8 studies (7 CT, 1 ultrasound) with a total of 198 participants suspected of having COVID-19, all of whom had a final diagnosis of COVID-19. Most studies (7/8) evaluated CT. Of 177 participants with initially negative RT-PCR who had positive RT-PCR results on follow-up testing, 75.8% (95% CI 45.3 to 92.2) had positive CT findings. Imaging in asymptomatic PCR-positive people For imaging asymptomatic individuals, we included 10 studies (7 CT, 1 X-ray, 2 ultrasound) with a total of 3548 asymptomatic participants, of whom 364 (10%) had a final diagnosis of COVID-19. For chest CT (7 studies, 3134 participants, 315 (10%) cases), the pooled sensitivity was 55.7% (95% CI 35.4 to 74.3) and the pooled specificity was 91.1% (95% CI 82.6 to 95.7). AUTHORS' CONCLUSIONS: Chest CT and ultrasound of the lungs are sensitive and moderately specific in diagnosing COVID-19. Chest X-ray is moderately sensitive and moderately specific in diagnosing COVID-19. Thus, chest CT and ultrasound may have more utility for ruling out COVID-19 than for differentiating SARS-CoV-2 infection from other causes of respiratory illness. The uncertainty resulting from high or unclear risk of bias and the heterogeneity of included studies limit our ability to confidently draw conclusions based on our results.
Asunto(s)
COVID-19 , COVID-19/diagnóstico por imagen , Humanos , SARS-CoV-2 , Sensibilidad y Especificidad , Tomografía Computarizada por Rayos X , UltrasonografíaRESUMEN
Cryptococcal antigen (CrAg) detection could direct the timely initiation of antifungal therapy. We searched MEDLINE and Embase for studies where CrAg detection in serum/cerebrospinal fluid (CSF) and CSF fungal culture were done on adults living with human immunodeficiency virus (HIV) who had suspected cryptococcal meningitis (CM). With Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2), we evaluated the risk of bias in 11 included studies with 3600 participants, and used a random-effects meta-analysis to obtain summary sensitivity and specificity of serum and CSF CrAg, as well as agreement between CSF CrAg and CSF culture. Summary sensitivity and specificity of serum CrAg were 99.7% (97.4-100) and 94.1% (88.3-98.1), respectively, and summary sensitivity and specificity of CSF CrAg were 98.8% (96.2-99.6) and 99.3% (96.7-99.9), respectively. Agreement between CSF CrAg and CSF culture was 98% (97-99). In adults living with HIV who have CM symptoms, serum CrAg negativity may rule out CM, while positivity should prompt induction antifungal therapy if lumbar puncture is not feasible. In a first episode of CM, CSF CrAg positivity is diagnostic.
Asunto(s)
Infecciones Oportunistas Relacionadas con el SIDA , Cryptococcus , Infecciones por VIH , Meningitis Criptocócica , Infecciones Oportunistas Relacionadas con el SIDA/diagnóstico , Adulto , Antígenos Fúngicos , Pruebas Diagnósticas de Rutina , VIH , Infecciones por VIH/complicaciones , Humanos , Meningitis Criptocócica/diagnósticoRESUMEN
BACKGROUND: A large proportion of proton pump inhibitor (PPI) prescriptions, including those for stress ulcer prophylaxis (SUP), are inappropriate. Our study purpose was to systematically review the effectiveness of de-implementation strategies aimed at reducing inappropriate PPI use for SUP in hospitalized, non-intensive care unit (non-ICU) patients. METHODS: We searched MEDLINE and Embase databases (from inception to January 2020). Two authors independently screened references, performed data extraction, and critical appraisal. Randomized trials and comparative observational studies were eligible for inclusion. Criteria developed by the Cochrane Effective Practice and Organisation of Care (EPOC) group were used for critical appraisal. Besides the primary outcome (inappropriate PPI prescription or use), secondary outcomes included (adverse) pharmaceutical effects and healthcare use. RESULTS: We included ten studies in this review. Most de-implementation strategies contained an educational component (meetings and/or materials), combined with either clinical guideline implementation (n = 5), audit feedback (n = 3), organizational culture (n = 4), or reminders (n = 1). One study evaluating the de-implementation strategy effectiveness showed a significant reduction (RR 0.14; 95% CI 0.03-0.55) of new inappropriate PPI prescriptions. Out of five studies evaluating the effectiveness of de-implementing inappropriate PPI use, four found a significant reduction (RR 0.21; 95% CI 0.18-0.26 to RR 0.76; 95% CI 0.68-0.86). No significant differences in the occurrence of pharmaceutical effects (n = 1) and in length of stay (n = 3) were observed. Adverse pharmaceutical effects were reported in two studies and five studies reported on PPI or total drug costs. No pooled effect estimates were calculated because of large statistical heterogeneity between studies. DISCUSSION: All identified studies reported mainly educational interventions in combination with one or multiple other intervention strategies and all interventions were targeted at providers. Most studies found a small to moderate reduction of (inappropriate) PPI prescriptions or use.
Asunto(s)
Inhibidores de la Bomba de Protones , Úlcera , Enfermedad Aguda , Humanos , Prescripción Inadecuada/prevención & controlRESUMEN
BACKGROUND & AIMS: Fibrosis is the strongest predictor for long-term clinical outcomes among patients with non-alcoholic fatty liver disease (NAFLD). There is growing interest in employing non-invasive methods for risk stratification based on prognosis. FIB-4, NFS and APRI are models commonly used for detecting fibrosis among NAFLD patients. We aimed to synthesize existing literature on the ability of these models in prognosticating NAFLD-related events. METHODS: A sensitive search was conducted in two medical databases to retrieve studies evaluating the prognostic accuracy of FIB-4, NFS and APRI among NAFLD patients. Target events were change in fibrosis, liver-related event and mortality. Two reviewers independently performed reference screening, data extraction and quality assessment (QUAPAS tool). RESULTS: A total of 13 studies (FIB-4:12, NFS: 11, APRI: 10), published between 2013 and 2019, were retrieved. All studies were conducted in a secondary or tertiary care setting, with follow-up ranging from 1 to 20 years. All three markers showed consistently good prognostication of liver-related events (AUC from 0.69 to 0.92). For mortality, FIB-4 (AUC of 0.67-0.82) and NFS (AUC of 0.70-0.83) outperformed APRI (AUC of 0.52-0.73) in all studies. All markers had inconsistent performance for predicting change in fibrosis stage. CONCLUSIONS: FIB-4, NFS, and APRI have demonstrated ability to risk stratify patients for liver-related morbidity and mortality, with comparable performance to a liver biopsy, although more head-to-head studies are needed to validate this. More refined models to prognosticate NAFLD-events may further enhance performance and clinical utility of non-invasive markers.
Asunto(s)
Enfermedad del Hígado Graso no Alcohólico , Biopsia , Humanos , Cirrosis Hepática/diagnóstico , Enfermedad del Hígado Graso no Alcohólico/complicaciones , Enfermedad del Hígado Graso no Alcohólico/diagnóstico , Pronóstico , Estudios Retrospectivos , Índice de Severidad de la EnfermedadRESUMEN
BACKGROUND: The clinical implications of SARS-CoV-2 infection are highly variable. Some people with SARS-CoV-2 infection remain asymptomatic, whilst the infection can cause mild to moderate COVID-19 and COVID-19 pneumonia in others. This can lead to some people requiring intensive care support and, in some cases, to death, especially in older adults. Symptoms such as fever, cough, or loss of smell or taste, and signs such as oxygen saturation are the first and most readily available diagnostic information. Such information could be used to either rule out COVID-19, or select patients for further testing. This is an update of this review, the first version of which published in July 2020. OBJECTIVES: To assess the diagnostic accuracy of signs and symptoms to determine if a person presenting in primary care or to hospital outpatient settings, such as the emergency department or dedicated COVID-19 clinics, has COVID-19. SEARCH METHODS: For this review iteration we undertook electronic searches up to 15 July 2020 in the Cochrane COVID-19 Study Register and the University of Bern living search database. In addition, we checked repositories of COVID-19 publications. We did not apply any language restrictions. SELECTION CRITERIA: Studies were eligible if they included patients with clinically suspected COVID-19, or if they recruited known cases with COVID-19 and controls without COVID-19. Studies were eligible when they recruited patients presenting to primary care or hospital outpatient settings. Studies in hospitalised patients were only included if symptoms and signs were recorded on admission or at presentation. Studies including patients who contracted SARS-CoV-2 infection while admitted to hospital were not eligible. The minimum eligible sample size of studies was 10 participants. All signs and symptoms were eligible for this review, including individual signs and symptoms or combinations. We accepted a range of reference standards. DATA COLLECTION AND ANALYSIS: Pairs of review authors independently selected all studies, at both title and abstract stage and full-text stage. They resolved any disagreements by discussion with a third review author. Two review authors independently extracted data and resolved disagreements by discussion with a third review author. Two review authors independently assessed risk of bias using the Quality Assessment tool for Diagnostic Accuracy Studies (QUADAS-2) checklist. We presented sensitivity and specificity in paired forest plots, in receiver operating characteristic space and in dumbbell plots. We estimated summary parameters using a bivariate random-effects meta-analysis whenever five or more primary studies were available, and whenever heterogeneity across studies was deemed acceptable. MAIN RESULTS: We identified 44 studies including 26,884 participants in total. Prevalence of COVID-19 varied from 3% to 71% with a median of 21%. There were three studies from primary care settings (1824 participants), nine studies from outpatient testing centres (10,717 participants), 12 studies performed in hospital outpatient wards (5061 participants), seven studies in hospitalised patients (1048 participants), 10 studies in the emergency department (3173 participants), and three studies in which the setting was not specified (5061 participants). The studies did not clearly distinguish mild from severe COVID-19, so we present the results for all disease severities together. Fifteen studies had a high risk of bias for selection of participants because inclusion in the studies depended on the applicable testing and referral protocols, which included many of the signs and symptoms under study in this review. This may have especially influenced the sensitivity of those features used in referral protocols, such as fever and cough. Five studies only included participants with pneumonia on imaging, suggesting that this is a highly selected population. In an additional 12 studies, we were unable to assess the risk for selection bias. This makes it very difficult to judge the validity of the diagnostic accuracy of the signs and symptoms from these included studies. The applicability of the results of this review update improved in comparison with the original review. A greater proportion of studies included participants who presented to outpatient settings, which is where the majority of clinical assessments for COVID-19 take place. However, still none of the studies presented any data on children separately, and only one focused specifically on older adults. We found data on 84 signs and symptoms. Results were highly variable across studies. Most had very low sensitivity and high specificity. Only cough (25 studies) and fever (7 studies) had a pooled sensitivity of at least 50% but specificities were moderate to low. Cough had a sensitivity of 67.4% (95% confidence interval (CI) 59.8% to 74.1%) and specificity of 35.0% (95% CI 28.7% to 41.9%). Fever had a sensitivity of 53.8% (95% CI 35.0% to 71.7%) and a specificity of 67.4% (95% CI 53.3% to 78.9%). The pooled positive likelihood ratio of cough was only 1.04 (95% CI 0.97 to 1.11) and that of fever 1.65 (95% CI 1.41 to 1.93). Anosmia alone (11 studies), ageusia alone (6 studies), and anosmia or ageusia (6 studies) had sensitivities below 50% but specificities over 90%. Anosmia had a pooled sensitivity of 28.0% (95% CI 17.7% to 41.3%) and a specificity of 93.4% (95% CI 88.3% to 96.4%). Ageusia had a pooled sensitivity of 24.8% (95% CI 12.4% to 43.5%) and a specificity of 91.4% (95% CI 81.3% to 96.3%). Anosmia or ageusia had a pooled sensitivity of 41.0% (95% CI 27.0% to 56.6%) and a specificity of 90.5% (95% CI 81.2% to 95.4%). The pooled positive likelihood ratios of anosmia alone and anosmia or ageusia were 4.25 (95% CI 3.17 to 5.71) and 4.31 (95% CI 3.00 to 6.18) respectively, which is just below our arbitrary definition of a 'red flag', that is, a positive likelihood ratio of at least 5. The pooled positive likelihood ratio of ageusia alone was only 2.88 (95% CI 2.02 to 4.09). Only two studies assessed combinations of different signs and symptoms, mostly combining fever and cough with other symptoms. These combinations had a specificity above 80%, but at the cost of very low sensitivity (< 30%). AUTHORS' CONCLUSIONS: The majority of individual signs and symptoms included in this review appear to have very poor diagnostic accuracy, although this should be interpreted in the context of selection bias and heterogeneity between studies. Based on currently available data, neither absence nor presence of signs or symptoms are accurate enough to rule in or rule out COVID-19. The presence of anosmia or ageusia may be useful as a red flag for COVID-19. The presence of fever or cough, given their high sensitivities, may also be useful to identify people for further testing. Prospective studies in an unselected population presenting to primary care or hospital outpatient settings, examining combinations of signs and symptoms to evaluate the syndromic presentation of COVID-19, are still urgently needed. Results from such studies could inform subsequent management decisions.
Asunto(s)
Atención Ambulatoria , COVID-19/diagnóstico , Atención Primaria de Salud , SARS-CoV-2 , Evaluación de Síntomas , Ageusia/diagnóstico , Ageusia/etiología , Anosmia/diagnóstico , Anosmia/etiología , Artralgia/diagnóstico , Artralgia/etiología , Sesgo , COVID-19/complicaciones , COVID-19/epidemiología , Tos/diagnóstico , Tos/etiología , Diarrea/diagnóstico , Diarrea/etiología , Disnea/diagnóstico , Disnea/etiología , Fatiga/diagnóstico , Fatiga/etiología , Fiebre/diagnóstico , Fiebre/etiología , Cefalea/diagnóstico , Cefalea/etiología , Humanos , Mialgia/diagnóstico , Mialgia/etiología , Servicio Ambulatorio en Hospital/estadística & datos numéricos , Pandemias , Examen Físico , Sesgo de Selección , Evaluación de Síntomas/clasificación , Evaluación de Síntomas/estadística & datos numéricosRESUMEN
BACKGROUND: Accurate rapid diagnostic tests for SARS-CoV-2 infection could contribute to clinical and public health strategies to manage the COVID-19 pandemic. Point-of-care antigen and molecular tests to detect current infection could increase access to testing and early confirmation of cases, and expediate clinical and public health management decisions that may reduce transmission. OBJECTIVES: To assess the diagnostic accuracy of point-of-care antigen and molecular-based tests for diagnosis of SARS-CoV-2 infection. We consider accuracy separately in symptomatic and asymptomatic population groups. SEARCH METHODS: Electronic searches of the Cochrane COVID-19 Study Register and the COVID-19 Living Evidence Database from the University of Bern (which includes daily updates from PubMed and Embase and preprints from medRxiv and bioRxiv) were undertaken on 30 Sept 2020. We checked repositories of COVID-19 publications and included independent evaluations from national reference laboratories, the Foundation for Innovative New Diagnostics and the Diagnostics Global Health website to 16 Nov 2020. We did not apply language restrictions. SELECTION CRITERIA: We included studies of people with either suspected SARS-CoV-2 infection, known SARS-CoV-2 infection or known absence of infection, or those who were being screened for infection. We included test accuracy studies of any design that evaluated commercially produced, rapid antigen or molecular tests suitable for a point-of-care setting (minimal equipment, sample preparation, and biosafety requirements, with results within two hours of sample collection). We included all reference standards that define the presence or absence of SARS-CoV-2 (including reverse transcription polymerase chain reaction (RT-PCR) tests and established diagnostic criteria). DATA COLLECTION AND ANALYSIS: Studies were screened independently in duplicate with disagreements resolved by discussion with a third author. Study characteristics were extracted by one author and checked by a second; extraction of study results and assessments of risk of bias and applicability (made using the QUADAS-2 tool) were undertaken independently in duplicate. We present sensitivity and specificity with 95% confidence intervals (CIs) for each test and pooled data using the bivariate model separately for antigen and molecular-based tests. We tabulated results by test manufacturer and compliance with manufacturer instructions for use and according to symptom status. MAIN RESULTS: Seventy-eight study cohorts were included (described in 64 study reports, including 20 pre-prints), reporting results for 24,087 samples (7,415 with confirmed SARS-CoV-2). Studies were mainly from Europe (n = 39) or North America (n = 20), and evaluated 16 antigen and five molecular assays. We considered risk of bias to be high in 29 (50%) studies because of participant selection; in 66 (85%) because of weaknesses in the reference standard for absence of infection; and in 29 (45%) for participant flow and timing. Studies of antigen tests were of a higher methodological quality compared to studies of molecular tests, particularly regarding the risk of bias for participant selection and the index test. Characteristics of participants in 35 (45%) studies differed from those in whom the test was intended to be used and the delivery of the index test in 39 (50%) studies differed from the way in which the test was intended to be used. Nearly all studies (97%) defined the presence or absence of SARS-CoV-2 based on a single RT-PCR result, and none included participants meeting case definitions for probable COVID-19. Antigen tests Forty-eight studies reported 58 evaluations of antigen tests. Estimates of sensitivity varied considerably between studies. There were differences between symptomatic (72.0%, 95% CI 63.7% to 79.0%; 37 evaluations; 15530 samples, 4410 cases) and asymptomatic participants (58.1%, 95% CI 40.2% to 74.1%; 12 evaluations; 1581 samples, 295 cases). Average sensitivity was higher in the first week after symptom onset (78.3%, 95% CI 71.1% to 84.1%; 26 evaluations; 5769 samples, 2320 cases) than in the second week of symptoms (51.0%, 95% CI 40.8% to 61.0%; 22 evaluations; 935 samples, 692 cases). Sensitivity was high in those with cycle threshold (Ct) values on PCR ≤25 (94.5%, 95% CI 91.0% to 96.7%; 36 evaluations; 2613 cases) compared to those with Ct values >25 (40.7%, 95% CI 31.8% to 50.3%; 36 evaluations; 2632 cases). Sensitivity varied between brands. Using data from instructions for use (IFU) compliant evaluations in symptomatic participants, summary sensitivities ranged from 34.1% (95% CI 29.7% to 38.8%; Coris Bioconcept) to 88.1% (95% CI 84.2% to 91.1%; SD Biosensor STANDARD Q). Average specificities were high in symptomatic and asymptomatic participants, and for most brands (overall summary specificity 99.6%, 95% CI 99.0% to 99.8%). At 5% prevalence using data for the most sensitive assays in symptomatic people (SD Biosensor STANDARD Q and Abbott Panbio), positive predictive values (PPVs) of 84% to 90% mean that between 1 in 10 and 1 in 6 positive results will be a false positive, and between 1 in 4 and 1 in 8 cases will be missed. At 0.5% prevalence applying the same tests in asymptomatic people would result in PPVs of 11% to 28% meaning that between 7 in 10 and 9 in 10 positive results will be false positives, and between 1 in 2 and 1 in 3 cases will be missed. No studies assessed the accuracy of repeated lateral flow testing or self-testing. Rapid molecular assays Thirty studies reported 33 evaluations of five different rapid molecular tests. Sensitivities varied according to test brand. Most of the data relate to the ID NOW and Xpert Xpress assays. Using data from evaluations following the manufacturer's instructions for use, the average sensitivity of ID NOW was 73.0% (95% CI 66.8% to 78.4%) and average specificity 99.7% (95% CI 98.7% to 99.9%; 4 evaluations; 812 samples, 222 cases). For Xpert Xpress, the average sensitivity was 100% (95% CI 88.1% to 100%) and average specificity 97.2% (95% CI 89.4% to 99.3%; 2 evaluations; 100 samples, 29 cases). Insufficient data were available to investigate the effect of symptom status or time after symptom onset. AUTHORS' CONCLUSIONS: Antigen tests vary in sensitivity. In people with signs and symptoms of COVID-19, sensitivities are highest in the first week of illness when viral loads are higher. The assays shown to meet appropriate criteria, such as WHO's priority target product profiles for COVID-19 diagnostics ('acceptable' sensitivity ≥ 80% and specificity ≥ 97%), can be considered as a replacement for laboratory-based RT-PCR when immediate decisions about patient care must be made, or where RT-PCR cannot be delivered in a timely manner. Positive predictive values suggest that confirmatory testing of those with positive results may be considered in low prevalence settings. Due to the variable sensitivity of antigen tests, people who test negative may still be infected. Evidence for testing in asymptomatic cohorts was limited. Test accuracy studies cannot adequately assess the ability of antigen tests to differentiate those who are infectious and require isolation from those who pose no risk, as there is no reference standard for infectiousness. A small number of molecular tests showed high accuracy and may be suitable alternatives to RT-PCR. However, further evaluations of the tests in settings as they are intended to be used are required to fully establish performance in practice. Several important studies in asymptomatic individuals have been reported since the close of our search and will be incorporated at the next update of this review. Comparative studies of antigen tests in their intended use settings and according to test operator (including self-testing) are required.
Asunto(s)
Antígenos Virales/análisis , Prueba Serológica para COVID-19/métodos , COVID-19/diagnóstico , Técnicas de Diagnóstico Molecular/métodos , Sistemas de Atención de Punto , SARS-CoV-2/inmunología , Adulto , Infecciones Asintomáticas , Sesgo , Prueba de Ácido Nucleico para COVID-19 , Prueba Serológica para COVID-19/normas , Niño , Estudios de Cohortes , Reacciones Falso Negativas , Reacciones Falso Positivas , Humanos , Técnicas de Diagnóstico Molecular/normas , Valor Predictivo de las Pruebas , Estándares de Referencia , Sensibilidad y EspecificidadRESUMEN
BACKGROUND: The respiratory illness caused by SARS-CoV-2 infection continues to present diagnostic challenges. Our 2020 edition of this review showed thoracic (chest) imaging to be sensitive and moderately specific in the diagnosis of coronavirus disease 2019 (COVID-19). In this update, we include new relevant studies, and have removed studies with case-control designs, and those not intended to be diagnostic test accuracy studies. OBJECTIVES: To evaluate the diagnostic accuracy of thoracic imaging (computed tomography (CT), X-ray and ultrasound) in people with suspected COVID-19. SEARCH METHODS: We searched the COVID-19 Living Evidence Database from the University of Bern, the Cochrane COVID-19 Study Register, The Stephen B. Thacker CDC Library, and repositories of COVID-19 publications through to 30 September 2020. We did not apply any language restrictions. SELECTION CRITERIA: We included studies of all designs, except for case-control, that recruited participants of any age group suspected to have COVID-19 and that reported estimates of test accuracy or provided data from which we could compute estimates. DATA COLLECTION AND ANALYSIS: The review authors independently and in duplicate screened articles, extracted data and assessed risk of bias and applicability concerns using the QUADAS-2 domain-list. We presented the results of estimated sensitivity and specificity using paired forest plots, and we summarised pooled estimates in tables. We used a bivariate meta-analysis model where appropriate. We presented the uncertainty of accuracy estimates using 95% confidence intervals (CIs). MAIN RESULTS: We included 51 studies with 19,775 participants suspected of having COVID-19, of whom 10,155 (51%) had a final diagnosis of COVID-19. Forty-seven studies evaluated one imaging modality each, and four studies evaluated two imaging modalities each. All studies used RT-PCR as the reference standard for the diagnosis of COVID-19, with 47 studies using only RT-PCR and four studies using a combination of RT-PCR and other criteria (such as clinical signs, imaging tests, positive contacts, and follow-up phone calls) as the reference standard. Studies were conducted in Europe (33), Asia (13), North America (3) and South America (2); including only adults (26), all ages (21), children only (1), adults over 70 years (1), and unclear (2); in inpatients (2), outpatients (32), and setting unclear (17). Risk of bias was high or unclear in thirty-two (63%) studies with respect to participant selection, 40 (78%) studies with respect to reference standard, 30 (59%) studies with respect to index test, and 24 (47%) studies with respect to participant flow. For chest CT (41 studies, 16,133 participants, 8110 (50%) cases), the sensitivity ranged from 56.3% to 100%, and specificity ranged from 25.4% to 97.4%. The pooled sensitivity of chest CT was 87.9% (95% CI 84.6 to 90.6) and the pooled specificity was 80.0% (95% CI 74.9 to 84.3). There was no statistical evidence indicating that reference standard conduct and definition for index test positivity were sources of heterogeneity for CT studies. Nine chest CT studies (2807 participants, 1139 (41%) cases) used the COVID-19 Reporting and Data System (CO-RADS) scoring system, which has five thresholds to define index test positivity. At a CO-RADS threshold of 5 (7 studies), the sensitivity ranged from 41.5% to 77.9% and the pooled sensitivity was 67.0% (95% CI 56.4 to 76.2); the specificity ranged from 83.5% to 96.2%; and the pooled specificity was 91.3% (95% CI 87.6 to 94.0). At a CO-RADS threshold of 4 (7 studies), the sensitivity ranged from 56.3% to 92.9% and the pooled sensitivity was 83.5% (95% CI 74.4 to 89.7); the specificity ranged from 77.2% to 90.4% and the pooled specificity was 83.6% (95% CI 80.5 to 86.4). For chest X-ray (9 studies, 3694 participants, 2111 (57%) cases) the sensitivity ranged from 51.9% to 94.4% and specificity ranged from 40.4% to 88.9%. The pooled sensitivity of chest X-ray was 80.6% (95% CI 69.1 to 88.6) and the pooled specificity was 71.5% (95% CI 59.8 to 80.8). For ultrasound of the lungs (5 studies, 446 participants, 211 (47%) cases) the sensitivity ranged from 68.2% to 96.8% and specificity ranged from 21.3% to 78.9%. The pooled sensitivity of ultrasound was 86.4% (95% CI 72.7 to 93.9) and the pooled specificity was 54.6% (95% CI 35.3 to 72.6). Based on an indirect comparison using all included studies, chest CT had a higher specificity than ultrasound. For indirect comparisons of chest CT and chest X-ray, or chest X-ray and ultrasound, the data did not show differences in specificity or sensitivity. AUTHORS' CONCLUSIONS: Our findings indicate that chest CT is sensitive and moderately specific for the diagnosis of COVID-19. Chest X-ray is moderately sensitive and moderately specific for the diagnosis of COVID-19. Ultrasound is sensitive but not specific for the diagnosis of COVID-19. Thus, chest CT and ultrasound may have more utility for excluding COVID-19 than for differentiating SARS-CoV-2 infection from other causes of respiratory illness. Future diagnostic accuracy studies should pre-define positive imaging findings, include direct comparisons of the various modalities of interest in the same participant population, and implement improved reporting practices.
Asunto(s)
COVID-19/diagnóstico por imagen , Radiografía Torácica , Tomografía Computarizada por Rayos X , Ultrasonografía , Adolescente , Adulto , Anciano , Sesgo , Prueba de Ácido Nucleico para COVID-19/normas , Niño , Intervalos de Confianza , Humanos , Pulmón/diagnóstico por imagen , Persona de Mediana Edad , Radiografía Torácica/normas , Radiografía Torácica/estadística & datos numéricos , Estándares de Referencia , Sensibilidad y Especificidad , Tomografía Computarizada por Rayos X/normas , Tomografía Computarizada por Rayos X/estadística & datos numéricos , Ultrasonografía/normas , Ultrasonografía/estadística & datos numéricos , Adulto JovenRESUMEN
BACKGROUND & AIMS: The enhanced liver fibrosis (ELF) test has been proposed for the non-invasive assessment of advanced fibrosis in patients with non-alcoholic fatty liver disease (NAFLD). We performed a systematic review to estimate the accuracy of this test against biopsy. METHODS: In this systematic review, we searched MEDLINE, Embase, Web of Science and the Cochrane Library for studies that included patients with NAFLD and that used both liver biopsy (as the reference standard) and the ELF test. Two authors independently screened the references, extracted the data and assessed the quality of included studies. Due to the variation in reported thresholds, we used a multiple thresholds random effects model for meta-analysis (diagmeta R-package). RESULTS: The meta-analysis of 11 studies reporting advanced fibrosis and 5 studies reporting significant fibrosis showed that the ELF test had a sensitivity of >0.90 for excluding fibrosis at a threshold of 7.7. However, as a diagnostic test at high thresholds, the test only achieved specificity and positive predictive value >0.80 in very high prevalence settings (>50%). To achieve a specificity of 0.90 for advanced and significant fibrosis, thresholds of 10.18 (sensitivity: 0.57) and 9.86 (sensitivity: 0.55) were required, respectively. CONCLUSION: The ELF test showed high sensitivity but limited specificity to exclude advanced and significant fibrosis at low cut-offs. The diagnostic performance of the test at higher thresholds was found to be more limited in low-prevalence settings. We conclude that clinicians should carefully consider the likely disease prevalence in their practice setting and adopt suitable test thresholds to achieve the desired performance. LAY SUMMARY: The enhanced liver fibrosis test has been suggested as a non-invasive blood test to aid the diagnosis of severe liver fibrosis in patients with non-alcoholic fatty liver disease (NAFLD). Our study results showed that the test has a high negative predictive value, especially in populations with low disease prevalence (likely encountered in primary care); so, it can exclude advanced fibrosis in patients with NAFLD. However, when prevalence is low, the positive predictive value of the enhanced liver fibrosis test is low, suggesting that additional strategies may be needed to make a positive diagnosis in such settings.
Asunto(s)
Ácido Hialurónico/sangre , Cirrosis Hepática , Hígado , Enfermedad del Hígado Graso no Alcohólico , Fragmentos de Péptidos/sangre , Procolágeno/sangre , Inhibidor Tisular de Metaloproteinasa-1/sangre , Algoritmos , Biomarcadores/sangre , Biopsia/métodos , Progresión de la Enfermedad , Humanos , Hígado/diagnóstico por imagen , Hígado/patología , Cirrosis Hepática/sangre , Cirrosis Hepática/diagnóstico , Cirrosis Hepática/etiología , Enfermedad del Hígado Graso no Alcohólico/complicaciones , Enfermedad del Hígado Graso no Alcohólico/diagnóstico , Valor Predictivo de las Pruebas , Estándares de ReferenciaRESUMEN
BACKGROUND: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the resulting COVID-19 pandemic present important diagnostic challenges. Several diagnostic strategies are available to identify or rule out current infection, identify people in need of care escalation, or to test for past infection and immune response. Point-of-care antigen and molecular tests to detect current SARS-CoV-2 infection have the potential to allow earlier detection and isolation of confirmed cases compared to laboratory-based diagnostic methods, with the aim of reducing household and community transmission. OBJECTIVES: To assess the diagnostic accuracy of point-of-care antigen and molecular-based tests to determine if a person presenting in the community or in primary or secondary care has current SARS-CoV-2 infection. SEARCH METHODS: On 25 May 2020 we undertook electronic searches in the Cochrane COVID-19 Study Register and the COVID-19 Living Evidence Database from the University of Bern, which is updated daily with published articles from PubMed and Embase and with preprints from medRxiv and bioRxiv. In addition, we checked repositories of COVID-19 publications. We did not apply any language restrictions. SELECTION CRITERIA: We included studies of people with suspected current SARS-CoV-2 infection, known to have, or not to have SARS-CoV-2 infection, or where tests were used to screen for infection. We included test accuracy studies of any design that evaluated antigen or molecular tests suitable for a point-of-care setting (minimal equipment, sample preparation, and biosafety requirements, with results available within two hours of sample collection). We included all reference standards to define the presence or absence of SARS-CoV-2 (including reverse transcription polymerase chain reaction (RT-PCR) tests and established clinical diagnostic criteria). DATA COLLECTION AND ANALYSIS: Two review authors independently screened studies and resolved any disagreements by discussion with a third review author. One review author independently extracted study characteristics, which were checked by a second review author. Two review authors independently extracted 2x2 contingency table data and assessed risk of bias and applicability of the studies using the QUADAS-2 tool. We present sensitivity and specificity, with 95% confidence intervals (CIs), for each test using paired forest plots. We pooled data using the bivariate hierarchical model separately for antigen and molecular-based tests, with simplifications when few studies were available. We tabulated available data by test manufacturer. MAIN RESULTS: We included 22 publications reporting on a total of 18 study cohorts with 3198 unique samples, of which 1775 had confirmed SARS-CoV-2 infection. Ten studies took place in North America, two in South America, four in Europe, one in China and one was conducted internationally. We identified data for eight commercial tests (four antigen and four molecular) and one in-house antigen test. Five of the studies included were only available as preprints. We did not find any studies at low risk of bias for all quality domains and had concerns about applicability of results across all studies. We judged patient selection to be at high risk of bias in 50% of the studies because of deliberate over-sampling of samples with confirmed COVID-19 infection and unclear in seven out of 18 studies because of poor reporting. Sixteen (89%) studies used only a single, negative RT-PCR to confirm the absence of COVID-19 infection, risking missing infection. There was a lack of information on blinding of index test (n = 11), and around participant exclusions from analyses (n = 10). We did not observe differences in methodological quality between antigen and molecular test evaluations. Antigen tests Sensitivity varied considerably across studies (from 0% to 94%): the average sensitivity was 56.2% (95% CI 29.5 to 79.8%) and average specificity was 99.5% (95% CI 98.1% to 99.9%; based on 8 evaluations in 5 studies on 943 samples). Data for individual antigen tests were limited with no more than two studies for any test. Rapid molecular assays Sensitivity showed less variation compared to antigen tests (from 68% to 100%), average sensitivity was 95.2% (95% CI 86.7% to 98.3%) and specificity 98.9% (95% CI 97.3% to 99.5%) based on 13 evaluations in 11 studies of on 2255 samples. Predicted values based on a hypothetical cohort of 1000 people with suspected COVID-19 infection (with a prevalence of 10%) result in 105 positive test results including 10 false positives (positive predictive value 90%), and 895 negative results including 5 false negatives (negative predictive value 99%). Individual tests We calculated pooled results of individual tests for ID NOW (Abbott Laboratories) (5 evaluations) and Xpert Xpress (Cepheid Inc) (6 evaluations). Summary sensitivity for the Xpert Xpress assay (99.4%, 95% CI 98.0% to 99.8%) was 22.6 (95% CI 18.8 to 26.3) percentage points higher than that of ID NOW (76.8%, (95% CI 72.9% to 80.3%), whilst the specificity of Xpert Xpress (96.8%, 95% CI 90.6% to 99.0%) was marginally lower than ID NOW (99.6%, 95% CI 98.4% to 99.9%; a difference of -2.8% (95% CI -6.4 to 0.8)) AUTHORS' CONCLUSIONS: This review identifies early-stage evaluations of point-of-care tests for detecting SARS-CoV-2 infection, largely based on remnant laboratory samples. The findings currently have limited applicability, as we are uncertain whether tests will perform in the same way in clinical practice, and according to symptoms of COVID-19, duration of symptoms, or in asymptomatic people. Rapid tests have the potential to be used to inform triage of RT-PCR use, allowing earlier detection of those testing positive, but the evidence currently is not strong enough to determine how useful they are in clinical practice. Prospective and comparative evaluations of rapid tests for COVID-19 infection in clinically relevant settings are urgently needed. Studies should recruit consecutive series of eligible participants, including both those presenting for testing due to symptoms and asymptomatic people who may have come into contact with confirmed cases. Studies should clearly describe symptomatic status and document time from symptom onset or time since exposure. Point-of-care tests must be conducted on samples according to manufacturer instructions for use and be conducted at the point of care. Any future research study report should conform to the Standards for Reporting of Diagnostic Accuracy (STARD) guideline.
Asunto(s)
Betacoronavirus , Técnicas de Laboratorio Clínico/métodos , Infecciones por Coronavirus/diagnóstico , Neumonía Viral/diagnóstico , Sistemas de Atención de Punto , Antígenos Virales/análisis , COVID-19 , Prueba de COVID-19 , Infecciones por Coronavirus/epidemiología , Reacciones Falso Negativas , Reacciones Falso Positivas , Humanos , Pandemias , Neumonía Viral/epidemiología , SARS-CoV-2 , Sensibilidad y EspecificidadRESUMEN
BACKGROUND: Specific diagnostic tests to detect severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and resulting COVID-19 disease are not always available and take time to obtain results. Routine laboratory markers such as white blood cell count, measures of anticoagulation, C-reactive protein (CRP) and procalcitonin, are used to assess the clinical status of a patient. These laboratory tests may be useful for the triage of people with potential COVID-19 to prioritize them for different levels of treatment, especially in situations where time and resources are limited. OBJECTIVES: To assess the diagnostic accuracy of routine laboratory testing as a triage test to determine if a person has COVID-19. SEARCH METHODS: On 4 May 2020 we undertook electronic searches in the Cochrane COVID-19 Study Register and the COVID-19 Living Evidence Database from the University of Bern, which is updated daily with published articles from PubMed and Embase and with preprints from medRxiv and bioRxiv. In addition, we checked repositories of COVID-19 publications. We did not apply any language restrictions. SELECTION CRITERIA: We included both case-control designs and consecutive series of patients that assessed the diagnostic accuracy of routine laboratory testing as a triage test to determine if a person has COVID-19. The reference standard could be reverse transcriptase polymerase chain reaction (RT-PCR) alone; RT-PCR plus clinical expertise or and imaging; repeated RT-PCR several days apart or from different samples; WHO and other case definitions; and any other reference standard used by the study authors. DATA COLLECTION AND ANALYSIS: Two review authors independently extracted data from each included study. They also assessed the methodological quality of the studies, using QUADAS-2. We used the 'NLMIXED' procedure in SAS 9.4 for the hierarchical summary receiver operating characteristic (HSROC) meta-analyses of tests for which we included four or more studies. To facilitate interpretation of results, for each meta-analysis we estimated summary sensitivity at the points on the SROC curve that corresponded to the median and interquartile range boundaries of specificities in the included studies. MAIN RESULTS: We included 21 studies in this review, including 14,126 COVID-19 patients and 56,585 non-COVID-19 patients in total. Studies evaluated a total of 67 different laboratory tests. Although we were interested in the diagnotic accuracy of routine tests for COVID-19, the included studies used detection of SARS-CoV-2 infection through RT-PCR as reference standard. There was considerable heterogeneity between tests, threshold values and the settings in which they were applied. For some tests a positive result was defined as a decrease compared to normal vaues, for other tests a positive result was defined as an increase, and for some tests both increase and decrease may have indicated test positivity. None of the studies had either low risk of bias on all domains or low concerns for applicability for all domains. Only three of the tests evaluated had a summary sensitivity and specificity over 50%. These were: increase in interleukin-6, increase in C-reactive protein and lymphocyte count decrease. Blood count Eleven studies evaluated a decrease in white blood cell count, with a median specificity of 93% and a summary sensitivity of 25% (95% CI 8.0% to 27%; very low-certainty evidence). The 15 studies that evaluated an increase in white blood cell count had a lower median specificity and a lower corresponding sensitivity. Four studies evaluated a decrease in neutrophil count. Their median specificity was 93%, corresponding to a summary sensitivity of 10% (95% CI 1.0% to 56%; low-certainty evidence). The 11 studies that evaluated an increase in neutrophil count had a lower median specificity and a lower corresponding sensitivity. The summary sensitivity of an increase in neutrophil percentage (4 studies) was 59% (95% CI 1.0% to 100%) at median specificity (38%; very low-certainty evidence). The summary sensitivity of an increase in monocyte count (4 studies) was 13% (95% CI 6.0% to 26%) at median specificity (73%; very low-certainty evidence). The summary sensitivity of a decrease in lymphocyte count (13 studies) was 64% (95% CI 28% to 89%) at median specificity (53%; low-certainty evidence). Four studies that evaluated a decrease in lymphocyte percentage showed a lower median specificity and lower corresponding sensitivity. The summary sensitivity of a decrease in platelets (4 studies) was 19% (95% CI 10% to 32%) at median specificity (88%; low-certainty evidence). Liver function tests The summary sensitivity of an increase in alanine aminotransferase (9 studies) was 12% (95% CI 3% to 34%) at median specificity (92%; low-certainty evidence). The summary sensitivity of an increase in aspartate aminotransferase (7 studies) was 29% (95% CI 17% to 45%) at median specificity (81%) (low-certainty evidence). The summary sensitivity of a decrease in albumin (4 studies) was 21% (95% CI 3% to 67%) at median specificity (66%; low-certainty evidence). The summary sensitivity of an increase in total bilirubin (4 studies) was 12% (95% CI 3.0% to 34%) at median specificity (92%; very low-certainty evidence). Markers of inflammation The summary sensitivity of an increase in CRP (14 studies) was 66% (95% CI 55% to 75%) at median specificity (44%; very low-certainty evidence). The summary sensitivity of an increase in procalcitonin (6 studies) was 3% (95% CI 1% to 19%) at median specificity (86%; very low-certainty evidence). The summary sensitivity of an increase in IL-6 (four studies) was 73% (95% CI 36% to 93%) at median specificity (58%) (very low-certainty evidence). Other biomarkers The summary sensitivity of an increase in creatine kinase (5 studies) was 11% (95% CI 6% to 19%) at median specificity (94%) (low-certainty evidence). The summary sensitivity of an increase in serum creatinine (four studies) was 7% (95% CI 1% to 37%) at median specificity (91%; low-certainty evidence). The summary sensitivity of an increase in lactate dehydrogenase (4 studies) was 25% (95% CI 15% to 38%) at median specificity (72%; very low-certainty evidence). AUTHORS' CONCLUSIONS: Although these tests give an indication about the general health status of patients and some tests may be specific indicators for inflammatory processes, none of the tests we investigated are useful for accurately ruling in or ruling out COVID-19 on their own. Studies were done in specific hospitalized populations, and future studies should consider non-hospital settings to evaluate how these tests would perform in people with milder symptoms.
Asunto(s)
Prueba de COVID-19/métodos , COVID-19/diagnóstico , Pruebas Diagnósticas de Rutina/métodos , SARS-CoV-2/aislamiento & purificación , Sesgo , Biomarcadores/sangre , Proteína C-Reactiva/análisis , COVID-19/sangre , COVID-19/epidemiología , Prueba de COVID-19/normas , Creatina Quinasa/sangre , Creatinina/sangre , Pruebas Diagnósticas de Rutina/normas , Humanos , Interleucina-6/sangre , L-Lactato Deshidrogenasa/sangre , Recuento de Leucocitos , Pruebas de Función Hepática , Recuento de Linfocitos , Pandemias , Recuento de Plaquetas , Curva ROC , Valores de Referencia , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa/normas , Sensibilidad y Especificidad , TriajeRESUMEN
BACKGROUND: Some people with SARS-CoV-2 infection remain asymptomatic, whilst in others the infection can cause mild to moderate COVID-19 disease and COVID-19 pneumonia, leading some patients to require intensive care support and, in some cases, to death, especially in older adults. Symptoms such as fever or cough, and signs such as oxygen saturation or lung auscultation findings, are the first and most readily available diagnostic information. Such information could be used to either rule out COVID-19 disease, or select patients for further diagnostic testing. OBJECTIVES: To assess the diagnostic accuracy of signs and symptoms to determine if a person presenting in primary care or to hospital outpatient settings, such as the emergency department or dedicated COVID-19 clinics, has COVID-19 disease or COVID-19 pneumonia. SEARCH METHODS: On 27 April 2020, we undertook electronic searches in the Cochrane COVID-19 Study Register and the University of Bern living search database, which is updated daily with published articles from PubMed and Embase and with preprints from medRxiv and bioRxiv. In addition, we checked repositories of COVID-19 publications. We did not apply any language restrictions. SELECTION CRITERIA: Studies were eligible if they included patients with suspected COVID-19 disease, or if they recruited known cases with COVID-19 disease and controls without COVID-19. Studies were eligible when they recruited patients presenting to primary care or hospital outpatient settings. Studies including patients who contracted SARS-CoV-2 infection while admitted to hospital were not eligible. The minimum eligible sample size of studies was 10 participants. All signs and symptoms were eligible for this review, including individual signs and symptoms or combinations. We accepted a range of reference standards including reverse transcription polymerase chain reaction (RT-PCR), clinical expertise, imaging, serology tests and World Health Organization (WHO) or other definitions of COVID-19. DATA COLLECTION AND ANALYSIS: Pairs of review authors independently selected all studies, at both title and abstract stage and full-text stage. They resolved any disagreements by discussion with a third review author. Two review authors independently extracted data and resolved disagreements by discussion with a third review author. Two review authors independently assessed risk of bias using the QUADAS-2 checklist. Analyses were descriptive, presenting sensitivity and specificity in paired forest plots, in ROC (receiver operating characteristic) space and in dumbbell plots. We did not attempt meta-analysis due to the small number of studies, heterogeneity across studies and the high risk of bias. MAIN RESULTS: We identified 16 studies including 7706 participants in total. Prevalence of COVID-19 disease varied from 5% to 38% with a median of 17%. There were no studies from primary care settings, although we did find seven studies in outpatient clinics (2172 participants), and four studies in the emergency department (1401 participants). We found data on 27 signs and symptoms, which fall into four different categories: systemic, respiratory, gastrointestinal and cardiovascular. No studies assessed combinations of different signs and symptoms and results were highly variable across studies. Most had very low sensitivity and high specificity; only six symptoms had a sensitivity of at least 50% in at least one study: cough, sore throat, fever, myalgia or arthralgia, fatigue, and headache. Of these, fever, myalgia or arthralgia, fatigue, and headache could be considered red flags (defined as having a positive likelihood ratio of at least 5) for COVID-19 as their specificity was above 90%, meaning that they substantially increase the likelihood of COVID-19 disease when present. Seven studies carried a high risk of bias for selection of participants because inclusion in the studies depended on the applicable testing and referral protocols, which included many of the signs and symptoms under study in this review. Five studies only included participants with pneumonia on imaging, suggesting that this is a highly selected population. In an additional four studies, we were unable to assess the risk for selection bias. These factors make it very difficult to determine the diagnostic properties of these signs and symptoms from the included studies. We also had concerns about the applicability of these results, since most studies included participants who were already admitted to hospital or presenting to hospital settings. This makes these findings less applicable to people presenting to primary care, who may have less severe illness and a lower prevalence of COVID-19 disease. None of the studies included any data on children, and only one focused specifically on older adults. We hope that future updates of this review will be able to provide more information about the diagnostic properties of signs and symptoms in different settings and age groups. AUTHORS' CONCLUSIONS: The individual signs and symptoms included in this review appear to have very poor diagnostic properties, although this should be interpreted in the context of selection bias and heterogeneity between studies. Based on currently available data, neither absence nor presence of signs or symptoms are accurate enough to rule in or rule out disease. Prospective studies in an unselected population presenting to primary care or hospital outpatient settings, examining combinations of signs and symptoms to evaluate the syndromic presentation of COVID-19 disease, are urgently needed. Results from such studies could inform subsequent management decisions such as self-isolation or selecting patients for further diagnostic testing. We also need data on potentially more specific symptoms such as loss of sense of smell. Studies in older adults are especially important.