RESUMO
Individuals entering incarceration are at high risk for infectious diseases, other ill conditions, and risky behavior. Typically, the status of active pulmonary tuberculosis (PTB) is not known at the time of admission. Early detection and treatment are essential for effective TB control. So far, no study has compared the diagnostic accuracy of various TB screening tools in detention using a network meta-analysis (NMA). We aimed to investigate the diagnostic accuracy of active PTB screening tests upon detention admission. We searched PubMed, Global Index Medicus, the Cochrane Library electronic databases, and grey literature for publications reporting detention TB entry screening in March 2022 and January 2024. Inclusion was non-restrictive regarding time, language, location, reference standards, or screening tests. Eligible study designs comprised comparative, observational, and diagnostic studies. Publications had to report TB screening of individuals entering confinement and provide data for diagnostic accuracy calculations. The QUADAS-2 tool was designed to assess the quality of primary diagnostic accuracy studies. This systematic review was registered with PROSPERO (CRD42022307863) and conducted without external funding. We screened a total of 2,455 records. Despite extensive searching, no studies met our inclusion criteria. However, we identified evidence revealing key differences in screening algorithm application. In conclusion, more diagnostic accuracy data on TB screening algorithms for detention admission worldwide needs to be collected. We recommend that global TB initiatives set up multi-site studies to investigate the diagnostic accuracy of TB screening on admission in low- and high-prevalence criminal justice systems. Further network meta-analyses of these studies could inform policymakers and public health experts to establish or fine-tune TB control in detention settings.
Assuntos
Programas de Rastreamento , Tuberculose Pulmonar , Humanos , Tuberculose Pulmonar/diagnóstico , Programas de Rastreamento/métodos , PrisioneirosRESUMO
BACKGROUND/OBJECTIVES: Algorithms for quantifying liver fat content based on the ultrasound attenuation coefficient (AC) are currently available; however, little is known about whether their accuracy increases by applying quality criteria such as the interquartile range-to-median ratio (IQR/M) or whether the median or average AC value should be used. METHODS: AC measurements were performed with the Aplio i800 ultrasound system using the attenuation imaging (ATI) algorithm (Canon Medical Systems, Otawara, Tochigi, Japan). Magnetic resonance imaging proton density fat fraction (MRI-PDFF) was the reference standard. The diagnostic performance of the AC median value of 5 measurements (AC-M) was compared to that of AC average value (AC-A) of 5 or 3 acquisitions and different levels of IQR/M for median values or standard deviation/average (SD/A) for average values were also analyzed. Concordance between AC-5M, AC-5A, and AC3A was evaluated with concordance correlation coefficient (CCC). RESULTS: A total of 182 individuals (94 females; mean age, 51.2y [SD: 15]) were evaluated. A total of 77 (42.3%) individuals had S0 steatosis (MRI-PDFF < 6%), 75 (41.2%) S1 (MRI-PDFF 6-17%), 10 (5.5%) S2 (MRI-PDFF 17.1-22%), and 20 (11%) S3 (MRI-PDFF ≥ 22.1%). Concordance of AC-5A and AC-3A with AC-5M was excellent (CCC: 0.99 and 0.96, respectively). The correlation with MRI-PDFF was almost perfect. Diagnostic accuracy of AC-5M, AC-5A, and AC3A was not significantly affected by different levels of IQR/M or SD/A. CONCLUSIONS: The accuracy of AC in quantifying liver fat content was not affected by reducing the number of acquisitions (from five to three), by using the mean instead of the median, or by reducing the IQR/M or SD/A to ≤5%.
RESUMO
The development of methods for the meta-analysis of diagnostic test accuracy (DTA) studies is still an active area of research. While methods for the standard case where each study reports a single pair of sensitivity and specificity are nearly routinely applied nowadays, methods to meta-analyze receiver operating characteristic (ROC) curves are not widely used. This situation is more complex, as each primary DTA study may report on several pairs of sensitivity and specificity, each corresponding to a different threshold. In a case study published earlier, we applied a number of methods for meta-analyzing DTA studies with multiple thresholds to a real-world data example (Zapf et al., Biometrical Journal. 2021; 63(4): 699-711). To date, no simulation study exists that systematically compares different approaches with respect to their performance in various scenarios when the truth is known. In this article, we aim to fill this gap and present the results of a simulation study that compares three frequentist approaches for the meta-analysis of ROC curves. We performed a systematic simulation study, motivated by an example from medical research. In the simulations, all three approaches worked partially well. The approach by Hoyer and colleagues was slightly superior in most scenarios and is recommended in practice.
Assuntos
Biometria , Metanálise como Assunto , Curva ROC , Biometria/métodos , Testes Diagnósticos de Rotina/métodos , Humanos , Simulação por ComputadorRESUMO
The development of new statistical models for the meta-analysis of diagnostic test accuracy studies is still an ongoing field of research, especially with respect to summary receiver operating characteristic (ROC) curves. In the recently published updated version of the "Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy", the authors point to the challenges of this kind of meta-analysis and propose two approaches. However, both of them come with some disadvantages, such as the nonstraightforward choice of priors in Bayesian models or the requirement of a two-step approach where parameters are estimated for the individual studies, followed by summarizing the results. As an alternative, we propose a novel model by applying methods from time-to-event analysis. To this task we use the discrete proportional hazard approach to treat the different diagnostic thresholds, that provide means to estimate sensitivity and specificity and are reported by the single studies, as categorical variables in a generalized linear mixed model, using both the logit- and the asymmetric cloglog-link. This leads to a model specification with threshold-specific discrete hazards, avoiding a linear dependency between thresholds, discrete hazard, and sensitivity/specificity and thus increasing model flexibility. We compare the resulting models to approaches from the literature in a simulation study. While the estimated area under the summary ROC curve is estimated comparably well in most approaches, the results depict substantial differences in the estimated sensitivities and specificities. We also show the practical applicability of the models to data from a meta-analysis for the screening of type 2 diabetes.
RESUMO
The Preferred Reporting Items for Diagnostic Accuracy Studies in Endodontics (PRIDASE) 2024 guidelines are based on the Standards for Reporting of Diagnostic Accuracy Studies (STARD) 2015 guidelines and the Clinical and Laboratory Images in Publications (CLIP) principles, with the addition of items specifically related to endodontics. The use of the PRIDASE 2024 guidelines by authors and their application by journals during the peer review process will reduce the possibility of bias and enhance the quality of future diagnostic accuracy studies. The PRIDASE 2024 guidelines consist of a checklist containing 11 domains and 66 individual items. The purpose of the current document is to provide an explanation for each item on the PRIDASE 2024 checklist, along with examples from the literature to help readers understand their importance and offer advice to those developing manuscripts. A link to the PRIDASE 2024 explanation and elaboration document is available on the Preferred Reporting Items for study Designs in Endodontology (PRIDE) website (https://pride-endodonticguidelines.org/pridase/) and on the International Endodontic Journal website (https://onlinelibrary.wiley.com/page/journal/13652591/homepage/pride-guidelines.htm).
RESUMO
BACKGROUND: Quality assessment of diagnostic accuracy studies (QUADAS), and more recently QUADAS-2, were developed to aid the evaluation of methodological quality within primary diagnostic accuracy studies. However, its current form, QUADAS-2 does not address the unique considerations raised by artificial intelligence (AI)-centered diagnostic systems. The rapid progression of the AI diagnostics field mandates suitable quality assessment tools to determine the risk of bias and applicability, and subsequently evaluate translational potential for clinical practice. OBJECTIVE: We aim to develop an AI-specific QUADAS (QUADAS-AI) tool that addresses the specific challenges associated with the appraisal of AI diagnostic accuracy studies. This paper describes the processes and methods that will be used to develop QUADAS-AI. METHODS: The development of QUADAS-AI can be distilled into 3 broad stages. Stage 1-a project organization phase had been undertaken, during which a project team and a steering committee were established. The steering committee consists of a panel of international experts representing diverse stakeholder groups. Following this, the scope of the project was finalized. Stage 2-an item generation process will be completed following (1) a mapping review, (2) a meta-research study, (3) a scoping survey of international experts, and (4) a patient and public involvement and engagement exercise. Candidate items will then be put forward to the international Delphi panel to achieve consensus for inclusion in the revised tool. A modified Delphi consensus methodology involving multiple online rounds and a final consensus meeting will be carried out to refine the tool, following which the initial QUADAS-AI tool will be drafted. A piloting phase will be carried out to identify components that are considered to be either ambiguous or missing. Stage 3-once the steering committee has finalized the QUADAS-AI tool, specific dissemination strategies will be aimed toward academic, policy, regulatory, industry, and public stakeholders, respectively. RESULTS: As of July 2024, the project organization phase, as well as the mapping review and meta-research study, have been completed. We aim to complete the item generation, including the Delphi consensus, and finalize the tool by the end of 2024. Therefore, QUADAS-AI will be able to provide a consensus-derived platform upon which stakeholders may systematically appraise the methodological quality associated with AI diagnostic accuracy studies by the beginning of 2025. CONCLUSIONS: AI-driven systems comprise an increasingly significant proportion of research in clinical diagnostics. Through this process, QUADAS-AI will aid the evaluation of studies in this domain in order to identify bias and applicability concerns. As such, QUADAS-AI may form a key part of clinical, governmental, and regulatory evaluation frameworks for AI diagnostic systems globally. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): DERR1-10.2196/58202.
Assuntos
Inteligência Artificial , Pesquisa Qualitativa , Humanos , Projetos de Pesquisa/normas , Garantia da Qualidade dos Cuidados de Saúde/métodos , Técnica DelphiRESUMO
Studies investigating the accuracy of diagnostic tests should provide data on how effectively they identify or exclude disease in order to inform clinicians responsible for managing patients. This consensus-based project was undertaken to develop reporting guidelines for authors submitting manuscripts, which describe studies that have evaluated the accuracy of diagnostic tests in endodontics. These guidelines are known as the Preferred Reporting Items for Diagnostic Accuracy Studies in Endodontics (PRIDASE) 2024 guidelines. A nine-member steering committee created an initial checklist by integrating and modifying items from the Standards for Reporting of Diagnostic Accuracy (STARD) 2015 checklist and the Clinical and Laboratory Images in Publications (CLIP) principles, as well as adding a number of new items specific to the specialty of endodontics. Thereafter, the steering committee formed the PRIDASE Delphi Group (PDG) and the PRIDASE Online Meeting Group (POMG) in order to collect expert feedback on the preliminary draft checklist. Members of the Delphi group engaged in an online Delphi process to reach consensus on the clarity and suitability of the items in the checklist. The online meeting group then held an in-depth discussion on the online Delphi-generated items via the Zoom platform on 20 October 2023. According to the feedback obtained, the steering committee revised the PRIDASE checklist, which was then piloted by several authors when preparing manuscripts describing diagnostic accuracy studies in endodontics. Feedback from this process resulted in the final version of the PRIDASE 2024 checklist, which has 11 sections and 66 items. Authors are encouraged to use the PRIDASE 2024 guidelines when developing manuscripts on diagnostic accuracy in endodontics in order to improve the quality of reporting in this area. Editors of relevant journals will be invited to include these guidelines in their instructions to authors.
Assuntos
Lista de Checagem , Consenso , Técnica Delphi , Endodontia , Humanos , Endodontia/normas , Testes Diagnósticos de Rotina/normasRESUMO
PURPOSE: Shear-wave elastography (SWE) provides quantitative and absolute metrics for analyzing the elasticity of soft tissues. Despite the anterior scalene muscle (AS) is a key structure in patients with neck pain and nerve compressive syndromes, the majority of SWE studies only included asymptomatic subjects. This study aimed to analyze the Young's modulus and shear wave speed test-retest reliability in a sample of patients with neck pain symptoms to characterize the AS stiffness. METHODS: A diagnostic accuracy study acquiring a set of ultrasound images at C7 level in 42 patients with mechanical neck pain by one experienced examiner. After blinding the participants' identity, trial and side, the Young's modulus and shear wave velocity were measured by an independent experienced rater in randomized order. Intra-class correlation coefficients (ICC), standard error of measurement (SEM), minimal detectable changes (MDC) and coefficient of variation (CV%) were calculated. RESULTS: The sample reported moderate pain intensity (5.9/10 points) and disability (17.38/100 points). AS stiffness metrics assessed showed no significant differences between males and females, left and right side nor painful and non-painful side. (all, p > 0.05). Intra-examiner reliability was excellent for calculating the Young's modulus for shear wave speed (ICC > 0.90). CONCLUSION: The results support the use of this procedure for assessing the AS stiffness in populations with mechanical neck pain as excellent reliability estimates were obtained. However, future research should analyze case-control differences and the association between SWE metrics with clinical severity indicators.
Assuntos
Módulo de Elasticidade , Técnicas de Imagem por Elasticidade , Cervicalgia , Humanos , Técnicas de Imagem por Elasticidade/métodos , Masculino , Feminino , Adulto , Cervicalgia/diagnóstico por imagem , Cervicalgia/fisiopatologia , Pessoa de Meia-Idade , Adulto Jovem , Reprodutibilidade dos TestesRESUMO
BACKGROUND: Ancillary diagnostic tests are frequent in dermatopathology practice. Publications on their accuracy influence their utilization. The transparency and completeness of these publications are unknown. METHODS: We performed a cross-sectional study on diagnostic accuracy studies in dermatopathology published between 2020 and 2022 for compliance with Standards for Reporting of Diagnostic Accuracy Studies (STARD) and the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2). RESULTS: 14.67 ± 3.02 STARD items were reported in 62 publications (range, 9.5-23.5 out of the recommended total of 30). More items were reported in high-impact factor journals (16.01 vs. 13.32, p = 0.0002) and journals that endorsed STARD in their author instructions (17.22 vs. 14.11, p = 0.0039). Less than 10% of publications reported quantifiable hypotheses, sample size calculations, flow diagrams, or study registrations. The risk of bias by our analysis of QUADAS-2 criteria was high or uncertain for index test interpretation (36/62, 58%) and patient selection (44/62, 71%). CONCLUSIONS: Publications on dermatopathology tests are exploratory studies without prespecified hypotheses or study designs. They do not meet the criteria for transparent reporting. We suggest that medical journal leadership should consider updating their instructions with more explicit guidance on recommended manuscript elements.
Assuntos
Projetos de Pesquisa , Humanos , Estudos Transversais , Padrões de ReferênciaRESUMO
OBJECTIVE: Ultrasound (US) imaging is an essential tool for clinicians because of its cost-effectiveness and accessibility for assessing multiple muscle metrics including muscle quality, size and shape. Although previous studies highlighted the importance of the anterior scalene muscle (AS) in patients with neck pain, studies analyzing the reliability of US measurements for this muscle are lacking. This study aimed to develop a protocol for assessing AS muscle shape and quality measured with US and investigating its intra- and inter-examiner reliability. METHODS: Through use of a linear transducer, B-mode images of the anterolateral neck region at the C7 level were acquired in 28 healthy volunteers by two examiners (one experienced and one novel). Cross-sectional area, perimeter, shape descriptors and mean echo-intensity were measured twice by each examiner in randomized order. Intra-class correlation coefficients (ICCs), standard errors of measurement and minimal detectable changes were calculated. RESULTS: Results indicated no muscle side-to-side asymmetries (p > 0.05). Gender differences were found for muscle size (p < 0.01), but muscle shape and brightness were comparable (p > 0.05). Intra-examiner reliability was good to excellent for all metrics for the experienced and (ICC >0.846) and novel (ICC >780) examiners. Although inter-examiner reliability was good for most of the metrics (ICC >0.709), the estimates for assessing solidity and circularity were unacceptable (ICC <0.70). CONCLUSION: This study found that the described ultrasound procedure for locating and measuring anterior scalene muscle morphology and quality is highly reliable in asymptomatic individuals.
Assuntos
Cervicalgia , Humanos , Reprodutibilidade dos Testes , Ultrassonografia/métodosRESUMO
PURPOSE: To investigate the accuracy of cutoff values of the morning serum cortisol (MSC) using the cortisol stimulus test (CST) insulin tolerance test (ITT) and 250 mcg short Synacthen test (SST) as the reference standard tests, to better define its clinical role as a tool in the diagnostic investigation of adrenal insufficiency (AI) AI. METHODS: An observational study was conducted with a retrospective analysis of MSC in adult patients who had been submitted to a CST to investigate AI between January 2014 and December 2020. The normal cortisol response (NR) to stimulation was defined based on the cortisol assay. RESULTS: 371 patients underwent CST for suspected AI, 121/371 patients (32.6%) were diagnosed with AI. ROC curve analysis showed an area under the curve (AUC) for MSC of 0.75 (95% CI 0.69 - 0.80). The best MSC cutoff values to confirm AI were < 3.65, < 2.35 and < 1.5 mcg/dL with specificity of 98%, 99%, and 100%, respectively. MSC > 12.35, > 14.2 and > 14.5 mcg/dL had sensitivity of 98%, 99%, and 100%, respectively, being the best cutoff values to exclude AI. Almost 25% of patients undergoing CST for possible AI had MSC values between < 3.65 mcg/dL (6.7% of patients) and > 12.35 mcg/dL (17.5% of patients), making the formal CST testing unnecessary if we consider these cutoff values. CONCLUSION: With the most modern cortisol assays, MSC could be used as a diagnostic tool, with high accuracy to confirm or exclude AI, avoiding unnecessary CST; thus, reducing expenses and safety risks during AI investigation.
Assuntos
Insuficiência Adrenal , Hidrocortisona , Adulto , Humanos , Estudos Retrospectivos , Insuficiência Adrenal/diagnóstico , Curva ROC , Fatores de TempoRESUMO
Ultrasound imaging (US) is a biosensing technique that is widely used in several healthcare disciplines (including physiotherapy) for assessing multiple muscle metrics, such as muscle morphology and quality. Since all biosensors need to be tested in order to demonstrate their reliability, accuracy, sensitivity, and specificity, identifying factors that affect their diagnostic accuracy is essential. Since previous studies analyzed the impact of sociodemographic but not body composition characteristics in US errors, this study aimed to assess whether body composition metrics are associated with ultrasound measurement errors. B-mode images of the lumbar multifidus muscle at the L5 level were acquired and analyzed in 47 healthy volunteers by two examiners (one experienced and one novice). The cross-sectional area, muscle perimeter, and mean echo intensity were calculated bilaterally. A correlation analysis and a multivariate linear regression model were used for assessing the inter-examiner differences with respect to body composition metrics. The results demonstrated good-to-excellent reliability estimates for the cross-sectional area, muscle perimeter, aspect ratio, roundness, circularity, and mean brightness metrics (all ICC > 0.85). However, solidity showed unacceptable reliability (ICC < 0.7). Age, height, total lean mass, trunk lean mass, and water volume were associated with inter-examiner disagreement on mean echo intensity. Cross-sectional area, perimeter, and roundness measurement errors were associated with lean mass and water volume.
Assuntos
Região Lombossacral , Músculos Paraespinais , Humanos , Músculos Paraespinais/diagnóstico por imagem , Músculos Paraespinais/anatomia & histologia , Reprodutibilidade dos Testes , Ultrassonografia/métodos , Região Lombossacral/diagnóstico por imagem , Composição CorporalRESUMO
Background: Modic changes (MC) have been proposed as a cause of low back pain (LBP). However, the proposition remains controversial. There is uncertainty over the existence or degree of association between the two and whether, if there is an association, it is a causal relationship. Previous systematic reviews of the evidence have had methodological flaws. Aims: The aim was to synthesize the current evidence to test the hypothesis that there is an association between MC and LBP and if there is, to evaluate the strength of the association. Methods: MEDLINE, EMBASE, and CINAHL were searched for all studies up to 31 December 2018 for cohort, case-control, and cross-sectional studies. Screening, quality assessment, and data extraction were conducted by two independent reviewers. Quality was assessed using the Joanna Biggs Institute tools for observational studies. The clinical heterogeneity among these studies ruled out pooling so a narrative review was undertaken. Results: Fifteen studies met the inclusion criteria, varying in patient characteristics, characteristics of MC, coexisting spinal conditions, and outcomes. The quality of evidence was poor in six and moderate in seven. There was wide clinical heterogeneity amongst the studies. The inclusion ages varied from early teens to over 65s, pain duration varied from under 6 weeks to over three months, and characteristics of the MC chosen for the studies also differed. The results were inconsistent across the studies: the odds ratios varied from showing an inverse relationship [Kovacs] with an odds ratio 0.31 (95% confidence interval, 0.1-0.95) to a very strong positive association 121.4 (11.21-1315.08) [Nakamae]. There was no consistency in associations for: type of MC, lumbar levels affected, position in relation to the vertebra, and presence of co-existing spinal conditions. The associations were possibly spurious arising from potential biases suggested by incomplete reporting: publication bias, selective reporting, and post hoc analysis. Only one study at low risk of bias found a substantial association but it was a small study of a narrow group meaning its results may not be generalizable. Discussion: The inconsistency of findings and the possibility that they were spurious means that no conclusions can be drawn about an association between MC and LBP. Future research should be designed as prospective cohort studies with adherence to reporting guidelines pertaining to observational studies and to MRI. Currently, clinicians should not look for the presence or absence of MC to guide their management of patients with LBP.
RESUMO
BACKGROUND: Decision rules (eg, Canadian computed tomography head rule [CCHR] for adults and Pediatric Emergency Care Applied Research Network [PECARN] rule for children/adolescents) are used in emergency settings (emergency room [ER] rules) to assess traumatic brain injuries (TBIs). The rules have a high-sensitivity and near-perfect negative predictive value that help to rule out more severe TBI. CLINICAL QUESTION: Which criteria should be added to the Sport Concussion Assessment Tool 5 (SCAT5) to reach the sensitivity of the ER rules and improve the utility of the SCAT5 for screening for higher-severity head and brain injuries? KEY RESULTS: We performed a comparative analysis of the SCAT5 with the CCHR and PECARN rules. We compared the presence (yes or no) and comparative "face value" sensitivity (lower, identical, or higher) of the SCAT5 criteria to the ER rules criteria. Loss of consciousness, vomiting, severe/increasing headache, and seizure are SCAT "red flags" with similar or higher sensitivity compared to ER rules criteria. Five criteria had lower sensitivity or were absent from the SCAT. Emergency room rules include any abnormality on the Glasgow Coma Scale (GCS<15), but only a "deterioration of the state of consciousness" is considered a "red flag" in the SCAT5. Persistent retrograde amnesia for more than 30 minutes, age>65, severity of the mechanism of injury, and signs of skull fractures are not mentioned in the SCAT5. CLINICAL APPLICATION: We identified 5 criteria that could inform the evolution the SCAT5 to improve its ability to rule out more severe TBI in a sideline assessment context. J Orthop Sports Phys Ther 2023;53(3):113-119. Epub: 9 December 2022. doi:10.2519/jospt.2022.11301.
Assuntos
Concussão Encefálica , Lesões Encefálicas Traumáticas , Traumatismos Craniocerebrais , Adulto , Adolescente , Humanos , Criança , Idoso , Traumatismos Craniocerebrais/diagnóstico , Sensibilidade e Especificidade , Canadá , Concussão Encefálica/diagnóstico por imagem , Lesões Encefálicas Traumáticas/diagnóstico por imagem , Serviço Hospitalar de EmergênciaRESUMO
OBJECTIVE: To develop a clinical practice guideline covering the assessment, management, and return to work of adults with rotator cuff disorders. DESIGN: Clinical practice guideline. METHODS: Using systematic reviews, appraisal of the literature, and an iterative approach to obtain consensus from key stakeholders, clinical recommendations and algorithms were developed in the context of the health care system and work environment of the province of Quebec (Canada). RESULTS: Recommendations (n = 73) and clinical decision algorithms (n = 3) were developed to match the objectives. The initial assessment should include the patient's history, a subjective assessment, and a physical examination. Diagnostic imaging is only necessary in select circumstances. Acetaminophen, nonsteroidal anti-inflammatory drugs, and injection therapies may be useful to reduce pain in the short term. Clinicians should prescribe an active and task-oriented rehabilitation program (exercises and education) to reduce pain and disability in adults with rotator cuff disorders. Subacromial decompression is not recommended to treat rotator cuff tendinopathy. Surgery is appropriate for selected patients with a full-thickness rotator cuff tear. A return-to-work plan should be developed early, in collaboration with the worker and other stakeholders, and must combine multiple strategies to promote return to work. CONCLUSION: This clinical practice guideline was developed to assist the multidisciplinary team of clinicians who provide health care for adults with a rotator cuff disorder. The CPG guides clinical decisionmaking for diagnosis and treatment, and planning for successful return to work. J Orthop Sports Phys Ther 2022;52(10):647-664. Epub: 27 July 2022. doi:10.2519/jospt.2022.11306.
Assuntos
Lesões do Manguito Rotador , Manguito Rotador , Acetaminofen , Adulto , Anti-Inflamatórios , Humanos , Retorno ao Trabalho , Lesões do Manguito Rotador/cirurgia , Lesões do Manguito Rotador/terapia , Dor de Ombro/terapiaRESUMO
OBJECTIVE: To estimate the screening performances of the most important provocation tests for diagnosing carpal tunnel syndrome (CTS). DESIGN: Diagnostic test accuracy systematic review with meta-analysis. LITERATURE SEARCH: We systematically searched the MEDLINE, Scopus, Web of Science, and Cochrane databases from inception to November 2020. STUDY SELECTION CRITERIA: Observational studies comparing the accuracies of the Durkan test (DT), the hand elevation test (HET), the Phalen test (PT), the Tinel test (TT), and the upper-limb neurodynamic test specific to the median nerve (ULNT1) with electrodiagnosis for screening for CTS. DATA SYNTHESIS: Random-effects models for the diagnostic odds ratio (dOR) values computed by Moses' constant for a linear model and 95% confidence intervals (CIs) were used to calculate the accuracy of these tests. Hierarchical summary receiver operating characteristic curve analyses were used to summarize the overall test performance. RESULTS: Thirty-seven studies were included in the meta-analysis, with a total sample of 2662 wrists for DT, 864 wrists for HET, 6361 wrists for PT, 6094 wrists for TT, and 571 wrists for ULNT1. The pooled dORs for screening for CTS were 15.84 (95% CI: 3.78, 66.38) for DT, 128.63 (95% CI: 40.64, 407.12) for HET, 7.23 (95% CI: 4.06, 12.86) for PT, 5.31 (95% CI: 3.49, 8.09) for TT, and 1.78 (95% CI: 0.61, 5.19) for ULNT1. CONCLUSION: HET has the best clinical performance for detecting CTS and should be considered the first screening test of choice during the physical examination. The most common tests (DT, PT, and TT) have good accuracies for screening for CTS. J Orthop Sports Phys Ther 2022;52(8):522-531. Epub: 19 June 2022. doi:10.2519/jospt.2022.10828.
Assuntos
Síndrome do Túnel Carpal , Programas de Rastreamento , Síndrome do Túnel Carpal/diagnóstico , Humanos , Programas de Rastreamento/métodos , Estudos Observacionais como Assunto , Reprodutibilidade dos TestesRESUMO
The accuracy of a diagnostic test is often expressed using a pair of measures: sensitivity (proportion of test positives among all individuals with target condition) and specificity (proportion of test negatives among all individuals without target condition). If the outcome of a diagnostic test is binary, results from different studies can easily be summarized in a meta-analysis. However, if the diagnostic test is based on a discrete or continuous measure (e.g., a biomarker), several cut-offs within one study as well as among different studies are published. Instead of taking all information of the cut-offs into account in the meta-analysis, a single cut-off per study is often selected arbitrarily for the analysis, even though there are statistical methods for the incorporation of several cut-offs. For these methods, distributional assumptions have to be met and/or the models may not converge when specific data structures occur. We propose a semiparametric approach to overcome both problems. Our simulation study shows that the diagnostic accuracy is under-estimated, although this underestimation in sensitivity and specificity is relatively small. The comparative approach of Steinhauser et al. is better in terms of coverage probability, but may lead to convergence problems. In addition to the simulation results, we illustrate the application of the semiparametric approach using a published meta-analysis for a diagnostic test differentiating between bacterial and viral meningitis in children.
Assuntos
Sensibilidade e Especificidade , Criança , Simulação por Computador , Humanos , ProbabilidadeRESUMO
BACKGROUND: Preferentially expressed antigen in melanoma (PRAME) is a tumor-associated antigen that is frequently expressed in cutaneous melanoma and can be evaluated by immunohistochemistry. Earlier studies on PRAME utilized case-control study designs that may misestimate diagnostic accuracy and lack generalizability. METHODS: Using retrospective cohort selection, a cross-sectional study of diagnostic accuracy of PRAME was conducted according to standards for reporting diagnostic accuracy studies requirements. RESULTS: Mean PRAME positive fraction was higher in 42 malignant melanocytic lesions than 101 benign melanocytic lesions (0.71 ± 0.30 vs. 0.13 ± 0.20, p < 0.01). Receiver operating characteristic curve showed the test was effective (area under the curve = 0.90). Global PRAME 4+ scores (>75%) were associated with sensitivity of 0.63, specificity of 0.97, accuracy of 0.87, and excellent interrater concordance (Kappa = 0.83). Lower cutoffs for PRAME of 2+ (>25%) and 3+ (>50%) produced higher joint sensitivity/specificity (Youden index) than PRAME 4+, but lower accuracy. CONCLUSION: PRAME as it is used in clinical practice is an effective test for melanoma. PRAME is best used as an ordinal variable to calculate the posttest probability of melanoma. PRAME ≤25% (0/1+) favors nevus, PRAME 26%-75% (2/3+) is noncontributory, and PRAME >75% (4+) favors melanoma.
Assuntos
Melanoma , Neoplasias Cutâneas , Antígenos de Neoplasias , Estudos de Casos e Controles , Estudos Transversais , Humanos , Imuno-Histoquímica , Melanoma/diagnóstico , Melanoma/patologia , Estudos Retrospectivos , Neoplasias Cutâneas/diagnóstico , Neoplasias Cutâneas/patologiaRESUMO
Objective: The present meta-analysis was conducted to determine the diagnostic accuracy of the bipolarity index (BI) and Rapid Mode Screener (RMS) as compared with the Bipolar Spectrum Diagnostic Scale (BSDS), the Hypomania Checklist (HCL-32), and the Mood Disorder Questionnaire (MDQ) in people with bipolar disorder (BD). Methods: We systematically searched five databases using standard search terms, and relevant articles published between May 1990 and November 30, 2021 were collected and reviewed. Results: Ninety-three original studies were included (n=62,291). At the recommended cutoffs for the BI, HCL-32, BSDS, MDQ, and RMS, the pooled sensitivities were 0.82, 0.75, 0.71, 0.71, and 0.78, respectively, while the corresponding pooled specificities were 0.73, 0.63, 0.73, 0.77, and 0.72, respectively. However, there was evidence that the accuracy of the BI was superior to that of the other tests, with a relative diagnostic odds ratio (RDOR) of 1.22 (0.98-1.52, p < 0.0001). The RMS was significantly more accurate than the other tests, with an RDOR (95%CI) of 0.79 (0.67-0.92, p < 0.0001) for the detection of BD type I (BD-I). However, there was evidence that the accuracy of the MDQ was superior to that of the other tests, with an RDOR of 1.93 (0.89-2.79, p = 0.0019), for the detection of BD type II (BD-II). Conclusion: The psychometric properties of two new instruments, the BI and RMS, in people with BD were consistent with considerably higher diagnostic accuracy than the HCL-32, BSDS, and MDQ. However, a positive screening should be confirmed by a clinical diagnostic evaluation for BD.