RESUMO
Background There is variable evidence and no randomized trials on the benefit of US elastography-guided fine-needle aspiration cytology (FNAC) over conventional US-guided FNAC alone for thyroid nodules. Purpose To compare the efficacy of US elastography-guided FNAC versus US-guided FNAC in reducing nondiagnostic rates for thyroid nodules. Materials and Methods A pragmatic, multicenter randomized controlled trial was performed at 18 secondary and tertiary hospitals across England between February 2015 and September 2018. Eligible adults with single or multiple thyroid nodules who had not previously undergone FNAC were randomized (1:1 ratio) to US elastography FNAC (intervention) or conventional US FNAC (control). The primary outcome was the proportion of patients who have a nondiagnostic cytologic Thy1 (British Thyroid Association system) result following the first FNAC. Results A total of 982 participants (mean age, 51.3 years ± 15 [SD] [IQR, 39-63]; male-to-female ratio, 1:4) were randomized. Of the 493 participants who underwent US elastography, 467 (94.7%) were examined with strain US elastography. There was no difference between the two arms in the nondiagnostic (Thy1) rate following the first FNAC (19% vs 16%; risk difference [RD], 0.03 [95% CI: -0.01, 0.07]; P = .11) or in the median time to reach the final definitive diagnosis (3.3 months [IQR, 1.5-6.4] for US elastography FNAC vs 3.4 months [IQR, 1.5-6.2] for US FNAC). All sensitivity analyses supported the primary analysis. Fewer participants in the US elastography FNAC arm underwent diagnostic hemithyroidectomy than in the US FNAC arm (183 of 493 [37%] vs 196 of 489 [40%]), but this was not statistically significant (adjusted RD, 0.02 [95% CI: -0.06, 0.01]; P = 0.15). There was no evidence of a difference in malignancy rates between the two arms: 70 of 493 (14%) in US elastography FNAC arm versus 79 of 489 (16%) in US FNAC arm (P = .39). There was also no difference in the rate of benign histologic findings between the groups (RD, -0.01 [95% CI: -0.04, 0.03]; P = .7). Conclusion Strain US elastography does not appear to have additional benefit over conventional US FNAC in the diagnosis of malignancy in thyroid nodules. Clinical trial registration no. ISRCTN18261857 Published under a CC BY 4.0 license. Supplemental material is available for this article. See also the editorial by Isikbay and Harwin in this issue.
Assuntos
Técnicas de Imagem por Elasticidade , Nódulo da Glândula Tireoide , Humanos , Técnicas de Imagem por Elasticidade/métodos , Masculino , Feminino , Nódulo da Glândula Tireoide/diagnóstico por imagem , Nódulo da Glândula Tireoide/patologia , Nódulo da Glândula Tireoide/cirurgia , Pessoa de Meia-Idade , Adulto , Biópsia por Agulha Fina , Biópsia Guiada por Imagem/métodos , Glândula Tireoide/diagnóstico por imagem , Glândula Tireoide/patologia , Ultrassonografia de Intervenção/métodosRESUMO
INTRODUCTION: ElaTION is a large multi-centre pragmatic randomised controlled trial, performed in 18 secondary/tertiary hospitals across England, comparing elastography ultrasound-guided fine needle aspiration cytology (EUS-FNAC) with ultrasound-guided FNAC (US-FNAC) alone in the diagnostic assessment of thyroid nodules. Secondary trial outcomes, reported here, assessed the accuracy of ultrasound-alone (US) compared with US-guided FNAC to inform and update current practice guidelines. METHODS: Adults with single or multiple thyroid nodules who had not undergone previous FNAC were eligible. Radiologists assessed all thyroid nodules using US alone, thereby enabling assessment of its accuracy (sensitivity and specificity) versus US-FNAC. RESULTS: Of the 982 participants, a final definitive diagnosis was obtained in 688, who were included in the final analyses. The sensitivity of US-alone was the same as US-FNAC (0.91, [95% CI 0.85, 0.97] vs 0.87 [95%CI 0.80-0.95], p=0.37). US alone had statistically significant lower specificity than US-FNAC alone (0.48 vs 0.67 respectively, p<0.0001). The malignancy rate on histology in a nodule classified as benign on ultrasound (U2) was 9/263 (3.42%) and on cytology (Thy2) was 15/353 (4.25%), whereas the malignancy rate in a nodule that was benign on both (U2, Thy2) was 3/210 (1.43%). Malignancy risk for U3, U4, and U5 nodules was 68/304 (22.4%), 43/83 (51.8%), and 29/38 (76.3%) respectively (p<0.0001). Yet 80/982 (8%) patients were discharged despite having U3-U5 scans with Thy1 (non-diagnostic) FNAC and no definitive diagnosis.Malignancy risk was higher in smaller nodules: <10mm 23/60 (38.3%), 10-20mm 46/162 (28.4 %), and >20mm 80/466 (17.2%) (p<0.0001). Nodules with indeterminate cytology with atypical features (Thy3a) carried a similar malignancy risk to those with indeterminate cytology (Thy3/3f): 27/95 (28.4%) versus 42/113 (37.2%) respectively (p=0.18). CONCLUSION: Ultrasound alone appears to be an effective diagnostic modality in thyroid nodules, confirming the recommendations of recent guidelines and the BTA classification. However, findings also suggest caution regarding existing recommendations for conservative management of non-diagnostic (Thy1/Bethesda I) and atypical (Thy3a/Bethesda III) nodules. In those cases, ultrasound (U3-5) features may help identify high-risk subgroups for more proactive management.
RESUMO
BACKGROUND: Multiple risk-prediction models are used in clinical practice to triage patients as being at low risk or high risk of ovarian cancer. In the ROCkeTS study, we aimed to identify the best diagnostic test for ovarian cancer in symptomatic patients, through head-to-head comparisons of risk-prediction models, in a real-world setting. Here, we report the results for the postmenopausal cohort. METHODS: In this multicentre, prospective diagnostic accuracy study, we recruited newly presenting female patients aged 16-90 years with non-specific symptoms and raised CA125 or abnormal ultrasound results (or both) who had been referred via rapid access, elective clinics, or emergency presentations from 23 hospitals in the UK. Patients with normal CA125 and simple ovarian cysts of smaller than 5 cm in diameter, active non-ovarian malignancy, or previous ovarian malignancy, or those who were pregnant or declined a transvaginal scan, were ineligible. In this analysis, only postmenopausal participants were included. Participants completed a symptom questionnaire, gave a blood sample, and had transabdominal and transvaginal ultrasounds performed by International Ovarian Tumour Analysis consortium (IOTA)-certified sonographers. Index tests were Risk of Malignancy 1 (RMI1) at a threshold of 200, Risk of Malignancy Algorithm (ROMA) at multiple thresholds, IOTA Assessment of Different Neoplasias in the Adnexa (ADNEX) at thresholds of 3% and 10%, IOTA SRRisk model at thresholds of 3% and 10%, IOTA Simple Rules (malignant vs benign, or inconclusive), and CA125 at 35 IU/mL. In a post-hoc analysis, the Ovarian Adnexal and Reporting Data System (ORADS) at 10% was derived from IOTA ultrasound variables using established methods since ORADS was described after completion of recruitment. Index tests were conducted by study staff masked to the results of the reference standard. The comparator was RMI1 at the 250 threshold (the current UK National Health Service standard of care). The reference standard was surgical or biopsy tissue histology or cytology within 3 months, or a self-reported diagnosis of ovarian cancer at 12 month follow-up. The primary outcome was diagnostic accuracy at predicting primary invasive ovarian cancer versus benign or normal histology, assessed by analysing the sensitivity, specificity, C-index, area under receiver operating characteristic curve, positive and negative predictive values, and calibration plots in participants with conclusive reference standard results and available index test data. This study is registered with the International Standard Randomised Controlled Trial Number registry (ISRCTN17160843). FINDINGS: Between July 13, 2015, and Nov 30, 2018, 1242 postmenopausal patients were recruited, of whom 215 (17%) had primary ovarian cancer. 166 participants had missing, inconclusive, or other reference standard results; therefore, data from a maximum of 1076 participants were used to assess the index tests for the primary outcome. Compared with RMI1 at 250 (sensitivity 82·9% [95% CI 76·7 to 88·0], specificity 87·4% [84·9 to 89·6]), IOTA ADNEX at 10% was more sensitive (difference of -13·9% [-20·2 to -7·6], p<0·0001) but less specific (difference of 28·5% [24·7 to 32·3], p<0·0001). ROMA at 29·9 had similar sensitivity (difference of -3·6% [-9·1 to 1·9], p=0·24) but lower specificity (difference of 5·2% [2·5 to 8·0], p=0·0001). RMI1 at 200 had similar sensitivity (difference of -2·1% [-4·7 to 0·5], p=0·13) but lower specificity (difference of 3·0% [1·7 to 4·3], p<0·0001). IOTA SRRisk model at 10% had similar sensitivity (difference of -4·3% [-11·0 to -2·3], p=0·23) but lower specificity (difference of 16·2% [12·6 to 19·8], p<0·0001). IOTA Simple Rules had similar sensitivity (difference of -1·6% [-9·3 to 6·2], p=0·82) and specificity (difference of -2·2% [-5·1 to 0·6], p=0·14). CA125 at 35 IU/mL had similar sensitivity (difference of -2·1% [-6·6 to 2·3], p=0·42) but higher specificity (difference of 6·7% [4·3 to 9·1], p<0·0001). In a post-hoc analysis, when compared with RMI1 at 250, ORADS achieved similar sensitivity (difference of -2·1%, 95% CI -8·6 to 4·3, p=0·60) and lower specificity (difference of 10·2%, 95% CI 6·8 to 13·6, p<0·0001). INTERPRETATION: In view of its higher sensitivity than RMI1 at 250, despite some loss in specificity, we recommend that IOTA ADNEX at 10% should be considered as the new standard-of-care diagnostic in ovarian cancer for postmenopausal patients. FUNDING: UK National Institute of Heath Research.
Assuntos
Antígeno Ca-125 , Neoplasias Ovarianas , Pós-Menopausa , Humanos , Feminino , Pessoa de Meia-Idade , Neoplasias Ovarianas/diagnóstico , Neoplasias Ovarianas/sangue , Neoplasias Ovarianas/epidemiologia , Neoplasias Ovarianas/patologia , Neoplasias Ovarianas/diagnóstico por imagem , Idoso , Estudos Prospectivos , Adulto , Reino Unido/epidemiologia , Medição de Risco , Idoso de 80 Anos ou mais , Antígeno Ca-125/sangue , Adolescente , Adulto Jovem , Valor Preditivo dos Testes , Ultrassonografia , Fatores de RiscoRESUMO
OBJECTIVE: Symptom-triggered testing for ovarian cancer was introduced to the UK whereby symptomatic women undergo an ultrasound scan and serum CA125, and are referred to hospital within 2 weeks if these are abnormal. The potential value of symptom-triggered testing in the detection of early-stage disease or low tumor burden remains unclear in women with high grade serous ovarian cancer. In this descriptive study, we report on the International Federation of Gynecology and Obstetrics (FIGO) stage, disease distribution, and complete cytoreduction rates in women presenting via the fast-track pathway and who were diagnosed with high grade serous ovarian cancer. METHODS: We analyzed the dataset from Refining Ovarian Cancer Test accuracy Scores (ROCkeTS), a single-arm prospective diagnostic test accuracy study recruiting from 24 hospitals in the UK. The aim of ROCkeTS is to validate risk prediction models in symptomatic women. We undertook an opportunistic analysis for women recruited between June 2015 to July 2022 and who were diagnosed with high grade serous ovarian cancer via the fast-track pathway. Women presenting with symptoms suspicious for ovarian cancer receive a CA125 blood test and an ultrasound scan if the CA125 level is abnormal. If either of these is abnormal, women are referred to secondary care within 2 weeks. Histology details were available on all women who underwent surgery or biopsy within 3 months of recruitment. Women who did not undergo surgery or biopsy at 3 months were followed up for 12 months as per the national guidelines in the UK. In this descriptive study, we report on patient demographics (age and menopausal status), WHO performance status, FIGO stage at diagnosis, disease distribution (low/pelvic confined, moderate/extending to mid-abdomen, high/extending to upper abdomen) and complete cytoreduction rates in women who underwent surgery. RESULTS: Of 1741 participants recruited via the fast-track pathway, 119 (6.8%) were diagnosed with high grade serous ovarian cancer. The median age was 63 years (range 32-89). Of these, 112 (94.1%) patients had a performance status of 0 and 1, 30 (25.2%) were diagnosed with stages I/II, and the disease distribution was low-to-moderate in 77 (64.7%). Complete and optimal cytoreduction were achieved in 73 (61.3%) and 18 (15.1%). The extent of disease was low in 43 of 119 (36.1%), moderate in 34 of 119 (28.6%), high in 32 of 119 (26.9%), and not available in 10 of 119 (8.4%). Nearly two thirds, that is 78 of 119 (65.5%) women with high grade serous ovarian cancer, underwent primary debulking surgery, 36 of 119 (30.3%) received neoadjuvant chemotherapy followed by interval debulking surgery, and 5 of 119 (4.2%) women did not undergo surgery. CONCLUSION: Our results demonstrate that one in four women identified with high grade serous ovarian cancer through the fast-track pathway following symptom-triggered testing was diagnosed with early-stage disease. Symptom-triggered testing may help identify women with a low disease burden, potentially contributing to high complete cytoreduction rates.
RESUMO
Background: Estimation of glomerular filtration rate using equations based on creatinine is widely used to manage chronic kidney disease. In the UK, the Chronic Kidney Disease Epidemiology Collaboration creatinine equation is recommended. Other published equations using cystatin C, an alternative marker of kidney function, have not gained widespread clinical acceptance. Given higher cost of cystatin C, its clinical utility should be validated before widespread introduction into the NHS. Objectives: Primary objectives were to: (1) compare accuracy of glomerular filtration rate equations at baseline and longitudinally in people with stage 3 chronic kidney disease, and test whether accuracy is affected by ethnicity, diabetes, albuminuria and other characteristics; (2) establish the reference change value for significant glomerular filtration rate changes; (3) model disease progression; and (4) explore comparative cost-effectiveness of kidney disease monitoring strategies. Design: A longitudinal, prospective study was designed to: (1) assess accuracy of glomerular filtration rate equations at baseline (nâ =â 1167) and their ability to detect change over 3 years (nâ =â 875); (2) model disease progression predictors in 278 individuals who received additional measurements; (3) quantify glomerular filtration rate variability components (nâ =â 20); and (4) develop a measurement model analysis to compare different monitoring strategy costs (nâ =â 875). Setting: Primary, secondary and tertiary care. Participants: Adults (≥ 18 years) with stage 3 chronic kidney disease. Interventions: Estimated glomerular filtration rate using the Chronic Kidney Disease Epidemiology Collaboration and Modification of Diet in Renal Disease equations. Main outcome measures: Measured glomerular filtration rate was the reference against which estimating equations were compared with accuracy being expressed as P30 (percentage of values within 30% of reference) and progression (variously defined) studied as sensitivity/specificity. A regression model of disease progression was developed and differences for risk factors estimated. Biological variation components were measured and the reference change value calculated. Comparative costs of monitoring with different estimating equations modelled over 10 years were calculated. Results: Accuracy (P30) of all equations was ≥ 89.5%: the combined creatinine-cystatin equation (94.9%) was superior (pâ <â 0.001) to other equations. Within each equation, no differences in P30 were seen across categories of age, gender, diabetes, albuminuria, body mass index, kidney function level and ethnicity. All equations showed poor (< 63%) sensitivity for detecting patients showing kidney function decline crossing clinically significant thresholds (e.g. a 25% decline in function). Consequently, the additional cost of monitoring kidney function annually using a cystatin C-based equation could not be justified (incremental cost per patient over 10 yearsâ =â £43.32). Modelling data showed association between higher albuminuria and faster decline in measured and creatinine-estimated glomerular filtration rate. Reference change values for measured glomerular filtration rate (%, positive/negative) were 21.5/-17.7, with lower reference change values for estimated glomerular filtration rate. Limitations: Recruitment of people from South Asian and African-Caribbean backgrounds was below the study target. Future work: Prospective studies of the value of cystatin C as a risk marker in chronic kidney disease should be undertaken. Conclusions: Inclusion of cystatin C in glomerular filtration rate-estimating equations marginally improved accuracy but not detection of disease progression. Our data do not support cystatin C use for monitoring of glomerular filtration rate in stage 3 chronic kidney disease. Trial registration: This trial is registered as ISRCTN42955626. Funding: This award was funded by the National Institute for Health and Care Research (NIHR) Health Technology Assessment programme (NIHR award ref: 11/103/01) and is published in full in Health Technology Assessment; Vol. 28, No. 35. See the NIHR Funding and Awards website for further award information.
Chronic kidney disease, which affects approximately 14% of the adult population, often has no symptoms but, in some people, may later develop into kidney failure. Kidney disease is most often detected using a blood test called creatinine. Creatinine does not identify everyone with kidney disease, or those most likely to develop more serious kidney disease. An alternative blood test called cystatin C may be more accurate, but it is more expensive than the creatinine test. We compared the accuracy of these two tests in more than 1000 people with moderate kidney disease. Participants were tested over 3 years to see if the tests differed in their ability to detect worsening kidney function. We also wanted to identify risk factors associated with loss of kidney function, and how much the tests normally vary to better understand what results mean. We compared the accuracy and costs of monitoring people with the two markers. Cystatin C was found slightly more accurate than the creatinine test at estimating kidney function when comparing the baseline single measurements (95% accurate compared to 90%), but not at detecting worsening function over time. This means that the additional cost of monitoring people over time with cystatin C to detect kidney disease progression could not be justified. Kidney test results could vary by up to 20% between tests without necessarily implying changes in underlying kidney function this is the normal level of individual variation. Cystatin C marginally improved accuracy of kidney function testing but not ability to detect worsening kidney function. Cystatin C improves identification of moderate chronic kidney disease, but our results do not support its use for routine monitoring of kidney function in such patients.
Assuntos
Creatinina , Cistatina C , Progressão da Doença , Taxa de Filtração Glomerular , Insuficiência Renal Crônica , Humanos , Cistatina C/sangue , Creatinina/sangue , Masculino , Feminino , Insuficiência Renal Crônica/fisiopatologia , Pessoa de Meia-Idade , Idoso , Estudos Prospectivos , Estudos Longitudinais , Biomarcadores , Análise Custo-Benefício , Adulto , Reino Unido , AlbuminúriaRESUMO
OBJECTIVE: To investigate psychological correlates in women referred with suspected ovarian cancer via the fast-track pathway, explore how anxiety and distress levels change at 12 months post-testing, and report cancer conversion rates by age and referral pathway. DESIGN: Single-arm prospective cohort study. SETTING: Multicentre. Secondary care including outpatient clinics and emergency admissions. POPULATION: A cohort of 2596 newly presenting symptomatic women with a raised CA125 level, abnormal imaging or both. METHODS: Women completed anxiety and distress questionnaires at recruitment and at 12 months for those who had not undergone surgery or a biopsy within 3 months of recruitment. MAIN OUTCOME MEASURES: Anxiety and distress levels measured using a six-item short form of the State-Trait Anxiety Inventory (STAI-6) and the Impact of Event Scale - Revised (IES-r) questionnaire. Ovarian cancer (OC) conversion rates by age, menopausal status and referral pathway. RESULTS: Overall, 1355/2596 (52.1%) and 1781/2596 (68.6%) experienced moderate-to-severe distress and anxiety, respectively, at recruitment. Younger age and emergency presentations had higher distress levels. The clinical category for anxiety and distress remained unchanged/worsened in 76% of respondents at 12 months, despite a non-cancer diagnosis. The OC rates by age were 1.6% (95% CI 0.5%-5.9%) for age <40 years and 10.9% (95% CI 8.7%-13.6%) for age ≥40 years. In women referred through fast-track pathways, 3.3% (95% CI 1.9%-5.7%) of pre- and 18.5% (95% CI 16.1%-21.0%) of postmenopausal women were diagnosed with OC. CONCLUSIONS: Women undergoing diagnostic testing display severe anxiety and distress. Younger women are especially vulnerable and should be targeted for support. Women under the age of 40 years have low conversion rates and we advocate reducing testing in this group to reduce the harms of testing.
Assuntos
Ansiedade , Neoplasias Ovarianas , Humanos , Feminino , Neoplasias Ovarianas/psicologia , Estudos Prospectivos , Pessoa de Meia-Idade , Ansiedade/etiologia , Ansiedade/epidemiologia , Adulto , Idoso , Inquéritos e Questionários , Encaminhamento e Consulta/estatística & dados numéricos , Detecção Precoce de Câncer/psicologia , Antígeno Ca-125/sangue , Angústia Psicológica , Estresse Psicológico/etiologia , Estresse Psicológico/epidemiologiaRESUMO
OBJECTIVES: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. METHODS: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, populations, variability measures, key results, and conclusions. Risk of bias was assessed using the COSMIN tool for assessing reliability and measurement error. RESULTS: Seventy-nine full-text studies were included covering various imaging tests and clinical areas. The median number of patients was 47 (IQR:23-88), and observers were 4 (IQR:2-7), with sample size justified in 12 (15%) studies. Most studies used static images (n = 75, 95%), where all observers interpreted images for all patients (n = 67, 85%). Intraclass correlation coefficients (ICC) (n = 41, 52%), Kappa (κ) statistics (n = 31, 39%) and percentage agreement (n = 15, 19%) were most commonly used. Interpretation of variability estimates often did not correspond with study conclusions. The COSMIN risk of bias tool gave a very good/adequate rating for 52 studies (66%) including any studies that used variability measures listed in the tool. For studies using static images, some study design standards were not applicable and did not contribute to the overall rating. CONCLUSIONS: Interobserver variability studies have diverse study designs and methods, the impact of which requires further evaluation. Sample size for patients and observers was often small without justification. Most studies report ICC and κ values, which did not always coincide with the study conclusion. High ratings were assigned to many studies using the COSMIN risk of bias tool, with certain standards scored 'not applicable' when static images were used. ADVANCES IN KNOWLEDGE: The sample size for both patients and observers was often small without justification. For most studies, observers interpreted static images and did not evaluate the process of acquiring the imaging test, meaning it was not possible to assess many COSMIN risk of bias standards for studies with this design. Most studies reported intraclass correlation coefficient and κ statistics; study conclusions often did not correspond with results.
Assuntos
Diagnóstico por Imagem , Projetos de Pesquisa , Humanos , Variações Dependentes do Observador , Reprodutibilidade dos TestesRESUMO
OBJECTIVES: To identify which international health technology assessment (HTA) agencies are undertaking evaluations of medical tests, summarize commonalities and differences in methodological approach, and highlight examples of good practice. METHODS: A methodological review incorporating: systematic identification of HTA guidance documents mentioning evaluation of tests; identification of key contributing organizations and abstraction of approaches to all essential HTA steps; summary of similarities and differences between organizations; and identification of important emergent themes which define the current state of the art and frontiers where further development is needed. RESULTS: Seven key organizations were identified from 216 screened. The main themes were: elucidation of claims of test benefits; attitude to direct and indirect evidence of clinical effectiveness (including evidence linkage); searching; quality assessment; and health economic evaluation. With the exception of dealing with test accuracy data, approaches were largely based on general approaches to HTA with few test-specific modifications. Elucidation of test claims and attitude to direct and indirect evidence are where we identified the biggest dissimilarities in approach. CONCLUSIONS: There is consensus on some aspects of HTA of tests, such as dealing with test accuracy, and examples of good practice which HTA organizations new to test evaluation can emulate. The focus on test accuracy contrasts with universal acknowledgment that it is not a sufficient evidence base for test evaluation. There are frontiers where methodological development is urgently required, notably integrating direct and indirect evidence and standardizing approaches to evidence linkage.
Assuntos
Atitude , Avaliação da Tecnologia Biomédica , Análise Custo-Benefício , Consenso , Agências InternacionaisRESUMO
Background: Ovarian cancer (OC) is a diagnostic challenge, with the majority diagnosed at late stages. Existing systematic reviews of diagnostic models either use inappropriate meta-analytic methods or do not conduct statistical comparisons of models or stratify test performance by menopausal status. Methods: We searched CENTRAL, MEDLINE, EMBASE, CINAHL, CDSR, DARE, Health Technology Assessment Database and SCI Science Citation Index, trials registers, conference proceedings from 1991 to June 2019. Cochrane collaboration review methods included QUADAS-2 quality assessment and meta-analysis using hierarchical modelling. RMI, ROMA or ADNEX at any test positivity threshold were investigated. Histology or clinical follow-up was the reference standard. We excluded screening studies, studies restricted to pregnancy, recurrent or metastatic OC. 2 × 2 diagnostic tables were extracted separately for pre- and post-menopausal women. Results: We included 58 studies (30,121 patients, 9061 cases of ovarian cancer). Prevalence of OC ranged from 16 to 55% in studies. For premenopausal women, ROMA at a threshold of 13.1 (+/−2) and ADNEX at a threshold of 10% demonstrated significantly higher sensitivity compared to RMI I at 200 (p < 0.0001) 77.8 (72.5, 82.4), 94.9 (92.5, 96.6), and 57.1% (50.6 to 63.4) but lower specificity (p < 0.002), 92.5 (90.0, 94.4), 84.3 (81.3, 86.8), and 78.2 (75.8, 80.4). For postmenopausal women, ROMA at a threshold of 27.7 (+/−2) and AdNEX at a threshold of 10% demonstrated significantly higher sensitivity compared to RMI I at a threshold of 200 (p < 0.001) 90.4 (87.4, 92.7), 97.6 (96.2, 98.5), and 78.7 (74.3, 82.5), specificity of ROMA was comparable, whilst ADneX was lower, 85.5 (81.3, 88.9), 81.3 (76.9, 85.0) (p = 0.155), compared to RMI 55.2 (51.2, 59.1) (p < 0.001). Conclusions: In pre-menopausal women, ROMA and ADNEX offer significantly higher sensitivity but significantly decreased specificity. In post-menopausal women, ROMA demonstrates significantly higher sensitivity and comparable specificity to RMI I, ADNEX has the highest sensitivity of all models, but with significantly reduced specificity. RMI I has poor sensitivity compared to ROMA or ADNEX. Choice between ROMA and ADNEX as a replacement test will depend on cost effectiveness and resource implications.
RESUMO
BACKGROUND: Transvaginal ultrasound and serum CA125 are routinely used for differential diagnosis of pelvic adnexal mass. Use of human epididymis 4 was approved in the United States in 2011. However, there is scarcity of studies evaluating the additional value of human epididymis 4. OBJECTIVE: The objective of the study was to evaluate the performance characteristics of transvaginal ultrasound, CA125, and human epididymis 4 for differential diagnosis of ovarian cancer in postmenopausal women with adnexal masses. STUDY DESIGN: This was a cohort study nested within the screen arms of the multicenter randomized controlled trial, United Kingdom Collaborative Trial of Ovarian Cancer Screening, based in England, Wales, and Northern Ireland. In United Kingdom Collaborative Trial of Ovarian Cancer Screening, 48,230 women randomized to transvaginal ultrasound screening and 50,078 to multimodal screening (serum CA125 interpreted by Risk of Ovarian Cancer Algorithm with second line transvaginal ultrasound) underwent the first (prevalence) screen. Women with adnexal lesions and/or persistently elevated risk were clinically assessed and underwent surgery or follow-up for a median of 10.9 years. Banked samples taken within 6 months of transvaginal ultrasound from all clinically assessed women were assayed for human epididymis 4 and CA125. Area under the curve and sensitivity for diagnosing ovarian cancer of multiple penalized logistic regression models incorporating logCA125, log human epididymis 4, age, and simple ultrasound features of the adnexal mass were compared. RESULTS: Of 1590 (158 multimodal, 1432 ultrasound) women with adnexal masses, 78 were diagnosed with ovarian cancer (48 invasive epithelial ovarian, 14 type I, 34 type II; 24 borderline epithelial; 6 nonepithelial) within 1 year of scan. The area under the curve (0.893 vs 0.896; P = .453) and sensitivity (74.4% vs 75.6% ;P = .564) at fixed specificity of 90% of the model incorporating age, ultrasound, and CA125 were similar to that also including human epididymis 4. Both models had high sensitivity for invasive epithelial ovarian (89.6%) and type II (>91%) cancers. CONCLUSION: Our population cohort study suggests that human epididymis 4 adds little value to concurrent use of CA125 and transvaginal ultrasound in the differential diagnosis of adnexal masses in postmenopausal women.
Assuntos
Antígeno Ca-125/metabolismo , Carcinoma Epitelial do Ovário/diagnóstico , Proteínas de Membrana/metabolismo , Neoplasias Ovarianas/diagnóstico , Proteína 2 do Domínio Central WAP de Quatro Dissulfetos/metabolismo , Idoso , Carcinoma Epitelial do Ovário/diagnóstico por imagem , Carcinoma Epitelial do Ovário/metabolismo , Estudos de Coortes , Diagnóstico Diferencial , Feminino , Humanos , Modelos Logísticos , Pessoa de Meia-Idade , Neoplasias Ovarianas/diagnóstico por imagem , Neoplasias Ovarianas/metabolismo , Pós-Menopausa , Ensaios Clínicos Controlados Aleatórios como Assunto , Sensibilidade e Especificidade , UltrassonografiaRESUMO
The aim of diagnostic accuracy studies is to evaluate how accurately a diagnostic test can distinguish diseased from nondiseased individuals. Depending on the research question, different study designs and accuracy measures are appropriate. As the prior knowledge in the planning phase is often very limited, modifications of design aspects such as the sample size during the ongoing trial could increase the efficiency of diagnostic trials. In intervention studies, group sequential and adaptive designs are well established. Such designs are characterized by preplanned interim analyses, giving the opportunity to stop early for efficacy or futility or to modify elements of the study design. In contrast, in diagnostic accuracy studies, such flexible designs are less common, even if they are as important as for intervention studies. However, diagnostic accuracy studies have specific features, which may require adaptations of the statistical methods or may lead to specific advantages or limitations of sequential and adaptive designs. In this article, we summarize the current status of methodological research and applications of flexible designs in diagnostic accuracy research. Furthermore, we indicate and advocate future development of adaptive design methodology and their use in diagnostic accuracy trials from an interdisciplinary viewpoint. The term "interdisciplinary viewpoint" describes the collaboration of experts of the academic and nonacademic research.
Assuntos
Futilidade Médica , Projetos de Pesquisa , Humanos , Tamanho da AmostraRESUMO
When assessing changes in glomerular filtration rate (GFR) it is important to differentiate pathological change from intrinsic biological and analytical variation. GFR is measured using complex reference methods (e.g., iohexol clearance). In clinical practice measurement of creatinine and cystatin C are used in the Modification of Diet in Renal Disease [MDRD] or Chronic Kidney Disease Epidemiology Collaboration [CKD-EPI] equations to provide estimated GFR. Here we studied the biological variability of measured and estimated GFR in twenty nephrology outpatients (10 male, 10 female; median age 71, range 50-80 years) with moderate CKD (GFR 30-59 ml/min per 1.73 m2). Patients underwent weekly GFR measurement by iohexol clearance over four consecutive weeks. Simultaneously, GFR was estimated using the MDRD, CKD-EPIcreatinine, CKD-EPIcystatinC and CKD-EPIcreatinine+cystatinC equations. Within-subject biological variation expressed as a percentage [95% confidence interval] for the MDRD (5.0% [4.3-6.1]), CKD-EPIcreatinine (5.3% [4.5-6.4]), CKD-EPIcystatinC (5.3% [4.5-6.5]), and CKD-EPIcreatinine+cystatinC (5.0% [4.3-6.2]) equations were broadly equivalent. The within-subject biological variation for MDRD and CKD- EPIcreatinine+cystatinC estimated GFR were each significantly lower than that of the measured GFR (6.7% [5.6-8.2]). Reference change values, the point at which a true change in a biomarker in an individual can be inferred to have occurred with 95% probability were calculated. By the MDRD equation, positive and negative reference change values were 15.1% and 13.1% respectively. If an individual's baseline MDRD estimated GFR (ml/min per 1.73 m2) was 59, significant increases or decreases would be to values over 68 or under 51 respectively. Within-subject variability of estimated GFR was lower than measured GFR. Reference change values can be used to understand GFR changes in clinical practice. Thus, estimates of GFR are at least as reliable as measured GFR for monitoring patients over time.
Assuntos
Taxa de Filtração Glomerular , Insuficiência Renal Crônica/fisiopatologia , Idoso , Idoso de 80 Anos ou mais , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Padrões de ReferênciaRESUMO
Background: Studies evaluating the impact of Xpert MTB/RIF testing for tuberculosis (TB) have demonstrated varied effects on health outcomes with many studies showing inconclusive results. We explored perceptions among diverse stakeholders about studies evaluating the impact of TB diagnostic tests, and identified suggestions for improving these studies. Methods: We used purposive sampling with consideration for differing expertise and geographical balance and conducted in depth semi-structured interviews. We interviewed English-speaking participants, including TB patients, and others involved in research, care or decision-making about TB diagnostics. We used the thematic approach to code and analyse the interview transcripts. Results: We interviewed 31 participants. Our study showed that stakeholders had different expectations with regard to test impact and how it is measured. TB test impact studies were perceived to be important for supporting implementation of tests but there were concerns about the unrealistic expectations placed on tests to improve outcomes in health systems with many influencing factors. To improve TB test impact studies, respondents suggested conducting health system assessments prior to the study; developing clear guidance on the study methodology and interpretation; improving study design by describing questions and interventions that consider the influences of the health-care ecosystem on the diagnostic test; selecting the target population at the health-care level most likely to benefit from the test; setting realistic targets for effect sizes in the sample size calculations; and interpreting study results carefully and avoiding categorisation and interpretation of results based on statistical significance alone. Researchers should involve multiple stakeholders in the design of studies. Advocating for more funding to support robust studies is essential. Conclusion: TB test impact studies were perceived to be important to support implementation of tests but there were concerns about their complexity. Process evaluations of their health system context and guidance for their design and interpretation are recommended.
RESUMO
Background: Most studies evaluating the effect of Xpert MTB/RIF testing for tuberculosis (TB) concluded that it did not reduce overall mortality compared to usual care. We conducted a systematic review to assess whether key study design and execution features contributed to earlier identification of patients with TB and decreased pre-treatment loss to follow-up, thereby reducing the potential impact of Xpert MTB/RIF testing. Methods: We searched the Cochrane Central Register of Controlled Trials (CENTRAL), MEDLINE, and Scopus for literature published from 1 st January 2009 to February 2019. We included all primary intervention studies that had evaluated the effect of Xpert MTB/RIF on mortality compared to usual care in participants with presumptive pulmonary TB. We critically reviewed features of included studies across: Study setting and context, Study population, Participant recruitment and enrolment, Study procedures, and Study follow-up. Results: We included seven randomised and one non-randomised study. All included studies demonstrated relative reductions in overall mortality in the Xpert MTB/RIF arm ranging from 6% to 40%. However, mortality reduction was reported to be statistically significant in two studies. Study features that could explain the lack of observed effect on mortality included: the higher quality of care at study sites; inclusion of patients with a higher pre-test probability of TB leading to higher than expected empirical rates; performance of additional diagnostic testing not done in usual care leading to increased TB diagnosis or empiric treatment initiation; the recruitment of participants likely to return for follow-up; and involvement of study staff in ensuring adherence with care and follow-up. Conclusion: Most studies of Xpert MTB/RIF were designed and conducted in a manner that resulted in more patients being diagnosed and treated for TB, minimising the potential difference in mortality Xpert MTB/RIF testing could have achieved compared to usual care.
RESUMO
BACKGROUND AND AIMS: Eustachian tube dysfunction (ETD) is a commonly diagnosed disorder of Eustachian tube opening and closure, which may be associated with severe symptoms and middle ear disease. Currently the diagnosis of obstructive and patulous forms of ETD is primarily based on non-specific symptoms or examination findings, rather than measurement of the underlying function of the Eustachian tube. This has proved problematic when selecting patients for treatment, and when designing trial inclusion criteria and outcomes. This study aims to determine the correlation and diagnostic value of various tests of ET opening and patient reported outcome measures (PROMs), in order to generate a recommended diagnostic pathway for ETD. METHODS: Index tests included two PROMs and 14 tests of ET opening (nine for obstructive, five for patulous ETD). In the absence of an accepted reference standard two methods were adopted to establish index test accuracy: expert panel diagnosis and latent class analysis. Index test results were assessed with Pearson correlation and principle component analysis, and test accuracy was determined. Logistic regression models assessed the predictive value of grouped test results. RESULTS: The expert panel diagnosis and PROMs results correlated with each other, but not with ET function measured by tests of ET opening. All index tests were found to be feasible in clinic, and acceptable to patients. PROMs had very poor specificity, and no diagnostic value. Combining the results of tests of ET function appeared beneficial. The latent class model suggested tympanometry, sonotubometry and tubomanometry have the best diagnostic performance for obstructive ETD, and these are included in a proposed diagnostic pathway. CONCLUSIONS: ETD should be diagnosed on the basis of clinical assessment and tests of ET opening, as PROMs have no diagnostic value. Currently diagnostic uncertainty exists for some patients who appear to have intermittent ETD clinically, but have negative index test results.
Assuntos
Otopatias/diagnóstico , Tuba Auditiva/fisiopatologia , Testes Auditivos/métodos , Adulto , Idoso , Feminino , Testes Auditivos/normas , Humanos , Modelos Logísticos , Masculino , Pessoa de Meia-Idade , Razão de Chances , Medidas de Resultados Relatados pelo Paciente , Valor Preditivo dos Testes , Análise de Componente Principal , Padrões de ReferênciaRESUMO
BACKGROUND: Pancreatic cancer diagnosis and staging can be difficult in 10-20% of patients. Positron emission tomography (PET)/computed tomography (CT) adds precise anatomical localisation to functional data. The use of PET/CT may add further value to the diagnosis and staging of pancreatic cancer. OBJECTIVE: To determine the incremental diagnostic accuracy and impact of PET/CT in addition to standard diagnostic work-up in patients with suspected pancreatic cancer. DESIGN: A multicentre prospective diagnostic accuracy and clinical value study of PET/CT in suspected pancreatic malignancy. PARTICIPANTS: Patients with suspected pancreatic malignancy. INTERVENTIONS: All patients to undergo PET/CT following standard diagnostic work-up. MAIN OUTCOME MEASURES: The primary outcome was the incremental diagnostic value of PET/CT in addition to standard diagnostic work-up with multidetector computed tomography (MDCT). Secondary outcomes were (1) changes in patients' diagnosis, staging and management as a result of PET/CT; (2) changes in the costs and effectiveness of patient management as a result of PET/CT; (3) the incremental diagnostic value of PET/CT in chronic pancreatitis; (4) the identification of groups of patients who would benefit most from PET/CT; and (5) the incremental diagnostic value of PET/CT in other pancreatic tumours. RESULTS: Between 2011 and 2013, 589 patients with suspected pancreatic cancer underwent MDCT and PET/CT, with 550 patients having complete data and in-range PET/CT. Sensitivity and specificity for the diagnosis of pancreatic cancer were 88.5% and 70.6%, respectively, for MDCT and 92.7% and 75.8%, respectively, for PET/CT. The maximum standardised uptake value (SUVmax.) for a pancreatic cancer diagnosis was 7.5. PET/CT demonstrated a significant improvement in relative sensitivity (p = 0.01) and specificity (p = 0.023) compared with MDCT. Incremental likelihood ratios demonstrated that PET/CT significantly improved diagnostic accuracy in all scenarios (p < 0.0002). PET/CT correctly changed the staging of pancreatic cancer in 56 patients (p = 0.001). PET/CT influenced management in 250 (45%) patients. PET/CT stopped resection in 58 (20%) patients who were due to have surgery. The benefit of PET/CT was limited in patients with chronic pancreatitis or other pancreatic tumours. PET/CT was associated with a gain in quality-adjusted life-years of 0.0157 (95% confidence interval -0.0101 to 0.0430). In the base-case model PET/CT was seen to dominate MDCT alone and is thus highly likely to be cost-effective for the UK NHS. PET/CT was seen to be most cost-effective for the subgroup of patients with suspected pancreatic cancer who were thought to be resectable. CONCLUSION: PET/CT provided a significant incremental diagnostic benefit in the diagnosis of pancreatic cancer and significantly influenced the staging and management of patients. PET/CT had limited utility in chronic pancreatitis and other pancreatic tumours. PET/CT is likely to be cost-effective at current reimbursement rates for PET/CT to the UK NHS. This was not a randomised controlled trial and therefore we do not have any information from patients who would have undergone MDCT only for comparison. In addition, there were issues in estimating costs for PET/CT. Future work should evaluate the role of PET/CT in intraductal papillary mucinous neoplasm and prognosis and response to therapy in patients with pancreatic cancer. STUDY REGISTRATION: Current Controlled Trials ISRCTN73852054 and UKCRN 8166. FUNDING: The National Institute for Health Research Health Technology Assessment programme.
Assuntos
Carcinoma Ductal Pancreático/diagnóstico , Carcinoma Ductal Pancreático/patologia , Neoplasias Pancreáticas/diagnóstico , Neoplasias Pancreáticas/patologia , Tomografia por Emissão de Pósitrons combinada à Tomografia Computadorizada/economia , Tomografia por Emissão de Pósitrons combinada à Tomografia Computadorizada/métodos , Adulto , Idoso , Idoso de 80 Anos ou mais , Carcinoma Ductal Pancreático/diagnóstico por imagem , Análise Custo-Benefício , Feminino , Fluordesoxiglucose F18 , Humanos , Masculino , Pessoa de Meia-Idade , Modelos Econométricos , Tomografia Computadorizada Multidetectores/economia , Tomografia Computadorizada Multidetectores/métodos , Estadiamento de Neoplasias , Neoplasias Pancreáticas/diagnóstico por imagem , Neoplasias Pancreáticas/terapia , Pancreatite Crônica/diagnóstico , Pancreatite Crônica/patologia , Estudos Prospectivos , Anos de Vida Ajustados por Qualidade de Vida , Sensibilidade e Especificidade , Medicina Estatal , Reino Unido , Adulto JovemRESUMO
BACKGROUND: Detection and treatment of heart failure (HF) can improve quality of life and reduce premature mortality. However, symptoms such as breathlessness are common in primary care, have a variety of causes and not all patients require cardiac imaging. In systems where healthcare resources are limited, ensuring those patients who are likely to have HF undergo appropriate and timely investigation is vital. DESIGN: A decision tree was developed to assess the cost-effectiveness of using the MICE (Male, Infarction, Crepitations, Edema) decision rule compared to other diagnostic strategies to identify HF patients presenting to primary care. METHODS: Data from REFER (REFer for EchocaRdiogram), a HF diagnostic accuracy study, was used to determine which patients received the correct diagnosis decision. The model adopted a UK National Health Service (NHS) perspective. RESULTS: The current recommended National Institute for Health and Care Excellence (NICE) guidelines for identifying patients with HF was the most cost-effective option with a cost of £4400 per quality adjusted life year (QALY) gained compared to a "do nothing" strategy. That is, patients presenting with symptoms suggestive of HF should be referred straight for echocardiography if they had a history of myocardial infarction or if their NT-proBNP level was ≥400pg/ml. The MICE rule was more expensive and less effective than the other comparators. Base-case results were robust to sensitivity analyses. CONCLUSIONS: This represents the first cost-utility analysis comparing HF diagnostic strategies for symptomatic patients. Current guidelines in England were the most cost-effective option for identifying patients for confirmatory HF diagnosis. The low number of HF with Reduced Ejection Fraction patients (12%) in the REFER patient population limited the benefits of early detection.