RESUMO
PURPOSE: Although comprehensive and widespread guidelines on how to conduct systematic reviews of outcome measurement instruments (OMIs) exist, for example from the COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) initiative, key information is often missing in published reports. This article describes the development of an extension of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guideline: PRISMA-COSMIN for OMIs 2024. METHODS: The development process followed the Enhancing the QUAlity and Transparency Of health Research (EQUATOR) guidelines and included a literature search, expert consultations, a Delphi study, a hybrid workgroup meeting, pilot testing, and an end-of-project meeting, with integrated patient/public involvement. RESULTS: From the literature and expert consultation, 49 potentially relevant reporting items were identified. Round 1 of the Delphi study was completed by 103 panelists, whereas round 2 and 3 were completed by 78 panelists. After 3 rounds, agreement (≥ 67%) on inclusion and wording was reached for 44 items. Eleven items without consensus for inclusion and/or wording were discussed at a workgroup meeting attended by 24 participants. Agreement was reached for the inclusion and wording of 10 items, and the deletion of 1 item. Pilot testing with 65 authors of OMI systematic reviews further improved the guideline through minor changes in wording and structure, finalized during the end-of-project meeting. The final checklist to facilitate the reporting of full systematic review reports contains 54 (sub)items addressing the review's title, abstract, plain language summary, open science, introduction, methods, results, and discussion. Thirteen items pertaining to the title and abstract are also included in a separate abstract checklist, guiding authors in reporting for example conference abstracts. CONCLUSION: PRISMA-COSMIN for OMIs 2024 consists of two checklists (full reports; abstracts), their corresponding explanation and elaboration documents detailing the rationale and examples for each item, and a data flow diagram. PRISMA-COSMIN for OMIs 2024 can improve the reporting of systematic reviews of OMIs, fostering their reproducibility and allowing end-users to appraise the quality of OMIs and select the most appropriate OMI for a specific application. NOTE: In order to encourage its wide dissemination this article is freely accessible on the web sites of the journals: Health and Quality of Life Outcomes; Journal of Clinical Epidemiology; Journal of Patient-Reported Outcomes; Quality of Life Research.
Assuntos
Técnica Delphi , Avaliação de Resultados em Cuidados de Saúde , Revisões Sistemáticas como Assunto , Humanos , Guias como Assunto , Lista de Checagem , Projetos de Pesquisa/normas , ConsensoRESUMO
PURPOSE: The minimal important change (MIC) is defined as the smallest within-individual change in a patient-reported outcome measure (PROM) that patients on average perceive as important. We describe a method to estimate this value based on longitudinal confirmatory factor analysis (LCFA). The method is evaluated and compared with a recently published method based on longitudinal item response theory (LIRT) in simulated and real data. We also examined the effect of sample size on bias and precision of the estimate. METHODS: We simulated 108 samples with various characteristics in which the true MIC was simulated as the mean of individual MICs, and estimated MICs based on LCFA and LIRT. Additionally, both MICs were estimated in existing PROMIS Pain Behavior data from 909 patients. In another set of 3888 simulated samples with sample sizes of 125, 250, 500, and 1000, we estimated LCFA-based MICs. RESULTS: The MIC was equally well recovered with the LCFA-method as using the LIRT-method, but the LCFA analyses were more than 50 times faster. In the Pain Behavior data (with higher scores indicating more pain behavior), an LCFA-based MIC for improvement was estimated to be 2.85 points (on a simple sum scale ranging 14-42), whereas the LIRT-based MIC was estimated to be 2.60. The sample size simulations showed that smaller sample sizes decreased the precision of the LCFA-based MIC and increased the risk of model non-convergence. CONCLUSION: The MIC can accurately be estimated using LCFA, but sample sizes need to be preferably greater than 125.
Assuntos
Medidas de Resultados Relatados pelo Paciente , Qualidade de Vida , Humanos , Qualidade de Vida/psicologia , DorRESUMO
PURPOSE: The minimal important change (MIC) in a patient-reported outcome measure is often estimated using patient-reported transition ratings as anchor. However, transition ratings are often more heavily weighted by the follow-up state than by the baseline state, a phenomenon known as "present state bias" (PSB). It is unknown if and how PSB affects the estimation of MICs using various methods. METHODS: We simulated 3240 samples in which the true MIC was simulated as the mean of individual MICs, and PSB was created by basing transition ratings on a "weighted change", differentially weighting baseline and follow-up states. In each sample we estimated MICs based on the following methods: mean change (MC), receiver operating characteristic (ROC) analysis, predictive modeling (PM), adjusted predictive modeling (APM), longitudinal item response theory (LIRT), and longitudinal confirmatory factor analysis (LCFA). The latter two MICs were estimated with and without constraints on the transition item slope parameters (LIRT) or factor loadings (LCFA). RESULTS: PSB did not affect MIC estimates based on MC, ROC, and PM but these methods were biased by other factors. PSB caused imprecision in the MIC estimates based on APM, LIRT and LCFA with constraints, if the degree of PSB was substantial. However, the unconstrained LIRT- and LCFA-based MICs recovered the true MIC without bias and with high precision, independent of the degree of PSB. CONCLUSION: We recommend the unconstrained LIRT- and LCFA-based MIC methods to estimate anchor-based MICs, irrespective of the degree of PSB. The APM-method is a feasible alternative if PSB is limited.
RESUMO
PURPOSE: Systematic reviews of patient-reported outcome measures (PROMs) are important tools to select the most suitable PROM for a study or clinical application. Conducting these reviews is challenging, and the quality of these reviews needs to be improved. We updated the COSMIN guideline for systematic reviews of PROMs, including the COSMIN Risk of Bias checklist, and the COSMIN criteria for good measurement properties. METHODS: Adaptations to the methodology were based on our experience with applying the COSMIN guideline, through discussions among the authors, and results from two related Delphi studies. RESULTS: The updated guideline places more emphasis on key aspects that are often missing or sub optimally conducted in published systematic reviews of PROMs, such as formulating a well-defined research question and developing a comprehensive search strategy, assessing risk of bias, applying criteria for good measurement properties, summarizing results, and grading the quality of the evidence. We also stress the importance of evaluating the measurement properties of each subscale of a PROM separately and evaluating content validity of all included PROMs. CONCLUSION: The quality of systematic reviews of PROMs can be improved by using this updated version of the COSMIN guideline for systematic reviews of PROMs. Improved quality will lead to better PROM selection and increased standardization of PROM use.
Patient-reported outcome measures (PROMs) are questionnaires that measure aspects of health from the patient perspective. To measure a specific health aspect, often dozens of PROMs are available. To choose the best PROM, a systematic review of PROMs can be conducted, in which all information on the quality and feasibility of each available PROM is collected, rated, and compared. Based on such a review a choice for the most suitable PROM for a certain study or clinical application can be made. However, conducting a systematic review of PROMs is very challenging, because nine quality aspects of PROMs need to be taken into account.In this article, we present an updated step-by-step guideline for conducting systematic reviews of PROMs. Each of these steps is described in detail in an accompanying manual. This updated guideline helps researchers to conduct systematic reviews of PROMs in a systematic and transparent way. It also helps readers of systematic reviews to understand how the review was conducted and to check the conclusions about which PROMs are recommended based on their quality.
RESUMO
OBJECTIVE: Our objective was to explore whether the extension of the PROMIS item bank Ability to Participate in Social Roles and Activities (APSRA) with new items would result in more effective targeting (i.e., selecting items that are appropriate for each individual's trait level), and more reliable measurements across all latent trait levels. METHODS: A sample of 1,022 Dutch adults completed all 35 items of the original item bank plus 17 new items (in Dutch). The new items presented in this publication have been translated provisionally from Dutch into English for presentation purposes. We evaluated the basic IRT assumptions unidimensionality, local independence, and monotonicity. Furthermore, we examined the item parameters, and assessed differential item functioning (DIF) for sex, education, region, age, and ethnicity. In addition, we compared the test information functions, item parameters, and θ scores, for the original and extended item bank in order to assess whether the measurement range had improved. RESULTS: We found that the extended item bank was compatible with the basic IRT assumptions and showed good reliability. Moreover, the extended item bank improved the measurement in the lower trait range, which is important for reliably assessing functioning in clinical populations (i.e., persons reporting lower levels of participation). CONCLUSION: We extended the PROMIS-APSRA item bank and improved its psychometric quality. Our study contributes to PROMIS measurement innovation, which allows for the addition of new items to existing item banks, without changing the interpretation of the scores and while maintaining the comparability of the scores with other PROMIS instruments.
Assuntos
Psicometria , Humanos , Feminino , Masculino , Adulto , Pessoa de Meia-Idade , Inquéritos e Questionários/normas , Países Baixos , Reprodutibilidade dos Testes , Idoso , Participação Social/psicologia , Adulto Jovem , Qualidade de Vida , AdolescenteRESUMO
PURPOSE: Systematic reviews evaluating and comparing the measurement properties of outcome measurement instruments (OMIs) play an important role in OMI selection. Earlier overviews of review quality (2007, 2014) evidenced substantial concerns with regards to alignment to scientific standards. This overview aimed to investigate whether the quality of recent systematic reviews of OMIs lives up to the current scientific standards. METHODS: One hundred systematic reviews of OMIs published from June 1, 2021 onwards were randomly selected through a systematic literature search performed on March 17, 2022 in MEDLINE and EMBASE. The quality of systematic reviews was appraised by two independent reviewers. An updated data extraction form was informed by the earlier studies, and results were compared to these earlier studies' findings. RESULTS: A quarter of the reviews had an unclear research question or aim, and in 22% of the reviews the search strategy did not match the aim. Half of the reviews had an incomprehensive search strategy, because relevant search terms were not included. In 63% of the reviews (compared to 41% in 2014 and 30% in 2007) a risk of bias assessment was conducted. In 73% of the reviews (some) measurement properties were evaluated (58% in 2014 and 55% in 2007). In 60% of the reviews the data were (partly) synthesized (42% in 2014 and 7% in 2007); evaluation of measurement properties and data syntheses was not conducted separately for subscales in the majority. Certainty assessments of the quality of the total body of evidence were conducted in only 33% of reviews (not assessed in 2014 and 2007). The majority (58%) did not make any recommendations on which OMI (not) to use. CONCLUSION: Despite clear improvements in risk of bias assessments, measurement property evaluation and data synthesis, specifying the research question, conducting the search strategy and performing a certainty assessment remain poor. To ensure that systematic reviews of OMIs meet current scientific standards, more consistent conduct and reporting of systematic reviews of OMIs is needed.
Assuntos
Avaliação de Resultados em Cuidados de Saúde , Revisões Sistemáticas como Assunto , HumanosRESUMO
PURPOSE: Although comprehensive and widespread guidelines on how to conduct systematic reviews of outcome measurement instruments (OMIs) exist, for example from the COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) initiative, key information is often missing in published reports. This article describes the development of an extension of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guideline: PRISMA-COSMIN for OMIs 2024. METHODS: The development process followed the Enhancing the QUAlity and Transparency Of health Research (EQUATOR) guidelines and included a literature search, expert consultations, a Delphi study, a hybrid workgroup meeting, pilot testing, and an end-of-project meeting, with integrated patient/public involvement. RESULTS: From the literature and expert consultation, 49 potentially relevant reporting items were identified. Round 1 of the Delphi study was completed by 103 panelists, whereas round 2 and 3 were completed by 78 panelists. After 3 rounds, agreement (≥ 67%) on inclusion and wording was reached for 44 items. Eleven items without consensus for inclusion and/or wording were discussed at a workgroup meeting attended by 24 participants. Agreement was reached for the inclusion and wording of 10 items, and the deletion of 1 item. Pilot testing with 65 authors of OMI systematic reviews further improved the guideline through minor changes in wording and structure, finalized during the end-of-project meeting. The final checklist to facilitate the reporting of full systematic review reports contains 54 (sub)items addressing the review's title, abstract, plain language summary, open science, introduction, methods, results, and discussion. Thirteen items pertaining to the title and abstract are also included in a separate abstract checklist, guiding authors in reporting for example conference abstracts. CONCLUSION: PRISMA-COSMIN for OMIs 2024 consists of two checklists (full reports; abstracts), their corresponding explanation and elaboration documents detailing the rationale and examples for each item, and a data flow diagram. PRISMA-COSMIN for OMIs 2024 can improve the reporting of systematic reviews of OMIs, fostering their reproducibility and allowing end-users to appraise the quality of OMIs and select the most appropriate OMI for a specific application. NOTE: In order to encourage its wide dissemination this article is freely accessible on the web sites of the journals: Health and Quality of Life Outcomes; Journal of Clinical Epidemiology; Journal of Patient-Reported Outcomes; Quality of Life Research.
Assuntos
Técnica Delphi , Avaliação de Resultados em Cuidados de Saúde , Revisões Sistemáticas como Assunto , Humanos , Guias como Assunto , Projetos de Pesquisa/normas , Lista de ChecagemRESUMO
To investigate daily functioning, health-related quality of life (HRQoL), and overall quality of life (QoL) in young adult survivors of childhood bacterial meningitis (BM) thereby shedding light on very long-term outcomes and contributing to global meningitis burden estimates. In this cross-sectional study, the Weiss Functional Impairment Rating Scale-Self Report (WFIRS-S), the Patient-Reported Outcomes Measurement Information System Scale vGlobal Health v1.2 (PROMIS-GH), PROMIS-29 profile v2.01 and PROMIS Satisfaction with Social Roles and Activities Short Form 4a v2.0 instruments were administered to 483 survivors of childhood BM who participated in the 20|30 Dutch Postmeningitis study. Mean WFIRS-S scores were calculated as well as mean T-scores of the PROMIS instruments. PROMIS scores of survivors of childhood BM were compared with age-matched Dutch reference scores from the general population. Overall, survivors of childhood BM reported comparable or even better levels of daily functioning, HRQoL, and overall QoL (Global02) than the age-matched Dutch reference group. 10.2% of survivors of childhood BM reported poor mental health and 1.0% of survivors of childhood BM reported poor physical health. The average score of survivors of childhood BM for overall QoL (Global02) was 3.81 (on a scale from 1, indicating poor QoL, to 5, indicating excellent QoL). CONCLUSION: Young adult survivors of childhood BM reported similar and even better levels of daily functioning, HRQoL, and overall QoL than age-matched Dutch reference scores. However, clinicians and healthcare workers should be mindful of poor mental and physical health of some survivors of childhood BM because these survivors also face problems with functioning in daily life. WHAT IS KNOWN: ⢠Acute bacterial meningitis is a life-threatening infection which may lead to devastating lifelong disabilities. ⢠Health-related quality of life is decreased in school-age survivors of bacterial meningitis. WHAT IS NEW: ⢠The present study shows that HRQoL in survivors of childhood BM improves over time ⢠However, especially poor mental health in adolescent and young adult survivors is associated with impaired functioning in daily life.
Assuntos
Atividades Cotidianas , Meningites Bacterianas , Qualidade de Vida , Sobreviventes , Humanos , Masculino , Feminino , Estudos Transversais , Meningites Bacterianas/psicologia , Adulto Jovem , Adolescente , Adulto , Sobreviventes/psicologia , Países Baixos , Criança , AutorrelatoRESUMO
PURPOSE: This study aimed to calculate region and diagnosis-specific minimal important changes (MICs) of the Foot and Ankle Outcome Score (FAOS) and the Foot and Ankle Ability Measure (FAAM) in patients requiring foot and ankle surgery and to assess their variability across different foot and ankle diagnoses. METHODS: The study used routinely collected data from patients undergoing elective foot and ankle surgery. Patients had been invited to complete the FAOS and FAAM preoperatively and at 3-6 months after surgery, along with two anchor questions encompassing change in pain and daily function. Patients were categorised according to region of pathology and subsequent diagnoses. MICs were calculated using predictive modelling (MICPRED) and receiver operating characteristic curve (MICROC) method and evaluated according to strict credibility criteria. RESULTS: Substantial variability of the MICs between forefoot and ankle/hindfoot region was observed, as well as among specific foot and ankle diagnoses, with MICPRED and MICROC values ranging from 7.8 to 25.5 points and 9.4 to 27.8, respectively. Despite differences between MICROC and MICPRED estimates, both calculation methods exhibited largely consistent patterns of variation across subgroups, with forefoot conditions systematically showing smaller MICs than ankle/hindfoot conditions. Most MICs demonstrated high credibility; however, the majority of the MICs for the FAOS symptoms subscale and forefoot conditions exhibited insufficient or low credibility. CONCLUSION: The MICs of the FAOS and FAAM vary across foot and ankle diagnoses in patients undergoing elective foot and ankle surgery and should not be used as a universal fixed value, but recognised as contextual parameters. This can help clinicians and researchers in more accurate interpretation of the FAOS and FAAM change scores. LEVEL OF EVIDENCE: Level IV.
Assuntos
Procedimentos Cirúrgicos Eletivos , Humanos , Feminino , Masculino , Pessoa de Meia-Idade , Adulto , Pé/cirurgia , Tornozelo/cirurgia , Idoso , Articulação do Tornozelo/cirurgia , Doenças do Pé/cirurgia , Doenças do Pé/diagnóstico , Diferença Mínima Clinicamente ImportanteRESUMO
BACKGROUND: People with communication problems experience challenges in participation. Optimizing communicative participation for this population is an important outcome of speech and language therapy. Participation experiences are best assessed from the patient's perspective, using a patient-reported outcome measure (PROM). The Communicative Participation Item Bank (CPIB) has been developed to identify perceived problems in communicative participation. However, previous research has shown that the item bank does not cover all domains and concepts of communicative participation, contains general communication items, and is not suitable for all people with communication problems. AIMS: To develop a comprehensive PROM aimed at measuring communicative participation for all adults with communication problems and test its content validity. METHODS & PROCEDURES: An initial pool of 242 items was based on three sources: relevant content from an earlier literature review and concept elicitation study, and the translation of the items of the CPIB. This item pool was pilot-tested for comprehensibility and comprehensiveness using cognitive debriefing interviews in adults with different communication problems (speech, language, voice, hearing and cognitive communication disorders). This resulted in a second version of the item bank. The content validity of this version was tested in a group of professionals. The last (third) version of 133 items was tested for content validity in a second, new group of adults with various communication problems. OUTCOMES & RESULTS: The initial pool of 242 items was developed and pilot-tested in 27 adults with different communication problems. After modifications, 161 items remained, which were tested in a content validity study with 25 people with different communication problems and five professionals (in the field of speech and language therapy and speech and language research). The professionals considered 91.9% of the item bank relevant and comprehensible, and comprehensive by adding two items. The item bank was then considered relevant, comprehensible and comprehensive by the group of adults with communication problems. A total of 133 items remained in the item bank, related to the International Classification of Functioning, Disability and Health (ICF) Activities and Participation domains 'mobility', 'self-care', 'domestic life', 'interpersonal interactions and relationships', 'major life areas' and 'community, social and civic life'. CONCLUSIONS & IMPLICATIONS: MyCommunication-Adults is an item bank for measuring communicative participation in adults with communication difficulties. The next step is to test the psychometric properties of the item bank, in order to establish a final item bank. The refined description of the construct of communicative participation can already be used for participation-focused goal-setting between individuals with communication problems and speech and language therapists. WHAT THIS PAPER ADDS: What is already known on the subject The CPIB was developed to measure communicative participation in community-dwelling adults with communication problems. The item bank does not cover all domains and concepts of communicative participation, contains general communication items and not all questions seem to be relevant for specific target groups (e.g., hearing population). What this paper adds to the existing knowledge This study presents a refined description of the construct of communicative participation and the development of a new comprehensive PROM aimed at measuring this construct for all adults with communication difficulties. The development of the PROM is based on a previously conducted literature review and concept elicitation study. This study describes the development process of this new PROM, called MyCommunication-Adults, and the assessment of its content validity. What are the potential or actual clinical implications of this work? The refined description of the construct of communicative participation can be used for participation-focused goal-setting between individuals with communication problems and speech and language therapists.
RESUMO
Patient-reported outcomes (PROs) are valuable for shared decision making and research. Patient-reported outcome measures (PROMs) are questionnaires used to measure PROs, such as health-related quality of life (HRQL). Although core outcome sets for trials and clinical practice have been developed separately, they, as well as other initiatives, recommend different PROs and PROMs. In research and clinical practice, different PROMs are used (some generic, some disease-specific), which measure many different things. This is a threat to the validity of research and clinical findings in the field of diabetes. In this narrative review, we aim to provide recommendations for the selection of relevant PROs and psychometrically sound PROMs for people with diabetes for use in clinical practice and research. Based on a general conceptual framework of PROs, we suggest that relevant PROs to measure in people with diabetes are: disease-specific symptoms (e.g. worries about hypoglycaemia and diabetes distress), general symptoms (e.g. fatigue and depression), functional status, general health perceptions and overall quality of life. Generic PROMs such as the 36-Item Short Form Health Survey (SF-36), WHO Disability Assessment Schedule (WHODAS 2.0), or Patient-Reported Outcomes Measurement Information System (PROMIS) measures could be considered to measure commonly relevant PROs, supplemented with disease-specific PROMs where needed. However, none of the existing diabetes-specific PROM scales has been sufficiently validated, although the Diabetes Symptom Self-Care Inventory (DSSCI) for measuring diabetes-specific symptoms and the Diabetes Distress Scale (DDS) and Problem Areas in Diabetes (PAID) for measuring distress showed sufficient content validity. Standardisation and use of relevant PROs and psychometrically sound PROMs can help inform people with diabetes about the expected course of disease and treatment, for shared decision making, to monitor outcomes and to improve healthcare. We recommend further validation studies of diabetes-specific PROMs that have sufficient content validity for measuring disease-specific symptoms and consider generic item banks developed based on item response theory for measuring commonly relevant PROs.
Assuntos
Diabetes Mellitus , Qualidade de Vida , Humanos , Medidas de Resultados Relatados pelo Paciente , Inquéritos e Questionários , Inquéritos Epidemiológicos , Diabetes Mellitus/terapiaRESUMO
BACKGROUND: The Patient-Reported Outcomes Measurement Information System (PROMIS®) has been recommended for computerized adaptive testing (CAT) of health-related quality of life. This study compared the content, validity, and reliability of seven PROMIS CATs to the 12-item Short-Form Health Survey (SF-12) in patients with advanced chronic kidney disease. METHODS: Adult patients with chronic kidney disease and an estimated glomerular filtration rate under 30 mL/min/1.73 m2 who were not receiving dialysis treatment completed seven PROMIS CATs (assessing physical function, pain interference, fatigue, sleep disturbance, anxiety, depression, and the ability to participate in social roles and activities), the SF-12, and the PROMIS Pain Intensity single item and Dialysis Symptom Index at inclusion and 2 weeks. A content comparison was performed between PROMIS CATs and the SF-12. Construct validity of PROMIS CATs was assessed using Pearson's correlations. We assessed the test-retest reliability of all patient-reported outcome measures by calculating the intraclass correlation coefficient and minimal detectable change. RESULTS: In total, 207 patients participated in the study. A median of 45 items (10 minutes) were completed for PROMIS CATs. All PROMIS CATs showed evidence of sufficient construct validity. PROMIS CATs, most SF-12 domains and summary scores, and Dialysis Symptom Index showed sufficient test-retest reliability (intraclass correlation coefficient ≥ 0.70). PROMIS CATs had a lower minimal detectable change compared with the SF-12 (range, 5.7-7.4 compared with 11.3-21.7 across domains, respectively). CONCLUSION: PROMIS CATs showed sufficient construct validity and test-retest reliability in patients with advanced chronic kidney disease. PROMIS CATs required more items but showed better reliability than the SF-12. Future research is needed to investigate the feasibility of PROMIS CATs for routine nephrology care.
Assuntos
Qualidade de Vida , Insuficiência Renal Crônica , Humanos , Reprodutibilidade dos Testes , Teste Adaptativo Computadorizado , Inquéritos e Questionários , Diálise Renal , Medidas de Resultados Relatados pelo Paciente , Insuficiência Renal Crônica/diagnóstico , Insuficiência Renal Crônica/terapia , Sistemas de InformaçãoRESUMO
Psychometric properties of the v1.0 Patient-Reported Outcomes Measurement Information System (PROMIS®) sleep disturbance (27 items) and sleep-related impairment (SRI; 16 items) item banks, short forms derived from the item bank, and simulated computerised adaptive test (CAT), were assessed in a representative sample of 1,006 adults from the Dutch general population. For sleep disturbance all items fitted the item response theory model. Four items showed differential item functioning (i.e., lack of measurement invariance) for age and two for language but the impact on scores (expressed as T-scores) was small. Reliable scores (r > 0.90) were found for 92.2%-96.3% of respondents with the full bank, short forms with six and eight items, and CAT, but for only 25.6% with the four-item short form. For SRI two items did not fit the item response theory model. Four items showed differential item functioning for language but the impact on T-scores was small. Reliable scores were found for 82.1% with the full bank, for 47.8%-69.5% with short forms and CAT. T-scores of 49.7 and 49.3 represent the average score of the Dutch general population for sleep disturbance and SRI, respectively. In conclusion, sufficient structural validity, reliability, and cross-cultural validity was found for the full banks but short forms of four items are not reliable enough for clinical practice. For SRI we recommend the full item bank if this is the primary outcome.
Assuntos
Idioma , Sono , Humanos , Inquéritos e Questionários , Psicometria , Reprodutibilidade dos Testes , Valores de Referência , Medidas de Resultados Relatados pelo Paciente , Sistemas de Informação , Qualidade de VidaRESUMO
PURPOSE: Meaningful thresholds are needed to interpret patient-reported outcome measure (PROM) results. This paper introduces a new method, based on item response theory (IRT), to estimate such thresholds. The performance of the method is examined in simulated datasets and two real datasets, and compared with other methods. METHODS: The IRT method involves fitting an IRT model to the PROM items and an anchor item indicating the criterion state of interest. The difficulty parameter of the anchor item represents the meaningful threshold on the latent trait. The latent threshold is then linked to the corresponding expected PROM score. We simulated 4500 item response datasets to a 10-item PROM, and an anchor item. The datasets varied with respect to the mean and standard deviation of the latent trait, and the reliability of the anchor item. The real datasets consisted of a depression scale with a clinical depression diagnosis as anchor variable and a pain scale with a patient acceptable symptom state (PASS) question as anchor variable. RESULTS: The new IRT method recovered the true thresholds accurately across the simulated datasets. The other methods, except one, produced biased threshold estimates if the state prevalence was smaller or greater than 0.5. The adjusted predictive modeling method matched the new IRT method (also in the real datasets) but showed some residual bias if the prevalence was smaller than 0.3 or greater than 0.7. CONCLUSIONS: The new IRT method perfectly recovers meaningful (interpretational) thresholds for multi-item questionnaires, provided that the data satisfy the assumptions for IRT analysis.
Assuntos
Qualidade de Vida , Humanos , Reprodutibilidade dos Testes , Qualidade de Vida/psicologia , Inquéritos e Questionários , Psicometria/métodosRESUMO
PURPOSE: The animated activity questionnaire (AAQ) is a computer-based measure of activity limitations. To answer a question, patients choose the animation of a person performing an activity that matches their own level of limitation. The AAQ has not yet been tested for suitability to be applied as computer-adaptive test (CAT). Thus, the objective of this study was to develop and evaluate an AAQ-based CAT to facilitate the application of the AAQ in daily clinical care. METHODS: Patients (n = 1408) with hip/knee osteoarthritis from Brazil, Denmark, France, The Netherlands, Norway, Spain, and the UK responded to all 17 AAQ items. Assumptions of item-response theory (IRT) modelling were investigated. To establish item parameters for the CAT, a graded response model was estimated. To evaluate the performance of post-hoc simulated AAQ-based CATs, precision, test length, and construct validity (correlations with well-established measures of activity limitations) were evaluated. RESULTS: Unidimensionality (CFI = 0.95), measurement invariance (R2-change < 2%), and IRT item fit (S-X2 p > .003) of the AAQ were supported. Performing simulated CATs, the mean test length was more than halved (≤ 8 items), while the range of precise measurement (standard error ≤ 0.3) was comparable to the full AAQ. The correlations between original AAQ scores and three AAQ-CAT versions were ≥ 0.95. Correlations of AAQ-CAT scores with patient-reported and performance measures of activity limitations were ≥ 0.60. CONCLUSION: The almost non-verbal AAQ-CAT is an innovative and efficient tool in patients with hip/knee osteoarthritis from various countries, measuring activity limitations with lower respondent burden, but similar precision and construct validity compared to the full AAQ.
Assuntos
Osteoartrite do Quadril , Osteoartrite do Joelho , Humanos , Qualidade de Vida/psicologia , Inquéritos e Questionários , Países Baixos , Computadores , Reprodutibilidade dos Testes , PsicometriaRESUMO
PURPOSE: The added value of measuring patient-reported outcomes (PROs) for delivering patient-centered care and assessment of healthcare quality is increasingly evident. However, healthcare system wide data collection initiatives are hampered by the proliferation of patient-reported outcome measures (PROMs) and conflicting data collection standards. As part of a national initiative of the Dutch Ministry of Health, Welfare and Sport we developed a consensus-based standard set of generic PROs and PROMs to be implemented across Dutch medical specialist care. METHODS: A working group of mandated representatives of umbrella organizations involved in Dutch medical specialist care, together with PROM experts and patient organizations worked through a structured, consensus-driven co-creation process. This included literature reviews, online expert and working group meetings, and feedback from national patient- and umbrella organizations. The 'PROM-cycle' methodology was used to select feasible, valid, and reliable PROMs to obtain domain scores for each of the PROs included in the set. RESULTS: Eight PROs across different domains of health were ultimately endorsed: symptoms (pain & fatigue), functioning (physical, social/participation, mental [anxiety & depression]), and overarching (quality of life & perceived overall health). A limited number of generic PROMs was endorsed. PROMIS short forms were selected as the preferred instruments for all PROs. Several recommendations were formulated to facilitate healthcare system level adoption and implementation of the standard set. CONCLUSIONS: We developed a consensus-based standard set of Generic PROMs and a set of recommendations to facilitate healthcare system wide implementation across Dutch medical specialist care.
Assuntos
Assistência ao Paciente , Qualidade de Vida , Humanos , Qualidade de Vida/psicologia , Medidas de Resultados Relatados pelo Paciente , Coleta de Dados , Atenção à SaúdeRESUMO
Clinician-reported outcome measures (ClinROMs) are essential for assessment of vitiligo in clinical trials and daily practice. Several instruments have been developed and tested to measure, for example, vitiligo extent, repigmentation and activity. The goal of this review was to identify all introductory publications of ClinROMs for vitiligo that include at least some aspects of validation and to describe the instruments' characteristics, intention for use and practical strengths and limitations. A search strategy was conducted in PubMed, Embase and Cochrane Library (CENTRAL) from inception to July 2022. Based on the literature search (n = 2860), 10 articles were identified, describing 14 different ClinROMs. Six ClinRoms measured disease extent and/or repigmentation, seven evaluated disease activity and one was a composite score. The Vitiligo Area Scoring Index (VASI), and Vitiligo Extent Score (VES and VESplus) measure overall disease extent and/or repigmentation. The VASI relies on hand units (1% body surface area), whereas the VES and VESplus use a picture-based scoring technique. The Vitiligo Extent Score for a Target Area (VESTA) measures repigmentation percentage for target lesions. One global assessment score for extent has been validated. Vitiligo disease activity scores included a static measure of clinical activity signs (Vitiligo Signs of Activity Score [VSAS]) and two measures assessing dynamic evolution (Vitiligo Disease Activity Score [VDAS] and Vitiligo Disease Improvement Score [VDIS]). The Vitiligo European Task Force assessment tool (VETFa) is a composite score. Depending on the practical strengths and limitations as well as the research question and setting (clinical trials vs. daily practice), the choice of an appropriate ClinROM may differ. Fourteen ClinROMs in vitiligo were identified to measure vitiligo extent, repigmentation, and activity. Further research evaluating the validity, reliability, and responsiveness of each instrument and worldwide consensus on which instrument to use for a specific outcome (domain) is greatly needed.
Assuntos
Eritema Multiforme , Vitiligo , Humanos , Vitiligo/terapia , Vitiligo/tratamento farmacológico , Reprodutibilidade dos Testes , Projetos de Pesquisa , Medidas de Resultados Relatados pelo Paciente , Resultado do TratamentoRESUMO
BACKGROUND: The BODY-Q is a patient-reported outcome measure developed for use in bariatric and body contouring surgery. OBJECTIVES: The objective of this study was to examine the validity and reliability of the Dutch version of the BODY-Q. METHODS: The BODY-Q consists of 163 items in 21 independently functioning scales that measure appearance, health-related quality of life, and experience of care. The data used to validate the Dutch BODY-Q were provided by 2 prospective multicenter cohort studies across 3 hospitals in the Netherlands. The BODY-Q was administered before and after surgery at 3 or 4 months and 12 months. Rasch measurement theory (RMT) analysis was used to evaluate the BODY-Q for targeting, category threshold order, Rasch model fit, Person Separation Index, and differential item functioning by language (original English data vs Dutch data). RESULTS: Data were collected between January 2016 and May 2019. The study included 876 participants, who provided 1614 assessments. Validity was supported by 3 RMT findings: most scales showed good targeting, 160 out of 163 items (98.2%) evidenced ordered thresholds, and 142 out of 163 items (87.1%) fitted the RMT model. Reliability was high with Person Separation Index values >0.70 for 19 out of 21 scales. There was negligible influence of differential item functioning by language on person item locations and the scale scoring. CONCLUSIONS: This study provides evidence for the reliability and validity of the Dutch BODY-Q for use in bariatric and body contouring patients in the Netherlands. The Dutch BODY-Q can be used in (inter)national research and clinical practice.
Assuntos
Bariatria , Contorno Corporal , Humanos , Qualidade de Vida , Reprodutibilidade dos Testes , Estudos Prospectivos , Inquéritos e Questionários , Satisfação do Paciente , Idioma , PsicometriaRESUMO
Patient-Reported Outcome Measures (PROMs) are important tools to assess outcomes relevant to patients, with Health-Related Quality Of Life (HRQOL) as an important construct to be measured. Many different HRQOL PROMs are used in the type 2 diabetes field, however a complete overview of these PROMs is currently lacking. We therefore aimed to systematically describe and classify the content of all PROMs that have specifically been developed or validated to measure (aspects of) HRQOL in people with type 2 diabetes. A literature search was performed in PubMed and EMBASE until 31 December 2021. Studies on the development or validation of a PROM measuring HRQOL, or aspects of HRQOL, in people with type 2 diabetes were included. Title and abstract and full-text screening were conducted by two independent researchers and data extraction was performed independently by one of the researchers. Data were extracted on language in which the PROM was developed, target population, construct(s) being measured, names of (sub)scales and number of items per (sub)scale. In addition, all PROMs and subscales were classified according to specific aspects of HRQOL based on the Wilson & Cleary model (symptom status, functional status, general health perceptions) to aid researchers in PROM selection. In total 220 studies were identified that developed or validated PROMs that measure (aspects of) HRQOL in people with type 2 diabetes. Of the 116 unique HRQOL PROMs, 91 (of the subscales) measured symptom status, 60 measured functional status and 26 measured general health perceptions. In addition, 16 of the PROMs (subscales) measured global quality of life. 61 of the 116 PROMs (subscales) also include characteristics of the individual (e.g. aspects of personality, coping) or environment (e.g. social or financial support) and patient-reported experience measures (PREMs, e.g. measure of a patient's perception of their personal experience of the healthcare they have received, e.g. treatment satisfaction), which are not part of the HRQOL construct. Only 9 of the 116 PROMs measure all aspects of HRQOL based on the Wilson & Cleary model. Finally, 8 of the 116 PROMs stating to measure HRQOL, measured no HRQOL construct. In conclusion, a large number of PROMs are available for people with type 2 diabetes, which intend to measure (aspects of) HRQOL. These PROMs measure a large variety of (sub)constructs, which are not all HRQOL constructs, with a small amount of PROMs not measuring HRQOL at all. There is a need for consensus on which aspects of HRQOL should be measured in people with type 2 diabetes and which PROMs to use in research and daily practice. PROSPERO: CRD42017071012. COMET database: http://www.comet-initiative.org/studies/details/956 .
Assuntos
Diabetes Mellitus Tipo 2 , Qualidade de Vida , Humanos , Medidas de Resultados Relatados pelo PacienteRESUMO
PURPOSE OF REVIEW: We aimed to systematically evaluate the content validity of patient-reported outcome measures (PROMs) specifically developed to measure (aspects of) health-related quality of life (HRQOL) in people with type 2 diabetes. A systematic review was performed in PubMed and Embase of PROMs measuring perceived symptoms, physical function, mental function, social function/participation, and general health perceptions, and that were validated to at least some extent. Content validity (relevance, comprehensiveness, and comprehensibility) was evaluated using COSMIN methodology. RECENT FINDINGS: We identified 54 (different versions of) PROMs, containing 150 subscales. We found evidence for sufficient content validity for only 41/150 (27%) (subscales of) PROMs. The quality of evidence was generally very low. We found 66 out of 150 (44%) (subscales of) PROMs with evidence for either insufficient relevance, insufficient comprehensiveness, or insufficient comprehensibility. For measuring diabetes-specific symptoms, physical function, mental function, social function/participation, and general health perceptions, we identified one to 11 (subscales of) PROMs with sufficient content validity, although quality of the evidence was generally low. For measuring depressive symptoms, no PROM with sufficient content validity was identified. For each aspect of HRQL, we found at least one PROM with sufficient content validity, except for depressive symptoms. The quality of the evidence was mostly very low.