Pesquisa | BVS Integralidade em Saúde

1.

Combined Assessment of Function and Survival to Demonstrate the Effect of Treatment on Progressive Supranuclear Palsy.

Germani, Massimiliano; Rebollo Mesa, Irene; Buchanan, Tim J; De Bruyn, Steven; Gasalla, Teresa; Van Tricht, Hans Lieve G; Ewen, Colin; Golbe, Lawrence I; Boxer, Adam; Höglinger, Günter.

Mov Disord ; 2024 Oct 29.

Artigo em Inglês | MEDLINE | ID: mdl-39470015

RESUMO

BACKGROUND: Progressive supranuclear palsy (PSP) is a rare and fatal neurodegenerative disorder for which there are currently no disease-modifying treatments. Recent trials of potential therapies had durations of 12 months, which may be insufficient because of nonrandom missingness due to death. Longer durations, incorporating PSP Rating Scale and survival, can reduce the potential for type II error. Selecting efficacy measures more sensitive to disease modification may facilitate identification of treatment effect. OBJECTIVE: The objective of this study was to evaluate the simulated phase 3 PSP trial assessing the effect of disease-modifying intervention on a novel combined primary endpoint comprising function (PSP Rating Scale) and survival, the Combined Assessment of Function and Survival (CAFS), and to determine operating characteristics of the CAFS. METHODS: To simulate PSP progression in the trial population, we developed models of PSP Rating Scale and survival using data from published clinical studies. These models were used to define operating characteristics of the CAFS for use in a phase 3 trial. RESULTS: The sample size determined (N = 384; 1:1 randomization) would provide >80% power to detect significant treatment effects on the CAFS compared with placebo. The CAFS provides good operating characteristics and increased power to detect moderate treatment effects on the PSP Rating Scale. We propose a trial design allowing potential detection of treatment effects at a preplanned interim analysis after participants complete 12 months of treatment, with assessment of effects of treatment (≤24 months) on survival. CONCLUSIONS: Use of the CAFS could provide a comprehensive and robust estimate of the clinical benefit of future therapies. © 2024 UCB. Movement Disorders published by Wiley Periodicals LLC on behalf of International Parkinson and Movement Disorder Society.

2.

Quantitative Comparisons of Progressive Supranuclear Palsy Rating Scale Versions Using Item Response Theory.

Gewily, Mohamed; Plan, Elodie L; Yousefi, Elham; König, Franz; Posch, Martin; Hopfner, Franziska; Höglinger, Günter; Karlsson, Mats O.

Mov Disord ; 2024 Aug 29.

Artigo em Inglês | MEDLINE | ID: mdl-39206961

RESUMO

BACKGROUND: Progressive supranuclear palsy (PSP) is a neurodegenerative, late-onset disease that is challenging in terms of assessment. The Progressive Supranuclear Palsy Rating Scale (PSPRS), a 28-item clinician-reported scale, is the most established clinical outcome assessment method. Recently, the U.S. Food and Drug Administration (FDA) has proposed a subscale of 10 items as an alternative to full PSPRS. OBJECTIVES: To quantitatively evaluate and compare the properties of full PSPRS and the FDA subscale using item response theory. To develop a progression model of the disease and assess relative merits of study designs and analysis options. METHODS: Data of 979 patients from four interventional trials and two registries were available for analysis. Our investigation was divided into: (1) estimating informativeness of the 28 items; (2) estimating disease progression; and (3) comparing the scales, trial designs, and analysis options with respect to power to detect a clinically relevant treatment effect. RESULTS: PSPRS item scores had a low pairwise correlation (r = 0.17 ± 0.14) and the items irritability, sleep difficulty, and postural tremor were uncorrelated with the other items. The FDA-selected items displayed higher correlation (r = 0.35 ± 0.14) and were the basis for a longitudinal item response model including disease progression. Trial simulations indicated that identification of a disease-modifying treatment effect required less than half the study size if the analysis was based on longitudinal item information compared with total scores at end-of-treatment. CONCLUSION: A longitudinal item response model based on the FDA-selected PSPRS items is a promising tool in evaluating treatments for PSP. © 2024 The Author(s). Movement Disorders published by Wiley Periodicals LLC on behalf of International Parkinson and Movement Disorder Society.

3.

Differential Item Functioning and Clinical Utility of the Subjective Memory Complaints Questionnaire in a Multi-Ethnic Cohort.

Ochoa Lopez, Andrea Pilar; Garcia, Joshua M; Williams, Michael W; Medina, Luis D.

Dement Geriatr Cogn Disord ; : 1-11, 2024 Sep 24.

Artigo em Inglês | MEDLINE | ID: mdl-39317176

RESUMO

INTRODUCTION: This study evaluated the psychometric properties of the Subjective Memory Complaints Questionnaire (SMCQ) in a non-Hispanic White (NHW) and Mexican American (MA) sample from Texas in the USA. METHODS: Data were obtained from the Health and Aging Brain Study - Health Disparities (HABS-HD; N = 1,691, age = 66.5 ± 8.7, education = 12.4 ± 4.8, 60.6% female, 33.2% MA Spanish speaking). Unidimensionality of the SMCQ was evaluated with confirmatory factor analysis. Differential item functioning (DIF) of the SMCQ was assessed across age, sex, education, and ethnicity/language using item response theory/logistic ordinal regression. Associations of the SMCQ in relation to cognitive status, Alzheimer's disease (AD) blood-based biomarkers, and psychological distress were examined. RESULTS: The SMCQ showed excellent fit in a single-factor model (CFI = 0.97, TLI = 0.97, RMSEA [95% CI] = 0.05 [0.04, 0.05], SRMR = 0.07). Significant item-level DIF was detected by education level and ethnicity/language, but not by age or sex; when detected, DIF was not salient (i.e., adverse). The SMCQ was associated with greater psychological distress, worse Clinical Dementia Rating scores, and greater disease burden as measured by total tau and neurofilament light. CONCLUSIONS: Practically negligible item-level bias was identified across education and ethnicity/language. Detected DIF can be described as benign, indicating that some items manifested differently between groups but had minimal impact on measurement properties. These results demonstrate that the SMCQ performs appropriately across demographic variables. Our findings also provide support for the associations of SMCQ scores with self-reported mood, cognitive status, and AD blood-based biomarkers.

4.

Validation of China Health-Related Outcomes Measures-Cardiovascular Disease.

Li, Xue; Li, Rui; Xiao, Feiyi; Zhao, Kun; Zhang, Xiaolu; Wang, Xinyi; Li, Meichen; Guo, Ke; Wang, Li; Wu, Yanan; Van Spall, Harriette; Gao, Tiantian; Fu, Qiang; Xie, Feng.

Value Health ; 27(4): 490-499, 2024 04.

Artigo em Inglês | MEDLINE | ID: mdl-38244982

RESUMO

OBJECTIVES: China Health-Related Outcomes Measures (CHROME) was an initiative aimed at developing a system of preference-based health-related quality of life instruments for China. CHROME-cardiovascular disease (CVD) is a CVD-specific instrument with 14 items developed under this initiative. This study aimed to test the psychometric properties of CHROME-CVD. METHODS: This validation study was conducted using cross-sectional questionnaire survey in China. Eligible patients with CVD were recruited and asked to complete the CHROME-CVD, the EQ-5D-5L, and a CVD-specific nonpreference-based health-related quality of life instrument selected according to the confirmed diagnosis of the patients. Item evaluation, internal consistency, measurement invariance, test-retest reliability, structural validity, and construct validity were tested using classic test theory. Item response theory was used to evaluate item-level performance. RESULTS: A total of 444 patients with CVD (coronary artery disease, n = 276, heart failure, n = 104, angina, n = 33, and atrial fibrillation, n = 16) from 6 provinces in China were enrolled for the validation. Exploratory factor analysis identified 4 factors: chest pain, other symptoms, physical health, and mental and social health. Cronbach's alpha and intraclass correlation coefficient were >0.8. A total of 20 of 26 (76.9%), and 90 of 95 (94.7%) predefined hypotheses were met for convergent and discriminant validities, respectively. No important difference was identified between subgroups of gender and residency. Response options of 10 items were found overlapped based on categorical response curves, which led to modification to 4-level response options. Wording of 3 items were modified by referring wordings of reference instruments. CONCLUSION: The validation of the CHROME-CVD demonstrated generally good psychometric properties. Further validation on the modified CHROME-CVD is needed.

Assuntos

Doenças Cardiovasculares , Qualidade de Vida , Humanos , Doenças Cardiovasculares/diagnóstico , Doenças Cardiovasculares/epidemiologia , Reprodutibilidade dos Testes , Estudos Transversais , Inquéritos e Questionários , Psicometria , China/epidemiologia

5.

Exploring Effects of Race and Differential Item Functioning on PrEP Stigma Experiences: Implications for Latent Stigma Measurement in Racially and Sexually Diverse Populations.

Maragh-Bass, Allysha C; Siegler, Aaron J; Mayer, Kenneth H; Dulin, Akilah; Sales, Jessica; Stegmueller, Daniel.

AIDS Behav ; 2024 Oct 01.

Artigo em Inglês | MEDLINE | ID: mdl-39352601

RESUMO

PrEP stigma measurement remains a challenge to the validity of studies and interventions addressing HIV prevention. It may lead to inaccurate assessment of the relationship between PrEP stigma and health outcomes such as PrEP persistence and care retention in groups experiencing HIV-related inequities. The present research explored the psychometric properties of a novel IV pre-exposure prophylaxis (PrEP) stigma scale in a cohort of racially diverse men who have sex with men (MSM). Using item response theory, analyses explored presence of differential item functioning (DIF) among Black and White respondents. Participants completed baseline surveys measuring psychosocial factors, sociodemographic factors, and PrEP stigma items. The primary analysis used a machine learning approach to assess (a) the presence of DIF; and (b) compare latent stigma between Black and White respondents, after correcting for any DIF. The model identified four out of 13 scale items as having a high probability of DIF for Black respondents, which is relatively good given that the original PrEP stigma scale was neither designed nor tested for validation comparing Black and White respondents. The DIF-adjusted latent PrEP stigma measure reveals statistically and substantially significantly higher levels of stigma for Black compared to White respondents (Diff.: 1.05 +/- 0.19). While most items performed well, findings demonstrate the importance of assessing measurement error in populations where stigma is rampant and being studied or intervened upon (and in this case, where multilevel and intersectional stigma may be present).

6.

The Starkstein Apathy Scale-Italian Version: An Update.

Ilardi, Ciro R; Sannino, Maria; Federico, Giovanni; Cirillo, Mara A; Cavaliere, Carlo; Iavarone, Alessandro; Garofalo, Elisabetta.

J Geriatr Psychiatry Neurol ; 37(5): 379-386, 2024 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-38233366

RESUMO

Apathy can manifest in various neuropsychiatric conditions, as well as in individuals who experience significant stressful life events or suffer from underlying internal medical conditions. The Starkstein Apathy Scale (SAS) is recognized as a reliable screening tool, besides being endorsed by the International Parkinson and Movement Disorder Society to assess apathy in patients with Parkinson's disease. Recently, the Italian version of this scale (SAS-I) has been introduced. Furthermore, normative data have been provided on a large sample of Italian healthy individuals. Here we present the official Italian translation of the SAS, along with clarifications regarding its administration. Also, we supply details concerning the scale's factorial structure, inter-item conditional associations and item performance by using EFA, Network analysis, and IRT modelling for polytomous items.

Assuntos

Apatia , Doença de Parkinson , Humanos , Itália , Masculino , Feminino , Doença de Parkinson/psicologia , Idoso , Escalas de Graduação Psiquiátrica/normas , Psicometria/normas , Psicometria/instrumentação , Pessoa de Meia-Idade , Reprodutibilidade dos Testes , Adulto , Análise Fatorial

7.

Evaluating the quality of life impact of recurrent urinary tract infection: Validation and refinement of the Recurrent UTI Impact Questionnaire (RUTIIQ).

Newlands, Abigail F; Kramer, Melissa; Roberts, Lindsey; Maxwell, Kayleigh; Price, Jessica L; Finlay, Katherine A.

Neurourol Urodyn ; 43(4): 902-914, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-38385648

RESUMO

BACKGROUND AND AIMS: Recurrent urinary tract infection (rUTI) has significant negative consequences for a wide variety of quality of life (QoL) domains. Without adequate validation and assessment of the unique insights of people living with rUTI, clinical results cannot be fully understood. The Recurrent UTI Impact Questionnaire (RUTIIQ), a novel patient-reported outcome measure of rUTI psychosocial impact, has been robustly developed with extensive patient and clinician input to facilitate enhanced rUTI management and research. This study aimed to confirm the structural validity of the RUTIIQ, assessing its strength and bifactor model fit. METHODS: A sample of 389 adults experiencing rUTI (96.9% female, aged 18-87 years) completed an online cross-sectional survey comprising a demographic questionnaire and the RUTIIQ. A bifactor graded response model was fitted to the data, optimizing the questionnaire structure based on item fit, discrimination capability, local dependence, and differential item functioning. RESULTS: The final RUTIIQ demonstrated excellent bifactor model fit (RMSEA = 0.054, CFI = 0.99, SRMSR = 0.052), and mean-square fit indices indicated that all included items were productive for measurement (MNSQ = 0.52-1.41). The final questionnaire comprised an 18-item general "rUTI QoL impact" factor, and five subfactor domains measuring "personal wellbeing" (three items), "social wellbeing" (four items), "work and activity interference" (four items), "patient satisfaction" (four items), and "sexual wellbeing" (three items). Together, the general factor and five subfactors explained 81.6% of the common model variance. All factor loadings were greater than 0.30 and communalities greater than 0.60, indicating good model fit and structural validity. CONCLUSIONS: The 18-item RUTIIQ is a robust, patient-tested questionnaire with excellent psychometric properties, which capably assesses the patient experience of rUTI-related impact to QoL and healthcare satisfaction. Facilitating standardized patient monitoring and improved shared decision-making, the RUTIIQ delivers the unique opportunity to improve patient-centered care.

Assuntos

Qualidade de Vida , Infecções Urinárias , Adulto , Humanos , Feminino , Masculino , Qualidade de Vida/psicologia , Estudos Transversais , Inquéritos e Questionários , Medidas de Resultados Relatados pelo Paciente , Psicometria/métodos , Reprodutibilidade dos Testes

8.

Development and validation of the patient-reported outcome for older people living with HIV/AIDS in China (PROHIV-OLD).

Zhou, Rui; Zheng, Ying-Jing; Wang, Bei-Jia; Patrick, Donald L; Edwards, Todd C; Yun, Jing-Yi; Zhou, Jie; Gu, Ren-Jun; Miao, Bing-Hui; Wang, Hong-Mei.

Health Qual Life Outcomes ; 22(1): 30, 2024 Apr 01.

Artigo em Inglês | MEDLINE | ID: mdl-38561752

RESUMO

BACKGROUND: The involvement of quality of life as the UNAIDS fourth 90 target to monitor the global HIV response highlighted the development of patient-reported outcome (PRO) measures to help address the holistic needs of people living with HIV/AIDS (PLWHA) beyond viral suppression. This study developed and tested preliminary measurement properties of a new patient-reported outcome (PROHIV-OLD) measure designed specifically to capture influences of HIV on patients aged 50 and older in China. METHODS: Ninety-three older people living with HIV/AIDS (PLWHA) were interviewed to solicit items and two rounds of patient cognitive interviews were conducted to modify the content and wording of the initial items. A validation study was then conducted to refine the initial instrument and evaluate measurement properties. Patients were recruited between February 2021 and November 2021, and followed six months later after the first investigation. Classical test theory (CTT) and item response theory (IRT) were used to select items using the baseline data. The follow-up data were used to evaluate the measurement properties of the final instrument. RESULTS: A total of 600 patients were recruited at the baseline. Of the 485 patients who completed the follow-up investigation, 483 were included in the validation sample. The final scale of PROHIV-OLD contained 25 items describing five dimensions (physical symptoms, mental status, illness perception, family relationship, and treatment). All the PROHIV-OLD dimensions had satisfactory reliability with Cronbach's alpha coefficient, McDonald's ω, and composite reliability of each dimension being all higher than 0.85. Most dimensions met the test-retest reliability standard except for the physical symptoms dimension (ICC = 0.64). Confirmatory factor analysis supported the structural validity of the final scale, and the model fit index satisfied the criterion. The correlations between dimensions of PROHIV-OLD and MOS-HIV met hypotheses in general. Significant differences on scores of the PROHIV-OLD were found between demographic and clinical subgroups, supporting known-groups validity. CONCLUSIONS: The PROHIV-OLD was found to have good feasibility, reliability and validity for evaluating health outcome of Chinese older PLWHA. Other measurement properties such as responsiveness and interpretability will be further examined.

Assuntos

Síndrome da Imunodeficiência Adquirida , Qualidade de Vida , Humanos , Pessoa de Meia-Idade , Idoso , Qualidade de Vida/psicologia , Inquéritos e Questionários , Reprodutibilidade dos Testes , Medidas de Resultados Relatados pelo Paciente , China , Psicometria/métodos

9.

Psychometric properties of computerized adaptive testing for chronic obstructive pulmonary disease patient-reported outcome measurement.

Wang, Jiajia; Xie, Yang; Feng, Zhenzhen; Li, Jiansheng.

Health Qual Life Outcomes ; 22(1): 73, 2024 Sep 04.

Artigo em Inglês | MEDLINE | ID: mdl-39227972

RESUMO

BACKGROUND: Computerized adaptive testing (CAT) is an effective way to reduce time, repetitious redundancy, and respond burden, and has been used to measure outcomes in many diseases. This study aimed to develop and validate a comprehensive disease-specific CAT for chronic obstructive pulmonary disease (COPD) patient-reported outcome measurement. METHODS: The discrimination and difficulty of the items from the modified patient-reported outcome scale for COPD (mCOPD-PRO) were analyzed using item response theory. Then the initial item, item selection method, ability estimation method, and stopping criteria were further set based on Concerto platform to form the CAT. Finally, the reliability and validity were validated. RESULTS: The item discrimination ranged from 1.05 to 2.71, and the item difficulty ranged from - 3.08 to 3.65. The measurement reliability of the CAT ranged from 0.910 to 0.922 using random method, while that ranged from 0.910 to 0.924 using maximum Fisher information (MFI) method. The content validity was good. The correlation coefficient between theta of the CAT and COPD assessment test and modified Medical Research Council dyspnea scale scores using random method was 0.628 and 0.540 (P < 0.001; P < 0.001) respectively, while that using MFI method was 0.347 and 0.328 (P = 0.007; P = 0.010) respectively. About 11 items (reducing by 59.3%) on average were tested using random method, while about seven items (reducing by 74.1%) on average using MFI method. The correlation coefficient between theta of the CAT and mCOPD-PRO total scores using random method was 0.919 (P < 0.001), while that using MFI method was 0.760 (P < 0.001). CONCLUSIONS: The comprehensive disease-specific CAT for COPD patient-reported outcome measurement is well developed with good psychometric properties, which can provide an efficient, accurate, and user-friendly measurement for patient-reported outcome of COPD.

Assuntos

Medidas de Resultados Relatados pelo Paciente , Psicometria , Doença Pulmonar Obstrutiva Crônica , Humanos , Doença Pulmonar Obstrutiva Crônica/psicologia , Masculino , Feminino , Reprodutibilidade dos Testes , Pessoa de Meia-Idade , Idoso , Inquéritos e Questionários/normas , Qualidade de Vida

10.

A scoping review to create a framework for the steps in developing condition-specific preference-based instruments de novo or from an existing non-preference-based instrument: use of item response theory or Rasch analysis.

Tsui, Teresa C O; Torres, Sofia C; Bielecki, Joanna M; Mitsakakis, Nicholas; Trudeau, Maureen E; Bremner, Karen E; Davis, Aileen M; Krahn, Murray D.

Health Qual Life Outcomes ; 22(1): 38, 2024 May 14.

Artigo em Inglês | MEDLINE | ID: mdl-38745165

RESUMO

BACKGROUND: There is no widely accepted framework to guide the development of condition-specific preference-based instruments (CSPBIs) that includes both de novo and from existing non-preference-based instruments. The purpose of this study was to address this gap by reviewing the published literature on CSPBIs, with particular attention to the application of item response theory (IRT) and Rasch analysis in their development. METHODS: A scoping review of the literature covering the concepts of all phases of CSPBI development and evaluation was performed from MEDLINE, Embase, PsychInfo, CINAHL, and the Cochrane Library, from inception to December 30, 2022. RESULTS: The titles and abstracts of 1,967 unique references were reviewed. After retrieving and reviewing 154 full-text articles, data were extracted from 109 articles, representing 41 CSPBIs covering 21 diseases or conditions. The development of CSPBIs was conceptualized as a 15-step framework, covering four phases: 1) develop initial questionnaire items (when no suitable non-preference-based instrument exists), 2) establish the dimensional structure, 3) reduce items per dimension, 4) value and model health state utilities. Thirty-nine instruments used a type of Rasch model and two instruments used IRT models in phase 3. CONCLUSION: We present an expanded framework that outlines the development of CSPBIs, both from existing non-preference-based instruments and de novo when no suitable non-preference-based instrument exists, using IRT and Rasch analysis. For items that fit the Rasch model, developers selected one item per dimension and explored item response level reduction. This framework will guide researchers who are developing or assessing CSPBIs.

Assuntos

Psicometria , Humanos , Inquéritos e Questionários/normas , Preferência do Paciente , Qualidade de Vida

11.

The Patient Activation Measure-13 (PAM-13) in an oncology patient population: psychometric properties and dimensionality evaluation.

Roesel, Inka; Froehlich, Daniela; Joos, Stefanie; Valentini, Jan; Mauch, Holger; Martus, Peter.

Health Qual Life Outcomes ; 22(1): 39, 2024 May 20.

Artigo em Inglês | MEDLINE | ID: mdl-38764032

RESUMO

BACKGROUND: Accurate assessment and enhancement of health-related skills among oncology patients are pivotal for optimizing cancer care. The Patient Activation Measure (PAM-13), a questionnaire designed to reflect an individual's knowledge, skills, and confidence in self-healthcare management, has been validated across diverse countries and settings. Concerns have been raised regarding the cross-situational applicability, as patients with specific diseases and cultural backgrounds interpret questionnaire items differently. This study aimed to examine the structural validity and psychometric properties of the PAM-13 in an oncological patient cohort. METHODS: Baseline data from a longitudinal non-randomized controlled study involving cancer out-patients (n = 1,125) from Comprehensive Cancer Centres in Southern Germany were analysed. The German version of the PAM-13 was employed. With classical test and item response theory methods data quality, reliability, convergent and structural validity, as well as psychometric properties were assessed. Exploratory (EFA) and confirmatory factor analyses (CFA) were employed to investigate the postulated unidimensionality of the underlying construct. With a partial credit model (PCM) we examined item fit, targeting, local independence and differential item functioning. RESULTS: Participants were predominantly female (73.0%) with a breast cancer diagnosis (41.3%). While items were generally well-accepted, ceiling effects were observed and a high mean PAM-13 score (69.7, SD = 14.2) was noted, potentially compromising responsiveness to interventions. Reliability was adequate (Cronbach's α = 0.81), person and item separation reliability were good to excellent (0.81 and 0.99, respectively). Explorations of the unidimensionality of the construct (EFA, CFA, PCM) yielded inconclusive results, hinting towards a two-factor solution. Item difficulty rankings deviated from the original. No differential item functioning was identified, and local independence was confirmed. CONCLUSIONS: While the PAM-13 serves as a valuable instrument for comprehending and promoting health-related skills in cancer patients, the identification of ceiling effects, disordered item-difficulty rankings, and inconclusive findings regarding unidimensionality contribute to the expanding body of evidence, emphasizing the dependency of PAM-13's validity and reliability on distinctive characteristics within the population under investigation. Future research should prioritize refining or adding PAM-13 items to better capture the specific health-related challenges within diverse populations, paving the way for more effective patient engagement strategies in oncology. TRIAL REGISTRATION NUMBER: DRKS00021779.

Assuntos

Neoplasias , Participação do Paciente , Psicometria , Humanos , Feminino , Masculino , Pessoa de Meia-Idade , Alemanha , Reprodutibilidade dos Testes , Inquéritos e Questionários/normas , Neoplasias/psicologia , Idoso , Adulto , Participação do Paciente/psicologia , Análise Fatorial , Estudos Longitudinais , Autocuidado/psicologia

12.

Validation of the cognitive section of the Penn computerized adaptive test for neurocognitive and clinical psychopathology assessment (CAT-CCNB).

Di Sandro, Akira; Moore, Tyler M; Zoupou, Eirini; Kennedy, Kelly P; Lopez, Katherine C; Ruparel, Kosha; Njokweni, Lucky J; Rush, Sage; Daryoush, Tarlan; Franco, Olivia; Gorgone, Alesandra; Savino, Andrew; Didier, Paige; Wolf, Daniel H; Calkins, Monica E; Cobb Scott, J; Gur, Raquel E; Gur, Ruben C.

Brain Cogn ; 174: 106117, 2024 02.

Artigo em Inglês | MEDLINE | ID: mdl-38128447

RESUMO

BACKGROUND: The Penn Computerized Neurocognitive Battery is an efficient tool for assessing brain-behavior domains, and its efficiency was augmented via computerized adaptive testing (CAT). This battery requires validation in a separate sample to establish psychometric properties. METHODS: In a mixed community/clinical sample of N = 307 18-to-35-year-olds, we tested the relationships of the CAT tests with the full-form tests. We compared discriminability among recruitment groups (psychosis, mood, control) and examined how their scores relate to demographics. CAT-Full relationships were evaluated based on a minimum inter-test correlation of 0.70 or an inter-test correlation within at least 0.10 of the full-form correlation with a previous administration of the full battery. Differences in criterion relationships were tested via mixed models. RESULTS: Most tests (15/17) met the minimum criteria for replacing the full-form with the updated CAT version (mean r = 0.67; range = 0.53-0.80) when compared to relationships of the full-forms with previous administrations of the full-forms (mean r = 0.68; range = 0.50-0.85). Most (16/17) CAT-based relationships with diagnostics and other validity criteria were indistinguishable (interaction p > 0.05) from their full-form counterparts. CONCLUSIONS: The updated CNB shows psychometric properties acceptable for research. The full-forms of some tests should be retained due to insufficient time savings to justify the loss in precision.

Assuntos

Teste Adaptativo Computadorizado , Transtornos Mentais , Humanos , Encéfalo , Psicometria , Cognição , Reprodutibilidade dos Testes

13.

Pre-natal and early life lead exposure and childhood inhibitory control: an item response theory approach to improve measurement precision of inhibitory control.

Liu, Shelley H; Chen, Yitong; Bellinger, David; de Water, Erik; Horton, Megan; Téllez-Rojo, Martha M; Wright, Robert.

Environ Health ; 23(1): 71, 2024 Sep 05.

Artigo em Inglês | MEDLINE | ID: mdl-39232724

RESUMO

BACKGROUND: Neurodevelopmental performance tasks are often separately analyzed, even when they tap into a similar construct. This may yield mixed findings for associations of an exposure-neurobehavioral outcome. We develop an item response theory (IRT) approach to integrate multiple task variables together to improve measurement precision of the underlying construct. We apply this approach to create an integrative measure of childhood inhibitory control, and study impacts of pre/post-natal lead exposure. METHODS: Using data from a prospective cohort based in Mexico (N = 533), we created an inhibitory control scale that integrates accuracy and reaction time information from four inhibitory control tasks (Go/NoGo Letter, Go/NoGo Neutral, Go/NoGo Happy, Delis-Kaplan Executive Function System (D-KEFS) Color-Word Interference Test, Condition 3). Using a generalized partial credit item response theory model, we estimated an inhibitory control index for each participant. We then assessed adjusted associations between umbilical cord blood and 4-year lead and childhood inhibitory control. We developed a resampling approach to incorporate error estimates from the inhibitory control variable to confirm the consistency of the lead-inhibitory control associations. We modeled time-varying associations of lead with each inhibitory control measure separately. RESULTS: Participants had a median age of 9 years; 51.4% were males. Umbilical cord blood [-0.06 (95% CI: -0.11, -0.01)] and 4-year lead [-0.07 (95% CI: -0.12, -0.02)] were associated with inhibitory control index at 8-10 years. A resampling approach confirmed that 4-year lead was consistently associated with childhood inhibitory control index. Umbilical cord blood and 4-year lead were each associated with 3 out of 8 measures in separate models. CONCLUSION: This is the first application of IRT in environmental epidemiology to create a latent variable for inhibitory control that integrates accuracy and reaction time information from multiple, related tasks. This framework can be applied to other correlated neurobehavioral assessments or other phenotype data.

Assuntos

Função Executiva , Inibição Psicológica , Chumbo , Humanos , Chumbo/sangue , Masculino , Feminino , México , Pré-Escolar , Gravidez , Efeitos Tardios da Exposição Pré-Natal , Poluentes Ambientais/sangue , Estudos Prospectivos , Criança , Exposição Ambiental/análise

14.

Psychometric evaluation of the WHOQOL-BREF and its shorter versions for general Thai population: confirmatory factor analysis and Rasch analysis.

Kangwanrattanakul, Krittaphas; Krägeloh, Christian U.

Qual Life Res ; 33(2): 335-348, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-37906345

RESUMO

PURPOSE: Rasch analysis was employed to validate the World Health Organization Quality of Life-BREF (WHOQOL-BREF) and its existing shorter versions in the general Thai population. METHODS: 1200 respondents were randomly selected to complete the questionnaire. Confirmatory factor analysis (CFA) was employed to test the structure of the WHOQOL-BREF and its shorter versions with the random sub-sample of 900 respondents, while Rasch analysis was performed with a random sub-sample of 300 respondents. RESULTS: The CFA confirmed the factor structure of WHOQOL-BREF and its shorter versions. The Rasch analysis revealed that the WHOQOL-BREF, when a four-domain structure was tested using a subtest approach, achieved acceptable model fit to the Rasch model and met the expectations of unidimensionality with high reliability (PSI = 0.87). Individual domain models were also unidimensional, but reliability of the 3-item social domain was inadequate. While the 8-item EUROHIS-QOL-8 and 5-item WHOQOL-5 achieved an overall acceptable fit and met the expectations of unidimensionality, the reliability of the WHOQOL-5 was below the acceptable threshold (PSI = 0.66). Reliability of the EUROHIS-QOL-8 was satisfactory (PSI = 0.79). CONCLUSIONS: The WHOQOL-BREF is a valid instrument for use in the Thai general population, both as a total score as well as individual subscales. Rasch analysis also supports the use of EUROHIS-QOL-8, but the WHOQOL-5 lacks good reliability. While the reliability of the EUROHIS-QOL-8 is sufficiently high for between-group analysis, the Thai WHOQOL-BREF total score can also be used for within-participant analyses. Rasch investigation with a more varied health conditions of general Thai samples or patient groups is encouraged for future studies.

Assuntos

Qualidade de Vida , Humanos , Psicometria/métodos , Qualidade de Vida/psicologia , Reprodutibilidade dos Testes , Tailândia , Inquéritos e Questionários , Organização Mundial da Saúde , Análise Fatorial

15.

A random item effects generalized partial credit model with a multiple imputation-based scoring procedure.

Huang, Sijia; Chung, Seungwon; Cai, Li.

Qual Life Res ; 33(3): 637-651, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-37950818

RESUMO

PURPOSE: Random item effects item response theory (IRT) models have received much attention for more than a decade. However, more research is needed on random item effects IRT models for polytomous data. Additionally, to improve the utility of this new class of IRT models, the scoring issue must be addressed. METHODS: We proposed a new random item effects generalized partial credit model (GPCM), which considers both random person and random item and category-specific effects. In addition, we introduced a multiple imputation (MI)-based scoring procedure that applies to various random item effects IRT models. To evaluate the proposed model and scoring procedure, we analyzed data from a Quality of Life (QoL) scale for the Chronically Mentally III and conducted a preliminary simulation study. RESULTS: In the empirical data analysis, we found that patient scores generated based on the proposed model and scoring procedure were almost identical to those obtained through the conventional GPCM and scoring method. However, the standard errors (SEs) associated with the scores were slightly larger when the proposed approach was utilized. In the simulation study, we observed adequate recovery of the model parameters and patient scores. CONCLUSION: The proposed model and MI-based scoring procedure contribute to the literature. The proposed model substantially reduces the number of free parameters in comparison to a conventional GPCM, which can be desired when sample sizes are small, e.g., special populations. In addition, the MI-based scoring procedure addresses the scoring issue and can be easily extended for scoring with other random item effects IRT models.

Assuntos

Qualidade de Vida , Projetos de Pesquisa , Humanos , Qualidade de Vida/psicologia , Tamanho da Amostra , Simulação por Computador , Inquéritos e Questionários

16.

When better is the enemy of good: two cautionary tales of conceptual validity versus parsimony in clinical psychometric research.

Schwartz, Carolyn E; Borowiec, Katrina; Rapkin, Bruce D.

Qual Life Res ; 33(6): 1493-1500, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38457054

RESUMO

This paper presents an empirical challenge to the assumption that an item-response theory analysis always yields a better measure of a clinical construct. We summarize results from two measurement development studies that showed that such an analysis lost important content reflecting the conceptual model ("conceptual validity"). The cost of parsimony may thus be too high. Conceptual models that form the foundation of QOL measurement reflect the patient's experience. This experience may include concepts and items that are psychometrically "redundant" but capture distinct features of the concept. Good measurement is likely a balance between relying on IRT's quantitative metrics and recognizing the importance of conceptual validity and clinical utility.

Assuntos

Psicometria , Qualidade de Vida , Humanos , Reprodutibilidade dos Testes , Inquéritos e Questionários/normas

17.

Measuring patient activation: the utility of the Patient Activation Measure administered in an interview setting.

Holter, Magdalena; Avian, Alexander; Weger, Martin; Strini, Sanja; Michelitsch, Monja; Brenk-Franz, Katja; Wedrich, Andreas; Berghold, Andrea.

Qual Life Res ; 33(5): 1389-1400, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38388807

RESUMO

BACKGROUND: Patient activation is an emerging field in healthcare research concerning knowledge, skills, and confidence of patients in managing their health. This is particularly important for patients with chronic diseases, who often require more complex care management and self-care skills. However, due to temporary or longer-lasting visual impairments, certain patient groups cannot answer a questionnaire independently. The main objective is to investigate the psychometric properties of the German Patient Activation Measure® (PAM) survey in an everyday clinical setting where it has to be read aloud. METHODS: Outpatients with macular edema participated in this questionnaire-based cross-sectional study. The study assessed patient activation by the PAM® survey, self-rated health, self-efficacy, quality of life, and general mood. Interviewers read questionnaires aloud to patients. Psychometric properties of the PAM® survey were investigated by item response theory (IRT), Cronbach's α and trait-trait correlations. RESULTS: The analysis included N = 554 patients. Median age was 69 (IQR 62.0-76.0) years and mean overall activation score 74.1 (SD 13.7). All items showed ceiling effects. Empirical reliability from the IRT model and Cronbach's α were 0.75. The PAM® survey showed a Spearman correlation of 0.54 with self-efficacy, 0.51 with quality of life and 0.34 with general mood. CONCLUSION: The read-aloud PAM® survey has been shown to provide to adequate measurement precision and convergent validity to be used as a screening tool in an everyday clinical setting. Objective assessment in an interview setting with the PAM® survey is possible. PAM® items are good in distinguishing lower to middle activated patients, but not patients with high activation. Further, issues with structural validity need more investigation.

Assuntos

Participação do Paciente , Psicometria , Qualidade de Vida , Humanos , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Inquéritos e Questionários/normas , Estudos Transversais , Reprodutibilidade dos Testes , Participação do Paciente/psicologia , Edema Macular/psicologia , Alemanha , Autoeficácia , Entrevistas como Assunto , Autocuidado

18.

Introduction and validation of the open symptom framework: a public domain modular framework for patient-reported measurement of symptoms related to cancer and its treatment.

Gibbons, C; Brown, G; Lu, S C; Elrick, A; Tang, Y; Kaufman, M; Williams, M; Xu, C; Harrison, C; Swisher, C.

Qual Life Res ; 33(9): 2349-2358, 2024 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-39023734

RESUMO

PURPOSE: We provide an initial description and validation of some public domain patient-reported outcome (PRO) items to assess cancer symptom burden to address immediate barriers to symptom assessment use in clinical practice and facilitate future research. METHODS: We created the Open Symptom Framework (OSF), a flexible tool for clinical cancer-related symptom assessment. The items comprise six components: recall period, concept, symptom, qualifier(s), a definition, and a 5-point Likert-type response. We recruited patients receiving cancer therapy in the United States and United Kingdom. We assessed external construct validity by comparing OSF scores to the PRO-CTCAE measure and assessed reliability, scalability, dimensionality, and item ordering within a non-parametric item response theory framework. We tested differential item functioning for country, age, gender, and level of education. RESULTS: We developed a framework alongside clinical and psychometric experts and debrieifed with 10 patients. For validation, we recruited 331patients. All items correlated with the PRO-CTCAE equivalents (r = 0.55-0.96, all p < 0.01). Mokken analysis confirmed the scalability and unidimensionality of all symptom scales with multiple items at the scale (Ho = 0.61-0.75) and item level (Hi = 0.60-0.76). Items are interpreted consistently between demographic groups (Crit = 0 for all groups). CONCLUSION: The public domain OSF has excellent psychometric properties including face, content, and criterion validity and can facilitate the development of flexible, robust measurements to fulfil stakeholder need. The OSF was designed specifically to support clinical assessment but will function well for research. Further work is planned to increase the number of symptoms and number of questions per symptom within the framework.

Assuntos

Neoplasias , Medidas de Resultados Relatados pelo Paciente , Psicometria , Humanos , Neoplasias/psicologia , Masculino , Feminino , Pessoa de Meia-Idade , Reino Unido , Reprodutibilidade dos Testes , Idoso , Estados Unidos , Adulto , Inquéritos e Questionários , Avaliação de Sintomas , Qualidade de Vida

19.

Estimating anchor-based minimal important change using longitudinal confirmatory factor analysis.

Terluin, Berend; Trigg, Andrew; Fromy, Piper; Schuller, Wouter; Terwee, Caroline B; Bjorner, Jakob B.

Qual Life Res ; 33(4): 963-973, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-38151593

RESUMO

PURPOSE: The minimal important change (MIC) is defined as the smallest within-individual change in a patient-reported outcome measure (PROM) that patients on average perceive as important. We describe a method to estimate this value based on longitudinal confirmatory factor analysis (LCFA). The method is evaluated and compared with a recently published method based on longitudinal item response theory (LIRT) in simulated and real data. We also examined the effect of sample size on bias and precision of the estimate. METHODS: We simulated 108 samples with various characteristics in which the true MIC was simulated as the mean of individual MICs, and estimated MICs based on LCFA and LIRT. Additionally, both MICs were estimated in existing PROMIS Pain Behavior data from 909 patients. In another set of 3888 simulated samples with sample sizes of 125, 250, 500, and 1000, we estimated LCFA-based MICs. RESULTS: The MIC was equally well recovered with the LCFA-method as using the LIRT-method, but the LCFA analyses were more than 50 times faster. In the Pain Behavior data (with higher scores indicating more pain behavior), an LCFA-based MIC for improvement was estimated to be 2.85 points (on a simple sum scale ranging 14-42), whereas the LIRT-based MIC was estimated to be 2.60. The sample size simulations showed that smaller sample sizes decreased the precision of the LCFA-based MIC and increased the risk of model non-convergence. CONCLUSION: The MIC can accurately be estimated using LCFA, but sample sizes need to be preferably greater than 125.

Assuntos

Medidas de Resultados Relatados pelo Paciente , Qualidade de Vida , Humanos , Qualidade de Vida/psicologia , Dor

20.

Scoring the EQ-HWB-S: can we do it without value sets? A non-parametric item response theory analysis.

Feng, You-Shan; Kohlmann, Thomas; Peasgood, Tessa; Engel, Lidia; Mulhern, Brendan; Pickard, A Simon.

Qual Life Res ; 33(5): 1211-1222, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38381281

RESUMO

BACKGROUND: Only one pilot value set (UK) is currently available for the EQ Health and Wellbeing Instrument short version (EQ-HWB-S). As an alternative to preference-weighted scoring, we examined whether a level summary score (LSS) is appropriate for the EQ-HWB-S using Mokken scaling analyses. METHODS: Data from patients, carers and the general population collected during the developmental phase of the EQ-HWB-S in Australia, US and UK were used, noting 3 of 9 items have since undergone revision. EQ-HWB-S data fit was examined using R package Mokken scaling's monotone homogeneity model, utilizing the automated item selection procedure (AISP) as well as Loevinger's scaling coefficients for items and the scale (HS). Manifest monotonicity was assessed by examining whether the cumulative probability for responses at or above each response level did not decrease across the summary score. RESULTS: EQ-HWB-S data were available for 3340 respondents: US = 903, Australia = 514 and UK = 1923. Mean age was 50 ± 18 and 1841 (55%) were female. AISP placed all 9 items of the EQ-HWB-S on a single scale when the lower bound was set to < 0.448. Strong scalability (HS = 0.561) was found for the EQ-HWB-S as a single scale. Stronger scales were formed by separating the psychosocial items (n = 6, HS = 0.683) and physical sensation items (n = 3, HS = 0.713). No violations of monotonicity were found except for the items mobility and daily activities for the subgroups with long-term conditions and UK subjects, respectively. DISCUSSION: As EQ-HWB-S items formed a strong scale and subscales based on Mokken analysis, LSS is a promising weighting-free approach to scoring.

Assuntos

Psicometria , Qualidade de Vida , Humanos , Feminino , Masculino , Inquéritos e Questionários , Pessoa de Meia-Idade , Austrália , Adulto , Reino Unido , Idoso , Estados Unidos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa