Pesquisa | BVS - MINISTÉRIO DA SAÚDE

1.

Establishing language and ethnic equivalence for health-related quality of life item banks and testing their efficiency via computerised adaptive testing simulations.

Kwan, Yu Heng; Fenwick, Eva; Leung, Ying Ying; Fong, Warren; Woon, Ting Hui; Xiang, Ling; Lamoureux, Ecosse; Thumboo, Julian.

PLoS One ; 19(2): e0298141, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38394320

RESUMO

PURPOSE: We aimed to (1) establish linguistic and ethnic equivalence (i.e. lack of bias) for the items in the English and Chinese versions of the Singapore Health and Well Being (SHAWS) Physical Functioning (PF), Positive Mindset (PM) and Social Relationship (SR) item banks (IBs); and (2) evaluate the preliminary efficiency of these IBs using Computer Adaptive Testing (CAT) simulations. METHODS: In this cross-sectional study, 671, 670, and 672 subjects answered 55, 48 and 30 items of the PF, PM, and SR IBs, respectively. Rasch analysis was conducted to assess each IB's psychometric properties, particularly the presence of differential item functioning (DIF) for language and ethnicity. A set of performance criteria related to removing items that displayed notable DIF were employed. CAT simulations determined the mean number of items for high, moderate, and moderate-low measurement precisions (stopping rule: SEM 0.300, 0.387. 0.521, respectively). RESULTS: Half of subjects were >50 years old (40.9% PF, 42.1% PM, 41.4% SR), Chinese (50.7% PF, 51.0% PM, 50.6% SR) and female (50.0% PF. 49.4% PM, 52.8% SR) respectively. Rasch analysis revealed 4 items with DIF for the PF IB, 9 items with DIF for the PM IB and 2 items with DIF for the SR IB. In CAT simulations, the mean number of items administered was 8.5, 21.6 and 14.5 for the PF, PM and SR IBs, respectively (SEM 0.300), 5.1, 13.0, 8.0 for PF, PM and SR IBs, respectively (SEM 0.387) and 3.1, 5.3 and 4.1 for PF, PM and SR IBs, respectively (SEM 0.521). CONCLUSION: The PF, PM and SR IBs to measure health-related quality of life revealed minimal DIF for language and ethnicity after remedial efforts. CAT simulations demonstrated that these IBs were efficient, especially when the stopping rule was set at moderate precision, and support the implementation of the SHAWS IBs into routine clinical care.

Assuntos

Teste Adaptativo Computadorizado , Qualidade de Vida , Humanos , Feminino , Pessoa de Meia-Idade , Estudos Transversais , Inquéritos e Questionários , Psicometria , Idioma , Reprodutibilidade dos Testes

2.

Computerized adaptive testing to screen pre-school children for emotional and behavioral problems.

Theunissen, Meinou H C; Eekhout, Iris; Reijneveld, Sijmen A.

Eur J Pediatr ; 183(4): 1777-1787, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-38252308

RESUMO

Questionnaires to detect emotional and behavioral (EB) problems in preventive child healthcare (PCH) should be short; this potentially affects their validity and reliability. Computerized adaptive testing (CAT) could overcome this weakness. The aim of this study was to (1) develop a CAT to measure EB problems among pre-school children and (2) assess the efficiency and validity of this CAT. We used a Dutch national dataset obtained from parents of pre-school children undergoing a well-child care assessment by PCH (n = 2192, response 70%). Data regarded 197 items on EB problems, based on four questionnaires, the Strengths and Difficulties Questionnaire (SDQ), the Child Behavior Checklist (CBCL), the Ages and Stages Questionnaire: Social Emotional (ASQ:SE), and the Brief Infant-Toddler Social and Emotional Assessment (BITSEA). Using 80% of the sample, we calculated item parameters necessary for a CAT and defined a cutoff for EB problems. With the remaining part of the sample, we used simulation techniques to determine the validity and efficiency of this CAT, using as criterion a total clinical score on the CBCL. Item criteria were met by 193 items. This CAT needed, on average, 16 items to identify children with EB problems. Sensitivity and specificity compared to a clinical score on the CBCL were 0.89 and 0.91, respectively, for total problems; 0.80 and 0.93 for emotional problems; and 0.94 and 0.91 for behavioral problems. Conclusion: A CAT is very promising for the identification of EB problems in pre-school children, as it seems to yield an efficient, yet high-quality identification. This conclusion should be confirmed by real-life administration of this CAT. What is Known: â¢ Studies indicate the validity of using computerized adaptive test (CAT) applications to identify emotional and behavioral problems in school-aged children. â¢ Evidence is as yet limited on whether CAT applications can also be used with pre-school children. What is New: â¢ The results of this study show that a computerized adaptive test is very promising for the identification of emotional and behavior problems in pre-school children, as it appears to yield an efficient and high-quality identification.

Assuntos

Transtornos do Comportamento Infantil , Comportamento Problema , Lactente , Criança , Humanos , Pré-Escolar , Transtornos do Comportamento Infantil/diagnóstico , Reprodutibilidade dos Testes , Teste Adaptativo Computadorizado , Emoções , Inquéritos e Questionários

3.

Item selection methods in multidimensional computerized adaptive testing for forced-choice items using Thurstonian IRT model.

Wang, Qin; Zheng, Yi; Liu, Kai; Cai, Yan; Peng, Siwei; Tu, Dongbo.

Behav Res Methods ; 56(2): 600-614, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-36750522

RESUMO

Multidimensional computerized adaptive testing for forced-choice items (MFC-CAT) combines the benefits of multidimensional forced-choice (MFC) items and computerized adaptive testing (CAT) in that it eliminates response biases and reduces administration time. Previous studies that explored designs of MFC-CAT only discussed item selection methods based on the Fisher information (FI), which is known to perform unstably at early stages of CAT. This study proposes a set of new item selection methods based on the KL information for MFC-CAT (namely MFC-KI, MFC-KB, and MFC-KLP) based on the Thurstonian IRT (TIRT) model. Three simulation studies, including one based on real data, were conducted to compare the performance of the proposed KL-based item selection methods against the existing FI-based methods in three- and five-dimensional MFC-CAT scenarios with various test lengths and inter-trait correlations. Results demonstrate that the proposed KL-based item selection methods are feasible for MFC-CAT and generate acceptable trait estimation accuracy and uniformity of item pool usage. Among the three proposed methods, MFC-KB and MFC-KLP outperformed the existing FI-based item selection methods and resulted in the most accurate trait estimation and relatively even utilization of the item pool.

Assuntos

Teste Adaptativo Computadorizado , Humanos , Simulação por Computador

4.

Validation of the cognitive section of the Penn computerized adaptive test for neurocognitive and clinical psychopathology assessment (CAT-CCNB).

Di Sandro, Akira; Moore, Tyler M; Zoupou, Eirini; Kennedy, Kelly P; Lopez, Katherine C; Ruparel, Kosha; Njokweni, Lucky J; Rush, Sage; Daryoush, Tarlan; Franco, Olivia; Gorgone, Alesandra; Savino, Andrew; Didier, Paige; Wolf, Daniel H; Calkins, Monica E; Cobb Scott, J; Gur, Raquel E; Gur, Ruben C.

Brain Cogn ; 174: 106117, 2024 02.

Artigo em Inglês | MEDLINE | ID: mdl-38128447

RESUMO

BACKGROUND: The Penn Computerized Neurocognitive Battery is an efficient tool for assessing brain-behavior domains, and its efficiency was augmented via computerized adaptive testing (CAT). This battery requires validation in a separate sample to establish psychometric properties. METHODS: In a mixed community/clinical sample of N = 307 18-to-35-year-olds, we tested the relationships of the CAT tests with the full-form tests. We compared discriminability among recruitment groups (psychosis, mood, control) and examined how their scores relate to demographics. CAT-Full relationships were evaluated based on a minimum inter-test correlation of 0.70 or an inter-test correlation within at least 0.10 of the full-form correlation with a previous administration of the full battery. Differences in criterion relationships were tested via mixed models. RESULTS: Most tests (15/17) met the minimum criteria for replacing the full-form with the updated CAT version (mean r = 0.67; range = 0.53-0.80) when compared to relationships of the full-forms with previous administrations of the full-forms (mean r = 0.68; range = 0.50-0.85). Most (16/17) CAT-based relationships with diagnostics and other validity criteria were indistinguishable (interaction p > 0.05) from their full-form counterparts. CONCLUSIONS: The updated CNB shows psychometric properties acceptable for research. The full-forms of some tests should be retained due to insufficient time savings to justify the loss in precision.

Assuntos

Teste Adaptativo Computadorizado , Transtornos Mentais , Humanos , Encéfalo , Psicometria , Cognição , Reprodutibilidade dos Testes

5.

Developing Computerized Adaptive Testing for a National Health Professionals Exam: An Attempt from Psychometric Simulations.

Xu, Lingling; Jiang, Zhehan; Han, Yuting; Liang, Haiying; Ouyang, Jinying.

Perspect Med Educ ; 12(1): 462-471, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37929203

RESUMO

Introduction: The accurate assessment of health professionals' competence is critical for ensuring public health safety and quality of care. Computerized Adaptive Testing (CAT) based on the Item Response Theory (IRT) has the potential to improve measurement accuracy and reduce respondent burden. In this study, we conducted psychometric simulations to develop a CAT for evaluating the candidates' competence of health professionals. Methods: The initial CAT item bank was sourced from the Standardized Competence Test for Clinical Medicine Undergraduates (SCTCMU), a nationwide summative test in China, consisting of 300 multiple-choice items. We randomly selected response data from 2000 Chinese clinical medicine undergraduates for analysis. Two types of analyses were performed: first, evaluating the psychometric properties of all items to meet the requirements of CAT; and second, conducting multiple CAT simulations using both simulated and real response data. Results: The final CAT item bank consisted of 121 items, for which item parameters were calculated using a two-parameter logistic model (2PLM). The CAT simulations, based on both simulated and real data, revealed sufficient marginal reliability (coefficient of marginal reliability above 0.750) and criterion-related validity (Pearson's correlations between CAT scores and aggregate scores of the SCTCMU exceeding 0.850). Discussion: In national-level medical education assessment, there is an increasing need for concise yet valid evaluations of candidates' competence of health professionals. The CAT developed in this study demonstrated satisfactory reliability and validity, offering a more efficient assessment of candidates' competence of health professionals. The psychometric properties of the CAT could lead to shorter test durations, reduced information loss, and a decreased testing burden for participants.

Assuntos

Teste Adaptativo Computadorizado , Pessoal de Saúde , Humanos , Psicometria , Reprodutibilidade dos Testes , Estudantes

6.

Development of the PREMIUM computerized adaptive testing for measuring the access and care coordination for patients with severe mental illness.

Boyer, Laurent; Fernandes, Sara; Brousse, Yann; Zendjidjian, Xavier; Cano, Delphine; Riedberger, Jeremie; Llorca, Pierre-Michel; Samalin, Ludovic; Dassa, Daniel; Trichard, Christian; Laprevote, Vincent; Sauvaget, Anne; Abbar, Mocrane; Misdrahi, David; Berna, Fabrice; Lancon, Christophe; Coulon, Nathalie; El-Hage, Wissam; Rozier, Pierre-Emmanuel; Benoit, Michel; Giordana, Bruno; Caqueo-Urizar, Alejandra; Yon, Dong Keon; Tran, Bach; Auquier, Pascal; Fond, Guillaume.

Psychiatry Res ; 328: 115444, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-37677894

RESUMO

Severe mental illness (SMI) patients often have complex health needs, which makes it difficult to access and coordinate their care. This study aimed to develop a computerized adaptive testing (CAT) tool, PREMIUM CAT-ACC, to measure SMI patients' experience with access and care coordination. This multicenter and cross-sectional study included 496 adult in- and out-patients with SMI (i.e., schizophrenia, bipolar disorder, or major depressive disorder). Psychometric analysis of the 13-item bank showed adequate properties, with preliminary evidence of external validity and no substantial differential item functioning for sex, age, care setting, and diagnosis, making it suitable for CAT administration. A post-hoc CAT simulation demonstrated that the tool was efficient and accurate, with an average of seven items, compared to the full item bank administration. Its use by clinicians can contribute to optimizing patient care pathways and transitioning towards more person-centered healthcare.

Assuntos

Transtorno Bipolar , Transtorno Depressivo Maior , Transtornos Mentais , Adulto , Humanos , Estudos Transversais , Teste Adaptativo Computadorizado , Transtornos Mentais/diagnóstico , Transtornos Mentais/terapia , Transtorno Bipolar/diagnóstico , Psicometria

7.

Applying computerized adaptive testing to the Desires for Speed Questionnaire in the Chinese population: A simulation study.

Liu, Siyang; Guo, Xiaorong; Ma, Shihao; Chen, Qian; Wang, Weijun.

Psychol Assess ; 35(9): 740-750, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37470987

RESUMO

The current methods for measuring patient-reported outcomes for amphetamine (speed) craving have limitation ability to adapt to the needs of individual patients while maintaining consistency in their scores. This study aimed to investigate whether the 40-item Desires for Speed Questionnaire (DSQ) could be improved for assessing clinical subjects using computerized adaptive testing (CAT). A sample of 677 participants from four drug addiction treatment centers in China was utilized in the study. Two types of analysis were conducted using the response data. First, the psychometric properties of all items were evaluated to meet the requirements of CAT. Second, multiple CAT simulations were carried out using real response data. The results indicated that the CAT method, which only required a small number of items (50%-75%), produced results that were only slightly different from the full DSQ assessment in terms of measuring amphetamine craving and criterion validity. In conclusion, this study suggests that developing a DSQ CAT for clinical subjects is useful as it leads to more efficient measurement without compromising the reliability of the test outcomes. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

Assuntos

Teste Adaptativo Computadorizado , População do Leste Asiático , Humanos , Simulação por Computador , Psicometria/métodos , Reprodutibilidade dos Testes , Inquéritos e Questionários , Anfetamina , Medidas de Resultados Relatados pelo Paciente , Transtornos Relacionados ao Uso de Anfetaminas/diagnóstico , Transtornos Relacionados ao Uso de Anfetaminas/psicologia , Fissura , China

8.

Applying multidimensional computerized adaptive testing to the MSQOL-54: a simulation study.

Giordano, Andrea; Testa, Silvia; Bassi, Marta; Cilia, Sabina; Bertolotto, Antonio; Quartuccio, Maria Esmeralda; Pietrolongo, Erika; Falautano, Monica; Grobberio, Monica; Niccolai, Claudia; Allegri, Beatrice; Viterbo, Rosa Gemma; Confalonieri, Paolo; Giovannetti, Ambra Mara; Cocco, Eleonora; Grasso, Maria Grazia; Lugaresi, Alessandra; Ferriani, Elisa; Nocentini, Ugo; Zaffaroni, Mauro; De Livera, Alysha; Jelinek, George; Solari, Alessandra; Rosato, Rosalba.

Health Qual Life Outcomes ; 21(1): 61, 2023 Jun 25.

Artigo em Inglês | MEDLINE | ID: mdl-37357308

RESUMO

BACKGROUND: The Multiple Sclerosis Quality of Life-54 (MSQOL-54) is one of the most commonly-used MS-specific health-related quality of life (HRQOL) measures. It is a multidimensional, MS-specific HRQOL inventory, which includes the generic SF-36 core items, supplemented with 18 MS-targeted items. Availability of an adaptive short version providing immediate item scoring may improve instrument usability and validity. However, multidimensional computerized adaptive testing (MCAT) has not been previously applied to MSQOL-54 items. We thus aimed to apply MCAT to the MSQOL-54 and assess its performance. METHODS: Responses from a large international sample of 3669 MS patients were assessed. We calibrated 52 (of the 54) items using bifactor graded response model (10 group factors and one general HRQOL factor). Then, eight simulations were run with different termination criteria: standard errors (SE) for the general factor and group factors set to different values, and change in factor estimates from one item to the next set at < 0.01 for both the general and the group factors. Performance of the MCAT was assessed by the number of administered items, root mean square difference (RMSD), and correlation. RESULTS: Eight items were removed due to local dependency. The simulation with SE set to 0.32 (general factor), and no SE thresholds (group factors) provided satisfactory performance: the median number of administered items was 24, RMSD was 0.32, and correlation was 0.94. CONCLUSIONS: Compared to the full-length MSQOL-54, the simulated MCAT required fewer items without losing precision for the general HRQOL factor. Further work is needed to add/integrate/revise MSQOL-54 items in order to make the calibration and MCAT performance efficient also on group factors, so that the MCAT version may be used in clinical practice and research.

Assuntos

Teste Adaptativo Computadorizado , Esclerose Múltipla , Qualidade de Vida , Teste Adaptativo Computadorizado/métodos , Simulação por Computador , Esclerose Múltipla/diagnóstico , Inquéritos e Questionários , Humanos , Adolescente , Adulto , Pessoa de Meia-Idade , Idoso , Idoso de 80 Anos ou mais , Psicometria

9.

Feasibility of PROMIS using computerized adaptive testing during inpatient rehabilitation.

Rafiq, Riyad Bin; Yount, Susan; Jerousek, Sara; Roth, Elliot J; Cella, David; Albert, Mark V; Heinemann, Allen W.

J Patient Rep Outcomes ; 7(1): 44, 2023 05 10.

Artigo em Inglês | MEDLINE | ID: mdl-37162607

RESUMO

BACKGROUND: There has been an increased significance on patient-reported outcomes in clinical settings. We aimed to evaluate the feasibility of administering patient-reported outcome measures by computerized adaptive testing (CAT) using a tablet computer with rehabilitation inpatients, assess workload demands on staff, and estimate the extent to which rehabilitation inpatients have elevated T-scores on six Patient Reported Outcomes Measurement Information System® (PROMIS®) measures. METHODS: Patients (N = 108) with stroke, spinal cord injury, traumatic brain injury, and other neurological disorders participated in this study. PROMIS computerized adaptive tests (CAT) were administered via a web-based platform. Summary scores were calculated for six measures: Pain Interference, Sleep Disruption, Anxiety, Depression, Illness Impact Positive, and Illness Impact Negative. We calculated the percent of patients with T-scores equivalent to 2 standard deviations or greater above the mean. RESULTS: During the first phase, we collected data from 19 of 49 patients; of the remainder, 61% were not available or had cognitive or expressive language impairments. In the second phase of the study, 40 of 59 patients participated to complete the assessment. The mean PROMIS T-scores were in the low 50 s, indicating an average symptom level, but 19-31% of patients had elevated T-scores where the patients needed clinical action. CONCLUSIONS: The study demonstrated that PROMIS assessment using a CAT administration during an inpatient rehabilitation setting is feasible with the presence of a research staff member to complete PROMIS assessment.

Assuntos

Teste Adaptativo Computadorizado , Pacientes Internados , Humanos , Estudos de Viabilidade , Dor/psicologia

10.

Computerized adaptive testing for the patient evaluation measure (PEM) in patients undergoing cubital tunnel syndrome surgery.

Teunissen, Joris S; Hovius, Steven E R; Ulrich, Dietmar J O; Issa, Fadi; Rodrigues, Jeremy N; Harrison, Conrad J.

J Hand Surg Eur Vol ; 48(10): 1042-1047, 2023 11.

Artigo em Inglês | MEDLINE | ID: mdl-37066610

RESUMO

In outcome measures, item response theory (IRT) validation can deliver interval-scaled high-quality measurement that can be harnessed using computerized adaptive tests (CATs) to pose fewer questions to patients. We aimed to develop a CAT by developing an IRT model for the Patient Evaluation Measure (PEM) for patients undergoing cubital tunnel syndrome (CuTS) surgery. Nine hundred and seventy-nine completed PEM responses of patients with CuTS in the United Kingdom Hand Registry were used to develop and calibrate the CAT. Its performance was then evaluated in a simulated cohort of 1000 patients. The CAT reduced the original PEM length from ten to a median of two questions (range two to four), while preserving a high level of precision (median standard error of measurement of 0.27). The mean error between the CAT score and full-length score was 0.08%. A Bland-Altman analysis showed good agreement with no signs of bias. The CAT version of the PEM can substantially reduce patient burden while enhancing construct validity by harnessing IRT for patients undergoing CuTS surgery.

Assuntos

Síndrome do Túnel Ulnar , Humanos , Síndrome do Túnel Ulnar/diagnóstico , Síndrome do Túnel Ulnar/cirurgia , Teste Adaptativo Computadorizado , Inquéritos e Questionários , Avaliação de Resultados em Cuidados de Saúde , Extremidade Superior

11.

The Development, Deployment, and Evaluation of the CLEFT-Q Computerized Adaptive Test: A Multimethods Approach Contributing to Personalized, Person-Centered Health Assessments in Plastic Surgery.

Harrison, Conrad; Apon, Inge; Ardouin, Kenny; Sidey-Gibbons, Chris; Klassen, Anne; Cano, Stefan; Wong Riff, Karen; Pusic, Andrea; Versnel, Sarah; Koudstaal, Maarten; Allori, Alexander C; Rogers-Vizena, Carolyn; Swan, Marc C; Furniss, Dominic; Rodrigues, Jeremy.

J Med Internet Res ; 25: e41870, 2023 04 27.

Artigo em Inglês | MEDLINE | ID: mdl-37104031

RESUMO

BACKGROUND: Routine use of patient-reported outcome measures (PROMs) and computerized adaptive tests (CATs) may improve care in a range of surgical conditions. However, most available CATs are neither condition-specific nor coproduced with patients and lack clinically relevant score interpretation. Recently, a PROM called the CLEFT-Q has been developed for use in the treatment of cleft lip or palate (CL/P), but the assessment burden may be limiting its uptake into clinical practice. OBJECTIVE: We aimed to develop a CAT for the CLEFT-Q, which could facilitate the uptake of the CLEFT-Q PROM internationally. We aimed to conduct this work with a novel patient-centered approach and make source code available as an open-source framework for CAT development in other surgical conditions. METHODS: CATs were developed with the Rasch measurement theory, using full-length CLEFT-Q responses collected during the CLEFT-Q field test (this included 2434 patients across 12 countries). These algorithms were validated in Monte Carlo simulations involving full-length CLEFT-Q responses collected from 536 patients. In these simulations, the CAT algorithms approximated full-length CLEFT-Q scores iteratively, using progressively fewer items from the full-length PROM. Agreement between full-length CLEFT-Q score and CAT score at different assessment lengths was measured using the Pearson correlation coefficient, root-mean-square error (RMSE), and 95% limits of agreement. CAT settings, including the number of items to be included in the final assessments, were determined in a multistakeholder workshop that included patients and health care professionals. A user interface was developed for the platform, and it was prospectively piloted in the United Kingdom and the Netherlands. Interviews were conducted with 6 patients and 4 clinicians to explore end-user experience. RESULTS: The length of all 8 CLEFT-Q scales in the International Consortium for Health Outcomes Measurement (ICHOM) Standard Set combined was reduced from 76 to 59 items, and at this length, CAT assessments reproduced full-length CLEFT-Q scores accurately (with correlations between full-length CLEFT-Q score and CAT score exceeding 0.97, and the RMSE ranging from 2 to 5 out of 100). Workshop stakeholders considered this the optimal balance between accuracy and assessment burden. The platform was perceived to improve clinical communication and facilitate shared decision-making. CONCLUSIONS: Our platform is likely to facilitate routine CLEFT-Q uptake, and this may have a positive impact on clinical care. Our free source code enables other researchers to rapidly and economically reproduce this work for other PROMs.

Assuntos

Fenda Labial , Fissura Palatina , Procedimentos de Cirurgia Plástica , Cirurgia Plástica , Humanos , Fenda Labial/cirurgia , Fissura Palatina/cirurgia , Medidas de Resultados Relatados pelo Paciente , Teste Adaptativo Computadorizado

12.

Measuring glaucoma quality of life in an Asian population using item banking: psychometric evaluation and computerized adaptive testing simulations.

Fenwick, Eva K; Lim, Belicia; Man, Ryan E K; Baskaran, Mani; Nongpiur, Monisha; Sng, Chelvin C A; Iyer, Jayant Venkatramani; Husain, Rahat; Perera, Shamira; Wong, Tina; Low, Jin Rong; Loe, Bao Sheng; Huang, Olivia Shimin; Lun, Katherine; Aung, Tin; Lamoureux, Ecosse L.

Qual Life Res ; 32(9): 2667-2679, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37118365

RESUMO

PURPOSE: To assess the psychometric properties of glaucoma-specific health-related quality of life (HRQoL) item banks (IBs), and explore their efficiency using computerized adaptive testing (CAT) simulations. METHODS: In this cross-sectional, clinical study, 300 Asian glaucoma patients answered 221 items within seven IBs: Ocular Comfort Symptoms (OS); Activity Limitation (AL); Lighting (LT); Mobility (MB); Glaucoma Management (GM); Psychosocial (PSY); and Work (WK). Rasch analysis was conducted to assess each IB's psychometric properties (e.g., item "fit" to the construct; unidimensionality) and a set of analytic performance criteria guiding decision making relating to retaining or dropping domains and items was employed. CAT simulations determined the mean number of items for 'high' and 'moderate' measurement precision (stopping rule: SEM 0.3 and 0.387, respectively). RESULTS: Participants' mean age was 67.2 ± 9.2 years (62% male; 87% Chinese). LT, MB, and GM displayed good psychometric properties overall. To optimize AL's psychometric properties, 16 items were deleted due to poor "fit", high missing data, item bias, low discrimination and/or a low clinical/patient importance rating. To resolve multidimensionality in PSY, we rehomed 16 items into a "Concern (CN)" domain. PSY and CN required further amendment, including collapsing of response categories, and removal of poorly functioning items (N = 7). Due to poor measurement precision, low applicability and high ceiling effect, low test information indices, and low item separation index the WK IB was not considered further. In CAT simulations on the final seven IBs (n = 182 items total), an average of 12.1 and 15.7 items per IB were required for moderate and high precision measurement, respectively. CONCLUSIONS: After reengineering our seven IBs, they displayed robust psychometric properties and good efficiency in CAT simulations. Once finalized, GlauCAT™-Asian may enable comprehensive assessment of the HRQoL impact of glaucoma and associated treatments.

Assuntos

Glaucoma , Psicometria , Qualidade de Vida , Feminino , Humanos , Masculino , Teste Adaptativo Computadorizado , Estudos Transversais , Qualidade de Vida/psicologia , Reprodutibilidade dos Testes , Inquéritos e Questionários

13.

Applying Item Response Theory to the Student Adaptation to College Questionnaire: Examining Psychometric Characteristics and Developing Computerized Adaptive Testing Version.

Gao, Xuliang; Xia, Linpo; Wang, Fang; Hou, Minmin; Gong, Yi.

J Pers Assess ; 105(6): 797-806, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-36847426

RESUMO

Incoming students have many difficulties adjusting to college, and selecting appropriate measures to effectively screen them is indispensable, especially in China, where there is insufficient research in this area. To enrich domestic research, this study seeks to examine psychometric characteristics and develop a computerized adaptive version of the Student Adaptation to College Questionnaire (SACQ-CAT) based on a sample of Chinese students. Under the framework of item response theory, the item bank of student adaptation to college was formulated after uni-dimensionality testing, model comparison, item fit testing, and local independence testing. Subsequently, a CAT simulation, including three termination rules, was performed using real data to evaluate and verify the SACQ-CAT. The results showed reliability values exceeding 0.90 when participants' latent traits were between -4 and 3, covering majority of the subjects. The SACQ-CAT administered an average of fewer than 10 items to participants compared to 67 items on the original scale. The correlation coefficient between latency estimated by the SACQ-CAT and the SACQ is greater than .85, whereas the correlation coefficient with the Symptom Checklist 90 (SCL-90) scores ranges from -.33 to -.55 (p < .001). The SACQ-CAT largely reduced the number of items administered to the participants and avoided losing measurement precision.

Assuntos

Teste Adaptativo Computadorizado , Estudantes , Humanos , Psicometria/métodos , Reprodutibilidade dos Testes , Inquéritos e Questionários

14.

Validity and reliability of the Patient-Reported Outcomes Measurement Information System (PROMIS®) using computerized adaptive testing in patients with advanced chronic kidney disease.

van der Willik, Esmee M; van Breda, Fenna; van Jaarsveld, Brigit C; van de Putte, Marlon; Jetten, Isabelle W; Dekker, Friedo W; Meuleman, Yvette; van Ittersum, Frans J; Terwee, Caroline B.

Nephrol Dial Transplant ; 38(5): 1158-1169, 2023 05 04.

Artigo em Inglês | MEDLINE | ID: mdl-35913734

RESUMO

BACKGROUND: The Patient-Reported Outcomes Measurement Information System (PROMIS®) has been recommended for computerized adaptive testing (CAT) of health-related quality of life. This study compared the content, validity, and reliability of seven PROMIS CATs to the 12-item Short-Form Health Survey (SF-12) in patients with advanced chronic kidney disease. METHODS: Adult patients with chronic kidney disease and an estimated glomerular filtration rate under 30 mL/min/1.73 m2 who were not receiving dialysis treatment completed seven PROMIS CATs (assessing physical function, pain interference, fatigue, sleep disturbance, anxiety, depression, and the ability to participate in social roles and activities), the SF-12, and the PROMIS Pain Intensity single item and Dialysis Symptom Index at inclusion and 2 weeks. A content comparison was performed between PROMIS CATs and the SF-12. Construct validity of PROMIS CATs was assessed using Pearson's correlations. We assessed the test-retest reliability of all patient-reported outcome measures by calculating the intraclass correlation coefficient and minimal detectable change. RESULTS: In total, 207 patients participated in the study. A median of 45 items (10 minutes) were completed for PROMIS CATs. All PROMIS CATs showed evidence of sufficient construct validity. PROMIS CATs, most SF-12 domains and summary scores, and Dialysis Symptom Index showed sufficient test-retest reliability (intraclass correlation coefficient ≥ 0.70). PROMIS CATs had a lower minimal detectable change compared with the SF-12 (range, 5.7-7.4 compared with 11.3-21.7 across domains, respectively). CONCLUSION: PROMIS CATs showed sufficient construct validity and test-retest reliability in patients with advanced chronic kidney disease. PROMIS CATs required more items but showed better reliability than the SF-12. Future research is needed to investigate the feasibility of PROMIS CATs for routine nephrology care.

Assuntos

Qualidade de Vida , Insuficiência Renal Crônica , Humanos , Reprodutibilidade dos Testes , Teste Adaptativo Computadorizado , Inquéritos e Questionários , Diálise Renal , Medidas de Resultados Relatados pelo Paciente , Insuficiência Renal Crônica/diagnóstico , Insuficiência Renal Crônica/terapia , Sistemas de Informação

15.

The Effect of Computerized Adaptive Testing on Motivation and Anxiety: A Systematic Review and Meta-Analysis.

Akhtar, Hanif; Vekety, Boglarka; Kovacs, Kristof.

Assessment ; 30(5): 1379-1390, 2023 07.

Artigo em Inglês | MEDLINE | ID: mdl-35699428

RESUMO

Although many studies have been carried out on the psychometric aspects of computerized adaptive testing (CAT), its psychological aspects are less researched. Early studies claimed that CAT can be more motivating and induce less anxiety than traditional fixed-item tests (FIT). The purpose of this systematic review and meta-analysis was to gain a comprehensive understanding of the effects of CAT on motivation and anxiety in comparison to traditional fixed-item testing. Seven databases were examined. Articles were eligible if they employed an empirical study containing a direct comparison between CAT and FIT. Meta-analytical results showed no overall effect of test type on anxiety and motivation when comparing CAT with FIT (k = 11, g+ = 0.06, p = .28). However, easier CAT had positive effect compared with FIT (k = 2, g+ = .22, p < .001). Certain modifications in CAT administration can provide positive psychological effects for test-takers.

Assuntos

Teste Adaptativo Computadorizado , Motivação , Humanos , Transtornos de Ansiedade , Ansiedade , Psicometria

16.

Comparison of Patient-Reported Outcomes Measurement Information System Computerized Adaptive Testing Versus Fixed Short Forms in Juvenile Myositis.

Patel, Ruchi N; Esparza, Valeria G; Lai, Jin-Shei; Gray, Elizabeth L; Reeve, Bryce B; Chang, Rowland W; Cella, David; Ardalan, Kaveh.

Arthritis Care Res (Hoboken) ; 75(2): 381-390, 2023 02.

Artigo em Inglês | MEDLINE | ID: mdl-34328696

RESUMO

OBJECTIVE: Patient-Reported Outcomes Measurement Information System (PROMIS) measures can be administered via computerized adaptive testing (CAT) or fixed short forms (FSFs), but the empirical benefits of CAT versus FSFs are unknown in juvenile myositis (JM). The present study was undertaken to assess whether PROMIS CAT is feasible, precise, correlated with FSFs, and less prone to respondent burden and floor/ceiling effects than FSFs in JM. METHODS: Patients 8-17 years of age (self-report and parent proxy) and parents of patients 5-7 years of age (only parent proxy) completed PROMIS fatigue, pain interference, upper extremity function, mobility, anxiety, and depressive symptoms measures. Pearson correlations, paired t-tests, and Cohen's d were calculated between PROMIS CAT and FSFs. McNemar's test assessed floor/ceiling effects between CAT and FSFs. Precision and respondent burden were examined across the T score range. RESULTS: Data from 67 patient-parent dyads were analyzed. CAT and FSF mean scores did not significantly differ except in parent proxy anxiety and fatigue (effect size 0.23 and 0.19, respectively). CAT had less pronounced floor/ceiling effects at the less symptomatic extreme in all domains except self-report anxiety. Increased item burden and higher SEs were seen in less symptomatic scorers for CAT. Modified stopping rules limiting CAT item administration did not decrease precision. CONCLUSION: PROMIS CAT appears to be feasible and correlated with FSFs. CAT had less pronounced floor/ceiling effects, allowing detection of individual differences in less symptomatic patients. Modified stopping rules for CAT may decrease respondent burden. CAT can be considered for long-term follow-up of JM patients.

Assuntos

Dermatomiosite , Medidas de Resultados Relatados pelo Paciente , Humanos , Teste Adaptativo Computadorizado , Extremidade Superior , Sistemas de Informação

17.

Time to Completion of Pediatric PROMIS Computerized Adaptive Testing Measures and the SRS-22r in an Adolescent Idiopathic Scoliosis Population.

Paranjape, Chinmay S; de Araujo, Olivia B; Reider, Lisa M; Sponseller, Paul D; Carlini, Anthony R; McLaughlin, Kevin; Bachmann, Keith R; Mitchell, Stuart L.

J Pediatr Orthop ; 42(9): 462-466, 2022 Oct 01.

Artigo em Inglês | MEDLINE | ID: mdl-35973055

RESUMO

BACKGROUND: Patient-reported outcome measures are useful tools to quantify patients' pre-treatment and post-treatment symptoms. Historically used "legacy measures", such as the Scoliosis Research Society-22 revised questionnaire (SRS-22r), are often disease-specific and can be time-intensive. Recently developed Patient-Reported Outcomes Measurement Information System (PROMIS) computerized adaptive testing (CAT) measures may reduce administrative burdens and permit more efficient outcome collection within clinic workflows. In an era of medicine where payments are becoming tied to outcomes, we sought to assess the time to completion (TTC) of 8 pediatric PROMIS CAT measures and the SRS-22r in adolescents with idiopathic scoliosis. MATERIALS AND METHODS: Patients presenting to a large, urban tertiary referral hospital were prospectively enrolled into the study. Subjects were first-time survey respondents in various phases and types of treatment for adolescent idiopathic scoliosis. In total, 200 patients ranging from 10 to 17 years old completed 8 Pediatric PROMIS CATs and the SRS-22r. PROMIS CATs administered include Physical Activity, Mobility, Anxiety, Depressive symptoms, Peer Relationships, Physical Stress Experiences, Pain Behavior and Pain Interference. TTC was calculated using start and stop timestamps in the REDCap software. RESULTS: The mean (±SD) TTC for each PROMIS CAT was 1.1 (±0.9) minutes with physical activity, mobility, anxiety, depressive symptoms, peer relationships, physical stress experiences, pain behavior, and pain interference taking 1.2, 1.4, 1.0, 0.9, 1.2, 1.0, 1.0, and 1.2 minutes on average to complete, respectively. Mean TTC for the SRS-22r was 5.2 (±3.0) minutes. CONCLUSIONS: In this pediatric orthopaedic cohort, completion of 8 PROMIS CATs demonstrated minimal test-taker burden and time required for completion. These findings support rapid and easily integrable PROMIS CATs in clinical practice to aid in increased delivery of efficient, patient-centered care. LEVEL OF EVIDENCE: III, cross-sectional study.

Assuntos

Cifose , Escoliose , Teste Adaptativo Computadorizado , Estudos Transversais , Humanos , Dor , Medidas de Resultados Relatados pelo Paciente , Qualidade de Vida , Escoliose/diagnóstico , Escoliose/terapia

18.

Use of computerised adaptive testing to reduce the number of items in patient-reported hip and knee outcome scores: an analysis of the NHS England National Patient-Reported Outcome Measures programme.

Evans, Jonathan Peter; Gibbons, Christopher; Toms, Andrew D; Valderas, Jose Maria.

BMJ Open ; 12(7): e059415, 2022 07 20.

Artigo em Inglês | MEDLINE | ID: mdl-35858721

RESUMO

OBJECTIVE: Over 160 000 participants per year complete the 12-item Oxford Hip and Knee Scores (OHS/OKS) as part of the NHS England Patient-Reported Outcome Measures (PROMs) programme. We used a modern computational approach, known as computerised adaptive testing (CAT), to simulate individually tailored OHS and OKS assessment, with the goal of reducing the number of questions a patient must complete without compromising measurement accuracy. METHODS: We fit the 2018/2019 PROMs data to an item response theory (IRT) model. We assessed IRT model assumptions alongside reliability. We used parameters from the IRT model with data from 2017/2018 to simulate CAT assessments. Two simulations were run until a prespecified SE of measurement was met (SE=0.32 and SE=0.45). We compared the number of questions required to meet each cut-off and assessed the correlation between the full-length and CAT administration. RESULTS: We conducted IRT analysis using 40 432 OHS and 44 714 OKS observations. The OHS and OKS were both unidimensional (root mean square error of approximation 0.08 and 0.07, respectively) and marginal reliability 0.91 and 0.90. The CAT, with a precision limit of SE=0.32 and SE=0.45, required a median of four items (IQR 1) and two items (IQR 1), respectively, for the OHS, and median of four items (IQR 2) and two items (IQR 0) for the OKS. This represents a potential 82% reduction in PROM length. In the context of 160 000 yearly assessments, these methodologies could result in the omission of some 1 280 000 redundant questions per year, which equates to 40 000 hours of patient time. CONCLUSION: The application of IRT to the OHS and OKS produces an efficient and substantially reduced CAT. We have demonstrated a path to reduce the burden and potentially increase the compliance for these ubiquitous outcome measures without compromising measurement accuracy.

Assuntos

Artroplastia do Joelho , Teste Adaptativo Computadorizado , Inglaterra , Humanos , Medidas de Resultados Relatados pelo Paciente , Reprodutibilidade dos Testes , Medicina Estatal

19.

Construct Validity of Pediatric PROMIS Computerized Adaptive Testing Measures in Children With Adolescent Idiopathic Scoliosis.

Mitchell, Stuart L; McLaughlin, Kevin H; Bachmann, Keith R; Sponseller, Paul D; Reider, Lisa M.

J Pediatr Orthop ; 42(7): e720-e726, 2022 Aug 01.

Artigo em Inglês | MEDLINE | ID: mdl-35703245

RESUMO

BACKGROUND: The use of patient-reported outcome measures, especially Patient-Reported Outcomes Measurement Information System (PROMIS) measures, has increased in recent years. Given this growth, it is imperative to ensure that the measures being used are validated for the intended population(s)/disease(s). Our objective was to assess the construct validity of 8 PROMIS computer adaptive testing (CAT) measures among children with adolescent idiopathic scoliosis (AIS). METHODS: We prospectively enrolled 200 children (aged 10 to 17 y) with AIS, who completed 8 PROMIS CATs (Anxiety, Depressive Symptoms, Mobility, Pain Behavior, Pain Interference, Peer Relationships, Physical Activity, Physical Stress Experiences) and the Scoliosis Research Society-22r questionnaire (SRS-22r) electronically. Treatment categories were observation, bracing, indicated for surgery, or postoperative from posterior spinal fusion. Construct validity was evaluated using known group analysis and convergent and discriminant validity analyses. Analysis of variance was used to identify differences in PROMIS T -scores by treatment category (known groups). The Spearman rank correlation coefficient ( rs ) was calculated between corresponding PROMIS and SRS-22r domains (convergent) and between unrelated PROMIS domains (discriminant). Floor/ceiling effects were calculated. RESULTS: Among treatment categories, significant differences were found in PROMIS Mobility, Pain Behavior, Pain Interference, and Physical Stress Experiences and in all SRS-22r domains ( P <0.05) except Mental Health ( P =0.15). SRS-22r Pain was strongly correlated with PROMIS Pain Interference ( rs =-0.72) and Pain Behavior ( rs =-0.71) and moderately correlated with Physical Stress Experiences ( rs =-0.57). SRS-22r Mental Health was strongly correlated with PROMIS Depressive Symptoms ( rs =-0.72) and moderately correlated with Anxiety ( rs =-0.62). SRS-22r Function was moderately correlated with PROMIS Mobility ( rs =0.64) and weakly correlated with Physical Activity ( rs =0.34). SRS-22r Self-Image was weakly correlated with PROMIS Peer Relationships ( rs =0.33). All unrelated PROMIS CATs were weakly correlated (| rs |<0.40). PROMIS Anxiety, Mobility, Pain Behavior, and Pain Interference and SRS-22r Function, Pain, and Satisfaction displayed ceiling effects. CONCLUSIONS: Evidence supports the construct validity of 6 PROMIS CATs in evaluating AIS patients. Ceiling effects should be considered when using specific PROMIS CATs. LEVEL OF EVIDENCE: Level II, prognostic.

Assuntos

Cifose , Escoliose , Teste Adaptativo Computadorizado , Humanos , Dor , Medidas de Resultados Relatados pelo Paciente , Qualidade de Vida , Escoliose/cirurgia , Inquéritos e Questionários

20.

Psychometric Evaluation of Glaucoma Quality of Life Item Banks (GlauCAT) and Initial Assessment Using Computerized Adaptive Testing.

Man, Ryan Eyn Kidd; Fenwick, Eva K; Khadka, Jyoti; Wu, ZhiChao; Skalicky, Simon; Pesudovs, Konrad; Lamoureux, Ecosse L.

Transl Vis Sci Technol ; 11(6): 9, 2022 06 01.

Artigo em Inglês | MEDLINE | ID: mdl-35679035

RESUMO

Purpose: To evaluate the psychometric properties of glaucoma-specific quality of life (QoL) item banks (GlauCAT) and assess their performance using computerized adaptive testing (CAT) simulations. Methods: In this cross-sectional study, 293 participants with glaucoma (mean age ± SD, 70.7 ± 13.2 years; 45% female) answered 342 items in 12 QoL item banks (IBs): Activity Limitation (AL); Driving (DV); Convenience (CV); Economic (EC); Emotional (EM); General Symptoms (GS); Health Concerns (HC); Lighting (LT); Mobility (MB); Ocular Surface Symptoms (OS); Social (SC); and Visual Symptoms (VS). These IBs were assessed using Rasch analysis, and CAT simulations with 1000 simulated respondents were utilized to determine the average number of items to be administered to achieve moderate and high precision levels. Results: The AL, DV, EM, HC, LT, MB, EC, OS, SC, and VS IBs required relatively minor amendments to achieve satisfactory psychometric fit. To resolve multidimensionality, we split CV into Treatment Convenience (TCV) and General Convenience (GCV). Due to poor measurement precision, the GS IB was not pursued further. This resulted in 12 total IBs. In CAT simulations, an average of 3.7 and 7.3 items per IB were required to attain measurement at moderate and high precision, respectively. Conclusions: Following rigorous psychometric assessment, we developed 12 valid glaucoma-specific QoL domains that can obtain highly precise person measure estimates using a small number of items. Translational Relevance: GlauCAT will enable researchers and clinicians to quickly and comprehensively assess the impact of glaucoma and its associated interventions across a range of QoL domains.

Assuntos

Glaucoma , Feminino , Humanos , Masculino , Teste Adaptativo Computadorizado , Estudos Transversais , Glaucoma/diagnóstico , Psicometria/métodos , Qualidade de Vida/psicologia , Inquéritos e Questionários

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA