Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 53
Filtrar
Más filtros

País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Arch Phys Med Rehabil ; 103(5S): S34-S42.e4, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-34678294

RESUMEN

OBJECTIVE: To (1) characterize the agreement between patient and proxy responses on a multidimensional computerized adaptive testing measure of function, and to (2) determine whether patient, proxy, or multidimensional computerized adaptive testing score characteristics identify when a proxy report can be used as a substitute for patient report in clinical decision making. DESIGN: A psychometric study of the Functional Assessment in Acute Care Multidimensional Computerized Adaptive Testing (FAMCAT) and its 3 scales (Applied Cognition, Daily Activity, and Basic Mobility). SETTING: An upper midwestern quaternary academic medical center PARTICIPANTS: A total of 300 pairs of patients (average age 60.9 years; range, 19-89) hospitalized on general medical services or readmitted to surgical services for postoperative complications and their proxies (average age 60.5 years; range, 20-88). INTERVENTION: Not applicable. MAIN OUTCOME MEASURES: There were 3 outcomes: (1) agreement between patient and proxy scores on the FAMCAT domains, as well as age and sex, analyzed with univariate and multivariate analysis of variance (MANOVA); (2) associations of patient-proxy relationship and FAMCAT score characteristics with patient-proxy score agreement; and (3) presence of psychometrically significant intra-dyad differences in FAMCAT scores. RESULTS: The results of the MANOVA and follow-up ANOVAs indicated that there were no statistically significant differences in FAMCAT scale scores between patient and proxy estimates for either the Daily Activity or Basic Mobility scales. There were significant differences for the Applied Cognition scale (P<.005) between mean patient and proxy scores, with proxies rating patients as functioning at a higher level (mean=0.42) than patients did themselves (mean=0.00). However, psychometrically significant intra-dyadic Applied Cognition score differences occurred in only 14% of dyads, compared with 25% in the other 2 scales. Sex and age were associated with patient-proxy agreement, but the patterns were not sufficiently consistent to permit generalizations regarding the likely validity of a proxy's scores. CONCLUSIONS: Patient and proxy FAMCAT Daily Activity and Basic Mobility scores did not differ significantly, and proxy reporting offers a creditable surrogate for patient report on these domains. Low rates of psychometrically significant intra-dyadic score differences suggest that proxy report may serve as a low-resolution screen for functional deficits in all FAMCAT domains. Approximately half the proxies provided multi-domain profile ratings on the 3 scales that did not differ significantly from these of the associated patients, but more research is needed to identify situations in which proxy profiles could be used in place of those provided by patients.


Asunto(s)
Apoderado , Calidad de Vida , Actividades Cotidianas , Humanos , Persona de Mediana Edad , Pacientes , Psicometría
2.
Arch Phys Med Rehabil ; 103(5S): S43-S52, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-34606759

RESUMEN

OBJECTIVE: To describe the adaptive measurement of change (AMC) as a means to identify psychometrically significant change in reported function of hospitalized patients and to reduce respondent burden on follow-up assessments. DESIGN: The AMC method uses multivariate computerized adaptive testing (CAT) and psychometric hypothesis tests based in item response theory to more efficiently measure intra-individual change using the responses of a single patient over 2 or more testing occasions. Illustrations of the utility of AMC in clinical care and estimates of AMC-based item reduction are provided using the Functional Assessment in Acute Care Multidimensional Computerized Adaptive Test (FAMCAT), a newly developed functional multidimensional CAT-based measurement of basic mobility, daily activities, and applied cognition. SETTING: Two quaternary hospitals in the Upper Midwest. PARTICIPANTS: Four hundred ninety-five hospitalized patients who completed the FAMCAT on 2 to 4 occasions during their hospital stay. INTERVENTION: N/A. RESULTS: Of the 495 patients who completed more than 1 FAMCAT, 72% completed 2 sessions, 13% completed 3, and 15% completed 4, with 22.1%, 23.4%, and 23.0%, respectively, exhibiting significant multivariate change. Use of the AMC in conjunction with the FAMCAT reduced respondent burden from that of the FAMCAT alone for follow-up assessments. On average, when used without the AMC, 22.7 items (range, 20.4-24.4) were administered during FAMCAT sessions. Post hoc analyses determined that when the AMC was used with the FAMCAT a mean±standard deviation reduction in FAMCAT number of items of 13.6 (11.1), 13.1 (9.8), and 18.1 (10.8) would occur during the second, third, and fourth sessions, respectively, which corresponded to a reduction in test duration of 3.0 (2.4), 3.0 (2.8), and 4.7 (2.6) minutes. Analysis showed that the AMC requires no assumptions about the nature of change and provides data that are potentially actionable for patient care. Various patterns of significant univariate and multivariate change are illustrated. CONCLUSIONS: The AMC method is an effective and parsimonious approach to identifying significant change in patients' measured CAT scores. The AMC approach reduced FAMCAT sessions by an average of 12.6 items (55%) and 2.9 minutes (53%) among patients with psychometrically significant score changes.


Asunto(s)
Servicios de Salud , Medición de Resultados Informados por el Paciente , Humanos , Psicometría , Proyectos de Investigación , Encuestas y Cuestionarios
3.
Arch Phys Med Rehabil ; 103(5S): S53-S58, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-34670134

RESUMEN

OBJECTIVE: To characterize the ability of the patient-reported Functional Assessment in Acute Care Multidimensional Computerized Adaptive Test (FAMCAT) domains to predict discharge disposition when administered during acute care stays. DESIGN: Cohort study. Logistic regression models were estimated to identify the ability of FAMCAT domains to predict discharge to an institution for postacute care (PAC). SETTING: Academic medical center. PARTICIPANTS: Patients admitted to general medicine services from June 2016 to June 2019 (n = 4240). INTERVENTIONS: Not applicable. MAIN OUTCOME MEASURE(S): Discharge to an institution. RESULTS: In this sample, 10.5% of patients were discharged to an institution for rehabilitation versus home. FAMCAT domain scores were highly predictive of discharge to institutional PAC. Daily Activity and Basic Mobility domains had excellent discriminative ability for discharge to an institution (c-statistic, 0.83 and 0.87, respectively). In best fit models accounting for additional characteristics, discrimination was outstanding for Daily Activity (c-statistic, 0.91; 95% confidence interval, 0.89-0.94) and Basic Mobility (c-statistic 0.92; 95% confidence interval, 0.89-0.94). CONCLUSIONS: The FAMCAT Daily Activity and Basic Mobility domains demonstrated excellent discrimination for identifying patients who discharged to an institutional setting for rehabilitation and outstanding discrimination when adjusted for salient patient factors associated with discharge disposition. Estimates obtained in this investigation are comparable to the best discrimination achieved with clinician-rated measures to identify patients who would require institutional PAC.


Asunto(s)
Alta del Paciente , Atención Subaguda , Actividades Cotidianas , Estudios de Cohortes , Humanos , Evaluación de Resultado en la Atención de Salud/métodos , Estudios Retrospectivos
4.
Arch Phys Med Rehabil ; 103(5S): S59-S66.e3, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-34606758

RESUMEN

OBJECTIVE: To determine whether a multidimensional computerized adaptive test, the Functional Assessment in Acute Care Multidimensional Computerized Adaptive Test (FAMCAT), could be administered to hospitalized patients via a tablet computer rather than being orally administered by an interviewer. DESIGN: A randomized comparison of the responses of hospitalized patients to interviewer vs tablet delivery of the FAMCAT and its assessment of applied cognition, daily activity, and basic mobility. SETTING: Two quaternary teaching hospitals in the Upper Midwest. PARTICIPANTS: A total of 300 patients (127 men, 165 women), average age 61.2 (range, 18-97) hospitalized on medical services or rehospitalized on surgical services were randomly assigned to either a tablet (150) or an interview (150) group. INTERVENTION: Electronic tablet vs interview. MAIN OUTCOME MEASURES: Item response theory point estimates of the FAMCAT latent scales, their psychometric standard errors, number of items administered per domain, the determinant (an indicator of overall precision of the latent trait vector), as well as the time that patients required to complete their FAMCAT sessions. RESULTS: Of the 300 patients, 292 completed their assessments. The assessments of 4 individuals in each group was interrupted by clinical care and were not included in the analyses. A significant (P=.009) mode effect (ie, interview vs tablet) was identified when all outcome variables were considered simultaneously. However, the only outcome that was affected by the administration mode was test duration: tablet administration reduced the roughly 6-minute test time required by both approaches by only 20 seconds, which, though statistically significant, was clinically insignificant. CONCLUSIONS: The results of a FAMCAT assessment, at least for this cohort of hospitalized patients, are independent of administration via tablet computer or interview.


Asunto(s)
Actividades Cotidianas , Computadoras de Mano , Estudios de Cohortes , Femenino , Humanos , Masculino , Persona de Mediana Edad , Medición de Resultados Informados por el Paciente , Psicometría
5.
Arch Phys Med Rehabil ; 103(5S): S84-S107.e38, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-34146534

RESUMEN

OBJECTIVE: To assess differential item functioning (DIF) in an item pool measuring the mobility of hospitalized patients across educational, age, and sex groups. DESIGN: Measurement evaluation cohort study. Content experts generated DIF hypotheses to guide the interpretation. The graded response item response theory (IRT) model was used. Primary DIF tests were Wald statistics; sensitivity analyses were conducted using the IRT ordinal logistic regression procedure. Magnitude and impact were evaluated by examining group differences in expected item and scale score functions. SETTING: Hospital-based rehabilitation. PARTICIPANTS: Hospitalized patients (N=2216). INTERVENTIONS: Not applicable. MAIN OUTCOME MEASURES: A total of 111 self-reported mobility items. RESULTS: Two linking items among those used to set the metric across forms evidenced DIF for sex and age: "difficulty climbing stairs step-over-step without a handrail (alternating feet)" and "difficulty climbing 3-5 steps without a handrail." Conditional on the mobility state, the items were more difficult for women and older people (aged ≥65y). An additional 18 items were identified with DIF. Items with both high DIF magnitude and hypotheses related to age were difficulty "crossing road at a 4-lane traffic light with curbs," "jumping/landing on one leg," "strenuous activities," and "descending 3-5 steps with no handrail." Although DIF of higher magnitude was observed for several items, the scale-level effect was relatively small and the exposure rate for the most problematic items was low (0.35, 0.27, and 0.20). CONCLUSIONS: This was the first study to evaluate measurement equivalence of the hospital-based rehabilitation mobility item bank. Although 20 items evidenced high magnitude DIF, 5 of which were related to stairs, the scale-level effect was minimal; however, it is recommended that such items be avoided in the development of short-form measures. No items with salient DIF were removed from calibrations, supporting the use of the item bank across groups differing in education, age, and sex. The bank may thus be useful to assist clinical assessment and decision-making regarding risk for specific mobility restrictions at discharge as well as identifying mobility-related functions targeted for postdischarge interventions. Additionally, with the goal of avoiding long and burdensome assessments for patients and clinical staff, these results could be informative for those using the item bank to construct short forms.


Asunto(s)
Cuidados Posteriores , Alta del Paciente , Anciano , Estudios de Cohortes , Femenino , Humanos , Modalidades de Fisioterapia , Psicometría/métodos , Autoinforme , Encuestas y Cuestionarios
6.
Arch Phys Med Rehabil ; 103(5S): S3-S14, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-35090886

RESUMEN

OBJECTIVE: To develop and evaluate an efficient and precise variable-length functional assessment of applied cognition, daily activity, and mobility to inform mobility preservation and rehabilitation service delivery among hospitalized patients. DESIGN: A multidimensional item bank tapping into these dimensions was developed, with all items calibrated using a multidimensional graded response model. The items were adaptively selected from the item banks to maximize the test information, and the test ended when a joint stopping rule was satisfied. A simulation study was conducted based on the completed instrument, the Functional Assessment in Acute Care Multidimensional Computerized Adaptive Test (FAMCAT), to compare its measurement precision and efficiency capabilities relative to conventional unidimensional computerized adaptive testing. Precision was measured by the bias and root mean squared error between the estimated and true (ie, simulated) θ estimates, whereas efficiency was measured by average test length. Data were collected by an interviewer reading questions from a tablet computer and entering patients' responses. SETTING: A large Midwestern hospital. PARTICIPANTS: A total of 4143 patients hospitalized with medical diagnosis and/or surgical complications, with 2060 in the calibration sample and 2083 in the validation cohort. INTERVENTION: Not applicable. RESULTS: Among the 2083 patients in the validation sample, FAMCAT administration required an average of 6 (SD=3.11) minutes. Ninety-six percent had their tests terminated by the standard error rule after responding to an average of 22.05 (SD=7.98) items, whereas 15 were terminated by the change in θ rule, with an average test length of 45.27 (SD=11.49). The remaining 76 responded until reaching the maximum test length of 60 items. CONCLUSIONS: The FAMCAT has the potential to satisfy the need for structured, frequent, and precise assessment of functional domains among hospitalized patients with medical diagnosis and/or surgical complications. The results are promising and may be informative for others who wish to develop similar instruments when concurrent assessment of correlated domains is required.


Asunto(s)
Actividades Cotidianas , Cognición , Sesgo , Simulación por Computador , Humanos , Psicometría/métodos , Encuestas y Cuestionarios
7.
Multivariate Behav Res ; 56(3): 459-475, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-32124648

RESUMEN

In psychological and educational measurement, it is often of interest to assess change in an individual. The current study expanded on previous research by introducing methods that can evaluate individual change on multiple latent traits measured on multiple occasions. The four methods considered are the likelihood ratio test (LRT), the multivariate Wald test (MWT), the modified multivariate Wald test (MMWT), and the score test (ST). Simulation studies were conducted to examine the true positive rate (TPR) and the false positive rate (FPR) of the new methods under a conventional fixed-form test and a computerized adaptive test (CAT). Manipulated variables included the number of occasions, change magnitudes, patterns of change, and correlations between latent traits. Results revealed that, in terms of FPR, all methods except MWT had close adherence to the nominal significance level. Among the three methods, the LRT is recommended as it provided a balance between FPR and TPR. Larger change magnitude yielded higher TPR, regardless of the remaining factors. With the same test length, a CAT yielded higher TPR than a conventional test. Real-data examples are provided of identifying psychometrically significant change across two to four occasions using a multivariate adaptive self-report medical outcomes measure from hospitalized patients. The detection of significant change among the three methods agreed highly, and those patients identified as having significant change exhibited large profile differences, which provided support for the valid performance of the proposed methods.


Asunto(s)
Evaluación Educacional , Proyectos de Investigación , Simulación por Computador , Humanos
8.
Multivariate Behav Res ; 56(5): 703-723, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-32598188

RESUMEN

Normality of latent traits is a common assumption made when estimating parameters for item response theory (IRT) models, but this assumption may be violated. The purpose of this research was to present a new Markov chain Monte Carlo (MCMC) method for ordinal items with flexible latent trait distributions (i.e., skewed and bimodal). Specifically, the Davidian curve (DC) was used to approximate the distribution of latent traits. The performance of the proposed MCMC algorithm with DCs was evaluated via a simulation study and compared with an EM method using DCs that is available in the "mirt" package (Chalmers, 2012). The manipulated factors included the number of response categories, sample size, and the shape of the latent trait distribution. The Hanna-Quinn (HQ) criterion was used to choose the best DC order. Results indicated that when informative priors were used, the MCMC algorithm with DCs could fit a flexible distribution well and the method provided good parameter estimates which, under some circumstances, had lower bias and RMSE than the EM method.


Asunto(s)
Algoritmos , Teorema de Bayes , Simulación por Computador , Cadenas de Markov , Método de Montecarlo
9.
Stud Hist Philos Sci ; 90: 10-14, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34508955

RESUMEN

We have each spent more than 50 years doing research that has had little impact. Even more lamentable is that our field, judgment and decision making (JDM), has on the whole had little impact during that span. We attribute that failure to the use of methodologies that emphasize testing models rather than looking for differences in behavior. The "cognitive revolution" led the field astray, toward the goal of studying model fit rather than comparing observable results. With modeling as the goal, experimentation was stultified. Simple tasks became dominant. Although a poor metaphor for real decision making, the gambling paradigm has lasted forever because the inputs to the decision are known to the researcher and thus easily modeled.


Asunto(s)
Toma de Decisiones , Juego de Azar , Juego de Azar/psicología , Humanos , Juicio , Inutilidad Médica , Motivación
10.
Multivariate Behav Res ; 53(3): 403-418, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-29624093

RESUMEN

A central assumption that is implicit in estimating item parameters in item response theory (IRT) models is the normality of the latent trait distribution, whereas a similar assumption made in categorical confirmatory factor analysis (CCFA) models is the multivariate normality of the latent response variables. Violation of the normality assumption can lead to biased parameter estimates. Although previous studies have focused primarily on unidimensional IRT models, this study extended the literature by considering a multidimensional IRT model for polytomous responses, namely the multidimensional graded response model. Moreover, this study is one of few studies that specifically compared the performance of full-information maximum likelihood (FIML) estimation versus robust weighted least squares (WLS) estimation when the normality assumption is violated. The research also manipulated the number of nonnormal latent trait dimensions. Results showed that FIML consistently outperformed WLS when there were one or multiple skewed latent trait distributions. More interestingly, the bias of the discrimination parameters was non-ignorable only when the corresponding factor was skewed. Having other skewed factors did not further exacerbate the bias, whereas biases of boundary parameters increased as more nonnormal factors were added. The item parameter standard errors recovered well with both estimation algorithms regardless of the number of nonnormal dimensions.


Asunto(s)
Modelos Estadísticos , Análisis Multivariante , Algoritmos , Simulación por Computador , Interpretación Estadística de Datos , Análisis Factorial , Análisis de los Mínimos Cuadrados
11.
Annu Rev Clin Psychol ; 12: 83-104, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-26651865

RESUMEN

In this review we explore recent developments in computerized adaptive diagnostic screening and computerized adaptive testing for the presence and severity of mental health disorders such as depression, anxiety, and mania. The statistical methodology is unique in that it is based on multidimensional item response theory (severity) and random forests (diagnosis) instead of traditional mental health measurement based on classical test theory (a simple total score) or unidimensional item response theory. We show that the information contained in large item banks consisting of hundreds of symptom items can be efficiently calibrated using multidimensional item response theory, and the information contained in these large item banks can be precisely extracted using adaptive administration of a small set of items for each individual. In terms of diagnosis, computerized adaptive diagnostic screening can accurately track an hour-long face-to-face clinician diagnostic interview for major depressive disorder (as an example) in less than a minute using an average of four questions with unprecedented high sensitivity and specificity. Directions for future research and applications are discussed.


Asunto(s)
Diagnóstico por Computador/métodos , Trastornos Mentales/diagnóstico , Modelos Estadísticos , Humanos
12.
Behav Brain Sci ; 37(4): 380, 2014 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-25162858

RESUMEN

Drawing inferences about the decision utilities of suicide terrorists from their final action is tempting, but hazardous. Direct elicitation of those utilities would be more informative, but is infeasible. Substituting examination of archival materials for elicitation makes the assumption that leaders and bombers have similar utilities. Insight regarding the beliefs of terrorist leaders might be available from observations of recruitment strategies.


Asunto(s)
Suicidio/psicología , Terrorismo/psicología , Femenino , Humanos , Masculino
13.
Behav Brain Sci ; 36(3): 306-7, 2013 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-23673053

RESUMEN

Pothos & Busemeyer (P&B) argue that classical probability (CP) fails to describe human decision processes accurately and should be supplanted by quantum probability. We accept the premise, but reject P&B's conclusion. CP is a prescriptive framework that has inspired a great deal of valuable research. Also, because CP is used across the sciences, it is a cornerstone of interdisciplinary collaboration.


Asunto(s)
Cognición , Modelos Psicológicos , Teoría de la Probabilidad , Teoría Cuántica , Humanos
14.
Educ Psychol Meas ; 82(4): 643-677, 2022 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-35754618

RESUMEN

Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests-a Z test, likelihood ratio test, and score ratio index-have demonstrated desirable statistical properties in this context, including low false positive rates and high true positive rates. However, the extant AMC research has assumed that the item parameter values in the simulated item banks were devoid of estimation error. This assumption is unrealistic for applied testing settings, where item parameters are estimated from a calibration sample before test administration. Using Monte Carlo simulation, this study evaluated the robustness of the common AMC hypothesis tests to the presence of item parameter estimation error when measuring omnibus change across four testing occasions. Results indicated that item parameter estimation error had at most a small effect on false positive rates and latent trait change recovery, and these effects were largely explained by the computerized adaptive testing item bank information functions. Differences in AMC performance as a function of item parameter estimation error and choice of hypothesis test were generally limited to simulees with particularly low or high latent trait values, where the item bank provided relatively lower information. These simulations highlight how AMC can accurately measure intra-individual change in the presence of item parameter estimation error when paired with an informative item bank. Limitations and future directions for AMC research are discussed.

15.
Appl Psychol Meas ; 46(7): 551-570, 2022 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-36131841

RESUMEN

Adaptive classification testing (ACT) is a variation of computerized adaptive testing (CAT) that is developed to efficiently classify examinees into multiple groups based on predetermined cutoffs. In multidimensional multiclassification (i.e., more than two categories exist along each dimension), grid classification is proposed to classify each examinee into one of the grids encircled by cutoffs (lines/surfaces) along different dimensions so as to provide clearer information regarding an examinee's relative standing along each dimension and facilitate subsequent treatment and intervention. In this article, the sequential probability ratio test (SPRT) and confidence interval method were implemented in the grid multiclassification ACT. In addition, two new termination criteria, the grid classification generalized likelihood ratio (GGLR) and simplified grid classification generalized likelihood ratio were proposed for grid multiclassification ACT. Simulation studies, using a simulated item bank, and a real item bank with polytomous multidimensional items, show that grid multiclassification ACT is more efficient than classification based on measurement CAT that focuses on trait estimate precision. In the context of a high-quality bank, GGLR was found to most efficiently terminate the grid multiclassification ACT and classify examinees.

16.
Am J Pathol ; 177(3): 1388-96, 2010 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-20696780

RESUMEN

In this study, a chronic yet synchronized version of the K/BxN mouse, the KRN-cell transfer model (KRN-CTM), was developed and extensively characterized. The transfer of purified splenic KRN T cells into T cell-deficient B6.TCR.Calpha(-/-)H-2(b/g7) mice induced anti-glucose 6-phosphate isomerase antibody-dependent chronic arthritis in 100% of the mice with uniform onset of disease 7 days after T cell transfer. Cellular infiltrations were assessed by whole-ankle transcript microarray, cytokine and chemokine levels, and microscopic and immunohistochemical analyses 7 through 42 days after T cell transfer. Transcripts identified an influx of monocytes/macrophages and neutrophils into the ankles and identified temporal progression of cartilage damage and bone resorption. In both serum and ankle tissue there was a significant elevation in interleukin-6, whereas macrophage inflammatory protein-1 alpha and monocyte chemotactic protein-1 were only elevated in tissue. Microscopic and immunohistochemical analyses revealed a time course for edema, synovial hypertrophy and hyperplasia, infiltration of F4/80-positive monocytes/macrophages and myeloperoxidase-positive neutrophils, destruction of articular cartilage, pannus invasion, bone resorption, extra-articular fibroplasia, and joint ankylosis. The KRN cell transfer model replicates many features of chronic rheumatoid arthritis in humans in a synchronized manner and lends itself to manipulation of adoptively transferred T cells and characterizing specific genes and T cell subsets responsible for rheumatoid arthritis pathogenesis and progression.


Asunto(s)
Artritis Reumatoide/patología , Modelos Animales de Enfermedad , Articulaciones/patología , Linfocitos T/patología , Linfocitos T/trasplante , Animales , Artritis Reumatoide/etiología , Artritis Reumatoide/metabolismo , Progresión de la Enfermedad , Ensayo de Inmunoadsorción Enzimática , Citometría de Flujo , Inmunohistoquímica , Inflamación , Articulaciones/metabolismo , Macrófagos/metabolismo , Macrófagos/patología , Ratones , Ratones Transgénicos , Monocitos/metabolismo , Monocitos/patología , Linfocitos T/metabolismo
17.
Educ Psychol Meas ; 81(3): 491-522, 2021 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-33994561

RESUMEN

S - χ 2 is a popular item fit index that is available in commercial software packages such as flexMIRT. However, no research has systematically examined the performance of S - χ 2 for detecting item misfit within the context of the multidimensional graded response model (MGRM). The primary goal of this study was to evaluate the performance of S - χ 2 under two practical misfit scenarios: first, all items are misfitting due to model misspecification, and second, a small subset of items violate the underlying assumptions of the MGRM. Simulation studies showed that caution should be exercised when reporting item fit results of polytomous items using S - χ 2 within the context of the MGRM, because of its inflated false positive rates (FPRs), especially with a small sample size and a long test. S - χ 2 performed well when detecting overall model misfit as well as item misfit for a small subset of items when the ordinality assumption was violated. However, under a number of conditions of model misspecification or items violating the homogeneous discrimination assumption, even though true positive rates (TPRs) of S - χ 2 were high when a small sample size was coupled with a long test, the inflated FPRs were generally directly related to increasing TPRs. There was also a suggestion that performance of S - χ 2 was affected by the magnitude of misfit within an item. There was no evidence that FPRs for fitting items were exacerbated by the presence of a small percentage of misfitting items among them.

18.
Psychometrika ; 86(3): 674-711, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-34251615

RESUMEN

Several methods used to examine differential item functioning (DIF) in Patient-Reported Outcomes Measurement Information System (PROMIS®) measures are presented, including effect size estimation. A summary of factors that may affect DIF detection and challenges encountered in PROMIS DIF analyses, e.g., anchor item selection, is provided. An issue in PROMIS was the potential for inadequately modeled multidimensionality to result in false DIF detection. Section 1 is a presentation of the unidimensional models used by most PROMIS investigators for DIF detection, as well as their multidimensional expansions. Section 2 is an illustration that builds on previous unidimensional analyses of depression and anxiety short-forms to examine DIF detection using a multidimensional item response theory (MIRT) model. The Item Response Theory-Log-likelihood Ratio Test (IRT-LRT) method was used for a real data illustration with gender as the grouping variable. The IRT-LRT DIF detection method is a flexible approach to handle group differences in trait distributions, known as impact in the DIF literature, and was studied with both real data and in simulations to compare the performance of the IRT-LRT method within the unidimensional IRT (UIRT) and MIRT contexts. Additionally, different effect size measures were compared for the data presented in Section 2. A finding from the real data illustration was that using the IRT-LRT method within a MIRT context resulted in more flagged items as compared to using the IRT-LRT method within a UIRT context. The simulations provided some evidence that while unidimensional and multidimensional approaches were similar in terms of Type I error rates, power for DIF detection was greater for the multidimensional approach. Effect size measures presented in Section 1 and applied in Section 2 varied in terms of estimation methods, choice of density function, methods of equating, and anchor item selection. Despite these differences, there was considerable consistency in results, especially for the items showing the largest values. Future work is needed to examine DIF detection in the context of polytomous, multidimensional data. PROMIS standards included incorporation of effect size measures in determining salient DIF. Integrated methods for examining effect size measures in the context of IRT-based DIF detection procedures are still in early stages of development.


Asunto(s)
Ansiedad , Medición de Resultados Informados por el Paciente , Humanos , Sistemas de Información , Psicometría
19.
Public Health Genomics ; 24(5-6): 291-303, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34058740

RESUMEN

BACKGROUND: Genomic testing is increasingly employed in clinical, research, educational, and commercial contexts. Genomic literacy is a prerequisite for the effective application of genomic testing, creating a corresponding need for validated tools to assess genomics knowledge. We sought to develop a reliable measure of genomics knowledge that incorporates modern genomic technologies and is informative for individuals with diverse backgrounds, including those with clinical/life sciences training. METHODS: We developed the GKnowM Genomics Knowledge Scale to assess the knowledge needed to make an informed decision for genomic testing, appropriately apply genomic technologies and participate in civic decision-making. We administered the 30-item draft measure to a calibration cohort (n = 1,234) and subsequent participants to create a combined validation cohort (n = 2,405). We performed a multistage psychometric calibration and validation using classical test theory and item response theory (IRT) and conducted a post-hoc simulation study to evaluate the suitability of a computerized adaptive testing (CAT) implementation. RESULTS: Based on exploratory factor analysis, we removed 4 of the 30 draft items. The resulting 26-item GKnowM measure has a single dominant factor. The scale internal consistency is α = 0.85, and the IRT 3-PL model demonstrated good overall and item fit. Validity is demonstrated with significant correlation (r = 0.61) with an existing genomics knowledge measure and significantly higher scores for individuals with adequate health literacy and healthcare providers (HCPs), including HCPs who work with genomic testing. The item bank is well suited to CAT, achieving high accuracy (r = 0.97 with the full measure) while administering a mean of 13.5 items. CONCLUSION: GKnowM is an updated, broadly relevant, rigorously validated 26-item measure for assessing genomics knowledge that we anticipate will be useful for assessing population genomic literacy and evaluating the effectiveness of genomics educational interventions.


Asunto(s)
Alfabetización en Salud , Análisis Factorial , Genómica , Humanos , Psicometría/métodos , Reproducibilidad de los Resultados , Encuestas y Cuestionarios
20.
Arch Rehabil Res Clin Transl ; 3(2): 100112, 2021 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-34179750

RESUMEN

OBJECTIVE: To (1) develop a patient-reported, multidomain functional assessment tool focused on medically ill patients in acute care settings; (2) characterize the measure's psychometric performance; and (3) establish clinically actionable score strata that link to easily implemented mobility preservation plans. DESIGN: This article describes the approach that our team pursued to develop and characterize this tool, the Functional Assessment in Acute Care Multidimensional Computer Adaptive Test (FAMCAT). Development involved a multistep process that included (1) expanding and refining existing item banks to optimize their salience for hospitalized patients; (2) administering candidate items to a calibration cohort; (3) estimating multidimensional item response theory models; (4) calibrating the item banks; (5) evaluating potential multidimensional computerized adaptive testing (MCAT) enhancements; (6) parameterizing the MCAT; (7) administering it to patients in a validation cohort; and (8) estimating its predictive and psychometric characteristics. SETTING: A large (2000-bed) Midwestern Medical Center. PARTICIPANTS: The overall sample included 4495 adults (2341 in a calibration cohort, 2154 in a validation cohort) who were admitted either to medical services with at least 1 chronic condition or to surgical/medical services if they required readmission after a hospitalization for surgery (N=4495). INTERVENTION: Not applicable. MAIN OUTCOME MEASURES: Not applicable. RESULTS: The FAMCAT is an instrument designed to permit the efficient, precise, low-burden, multidomain functional assessment of hospitalized patients. We tried to optimize the FAMCAT's efficiency and precision, as well as its ability to perform multiple assessments during a hospital stay, by applying cutting edge methods such as the adaptive measure of change (AMC), differential item functioning computerized adaptive testing, and integration of collateral test-taking information, particularly item response times. Evaluation of these candidate methods suggested that all may enhance MCAT performance, but none were integrated into initial MCAT parameterization. CONCLUSIONS: The FAMCAT has the potential to address a longstanding need for structured, frequent, and accurate functional assessment among patients hospitalized with medical diagnoses and complications of surgery.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA