Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Psicothema ; 33(4): 631-638, 2021 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-34668479

RESUMO

BACKGROUND: The objective of the study was to design and validate the Psychosocial Trauma Scale (ETAPS) for assessing psychosocial consequences of collective violence. This instrument proposed the following dimensions: Pre-traumatic Situation, Destruction of Fundamental Beliefs, Intergroup Emotions, and Family and Community Destruction. METHOD: A total of 382 people participated who had been affected by political violence: civil war in El Salvador, forced displacement from Colombia and state violence from Chile. The study had three phases: (1) content validity of the items evaluated by experts; (2) exploratory factor analysis to study the structure of ETAPS, reducing the number of items; (3) convergent (post-traumatic stress symptomatology) and divergent (psychological and social well-being) validity. RESULTS: The EFA showed that ETAPS had a slightly different internal structure from that proposed. The dimensions found were Pre-traumatic Situation and Intergroup Emotions along with two new emerging dimensions: Destruction of Sociality and Personal and Collective Self-Efficacy. Divergent and convergent validity gave expected results except for the pre-traumatic situation. CONCLUSIONS: The ETAPS dimensions show that the effects of violence are broader than the symptoms measured by clinical scales. An instrument with adequate psychometric properties was obtained which will be useful for future studies in the area.


Assuntos
Emoções , Violência , Chile , Análise Fatorial , Humanos , Psicometria , Reprodutibilidade dos Testes
2.
Span J Psychol ; 21: E62, 2018 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-30501646

RESUMO

This study analyses the extent to which cheating occurs in a real selection setting. A two-stage, unproctored and proctored, test administration was considered. Test score inconsistencies were concluded by applying a verification test (Guo and Drasgow Z-test). An initial simulation study showed that the Z-test has adequate Type I error and power rates in the specific selection settings explored. A second study applied the Z-test statistic verification procedure to a sample of 954 employment candidates. Additional external evidence based on item time response to the verification items was gathered. The results revealed a good performance of the Z-test statistic and a relatively low, but non-negligible, number of suspected cheaters that showed higher distorted ability estimates. The study with real data provided additional information on the presence of suspected cheating in unproctored applications and the viability of using item response times as an additional evidence of cheating. In the verification test, suspected cheaters spent 5.78 seconds per item more than expected considering the item difficulty and their assumed ability in the unproctored stage. We found that the percentage of suspected cheaters in the empirical study could be estimated at 13.84%. In summary, the study provides evidence of the usefulness of the Z-test in the detection of cheating in a specific setting, in which a computerized adaptive test for assessing English grammar knowledge was used for personnel selection.


Assuntos
Enganação , Avaliação Educacional/normas , Internet , Seleção de Pessoal/normas , Adulto , Feminino , Humanos , Masculino
3.
Psychol Assess ; 30(12): 1678-1690, 2018 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-30160497

RESUMO

Multidimensional computerized adaptive testing based on the bifactor model (MCAT-B) can provide efficient assessments of multifaceted constructs. In this study, MCAT-B was compared with a short fixed-length scale and computerized adaptive testing based on unidimensional (UCAT) and multidimensional (correlated-factors) models (MCAT) to measure the Big Five model of personality. The sample comprised 826 respondents who completed a pool with 360 personality items measuring the Big Five domains and facets. The dimensionality of the Big Five domains was also tested. With only 12 items per domain, the MCAT and MCAT-B procedures were more efficient to assess highly multidimensional constructs (e.g., Agreeableness), whereas no differences were found with UCAT and the short scale with traits that were essentially unidimensional (e.g., Extraversion). Furthermore, the study showed that MCAT and MCAT-B provide better content-balance of the pool because, for each Big Five domain, items from all the facets are selected in similar proportions. (PsycINFO Database Record (c) 2018 APA, all rights reserved).


Assuntos
Diagnóstico por Computador/métodos , Determinação da Personalidade , Transtornos da Personalidade/diagnóstico , Transtornos da Personalidade/psicologia , Adolescente , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Reprodutibilidade dos Testes , Estudantes/psicologia , Adulto Jovem
4.
Psicothema ; 29(3): 390-395, 2017 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-28693712

RESUMO

BACKGROUND: Even though the Five Factor Model (FFM) has been the dominant paradigm in personality research for the past two decades, very few studies have measured the FFM adaptively. Thus, the purpose of this research was the building of a new item pool to develop a computerized adaptive test (CAT) for personality assessment. METHOD: A pool of 480 items that measured the FFM facets was developed and applied to 826 participants. Facets were calibrated separately and item selection was performed being mindful of the preservation of unidimensionality of each facet. Then, a post-hoc simulation study was carried out to test the performance of separate CATs to measure the facets. RESULTS: The final item pool was composed of 360 items with good psychometric properties. Findings reveal that a CAT administration of four items per facet (total length of 120 items) provides accurate facets scores, while maintaining the factor structure of the FFM. CONCLUSIONS: An item pool with good psychometric properties was obtained and a CAT simulation study demonstrated that the FFM facets could be measured with precision using a third of the items in the pool.


Assuntos
Testes de Personalidade , Adolescente , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Adulto Jovem
5.
Hum Brain Mapp ; 38(2): 803-816, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27726264

RESUMO

Neuroimaging research involves analyses of huge amounts of biological data that might or might not be related with cognition. This relationship is usually approached using univariate methods, and, therefore, correction methods are mandatory for reducing false positives. Nevertheless, the probability of false negatives is also increased. Multivariate frameworks have been proposed for helping to alleviate this balance. Here we apply multivariate distance matrix regression for the simultaneous analysis of biological and cognitive data, namely, structural connections among 82 brain regions and several latent factors estimating cognitive performance. We tested whether cognitive differences predict distances among individuals regarding their connectivity pattern. Beginning with 3,321 connections among regions, the 36 edges better predicted by the individuals' cognitive scores were selected. Cognitive scores were related to connectivity distances in both the full (3,321) and reduced (36) connectivity patterns. The selected edges connect regions distributed across the entire brain and the network defined by these edges supports high-order cognitive processes such as (a) (fluid) executive control, (b) (crystallized) recognition, learning, and language processing, and (c) visuospatial processing. This multivariate study suggests that one widespread, but limited number, of regions in the human brain, supports high-level cognitive ability differences. Hum Brain Mapp 38:803-816, 2017. © 2016 Wiley Periodicals, Inc.


Assuntos
Encéfalo/diagnóstico por imagem , Encéfalo/fisiologia , Cognição/fisiologia , Análise Multivariada , Análise de Regressão , Adolescente , Mapeamento Encefálico , Feminino , Humanos , Imageamento Tridimensional , Imageamento por Ressonância Magnética , Masculino , Vias Neurais/diagnóstico por imagem , Vias Neurais/fisiologia , Testes Neuropsicológicos , Reprodutibilidade dos Testes , Adulto Jovem
6.
Appl Psychol Meas ; 41(8): 614-631, 2017 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-29882533

RESUMO

Research related to the fit evaluation at the item level involving cognitive diagnosis models (CDMs) has been scarce. According to the parsimony principle, balancing goodness of fit against model complexity is necessary. General CDMs require a larger sample size to be estimated reliably, and can lead to worse attribute classification accuracy than the appropriate reduced models when the sample size is small and the item quality is poor, which is typically the case in many empirical applications. The main purpose of this study was to systematically examine the statistical properties of four inferential item-fit statistics: S-X2 , the likelihood ratio (LR) test, the Wald (W) test, and the Lagrange multiplier (LM) test. To evaluate the performance of the statistics, a comprehensive set of factors, namely, sample size, correlational structure, test length, item quality, and generating model, is systematically manipulated using Monte Carlo methods. Results show that the S-X2 statistic has unacceptable power. Type I error and power comparisons favor LR and W tests over the LM test. However, all the statistics are highly affected by the item quality. With a few exceptions, their performance is only acceptable when the item quality is high. In some cases, this effect can be ameliorated by an increase in sample size and test length. This implies that using the above statistics to assess item fit in practical settings when the item quality is low remains a challenge.

7.
Psicothema ; 28(3): 346-52, 2016 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-27448271

RESUMO

BACKGROUND: Multistage adaptive testing has recently emerged as an alternative to the computerized adaptive test. The current study details a new multistage test to assess fluid intelligence. METHOD: An item pool of progressive matrices with constructed response format was developed, and divided into six subtests. The subtests were applied to a sample of 724 college students and their psychometric properties were studied (i.e., reliability, dimensionality and validity evidence). The item pool was calibrated under the graded response model, and two multistage structures were developed, based on the automatic test assembly principles. Finally, the test information provided by each structure was compared in order to select the most appropriate one. RESULTS: The item pool showed adequate psychometric properties. From the two compared multistage structures, the simplest structure (i.e., routing test and two modules in the next stages) were more informative across the latent trait continuum and were therefore kept. DISCUSSION: Taken together, the results of the two studies support the application of the FIMT (Fluid Intelligence Multistage Test), a multistage test to assess fluid intelligence accurately and innovatively.


Assuntos
Testes de Inteligência , Adolescente , Adulto , Feminino , Humanos , Masculino , Psicometria , Adulto Jovem
8.
Span J Psychol ; 17: E48, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25012203

RESUMO

Test security can be a major problem in computerized adaptive testing, as examinees can share information about the items they receive. Of the different item selection rules proposed to alleviate this risk, stratified methods are among those that have received most attention. In these methods, only low discriminative items can be presented at the beginning of the test and the mean information of the items increases as the test goes on. To do so, the item bank must be divided into several strata according to the information of the items. To date, there is no clear guidance about the optimal number of strata into which the item bank should be split. In this study, we will simulate conditions with different numbers of strata, from 1 (no stratification) to a number of strata equal to test length (maximum level of stratification) while manipulating the maximum exposure rate that no item should surpass (r max ) in its whole domain. In this way, we can plot the relation between test security and accuracy, making it possible to determine the number of strata that leads to better security while holding constant measurement accuracy. Our data indicates that the best option is to stratify into as many strata as possible.


Assuntos
Metodologias Computacionais , Avaliação Educacional/normas , Psicometria/normas , Avaliação Educacional/métodos , Humanos , Psicometria/métodos
9.
Psicothema ; 26(3): 372-7, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25069557

RESUMO

BACKGROUND: Profiling of jobs in terms of competency requirements has increasingly been applied in many organizational settings. Testing these competencies through situational judgment tests (SJTs) leads to validity problems because it is not usually clear which constructs SJTs measure. The primary purpose of this paper is to evaluate whether the application of cognitive diagnosis models (CDM) to competency-based SJTs can ascertain the underlying competencies measured by the items, and whether these competencies can be estimated precisely. METHOD: The generalized deterministic inputs, noisy "and" gate (G-DINA) model was applied to 26 situational judgment items measuring professional competencies based on the great eight model. These items were applied to 485 employees of a Spanish financial company. The fit of the model to the data and the convergent validity between the estimated competencies and personality dimensions were examined. RESULTS: The G-DINA showed a good fit to the data and the estimated competency factors, adapting and coping and interacting and presenting were positively related to emotional stability and extraversion, respectively. CONCLUSIONS: This work indicates that CDM can be a useful tool when measuring professional competencies through SJTs. CDM can clarify the competencies being measured and provide precise estimates of these competencies.


Assuntos
Cognição , Modelos Psicológicos , Competência Profissional , Adulto , Feminino , Humanos , Julgamento , Masculino , Testes Psicológicos
10.
Psicothema ; 26(3): 395-400, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25069561

RESUMO

BACKGROUND: The Exploratory Factor Analysis (EFA) procedure is one of the most commonly used in social and behavioral sciences. However, it is also one of the most criticized due to the poor management researchers usually display. The main goal is to examine the relationship between practices usually considered more appropriate and actual decisions made by researchers. METHOD: The use of exploratory factor analysis is examined in 117 papers published between 2011 and 2012 in 3 Spanish psychological journals with the highest impact within the previous five years. RESULTS: RESULTS show significant rates of questionable decisions in conducting EFA, based on unjustified or mistaken decisions regarding the method of extraction, retention, and rotation of factors. CONCLUSIONS: Overall, the current review provides support for some improvement guidelines regarding how to apply and report an EFA.


Assuntos
Análise Fatorial , Estudos de Validação como Assunto , Guias como Assunto
11.
Psicothema ; 25(2): 238-44, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23628540

RESUMO

BACKGROUND: Criterion-referenced interpretations of tests are highly necessary, which usually involves the difficult task of establishing cut scores. Contrasting with other Item Response Theory (IRT)-based standard setting methods, a non-judgmental approach is proposed in this study, in which Item Characteristic Curve (ICC) transformations lead to the final cut scores. METHOD: eCat-Listening, a computerized adaptive test for the evaluation of English Listening, was administered to 1,576 participants, and the proposed standard setting method was applied to classify them into the performance standards of the Common European Framework of Reference for Languages (CEFR). RESULTS: The results showed a classification closely related to relevant external measures of the English language domain, according to the CEFR. CONCLUSIONS: It is concluded that the proposed method is a practical and valid standard setting alternative for IRT-based tests interpretations.


Assuntos
Compreensão , Testes Psicológicos , Computadores , Humanos , Modelos Estatísticos , Psicometria , Reprodutibilidade dos Testes
12.
Span J Psychol ; 15(1): 424-41, 2012 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-22379731

RESUMO

This paper describes several simulation studies that examine the effects of capitalization on chance in the selection of items and the ability estimation in CAT, employing the 3-parameter logistic model. In order to generate different estimation errors for the item parameters, the calibration sample size was manipulated (N = 500, 1000 and 2000 subjects) as was the ratio of item bank size to test length (banks of 197 and 788 items, test lengths of 20 and 40 items), both in a CAT and in a random test. Results show that capitalization on chance is particularly serious in CAT, as revealed by the large positive bias found in the small sample calibration conditions. For broad ranges of theta, the overestimation of the precision (asymptotic Se) reaches levels of 40%, something that does not occur with the RMSE (theta). The problem is greater as the item bank size to test length ratio increases. Potential solutions were tested in a second study, where two exposure control methods were incorporated into the item selection algorithm. Some alternative solutions are discussed.


Assuntos
Algoritmos , Inteligência Artificial , Avaliação Educacional/estatística & dados numéricos , Modelos Estatísticos , Psicometria/estatística & dados numéricos , Humanos , Reprodutibilidade dos Testes , Design de Software
13.
Psicothema ; 23(4): 802-7, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-22047876

RESUMO

In this study, eCAT-Listening, a new computerized adaptive test for the evaluation of English Listening, is described. Item bank development, anchor design for data collection, and the study of the psychometric properties of the item bank and the adaptive test are described. The calibration sample comprised 1.576 participants. Good psychometric guarantees: the bank is unidimensional, the items are satisfactorily fitted to the 3-parameter logistic model, and an accurate estimation of the trait level is obtained. As validity evidence, a high correlation was obtained between the estimated trait level and a latent factor made up of the diverse criteria selected. The analysis of the trait level estimation by means of a simulation led us to fix the test length at 20 items, with a maximum exposure rate of .40.


Assuntos
Audição , Testes de Linguagem , Computadores , Humanos , Psicometria
14.
Span J Psychol ; 14(1): 500-8, 2011 May.
Artigo em Inglês | MEDLINE | ID: mdl-21568205

RESUMO

In computerized adaptive testing, the most commonly used valuating function is the Fisher information function. When the goal is to keep item bank security at a maximum, the valuating function that seems most convenient is the matching criterion, valuating the distance between the estimated trait level and the point where the maximum of the information function is located. Recently, it has been proposed not to keep the same valuating function constant for all the items in the test. In this study we expand the idea of combining the matching criterion with the Fisher information function. We also manipulate the number of strata into which the bank is divided. We find that the manipulation of the number of items administered with each function makes it possible to move from the pole of high accuracy and low security to the opposite pole. It is possible to greatly improve item bank security with much fewer losses in accuracy by selecting several items with the matching criterion. In general, it seems more appropriate not to stratify the bank.


Assuntos
Diagnóstico por Computador/estatística & dados numéricos , Avaliação Educacional/estatística & dados numéricos , Sistemas de Informação/estatística & dados numéricos , Testes Psicológicos/estatística & dados numéricos , Psicometria/estatística & dados numéricos , Software , Inquéritos e Questionários , Algoritmos , Inteligência Artificial , Simulação por Computador , Humanos , Linguística , Computação Matemática , Matemática , Reprodutibilidade dos Testes
15.
Psicothema ; 22(2): 340-7, 2010 May.
Artigo em Espanhol | MEDLINE | ID: mdl-20423641

RESUMO

This study describes the parameter drift analysis conducted on eCAT (a Computerized Adaptive Test to assess the written English level of Spanish speakers). The original calibration of the item bank (N = 3224) was compared to a new calibration obtained from the data provided by most eCAT operative administrations (N = 7254). A Differential Item Functioning (DIF) study was conducted between the original and the new calibrations. The impact that the new parameters have on the trait level estimates was obtained by simulation. Results show that parameter drift is found especially for a and c parameters, an important number of bank items show DIF, and the parameter change has a moderate impact on high-level-English ? estimates. It is then recommended to replace the original estimates by the new set.


Assuntos
Avaliação Educacional/métodos , Avaliação Educacional/estatística & dados numéricos , Software , Humanos , Idioma
16.
Span J Psychol ; 11(2): 618-25, 2008 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-18988447

RESUMO

If examinees were to know, beforehand, part of the content of a computerized adaptive test, their estimated trait levels would then have a marked positive bias. One of the strategies to avoid this consists of dividing a large item bank into several sub-banks and rotating the sub-bank employed (Ariel, Veldkamp & van der Linden, 2004). This strategy permits substantial improvements in exposure control at little cost to measurement accuracy, However, we do not know whether this option provides better results than using the master bank with greater restriction in the maximum exposure rates (Sympson & Hetter, 1985). In order to investigate this issue, we worked with several simulated banks of 2100 items, comparing them, for RMSE and overlap rate, with the same banks divided in two, three... up to seven sub-banks. By means of extensive manipulation of the maximum exposure rate in each bank, we found that the option of rotating banks slightly outperformed the option of restricting maximum exposure rate of the master bank by means of the Sympson-Hetter method.


Assuntos
Testes de Aptidão/estatística & dados numéricos , Caráter , Bases de Dados como Assunto , Diagnóstico por Computador/estatística & dados numéricos , Avaliação Educacional/estatística & dados numéricos , Design de Software , Viés , Metodologias Computacionais , Humanos , Computação Matemática , Psicometria/estatística & dados numéricos
17.
Br J Math Stat Psychol ; 61(Pt 2): 493-513, 2008 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-17681109

RESUMO

The most commonly employed item selection rule in a computerized adaptive test (CAT) is that of selecting the item with the maximum Fisher information for the estimated trait level. This means a highly unbalanced distribution of item-exposure rates, a high overlap rate among examinees and, for item bank management, strong pressure to replace items with a high discrimination parameter in the bank. An alternative for mitigating these problems involves, at the beginning of the test, basing item selection mainly on randomness. As the test progresses, the weight of information in the selection increases. In the present work we study, for two selection rules, the progressive methods (Revuelta & Ponsoda, 1998) and the proportional method (Segall, 2004a), different functions that define the weight of the random component according to the position in the test of the item to be administered. The functions were tested in simulated item banks and in an operative bank. We found that both the progressive and the proportional methods tolerate a high weight of the random component with minimal or zero loss of accuracy, while bank security and maintenance are improved.


Assuntos
Desenho Assistido por Computador , Modelos Psicológicos , Distribuição Aleatória , Humanos , Psicologia/métodos , Psicologia/estatística & dados numéricos
18.
Psicothema ; 18(1): 156-9, 2006 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-17296025

RESUMO

The proposal for increasing the security in Computerized Adaptive Tests that has received most attention in recent years is the a-stratified method (AS - Chang and Ying, 1999): at the beginning of the test only items with low discrimination parameters ( a ) can be administered, with the values of the a parameters increasing as the test goes on. With this method, distribution of the exposure rates of the items is less skewed, while efficiency is maintained in trait-level estimation. The pseudo-guessing parameter ( c ), present in the three-parameter logistic model, is considered irrelevant, and is not used in the AS method. The Maximum Information Stratified (MIS) model incorporates the c parameter in the stratification of the bank and in the item-selection rule, improving accuracy by comparison with the AS, for item banks with a and b parameters correlated and uncorrelated. For both kinds of banks, the blocking b methods (Chang, Qian and Ying, 2001) improve the security of the item bank.


Assuntos
Inteligência Artificial , Microcomputadores , Testes Psicológicos , Design de Software , Algoritmos , Distribuição de Qui-Quadrado , Humanos , Funções Verossimilhança
19.
Psicothema ; 18(4): 828-34, 2006 Nov.
Artigo em Espanhol | MEDLINE | ID: mdl-17296125

RESUMO

Item selection rules in a Computerized Adaptive Test for the assessment of written English. e-CAT is a Computerized Adaptive Test for the evaluation of written English knowledge, using the item selection rule most commonly employed: the maximum Fisher information criterion. Some of the problems of this criterion have a negative impact in the estimation accuracy and in the item bank security. In this study, the performance of this item selection rule is compared, by means of simulation, with two other rules: selecting the item with maximum Fisher information in an interval (Veerkamp y Berger, 1997) and a new criterion, called "maximum Fisher information in an interval with geometric mean". In general, this new rule shows smaller measurement error and smaller item overlap rates. It seems, thus, recommendable, as it allows the simultaneous improvement of estimation accuracy and the maintenance of the item bank security of e-CAT.


Assuntos
Compreensão , Metodologias Computacionais , Testes de Linguagem , Leitura , Algoritmos , Simulação por Computador , Avaliação Educacional , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA