Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Más filtros












Base de datos
Intervalo de año de publicación
1.
Sci Prog ; 107(2): 368504241238773, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38614464

RESUMEN

In alignment with the distributional hypothesis of language, the work "Quantum Projections on Conceptual Subspaces" (Martínez-Mingo A, Jorge-Botana G, Martinez-Huertas JÁ, et al. Quantum projections on conceptual subspaces. Cogn Syst Res 2023; 82: 101154) proposed a methodology for generating conceptual subspaces from textual information based on previous work (Martinez-Mingo A, Jorge-Botana G and Olmos R. Quantum approach for similarity evaluation in LSA vector space models. 2020). These subspaces enable the utilization of the quantum model of similarity put forth by Pothos and Busemeyer (Pothos E, Busemeyer J. A quantum probability explanation for violations of symmetry in similarity judgments. In Proceedings of the annual meeting of the cognitive science society, 2011, Vol. 33, No. 33), allowing for the empirical examination of the violations of assumptions concerning symmetry and triangular inequality (Tversky A. Features of similarity. Psychol Rev 1977; 84: 327-352; Yearsley JM, Barque-Duran A, Scerrati E, et al. The triangle inequality constraint in similarity judgments. Prog Biophys Mol Biol 2017; 130: 26-32), as well as the diagnosticity effect (Tversky A. Features of similarity. Psychol Rev 1977; 84: 327-352; Yearsley JM, Pothos EM, Barque-Duran A, et al. Context effects in similarity judgments. J Exp Psychol Gen 2022; 151: 711-717), within a data-driven environment. These psychological biases, deeply studied by authors such as Tversky and Kahneman, inform us about the limitations of modeling psychological similarity measures using tools from classical geometry. This commentary aims to offer methodological clarifications, discuss theoretical and practical implications, and speculate on future directions in this field of research. Concretely, it aims to propose the use of different contours (conceptual or contextual) to generate the subspaces, which lead to subspaces of terms or contexts. Once these contours are defined, a differentiation is proposed between Aggregated Terms Subspaces (ATSs), Aggregated Contexts Subspaces (ACSs), and Aggregated Features Subspaces (AFSs) depending on whether we define the subspaces by grouping the terms or contexts within the contour, or from the latent dimensions of the semantic space obtained in the contour window. Finally, new data is provided on the violation of the triangular inequality assumption through the application of the quantum similarity model to ATSs.

2.
Cogn Emot ; : 1-9, 2023 Nov 21.
Artículo en Inglés | MEDLINE | ID: mdl-37987756

RESUMEN

In this study, we analyzed the relationship between the amodal (semantic) development of words and two popular emotional norms (emotional valence and arousal) in English and Spanish languages. To do so, we combined the strengths of semantics from vector space models (vector length, semantic diversity, and word maturity measures), and feature-based models of emotions. First, we generated a common vector space representing the meaning of words at different developmental stages (five and four developmental stages for English and Spanish, respectively) using the Word Maturity methodology to align different vector spaces. Second, we analyzed the amodal development of words through mixed-effects models with crossed random effects for words and variables using a continuous time metric. Third, the emotional norms were included as covariates in the statistical models. We evaluated more than 23,000 words, whose emotional norms were available for more than 10,000 words, in each language separately. Results showed a curve of amodal development with an increasing linear effect and a small quadratic deceleration. A relevant influence on the amodal development of words was found only for emotional valence (not for arousal), suggesting that positive words have an earlier amodal development and a less pronounced semantic change across early lifespan.

3.
Behav Res Methods ; 54(5): 2579-2601, 2022 10.
Artículo en Inglés | MEDLINE | ID: mdl-35018609

RESUMEN

In this paper, we highlight the importance of distilling the computational assessments of constructed responses to validate the indicators/proxies of constructs/trins using an empirical illustration in automated summary evaluation. We present the validation of the Inbuilt Rubric (IR) method that maps rubrics into vector spaces for concepts' assessment. Specifically, we improved and validated its scores' performance using latent variables, a common approach in psychometrics. We also validated a new hierarchical vector space, namely a bifactor IR. 205 Spanish undergraduate students produced 615 summaries of three different texts that were evaluated by human raters and different versions of the IR method using latent semantic analysis (LSA). The computational scores were validated using multiple linear regressions and different latent variable models like CFAs or SEMs. Convergent and discriminant validity was found for the IR scores using human rater scores as validity criteria. While this study was conducted in the Spanish language, the proposed scheme is language-independent and applicable to any language. We highlight four main conclusions: (1) Accurate performance can be observed in topic-detection tasks without hundreds/thousands of pre-scored samples required in supervised models. (2) Convergent/discriminant validity can be improved using measurement models for computational scores as they adjust for measurement errors. (3) Nouns embedded in fragments of instructional text can be an affordable alternative to use the IR method. (4) Hierarchical models, like the bifactor IR, can increase the validity of computational assessments evaluating general and specific knowledge in vector space models. R code is provided to apply the classic and bifactor IR method.


Asunto(s)
Lenguaje , Semántica , Humanos , Psicometría/métodos , Estudiantes , Simulación del Espacio , Reproducibilidad de los Resultados
4.
Cogn Sci ; 45(7): e13026, 2021 07.
Artículo en Inglés | MEDLINE | ID: mdl-34288038

RESUMEN

We present a longitudinal computational study on the connection between emotional and amodal word representations from a developmental perspective. In this study, children's and adult word representations were generated using the latent semantic analysis (LSA) vector space model and Word Maturity methodology. Some children's word representations were used to set a mapping function between amodal and emotional word representations with a neural network model using ratings from 9-year-old children. The neural network was trained and validated in the child semantic space. Then, the resulting neural network was tested with adult word representations using ratings from an adult data set. Samples of 1210 and 5315 words were used in the child and the adult semantic spaces, respectively. Results suggested that the emotional valence of words can be predicted from amodal vector representations even at the child stage, and accurate emotional propagation was found in the adult word vector representations. In this way, different propagative processes were observed in the adult semantic space. These findings highlight a potential mechanism for early verbal emotional anchoring. Moreover, different multiple linear regression and mixed-effect models revealed moderation effects for the performance of the longitudinal computational model. First, words with early maturation and subsequent semantic definition promoted emotional propagation. Second, an interaction effect between age of acquisition and abstractness was found to explain model performance. The theoretical and methodological implications are discussed.


Asunto(s)
Emociones , Semántica , Niño , Humanos
5.
Mem Cognit ; 49(2): 219-234, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-32820469

RESUMEN

Some proposals claim that language acts as a link to propagate emotional and other modal information. Thus, there is an eminently amodal path of emotional propagation in the mental lexicon. Following these proposals, we present a computational model that emulates a linking mechanism (mapping function) between emotional and amodal representations of words using vector space models, emotional feature-based models, and neural networks. We analyzed three central concepts within the embodiment debate (redundancy, isomorphism, and propagative mechanisms) comparing two alternative hypotheses: semantic neighborhood hypothesis versus specific dimensionality hypothesis. Univariate and multivariate neural networks were trained for dimensional (N = 11,357) and discrete emotions (N = 2,266), and later we analyzed its predictions in a test set (N = 4,167 and N = 875, respectively). We showed how this computational model could propagate emotional responses to words without a direct emotional experience via amodal propagation, but no direct relations were found between emotional rates and amodal distances. Thereby, we found that there were clear redundancy and propagative mechanisms, but no isomorphism should be assumed. Results suggested that it was necessary to establish complex links to go beyond amodal distances of vector spaces. In this way, although the emotional rates of semantic neighborhoods could predict the emotional rates of target words, the mapping function of specific amodal features seemed to simulate emotional responses better. Thus, both hypotheses would not be mutually exclusive. We also showed that discrete emotions could have simpler relations between modal and amodal representations than dimensional emotions. All these results and their theoretical implications are discussed.


Asunto(s)
Emociones , Humanos , Lenguaje , Semántica
6.
Cogn Process ; 21(1): 1-21, 2020 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-31555943

RESUMEN

In recent years, latent semantic analysis (LSA) has reached a level of maturity at which its presence is ubiquitous in technology as well as in simulation of cognitive processes. In spite of this, in recent years there has been a trend of subjecting LSA to some criticisms, usually because it is compared to other models in very specific tasks and conditions and sometimes without having good knowledge of what the semantic representation of LSA means, and without exploiting all the possibilities of which LSA is capable other than the cosine. This paper provides a critical review to clarify some of the misunderstandings regarding LSA and other space models. The historical stability of the predecessors of LSA, the representational structure of word meaning and the multiple topologies that could arise from a semantic space, the computation of similarity, the myth that LSA dimensions have no meaning, the computational and algorithm plausibility to account for meaning acquisition in LSA (in contrast to others models based on online mechanisms), the possibilities of spatial models to substantiate recent proposals, and, in general, the characteristics of classic vector models and their ease and flexibility to simulate some cognitive phenomena will be reviewed. The review highlights the similarity between LSA and other techniques and proposes using long LSA experiences in other models, especially in predicting models such as word2vec. In sum, it emphasizes the lessons that can be learned from comparing LSA-based models to other models, rather than making statements about "the best."


Asunto(s)
Aprendizaje , Semántica , Algoritmos , Humanos , Conocimiento , Modelos Teóricos
7.
Artículo en Inglés | MEDLINE | ID: mdl-29024568

RESUMEN

The aim of this paper is to describe and explain one useful computational methodology to model the semantic development of word representation: Word maturity. In particular, the methodology is based on the longitudinal word monitoring created by Kirylev and Landauer using latent semantic analysis for the representation of lexical units. The paper is divided into two parts. First, the steps required to model the development of the meaning of words are explained in detail. We describe the technical and theoretical aspects of each step. Second, we provide a simple example of application of this methodology with some simple tools that can be used by applied researchers. This paper can serve as a user-friendly guide for researchers interested in modeling changes in the semantic representations of words. Some current aspects of the technique and future directions are also discussed. WIREs Cogn Sci 2018, 9:e1457. doi: 10.1002/wcs.1457 This article is categorized under: Computer Science > Natural Language Processing Linguistics > Language Acquisition Psychology > Development and Aging.


Asunto(s)
Desarrollo del Lenguaje , Modelos Teóricos , Semántica , Vocabulario , Humanos
8.
Span J Psychol ; 12(2): 424-40, 2009 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-19899645

RESUMEN

There is currently a widespread interest in indexing and extracting taxonomic information from large text collections. An example is the automatic categorization of informally written medical or psychological diagnoses, followed by the extraction of epidemiological information or even terms and structures needed to formulate guiding questions as an heuristic tool for helping doctors. Vector space models have been successfully used to this end (Lee, Cimino, Zhu, Sable, Shanker, Ely & Yu, 2006; Pakhomov, Buntrock & Chute, 2006). In this study we use a computational model known as Latent Semantic Analysis (LSA) on a diagnostic corpus with the aim of retrieving definitions (in the form of lists of semantic neighbors) of common structures it contains (e.g. "storm phobia", "dog phobia") or less common structures that might be formed by logical combinations of categories and diagnostic symptoms (e.g. "gun personality" or "germ personality"). In the quest to bring definitions into line with the meaning of structures and make them in some way representative, various problems commonly arise while recovering content using vector space models. We propose some approaches which bypass these problems, such as Kintsch's (2001) predication algorithm and some corrections to the way lists of neighbors are obtained, which have already been tested on semantic spaces in a non-specific domain (Jorge-Botana, León, Olmos & Hassan-Montero, under review). The results support the idea that the predication algorithm may also be useful for extracting more precise meanings of certain structures from scientific corpora, and that the introduction of some corrections based on vector length may increases its efficiency on non-representative terms.


Asunto(s)
Indización y Redacción de Resúmenes , Algoritmos , Inteligencia Artificial , Almacenamiento y Recuperación de la Información , Trastornos Mentales/diagnóstico , Semántica , Terminología como Asunto , Sistemas de Administración de Bases de Datos , Eficiencia , Humanos , Procesamiento de Lenguaje Natural , Reconocimiento de Normas Patrones Automatizadas , Psicolingüística
9.
Behav Res Methods ; 41(3): 944-50, 2009 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-19587211

RESUMEN

In this study, we compared four expert graders with latent semantic analysis (LSA) to assess short summaries of an expository text. As is well known, there are technical difficulties for LSA to establish a good semantic representation when analyzing short texts. In order to improve the reliability of LSA relative to human graders, we analyzed three new algorithms by two holistic methods used in previous research (León, Olmos, Escudero, Cañas, & Salmerón, 2006). The three new algorithms were (1) the semantic common network algorithm, an adaptation of an algorithm proposed by W. Kintsch (2001, 2002) with respect to LSA as a dynamic model of semantic representation; (2) a best-dimension reduction measure of the latent semantic space, selecting those dimensions that best contribute to improving the LSA assessment of summaries (Hu, Cai, Wiemer-Hastings, Graesser, & McNamara, 2007); and (3) the Euclidean distance measure, used by Rehder et al. (1998), which incorporates at the same time vector length and the cosine measures. A total of 192 Spanish middle-grade students and 6 experts took part in this study. They read an expository text and produced a short summary. Results showed significantly higher reliability of LSA as a computerized assessment tool for expository text when it used a best-dimension algorithm rather than a standard LSA algorithm. The semantic common network algorithm also showed promising results.


Asunto(s)
Algoritmos , Investigación Conductal/métodos , Semántica , Adolescente , Adulto , Comprensión , Humanos , Modelos Estadísticos , Lectura
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...