Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Asunto de la revista
País de afiliación
Intervalo de año de publicación
1.
Behav Res Methods ; 56(5): 5178-5189, 2024 08.
Artículo en Inglés | MEDLINE | ID: mdl-38129738

RESUMEN

We present a collection of concreteness ratings for 35,979 words in Estonian. The data were collected via a web application from 2278 native Estonian speakers. Human ratings of concreteness have not been collected for Estonian beforehand. We compare our results to Aedmaa et al. (2018), who assigned concreteness ratings to 240,000 Estonian words by means of machine learning. We show that while these two datasets show reasonable correlation (R = 0.71), there are considerable differences in the distribution of the ratings, which we discuss in this paper. Furthermore, the results also raise questions about the importance of the type of scale used for collecting ratings. While most other datasets have been compiled based on questionnaires entailing five- or seven-point Likert scales, we used a continuous 0-10 scale. Comparing our rating distribution to those of other studies, we found that it is most similar to the distribution in Lahl et al. (Behavior Research Methods, 41(1), 13-19, 2009), who also used a 0-10 scale. Concreteness ratings for Estonian words are available at OSF .


Asunto(s)
Lenguaje , Humanos , Estonia , Femenino , Adulto , Masculino , Adulto Joven , Persona de Mediana Edad , Adolescente , Semántica , Aprendizaje Automático , Psicolingüística/métodos
2.
Sci Data ; 7(1): 13, 2020 01 13.
Artículo en Inglés | MEDLINE | ID: mdl-31932593

RESUMEN

Advances in computer-assisted linguistic research have been greatly influential in reshaping linguistic research. With the increasing availability of interconnected datasets created and curated by researchers, more and more interwoven questions can now be investigated. Such advances, however, are bringing high requirements in terms of rigorousness for preparing and curating datasets. Here we present CLICS, a Database of Cross-Linguistic Colexifications (CLICS). CLICS tackles interconnected interdisciplinary research questions about the colexification of words across semantic categories in the world's languages, and show-cases best practices for preparing data for cross-linguistic research. This is done by addressing shortcomings of an earlier version of the database, CLICS2, and by supplying an updated version with CLICS3, which massively increases the size and scope of the project. We provide tools and guidelines for this purpose and discuss insights resulting from organizing student tasks for database updates.


Asunto(s)
Bases de Datos Factuales , Lingüística , Humanos , Lenguaje
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA