Búsqueda | Biblioteca Virtual en Salud Odontología. Uruguay

Peekbank: An open, large-scale repository for developmental eye-tracking data of children's word recognition.

Zettersten, Martin; Yurovsky, Daniel; Xu, Tian Linger; Uner, Sarp; Tsui, Angeline Sin Mei; Schneider, Rose M; Saleh, Annissa N; Meylan, Stephan C; Marchman, Virginia A; Mankewitz, Jessica; MacDonald, Kyle; Long, Bria; Lewis, Molly; Kachergis, George; Handa, Kunal; deMayo, Benjamin; Carstensen, Alexandra; Braginsky, Mika; Boyce, Veronica; Bhatt, Naiti S; Bergey, Claire Augusta; Frank, Michael C.

Behav Res Methods ; 55(5): 2485-2500, 2023 08.

Artículo en Inglés | MEDLINE | ID: mdl-36002623

RESUMEN

The ability to rapidly recognize words and link them to referents is central to children's early language development. This ability, often called word recognition in the developmental literature, is typically studied in the looking-while-listening paradigm, which measures infants' fixation on a target object (vs. a distractor) after hearing a target label. We present a large-scale, open database of infant and toddler eye-tracking data from looking-while-listening tasks. The goal of this effort is to address theoretical and methodological challenges in measuring vocabulary development. We first present how we created the database, its features and structure, and associated tools for processing and accessing infant eye-tracking datasets. Using these tools, we then work through two illustrative examples to show how researchers can use Peekbank to interrogate theoretical and methodological questions about children's developing word recognition ability.

Asunto(s)

Tecnología de Seguimiento Ocular , Desarrollo del Lenguaje , Lactante , Humanos , Percepción Auditiva , Vocabulario

childes-db: A flexible and reproducible interface to the child language data exchange system.

Sanchez, Alessandro; Meylan, Stephan C; Braginsky, Mika; MacDonald, Kyle E; Yurovsky, Daniel; Frank, Michael C.

Behav Res Methods ; 51(4): 1928-1941, 2019 08.

Artículo en Inglés | MEDLINE | ID: mdl-30623390

RESUMEN

The Child Language Data Exchange System (CHILDES) has played a critical role in research on child language development, particularly in characterizing the early language learning environment. Access to these data can be both complex for novices and difficult to automate for advanced users, however. To address these issues, we introduce childes-db, a database-formatted mirror of CHILDES that improves data accessibility and usability by offering novel interfaces, including browsable web applications and an R application programming interface (API). Along with versioned infrastructure that facilitates reproducibility of past analyses, these interfaces lower barriers to analyzing naturalistic parent-child language, allowing for a wider range of researchers in language and cognitive development to easily leverage CHILDES in their work.

Asunto(s)

Lenguaje Infantil , Niño , Preescolar , Bases de Datos Factuales , Femenino , Humanos , Lactante , Desarrollo del Lenguaje , Masculino , Reproducibilidad de los Resultados

The Emergence of an Abstract Grammatical Category in Children's Early Speech.

Meylan, Stephan C; Frank, Michael C; Roy, Brandon C; Levy, Roger.

Psychol Sci ; 28(2): 181-192, 2017 02.

Artículo en Inglés | MEDLINE | ID: mdl-28074675

RESUMEN

How do children begin to use language to say things they have never heard before? The origins of linguistic productivity have been a subject of heated debate: Whereas generativist accounts posit that children's early language reflects the presence of syntactic abstractions, constructivist approaches instead emphasize gradual generalization derived from frequently heard forms. In the present research, we developed a Bayesian statistical model that measures the degree of abstraction implicit in children's early use of the determiners "a" and "the." Our work revealed that many previously used corpora are too small to allow researchers to judge between these theoretical positions. However, several data sets, including the Speechome corpus-a new ultra-dense data set for one child-showed evidence of low initial levels of productivity and higher levels later in development. These findings are consistent with the hypothesis that children lack rich grammatical knowledge at the outset of language learning but rapidly begin to generalize on the basis of structural regularities in their input.

Asunto(s)

Desarrollo del Lenguaje , Modelos Estadísticos , Psicolingüística , Niño , Humanos

Word Forms Reflect Trade-Offs Between Speaker Effort and Robust Listener Recognition.

Meylan, Stephan C; Griffiths, Thomas L.

Cogn Sci ; 48(7): e13478, 2024 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-38980972

RESUMEN

How do cognitive pressures shape the lexicons of natural languages? Here, we reframe George Kingsley Zipf's proposed "law of abbreviation" within a more general framework that relates it to cognitive pressures that affect speakers and listeners. In this new framework, speakers' drive to reduce effort (Zipf's proposal) is counteracted by the need for low-frequency words to have word forms that are sufficiently distinctive to allow for accurate recognition by listeners. To support this framework, we replicate and extend recent work using the prevalence of subword phonemic sequences (phonotactic probability) to measure speakers' production effort in place of Zipf's measure of length. Across languages and corpora, phonotactic probability is more strongly correlated with word frequency than word length. We also show this measure of ease of speech production (phonotactic probability) is strongly correlated with a measure of perceptual difficulty that indexes the degree of competition from alternative interpretations in word recognition. This is consistent with the claim that there must be trade-offs between these two factors, and is inconsistent with a recent proposal that phonotactic probability facilitates both perception and production. To our knowledge, this is the first work to offer an explanation why long, phonotactically improbable word forms remain in the lexicons of natural languages.

Asunto(s)

Lenguaje , Fonética , Reconocimiento en Psicología , Percepción del Habla , Humanos , Habla

How adults understand what young children say.

Meylan, Stephan C; Foushee, Ruthe; Wong, Nicole H; Bergelson, Elika; Levy, Roger P.

Nat Hum Behav ; 7(12): 2111-2125, 2023 Dec.

Artículo en Inglés | MEDLINE | ID: mdl-37884678

RESUMEN

Children's early speech often bears little resemblance to that of adults, and yet parents and other caregivers are able to interpret that speech and react accordingly. Here we investigate how adult listeners' inferences reflect sophisticated beliefs about what children are trying to communicate, as well as how children are likely to pronounce words. Using a Bayesian framework for modelling spoken word recognition, we find that computational models can replicate adult interpretations of children's speech only when they include strong, context-specific prior expectations about the messages that children will want to communicate. This points to a critical role of adult cognitive processes in supporting early communication and reveals how children can actively prompt adults to take actions on their behalf even when they have only a nascent understanding of the adult language. We discuss the wide-ranging implications of the powerful listening capabilities of adults for theories of first language acquisition.

Asunto(s)

Lenguaje , Percepción del Habla , Niño , Adulto , Humanos , Preescolar , Teorema de Bayes , Habla , Desarrollo del Lenguaje

Learning Through Processing: Toward an Integrated Approach to Early Word Learning.

Meylan, Stephan C; Bergelson, Elika.

Annu Rev Linguist ; 8: 77-99, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-35481110

RESUMEN

Children's linguistic knowledge and the learning mechanisms by which they acquire it grow substantially in infancy and toddlerhood, yet theories of word learning largely fail to incorporate these shifts. Moreover, researchers' often-siloed focus on either familiar word recognition or novel word learning limits the critical consideration of how these two relate. As a step toward a mechanistic theory of language acquisition, we present a framework of "learning through processing" and relate it to the prevailing methods used to assess children's early knowledge of words. Incorporating recent empirical work, we posit a specific, testable timeline of qualitative changes in the learning process in this interval. We conclude with several challenges and avenues for building a comprehensive theory of early word learning: better characterization of the input, reconciling results across approaches, and treating lexical knowledge in the nascent grammar with sufficient sophistication to ensure generalizability across languages and development.

The Challenges of Large-Scale, Web-Based Language Datasets: Word Length and Predictability Revisited.

Meylan, Stephan C; Griffiths, Thomas L.

Cogn Sci ; 45(6): e12983, 2021 06.

Artículo en Inglés | MEDLINE | ID: mdl-34170030

RESUMEN

Language research has come to rely heavily on large-scale, web-based datasets. These datasets can present significant methodological challenges, requiring researchers to make a number of decisions about how they are collected, represented, and analyzed. These decisions often concern long-standing challenges in corpus-based language research, including determining what counts as a word, deciding which words should be analyzed, and matching sets of words across languages. We illustrate these challenges by revisiting "Word lengths are optimized for efficient communication" (Piantadosi, Tily, & Gibson, 2011), which found that word lengths in 11 languages are more strongly correlated with their average predictability (or average information content) than their frequency. Using what we argue to be best practices for large-scale corpus analyses, we find significantly attenuated support for this result and demonstrate that a stronger relationship obtains between word frequency and length for a majority of the languages in the sample. We consider the implications of the results for language research more broadly and provide several recommendations to researchers regarding best practices.

Asunto(s)

Lenguaje , Lingüística , Comunicación , Humanos , Internet

Evaluating models of robust word recognition with serial reproduction.

Meylan, Stephan C; Nair, Sathvik; Griffiths, Thomas L.

Cognition ; 210: 104553, 2021 05.

Artículo en Inglés | MEDLINE | ID: mdl-33482474

RESUMEN

Spoken communication occurs in a "noisy channel" characterized by high levels of environmental noise, variability within and between speakers, and lexical and syntactic ambiguity. Given these properties of the received linguistic input, robust spoken word recognition-and language processing more generally-relies heavily on listeners' prior knowledge to evaluate whether candidate interpretations of that input are more or less likely. Here we compare several broad-coverage probabilistic generative language models in their ability to capture human linguistic expectations. Serial reproduction, an experimental paradigm where spoken utterances are reproduced by successive participants similar to the children's game of "Telephone," is used to elicit a sample that reflects the linguistic expectations of English-speaking adults. When we evaluate a suite of probabilistic generative language models against the yielded chains of utterances, we find that those models that make use of abstract representations of preceding linguistic context (i.e., phrase structure) best predict the changes made by people in the course of serial reproduction. A logistic regression model predicting which words in an utterance are most likely to be lost or changed in the course of spoken transmission corroborates this result. We interpret these findings in light of research highlighting the interaction of memory-based constraints and representations in language processing.

Asunto(s)

Percepción del Habla , Adulto , Niño , Humanos , Lenguaje , Lingüística , Ruido , Reproducción

Zipfian frequency distributions facilitate word segmentation in context.

Kurumada, Chigusa; Meylan, Stephan C; Frank, Michael C.

Cognition ; 127(3): 439-53, 2013 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-23558340

RESUMEN

Word frequencies in natural language follow a highly skewed Zipfian distribution, but the consequences of this distribution for language acquisition are only beginning to be understood. Typically, learning experiments that are meant to simulate language acquisition use uniform word frequency distributions. We examine the effects of Zipfian distributions using two artificial language paradigms-a standard forced-choice task and a new orthographic segmentation task in which participants click on the boundaries between words in contexts. Our data show that learners can identify word forms robustly across widely varying frequency distributions. In addition, although performance in recognizing individual words is predicted best by their frequency, a Zipfian distribution facilitates word segmentation in context: the presence of high-frequency words creates more chances for learners to apply their knowledge in processing new sentences. We find that computational models that implement "chunking" are more effective than "transition finding" models at reproducing this pattern of performance.

Asunto(s)

Desarrollo del Lenguaje , Lenguaje , Adulto , Algoritmos , Simulación por Computador , Femenino , Humanos , Masculino , Modelos Estadísticos , Estimulación Luminosa , Desempeño Psicomotor/fisiología , Lectura , Análisis de Regresión

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA