RESUMO
Compounds are morphologically complex words made of different linguistic parts. They are very prevalent in a number of languages such as French. Different psycholinguistic characteristics of compounds have been used in certain studies to investigate the mechanisms involved in compound processing (see Table 7). We provide psycholinguistic norms for a set of 506 French compound words. The words were normed on seven characteristics: lexeme meaning dominance, semantic transparency, sensory experience, conceptual familiarity, imageability, age of acquisition (AoA) and subjective frequency. Reliability measures were computed for the collected norms. Descriptive statistical analyses, and correlational and multiple regression analyses were performed. We also report some comparisons made between our normative data and certain norms obtained in other similar studies. The entire set of norms, which will be very useful to researchers investigating the processing of compounds, is available as Supplemental Material.
Assuntos
Idioma , Psicolinguística , Humanos , Reconhecimento Psicológico , Reprodutibilidade dos Testes , SemânticaRESUMO
This study investigated the effects of semantic transparency of Chinese disyllabic compound words on Chinese as a second language (CSL) learners' incidental learning of word meanings in sentence-level reading and passage-level reading. The accuracy of the learners' lexical inferencing was compared among various types of words (transparent, semi-transparent, and opaque words), different context lengths (sentence and passage contexts), and learners with different L1 backgrounds (with and without a Chinese character background in their L1s). In the study, ninety CSL adult learners were asked to infer the meanings of target words in the sentence context and the passage context. The results indicated that the effects of semantic transparency and context length on inferencing accuracy were significant, while the effect of L1 background was not. It was also found that there were significant interactions between transparency and context length as well as between transparency and L1 background.
Assuntos
Leitura , Semântica , Adulto , Povo Asiático , Humanos , IdiomaRESUMO
The CompLex database presents a large-scale collection of eye-movement studies on English compound-word processing. A combined total of 440 participants completed eye-tracking experiments in which they silently read unspaced English compound words (e.g., goalpost) embedded in sentence contexts (e.g., Dylan hit the goalpost when he was aiming for the net.). Three studies were conducted using participants representing the non-college-bound population (300 participants), and four studies included participants recruited from the student population (140 participants). The database comprises trial-level eye-movement data (47,763 trials), participant data (including a measure of reading experience estimated via the Author Recognition Test), and lexical characteristics for the set of 931 English compound words used as critical stimuli in the studies. One contribution of the present paper is a set of regression analyses conducted on the full database and individual experiments. We report that the most reliable and consistent main effects were those elicited by compound word length, left constituent frequency, right constituent frequency, compound frequency and semantic transparency. Separately, we also found that the effect of left frequency and compound word length is weaker among more frequent compounds. Another contribution is a power analysis, in which we determined the sample sizes required to reliably detect effect sizes that are comparable to those observed in our regression models. These sample size estimates serve as a recommendation for researchers wishing to either collect eye-movement data for compound word reading, or use the current database as a resource for the study of English compound word processing.
Assuntos
Leitura , Vocabulário , Movimentos Oculares , Humanos , Idioma , Masculino , Reconhecimento Visual de Modelos , SemânticaRESUMO
Word learning in a second language (L2) is a complex process, which is affected by learner-related (e.g., morphological awareness) and language-related (e.g., word semantic transparency) factors. Morphological awareness is learners' sensitivity to the morphological structure of printed words, and semantic transparency is the degree to which word-internal morphemes contribute to the whole words' meanings. However, little is known regarding how these morpheme-related factors jointly function in L2 word learning. Thus, this study focuses on how learners' morphological awareness and word semantic transparency affect word-meaning retention in L2 Chinese, particularly the extent to which semantic transparency shapes the contribution of learners' morphological awareness. Thirty-four L2 Chinese learners from an American university participated in this study. All the participants received a 40-min learning session and five paper-pencil measures. The main findings showed that (1) L2 learners' morphological awareness contributed to their word-meaning retention beyond L2 linguistic knowledge; (2) L2 learners performed better when recalling semantically transparent words than opaque words after learning sessions; and more importantly (3) learners with higher morphological awareness performed better than those with lower morphological awareness when recalling semantically transparent words, but the learners in both groups had similar performances when recalling opaque words.
Assuntos
Idioma , Multilinguismo , China , Humanos , Semântica , Aprendizagem VerbalRESUMO
According to the self-teaching hypothesis, children can self-teach new written words via phonological recoding-translating written words into their verbal pronunciations. Moreover, learning words in meaningful story contexts supports learning when phonological recoding is reduced (e.g., when encountering irregular words). The current study tested the self-teaching hypothesis in Chinese-a non-alphabetic writing system. Native Mandarin-speaking second graders read either 16 stories (story context; n = 33) or word lists (without context; n = 34) under the conditions of allowed phonological recoding (reading aloud) or reduced phonological recoding (viewing while repeatedly naming a meaningless letter sequence b p m f). Each story and word list included one target pseudo-character consisting of a phonetic radical and a semantic radical providing pronunciation and meaning cues, respectively, for the pseudo-character. The regularity of phonetic radicals of pseudo-characters (regular vs. irregular: providing full vs. no pronunciation cue) and the transparency of semantic radicals of pseudo-characters (transparent vs. opaque: providing complete vs. no meaning cue) were manipulated. The orthographic learning outcomes were measured using the character writing, orthographic choice, naming, and a semantic production task. Our results support a primary role of phonological recoding in orthographic learning. Moreover, regular phonetic radicals facilitate phonology-orthography association, whereas transparent semantic radicals support semantic-orthography mapping. In sum, we extended the self-teaching hypothesis to a non-alphabetic writing system and revealed the unique roles of phonetic and semantic radicals in orthographic learning.
Assuntos
Aprendizagem , Fonética , Leitura , Semântica , Criança , China , Feminino , Humanos , MasculinoRESUMO
In the present study, we provide a comprehensive analysis and a multi-dimensional dataset of semantic transparency measures for 1810 German compound words. Compound words are considered semantically transparent when the contribution of the constituents' meaning to the compound meaning is clear (as in airport), but the degree of semantic transparency varies between compounds (compare strawberry or sandman). Our dataset includes both compositional and relatedness-based semantic transparency measures, also differentiated by constituents. The measures are obtained from a computational and fully implemented semantic model based on distributional semantics. We validate the measures using data from four behavioral experiments: Explicit transparency ratings, two different lexical decision tasks using different nonwords, and an eye-tracking study. We demonstrate that different semantic effects emerge in different behavioral tasks, which can only be captured using a multi-dimensional approach to semantic transparency. We further provide the semantic transparency measures derived from the model for a dataset of 40,475 additional German compounds, as well as for 2061 novel German compounds.
Assuntos
SemânticaRESUMO
Previous studies on visual word recognition of compound words have provided evidence for the influence of lexical properties (e.g., length, frequency) and semantic transparency (the degree of relatedness in meaning between a compound word and its constituents) in morphological processing (e.g., to what extent is doorbell influenced by door and bell?). However, a number of questions in this domain, which are difficult to address with the available methodological resources, are still unresolved. We collected semantic transparency scores for 2,861 compound words at the constituent level (i.e., how strongly the overall meaning of a compound word is related to that of each constituent) and analyzed their effects on speeded pronunciation and lexical decision performance for the compound words using the English Lexicon Project (http://elexicon.wustl.edu) data. The results from both tasks indicated that our human-judged semantic transparency ratings for both the first and second constituents play a significant role in compound word processing. Moreover, additional analyses indicated that the human-judged semantic transparency scores at the constituent level accounted for more variance in compound word recognition performance than did either whole-word semantic transparency scores or corpus-based semantic distance scores.
Assuntos
Tomada de Decisões/fisiologia , Semântica , Humanos , Leitura , Reconhecimento PsicológicoRESUMO
The Large Database of English Compounds (LADEC) consists of over 8,000 English words that can be parsed into two constituents that are free morphemes, making it the largest existing database specifically for use in research on compound words. Both monomorphemic (e.g., wheel) and multimorphemic (e.g., teacher) constituents were used. The items were selected from a range of sources, including CELEX, the English Lexicon Project, the British Lexicon Project, the British National Corpus, and Wordnet, and were hand-coded as compounds (e.g., snowball). Participants rated each compound in terms of how predictable its meaning is from its parts, as well as the extent to which each constituent retains its meaning in the compound. In addition, we obtained linguistic characteristics that might influence compound processing (e.g., frequency, family size, and bigram frequency). To show the usefulness of the database in investigating compound processing, we conducted a number of analyses that showed that compound processing is consistently affected by semantic transparency, as well as by many of the other variables included in LADEC. We also showed that the effects of the variables associated with the two constituents are not symmetric. In short, LADEC provides the opportunity for researchers to investigate a number of questions about compounds that have not been possible to investigate in the past, due to the lack of sufficiently large and robust datasets. In addition to directly allowing researchers to test hypotheses using the information included in LADEC, the database will contribute to future compound research by allowing better stimulus selection and matching.
Assuntos
Bases de Dados Factuais , Coleta de Dados , Linguística , SemânticaRESUMO
A central issue in visual and spoken word recognition is the lexical representation of complex words-in particular, whether the lexical representation of complex words depends on semantic transparency: Is a complex verb like understand lexically represented as a whole word or via its base stand, given that its meaning is not transparent from the meanings of its parts? To study this issue, a number of stimulus characteristics are of interest that are not yet available in public databases of German. This article provides semantic association ratings, lexical paraphrases, and vector-based similarity measures for German verbs, measuring (a) the semantic transparency between 1,259 complex verbs and their bases, (b) the semantic relatedness between 1,109 verb pairs with 432 different bases, and (c) the vector-based similarity measures of 846 verb pairs. Additionally, we include the verb regularity of all verbs and two counts of verb family size for 184 base verbs, as well as estimates of age of acquisition and age of reading for 200 verbs. Together with lemma and type frequencies from public lexical databases, all measures can be downloaded along with this article. Statistical analyses indicate that verb family size, morphological complexity, frequency, and verb regularity affect the semantic transparency and relatedness ratings as well as the age of acquisition estimates, indicating that these are relevant variables in psycholinguistic experiments. Although lexical paraphrases, vector-based similarity measures, and semantic association ratings may deliver complementary information, the interrater reliability of the semantic association ratings for each verb pair provides valuable information when selecting stimuli for psycholinguistic experiments.
Assuntos
Envelhecimento/psicologia , Aprendizagem , Psicolinguística , Leitura , Semântica , Adulto , Características da Família , Feminino , Alemanha , Humanos , Masculino , Reprodutibilidade dos Testes , Testes de Associação de Palavras , Adulto JovemRESUMO
The present study used event-related potentials to investigate whether the syntactic structure was activated in the comprehension of lexical idioms, and if so, whether it varied as a function of familiarity and semantic transparency. Participants were asked to passively read the "1+2" structural Chinese lexical idioms with each being presented following 3-5 contextual "1+2" (congruent-structure condition) or "2+1" structural Chinese phrases (incongruent-structure condition). The N400 ERP responses showed more positivity in congruent-structure condition relative to incongruent-structure condition in idioms with high familiarity and high semantic transparency, but less positivity in congruent-structure condition in idioms with high familiarity but low semantic transparency, idioms with low familiarity but high semantic transparency, and idioms with low familiarity and low semantic transparency. Our results suggest that syntactic structure, as the unnecessarity of lexical idiomatic words, was nevertheless activated, independent of familiarity and semantic transparency.
Assuntos
Compreensão/fisiologia , Potenciais Evocados Visuais/fisiologia , Psicolinguística , Semântica , Feminino , Humanos , Masculino , Processos Mentais , Leitura , Reconhecimento Psicológico , Adulto JovemRESUMO
Despite flourishing research on the relationship between emotion and literal language, and despite the pervasiveness of figurative expressions in communication, the role of figurative language in conveying affect has been underinvestigated. This study provides affective and psycholinguistic norms for 619 German idiomatic expressions and explores the relationships between affective and psycholinguistic idiom properties. German native speakers rated each idiom for emotional valence, arousal, familiarity, semantic transparency, figurativeness, and concreteness. They also described the figurative meaning of each idiom and rated how confident they were about the attributed meaning. The results showed that idioms rated high in valence were also rated high in arousal. Negative idioms were rated as more arousing than positive ones, in line with results from single words. Furthermore, arousal correlated positively with figurativeness (supporting the idea that figurative expressions are more emotionally engaging than literal expressions) and with concreteness and semantic transparency. This suggests that idioms may convey a more direct reference to sensory representations, mediated by the meanings of their constituting words. Arousal correlated positively with familiarity. In addition, positive idioms were rated as more familiar than negative idioms. Finally, idioms without a literal counterpart were rated as more emotionally valenced and arousing than idioms with a literal counterpart. Although the meanings of ambiguous idioms were less correctly defined than those of unambiguous idioms, ambiguous idioms were rated as more concrete than unambiguous ones. We also discuss the relationships between the various psycholinguistic variables characterizing idioms, with reference to the literature on idiom structure and processing.
Assuntos
Afeto , Emoções , Idioma , Reconhecimento Psicológico , Adulto , Idoso , Feminino , Humanos , Masculino , Metáfora , Pessoa de Meia-Idade , Psicolinguística , Semântica , Adulto JovemRESUMO
The present study tests theories about the representation of compound nouns and grammatical gender in the mental lexicon. Comprehension and production of determiner-compound-noun phrases were examined in three aphasic native speakers of German, a language that marks grammatical gender on definite determiners of nouns. In picture naming, participants were more impaired in retrieving compounds than matched simple nouns and showed different error patterns. However, retrieving the correct determiner was equally impaired for compounds and simple nouns. Clear dissociations between impaired determiner retrieval in production and relatively preserved processing of determiner-noun phrases in comprehension were observed for existing compounds and simple nouns. In contrast, processing of novel compounds was more impaired in both modalities, and gender-mismatch effects were especially observed for novel compounds. The results support the account of decomposed word forms and holistic lemma representations of compound nouns in the mental lexicon.
Assuntos
Afasia/psicologia , Compreensão , Idioma , Semântica , Vocabulário , Adulto , Feminino , Humanos , Masculino , Psicolinguística , Adulto JovemRESUMO
There is no converging evidence on how a word's semantic transparency affects morphemes' potential semantic activation. The inconsistent results may be due to the limitation of traditional univariate analyses, in which the semantic transparency was treated as discrete categories. In the current study, Chinese two-character words were used as stimuli and functional magnetic resonance imaging (fMRI) techniques were combined with a priming paradigm. Unlike most previous studies, the multivariate representation similarity analysis (RSA) was used to treat semantic transparency as a continuous variable. The RSA results showed that widespread regions in the frontal-parietal-temporal network represent the semantic perception of characters in all words and transparent words, but no brain areas were identified in opaque words. Unlike RSA results, univariate analyses showed no significant difference between the opaque and transparent words. These results suggest that RSA is more suitable to examine the neural mechanism related to continuous variables such as semantic transparency.
Assuntos
Encéfalo , Semântica , Encéfalo/diagnóstico por imagem , Encéfalo/fisiologia , Mapeamento Encefálico , Análise Multivariada , Percepção , Imageamento por Ressonância MagnéticaRESUMO
Although extensive research has been carried out on collocation processing, it is still unclear how cross-language overlap and transparency influence the processing of collocations by L2 learners. In the current study, a phrase judgment task was used to investigate the processing of congruent (i.e., exist in both English and Arabic) and incongruent collocations (i.e., exist only in English) by Arabic non-native speakers of English. The semantic transparency of the items was controlled for. Results demonstrated the effect of congruency on processing: congruent items yielded more correct responses and faster response times than incongruent items. The effect of congruency was modulated by proficiency, with congruency having a stronger effect on lower-proficiency learners than higher-proficiency learners. Transparency had no effect, with no differences in response times and accuracy between transparent and opaque collocations. The findings have implications for the learning and teaching of L2 collocations.
RESUMO
Introduction: Research on collocations has become an essential issue in L2 acquisition and cognitive psychology. Previous studies have mainly focused on phonographic languages such as English, Swedish, and German, and primarily discussed the effect of semantic transparency and translational congruency. However, these studies have lacked (1) an analysis of the interactions between presentation modalities (visual vs. auditory) and the semantic transparency and translational congruency, and (2) a discussion of an ideographic language, such as Chinese and Japanese. Methods: We conducted an experiment with 36 Chinese Japanese-as-a-Foreign-Language learners to examine the processing of Japanese collocations. In the experiment, we manipulated the presentation modality, semantic transparency, and translational congruency during a lexical judgment task. Results: Data analysis using linear mixed-effects models revealed the following. (1) In both conditions of semantic transparency and translational congruency, the auditory presentation was associated with longer reaction times than the visual presentation. (2) In the visual presentation condition, neither semantic transparency nor translational congruency showed significant effects. (3) In the auditory presentation condition, the reaction time for collocations with high semantic transparency tended to be longer than that for collocations with medium semantic transparency and significantly longer than that for collocations with low semantic transparency. The reaction time for collocations with congruent translation was longer than that for collocations with incongruent translation. Discussion: These results support the dual-route model of Japanese collocational processing by Chinese Japanese-as-a-Foreign-Language learners. Our findings suggest that whether the analytic or holistic processing dominates is closely related to the learners' knowledge of Chinese and Japanese Kanji words and strongly influenced by the presentation modality, semantic transparency, and translational congruency.
RESUMO
The present study examined the effect of both morphological complexity and semantic transparency in Chinese compound word recognition. Using a visual lexical decision task, our electrophysiological results showed that transparent and opaque compounds induced stronger Left Anterior Negativity (LAN) than monomorphemic words. This result suggests that Chinese compounds might be decomposed into their constituent morphemes at the lemma level, whereas monomorphemic words are accessed as a whole-word lemma directly from the form level. In addition, transparent and opaque compounds produced a similar N400 as each other, suggesting that transparency did not show an effect on the involvement of constituent morphemes during access to the whole-word lemma. Two behavioral experiments additionally showed similar patterns to the EEG results. These findings support morphological decomposition for compounds at the lemma level as proposed by the full-parsing model, and no evidence is found to support the role of transparency during Chinese compound word recognition.
Assuntos
Eletroencefalografia , Idioma , Vocabulário , Humanos , Potenciais Evocados/fisiologia , SemânticaRESUMO
Complex verbs with the same preverb/prefix/particle that is both linguistically productive and analyzable can be compositional as well as non-compositional in meaning. For example, the English on has compositional spatial uses (put a hat on) but also a non-spatial "continuative" use, where its semantic contribution is consistent with multiple verbs (we played / worked / talked on despite the interruption). Comparable examples can be given with German preverbs or Russian prefixes, which are the main data analyzed in the present paper. The preverbs/prefixes/particles that encode non-compositional, construction-specific senses have been extensively studied; however, it is still far from clear how their semantic idiosyncrasies arise. Even when one can identify the contribution of the base, it is counterintuitive to assign the remaining sememes to the preverb/prefix/particle part. Therefore, on one hand, there seems to be an element without meaning, and on the other, there is a word sense that apparently comes from nowhere. In this article, I suggest analyzing compositional and non-compositional complex verbs as instantiations of two different types of constructions: one with an open slot for the preverb/prefix/particle and a fixed base verb and another with a fixed preverb/prefix/particle and an open slot for the base verb. Both experimental and corpus evidence supporting this decision is provided for Russian data. I argue that each construction implies its own meaning-processing model and that the actual choice between the two can be predicted by taking into account the discrepancy in probabilities of transition from preverb/prefix/particle to base and from base to preverb/prefix/particle.
RESUMO
This study tried to replicate and extend the semantic transparency morphological effect using the flanker lexical decision paradigm (Grainger et al., 2020). In the first experiment, stems were used as flankers of target words that could be truly morphological (hunt hunter hunt), pseudomorphological (corn corner corn), or form-related with the flanker (broth brothel broth). In half of the trials, a related flanker was employed, and in the other half, an unrelated word was presented as flanker (e.g., table player table). The results showed a facilitative effect for the related condition as a main effect with no difference between experimental conditions. These results were interpreted in terms of an orthographic facilitation taking place when whole stems are presented as flankers. In the second experiment, short derivational suffixes were used as flankers of the same targets employed in the first experiment. The results showed an inhibitory effect of the same magnitude for the transparent and pseudomorphological conditions with no effect for the form condition. This finding suggests an inhibitory effect by which morphemes activate several lexical candidates that compete for recognition. Overall, the results are interpreted in terms of the cognitive requirements of the experimental task, the items selected, and the current models of morphological processing.
Assuntos
Reconhecimento Psicológico , Semântica , Humanos , LeituraRESUMO
This study examined how semantic transparency modulated the processing of spoken Chinese compound words with event-related potential (ERP) recording. A reverse-block passive oddball paradigm was adopted to elicit mismatch negativity (MMN), which responds to holistic and combinatorial processing in opposite directions. Specifically, linguistic inputs that are processed as holistic lexical representations will elicit stronger MMNs (lexical enhancement) than those that do not have such representations. In contrast, when they are processed by combining the constituents, smaller MMNs will be elicited (combinatorial reduction) as compared to non-combinable inputs. We compared the strengths of MMNs among transparent words, opaque words, and pseudocompounds (that did not have lexical representations and were non-combinable). It was shown that transparent words triggered smaller MMNs than pseudocompounds, which supported combinatorial processing. Opaque words did not differ from pseudocompounds, which was interpreted as parallel employment of the holistic and combinatorial processing routes. Overall, the results are consistent with the idea that native Chinese speakers routinely attempt to process Chinese compound words by retrieving and combining morphemes. However, because the meanings of opaque words are irrelevant to their constituent morphemes, Chinese speakers must construct and retrieve their holistic representations to ensure accurate processing.
Assuntos
Semântica , Processamento de Texto , China , Eletroencefalografia , Potenciais EvocadosRESUMO
The combination of "Big Data" and Artificial Intelligence (AI) is frequently promoted as having the potential to deliver valuable health benefits when applied to medical decision-making. However, the responsible adoption of AI-based clinical decision support systems faces several challenges at both the individual and societal level. One of the features that has given rise to particular concern is the issue of explainability, since, if the way an algorithm arrived at a particular output is not known (or knowable) to a physician, this may lead to multiple challenges, including an inability to evaluate the merits of the output. This "opacity" problem has led to questions about whether physicians are justified in relying on the algorithmic output, with some scholars insisting on the centrality of explainability, while others see no reason to require of AI that which is not required of physicians. We consider that there is merit in both views but find that greater nuance is necessary in order to elucidate the underlying function of explainability in clinical practice and, therefore, its relevance in the context of AI for clinical use. In this paper, we explore explainability by examining what it requires in clinical medicine and draw a distinction between the function of explainability for the current patient versus the future patient. This distinction has implications for what explainability requires in the short and long term. We highlight the role of transparency in explainability, and identify semantic transparency as fundamental to the issue of explainability itself. We argue that, in day-to-day clinical practice, accuracy is sufficient as an "epistemic warrant" for clinical decision-making, and that the most compelling reason for requiring explainability in the sense of scientific or causal explanation is the potential for improving future care by building a more robust model of the world. We identify the goal of clinical decision-making as being to deliver the best possible outcome as often as possible, and find-that accuracy is sufficient justification for intervention for today's patient, as long as efforts to uncover scientific explanations continue to improve healthcare for future patients.