Your browser doesn't support javascript.
loading
Sound symbolism in Japanese names: Machine learning approaches to gender classification.
Ngai, Chun Hau; Kilpatrick, Alexander J; Cwiek, Aleksandra.
Afiliação
  • Ngai CH; East Asian Languages and Cultures Department, Indiana University, Bloomington, Indiana, United States of America.
  • Kilpatrick AJ; Faculty of International Studies, Nagoya University of Business and Commerce, Nisshin, Aichi, Japan.
  • Cwiek A; Leibniz-Centre General Linguistics, Laboratory Phonology, Berlin, Germany.
PLoS One ; 19(3): e0297440, 2024.
Article em En | MEDLINE | ID: mdl-38466741
ABSTRACT
This study investigates the sound symbolic expressions of gender in Japanese names with machine learning algorithms. The main goal of this study is to explore how gender is expressed in the phonemes that make up Japanese names and whether systematic sound-meaning mappings, observed in Indo-European languages, extend to Japanese. In addition to this, this study compares the performance of machine learning algorithms. Random Forest and XGBoost algorithms are trained using the sounds of names and the typical gender of the referents as the dependent variable. Each algorithm is cross-validated using k-fold cross-validation (28 folds) and tested on samples not included in the training cycle. Both algorithms are shown to be reasonably accurate at classifying names into gender categories; however, the XGBoost model performs significantly better than the Random Forest algorithm. Feature importance scores reveal that certain sounds carry gender information. Namely, the voiced bilabial nasal /m/ and voiceless velar consonant /k/ were associated with femininity, and the high front vowel /i/ were associated with masculinity. The association observed for /i/ and /k/ stand contrary to typical patterns found in other languages, suggesting that Japanese is unique in the sound symbolic expression of gender. This study highlights the importance of considering cultural and linguistic nuances in sound symbolism research and underscores the advantage of XGBoost in capturing complex relationships within the data for improved classification accuracy. These findings contribute to the understanding of sound symbolism and gender associations in language.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Simbolismo / Idioma Limite: Female / Humans / Male País/Região como assunto: Asia Idioma: En Revista: PLoS One Assunto da revista: CIENCIA / MEDICINA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos País de publicação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Simbolismo / Idioma Limite: Female / Humans / Male País/Região como assunto: Asia Idioma: En Revista: PLoS One Assunto da revista: CIENCIA / MEDICINA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos País de publicação: Estados Unidos