1.
Behav Res Methods
; 47(4): 1085-1094, 2015 Dec.
Artigo
em Inglês
| MEDLINE
| ID: mdl-25319039
RESUMO
This article introduces childLex, an online database of German read by children. childLex is based on a corpus of children's books and comprises 10 million words that were syntactically annotated and lemmatized. childLex reports linguistic norms for lexical, superlexical, and sublexical variables in three different age groups: 6-8 (grades 1-2), 9-10 (grades 3-4), and 11-12 years (grades 5-6). Here, we describe how childLex was collected and analyzed. In addition, we provide information about the distributions of word frequency, word length, and orthographic neighborhood size, as well as their intercorrelations. Finally, we explain how childLex can be accessed using a Web interface.