The ChildPoeDE Corpus: 1082 German children’s poems for computational and experimental studies on poetry reception
Date issued
Authors
Lehmann, Marina
Heumann, Anne
Kuijpers, Moniek M.
Lauer, Gerhard
Lüdtke, Jana
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
License
Abstract
We introduce childPoeDE: the first corpus of German poetry for children comprising poems which are still read today and cover a wide range of topics and authors. ChildPoeDE contains poem texts and both poem-level and token-level metadata. Poem-level metadata includes information about the anthologies and authors, quantitative text features, rhyme and lexical richness. Token-level metadata covers word length, position and frequency, parts-of-speech, onomatopoeia and sonority. This corpus can be used for computational text analysis, but also as a source for stimulus material in experimental studies. The corpus metadata is freely accessible via Zenodo. The poem texts are protected by copyright.