The ChildPoeDE Corpus: 1082 German children’s poems for computational and experimental studies on poetry reception

ItemData PaperOpen Access

Abstract

We introduce childPoeDE: the first corpus of German poetry for children comprising poems which are still read today and cover a wide range of topics and authors. ChildPoeDE contains poem texts and both poem-level and token-level metadata. Poem-level metadata includes information about the anthologies and authors, quantitative text features, rhyme and lexical richness. Token-level metadata covers word length, position and frequency, parts-of-speech, onomatopoeia and sonority. This corpus can be used for computational text analysis, but also as a source for stimulus material in experimental studies. The corpus metadata is freely accessible via Zenodo. The poem texts are protected by copyright.

Description

Keywords

Citation

Published in

Journal of open humanities data, 9, Ubiquity Press, London, 2023, https://doi.org/10.5334/johd.102

Relationships

Collections