PoeTree: Poetry Treebanks in Czech, English,  French, German, Hungarian, Italian, Portuguese,  Russian, Slovenian and Spanish

Petr  Plecháč; Silvie Cinková; Robert Kolár; Artjoms Šeļa; Mirella De Sisto; Lara Nugues; Thomas Haider; Neža Kočnik

doi:10.1163/24523666-bja10044

Authors

Petr Plecháč Czech Academy of Sciences https://orcid.org/0000-0002-1003-4541
Silvie Cinková Czech Academy of Sciences https://orcid.org/0000-0003-4526-3915
Robert Kolár Czech Academy of Sciences https://orcid.org/0000-0001-8061-1917
Artjoms Šeļa Polish Academy of Sciences https://orcid.org/0000-0002-2272-2077
Mirella De Sisto Tilburg University https://orcid.org/0000-0002-0899-5976
Lara Nugues University of Basel https://orcid.org/0000-0003-1381-8090
Thomas Haider University of Passau, Passau, Germany https://orcid.org/0000-0003-1522-4026
Neža Kočnik University of Ljubljana https://orcid.org/0009-0003-8318-2179

DOI:

https://doi.org/10.1163/24523666-bja10044

Keywords:

poetry, computational poetics, corpus linguistics, digital humanities

Abstract

This article presents a set of standardised corpora of poetry comprising over 330,000 poems in ten languages (Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian, and Spanish). Each corpus has been deduplicated, enriched with Universal Dependencies, provided with additional metadata, and converted into a unified json structure.

Author Biographies

Petr Plecháč, Czech Academy of Sciences

Corresponding author
Institute of Czech Literature, Czech Academy of Sciences, Prague, Czechia
Silvie Cinková, Czech Academy of Sciences

Institute of Czech Literature, Czech Academy of Sciences, Prague, Czechia
Charles University, Prague, Czechia
Robert Kolár, Czech Academy of Sciences

Institute of Czech Literature, Czech Academy of Sciences, Prague, Czechia
Artjoms Šeļa, Polish Academy of Sciences

Institute of Polish Language, Polish Academy of Sciences, Warsaw, Poland
Mirella De Sisto, Tilburg University

Tilburg University, Tilburg, the Netherlands
Lara Nugues, University of Basel

University of Basel, Basel, Switzerland
Neža Kočnik, University of Ljubljana

PoeTree: Poetry Treebanks in Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian and Spanish

Authors

DOI:

Keywords:

Abstract

Author Biographies

Downloads

Published

Issue

Section

License

How to Cite

Make a Submission

Indexed by:

PoeTree: Poetry Treebanks in Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian and Spanish

Authors

DOI:

Keywords:

Abstract

Author Biographies

Downloads

Published

Issue

Section

License

How to Cite

Make a Submission

social media

Indexed by: