PELCRA Polish-English parallel corpus of literary works (CC-BY)

PELCRA-LIT-1

Pęzik P., Ogrodniczuk M., Przepiórkowski A (2011). Parallel and spoken corpora in an open repository of Polish language resources. Human Language Technologies as a Challenge for Computer Science and Linguistics. LTC Poznań 2011.

ID:

507

A subset of the PELCRA Polish parallel corpora licensed under the CC-BY license. This resource contains 17 public-domain literary works and their English-Polish/Polish-English translations. The texts have been aligned manually on the sentence level. The texts are provided as TEI P5-compliant XML files with custom PELCRA extensions to mark complex translation equivalence types, and in the XLIFF format.

You don’t have the permission to edit this resource.

  • xmllint