Directorate-General for Communications Networks, Content and Technology
-
Polish Ministry of Foreign Affairs Regional Dataset
A collection of Polish-English whitepapers published by the Polish Ministry of Foreign Affairs, including "Eastern Partnership" (10K words in 492 segments) and "Poland's 10 years in the...
XML PDF ZIP (547 views) (437 Downloads)
-
Natolin European Centre Dataset
The Polish-English parallel corpus is composed of three volumes (100K words in total) of Natolin Papers, a periodical issued by Polish Natolin European Centre, a research centre dealing...
XML PDF ZIP (233 views) (196 Downloads)
-
Parallel Corpus from the Web Site of the the MFA of Latvia
The Corpus has been built from the News and Press Releases published in the Web Site of the Ministry of Foreign Affairs of the Republic of Latvia. This dataset has been created within...
XML PDF ZIP (674 views) (571 Downloads)
-
Polish Ministry of Foreign Affairs Historical Dataset
A collection of parallel Polish-English texts published by the Polish Ministry of Polish Affairs. Sentence-level alignment of translation segments was carried out manually and encoded in...
XML PDF ZIP (481 views) (361 Downloads)
-
Polish Ministry of Foreign Affairs Youth 2011 Report
A parallel Polish-English version of the Youth 2011 report published by the Polish Ministry of Polish Affairs. Sentence-level alignment of translation segments was carried out manually...
XML PDF ZIP (513 views) (397 Downloads)