-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 views) (557 Downloads)
-
National Health Fund Dataset (Processed)
The dataset is a 274K-token Polish-English parallel resource in XLIFF format created on the basis of "Diagnosis-Related Groups in Europe" publication of the Polish National Health Fund....
ZIP (345 views) (231 Downloads)
-
Letter of rights for persons arrested and or detained
Police form, 12 pages. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (413 views) (296 Downloads)
-
Polish Court Rulings Corpus (Processed)
The Polish Court Rulings Corpus contains 62 726 rulings of Polish courts, over 178 million words of running text. The texts of the rulings together with some metadata were acquired from...
ZIP (285 views) (174 Downloads)
-
Polish Ministry of Foreign Affairs reports in EN and PL (Processed)
The dataset comprises the EN and PL versions of two reports created by the Polish Ministry of Foreign Affairs, “Rules for communicating the POLSKA brand” and “Polish Presidency of the...
ZIP (407 views) (303 Downloads)
-
Monolingual Polish corpus in the public administration domain
Monolingual Polish corpus, containing 22372690 tokens and 1805280 lexical types in the public administration domain. This dataset has been created within the framework of the European...
ZIP (431 views) (317 Downloads)
-
Polish-English parallel corpus from the website of the National Digital Archives (Processed)
Polish-English parallel corpus from the website of the National Digital Archives (https://www.nac.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (412 views) (313 Downloads)
-
Parallel corpus (en-pl) from the Export Promotion Portal of Poland (Processed)
A paralell corpus constructed from data acquired form the *.trade.gov.pl websites This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (405 views) (289 Downloads)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 views) (487 Downloads)
-
Letter of rights for persons arrested and or detained (Processed)
Collection of transaltion units (1906 in total) in 21 language pairs extracted from 7 Police forms (one form 12 pages long in each of the following languages: BG, EL, EN, FR, LV, PL, RO)....
ZIP (452 views) (338 Downloads)
-
Parallel texts from Swedish Labour market agency. Part 2 (Processed)
Same as part 1, but with the Readme-file. (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (439 views) (332 Downloads)
-
Polish-English Internal Aviation Glossaries (Processed)
A set of bilingual glossaries developed by the Civil Aviation Authority of Republic of Poland, totalling 8548 Polish and English terms with commentaries and reference notes, including...
ZIP (269 views) (175 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Digitization (Processed)
Polish-English parallel corpus from the website of the Ministry of Digitization, Republic of Poland (http://mac.gov.pl) This dataset has been created within the framework of the European...
ZIP (330 views) (229 Downloads)
-
Polish-English parallel corpus from the website "Polish Aid" (Processed)
Polish-English parallel corpus from the website of the website "Polish Aid" (http://www.polskapomoc.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (339 views) (233 Downloads)
-
Polish-English parallel corpus from the website of the U.S. EMBASSY and CONSULATE IN POLAND (Processed)
Polish-English parallel corpus from the website of the U.S. EMBASSY and CONSULATE IN POLAND (https://pl.usembassy.gov/) This dataset has been created within the framework of the European...
ZIP (385 views) (270 Downloads)
-
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network) (Processed)
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network, https://eures.praca.gov.pl) This dataset has been created within the...
ZIP (549 views) (440 Downloads)
-
Financial Stability Reports from the National Bank of Poland (2015-16) (Processed)
Financial Stability Reports from the National Bank of Poland (2015-16) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (316 views) (208 Downloads)
-
Parallel texts from Swedish Work environment Authority
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (406 views) (296 Downloads)
-
Translations of Hungarian from public websites
A webcrawl of 14 different websites covering parallel corpora of Hungarian with Polish, Czech, Swedish, Finnish, French, German, Italian, English and Slovenian This dataset has been...
ZIP (388 views) (287 Downloads)
-
Public Procurement Dataset 2 (Processed)
A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in...
ZIP (441 views) (312 Downloads)