Ryšių tinklų, turinio ir technologijų generalinis direktoratas
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 rodiniai) (557 Siuntos)
-
National Health Fund Dataset (Processed)
The dataset is a 274K-token Polish-English parallel resource in XLIFF format created on the basis of "Diagnosis-Related Groups in Europe" publication of the Polish National Health Fund....
ZIP (345 rodiniai) (231 Siuntos)
-
Letter of rights for persons arrested and or detained
Police form, 12 pages. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (413 rodiniai) (296 Siuntos)
-
Polish Court Rulings Corpus (Processed)
The Polish Court Rulings Corpus contains 62 726 rulings of Polish courts, over 178 million words of running text. The texts of the rulings together with some metadata were acquired from...
ZIP (285 rodiniai) (174 Siuntos)
-
Polish Ministry of Foreign Affairs reports in EN and PL (Processed)
The dataset comprises the EN and PL versions of two reports created by the Polish Ministry of Foreign Affairs, “Rules for communicating the POLSKA brand” and “Polish Presidency of the...
ZIP (407 rodiniai) (303 Siuntos)
-
Monolingual Polish corpus in the public administration domain
Monolingual Polish corpus, containing 22372690 tokens and 1805280 lexical types in the public administration domain. This dataset has been created within the framework of the European...
ZIP (431 rodiniai) (317 Siuntos)
-
Polish-English parallel corpus from the website of the National Digital Archives (Processed)
Polish-English parallel corpus from the website of the National Digital Archives (https://www.nac.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (412 rodiniai) (313 Siuntos)
-
Parallel corpus (en-pl) from the Export Promotion Portal of Poland (Processed)
A paralell corpus constructed from data acquired form the *.trade.gov.pl websites This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (405 rodiniai) (289 Siuntos)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 rodiniai) (487 Siuntos)
-
Letter of rights for persons arrested and or detained (Processed)
Collection of transaltion units (1906 in total) in 21 language pairs extracted from 7 Police forms (one form 12 pages long in each of the following languages: BG, EL, EN, FR, LV, PL, RO)....
ZIP (452 rodiniai) (338 Siuntos)
-
Parallel texts from Swedish Labour market agency. Part 2 (Processed)
Same as part 1, but with the Readme-file. (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (439 rodiniai) (332 Siuntos)
-
Polish-English Internal Aviation Glossaries (Processed)
A set of bilingual glossaries developed by the Civil Aviation Authority of Republic of Poland, totalling 8548 Polish and English terms with commentaries and reference notes, including...
ZIP (269 rodiniai) (175 Siuntos)
-
Polish-English parallel corpus from the website of the Ministry of Digitization (Processed)
Polish-English parallel corpus from the website of the Ministry of Digitization, Republic of Poland (http://mac.gov.pl) This dataset has been created within the framework of the European...
ZIP (330 rodiniai) (229 Siuntos)
-
Polish-English parallel corpus from the website "Polish Aid" (Processed)
Polish-English parallel corpus from the website of the website "Polish Aid" (http://www.polskapomoc.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (339 rodiniai) (233 Siuntos)
-
Polish-English parallel corpus from the website of the U.S. EMBASSY and CONSULATE IN POLAND (Processed)
Polish-English parallel corpus from the website of the U.S. EMBASSY and CONSULATE IN POLAND (https://pl.usembassy.gov/) This dataset has been created within the framework of the European...
ZIP (385 rodiniai) (270 Siuntos)
-
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network) (Processed)
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network, https://eures.praca.gov.pl) This dataset has been created within the...
ZIP (549 rodiniai) (440 Siuntos)
-
Financial Stability Reports from the National Bank of Poland (2015-16) (Processed)
Financial Stability Reports from the National Bank of Poland (2015-16) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (316 rodiniai) (208 Siuntos)
-
Parallel texts from Swedish Work environment Authority
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (406 rodiniai) (296 Siuntos)
-
Translations of Hungarian from public websites
A webcrawl of 14 different websites covering parallel corpora of Hungarian with Polish, Czech, Swedish, Finnish, French, German, Italian, English and Slovenian This dataset has been...
ZIP (388 rodiniai) (287 Siuntos)
-
Public Procurement Dataset 2 (Processed)
A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in...
ZIP (441 rodiniai) (312 Siuntos)