Resources for Language Technologies
-
National Health Fund Dataset (Processed)
The dataset is a 274K-token Polish-English parallel resource in XLIFF format created on the basis of "Diagnosis-Related Groups in Europe" publication of the Polish National Health Fund....
ZIP (345 visninger) (231 Downloads)
-
Polish-English parallel corpus from the website of the National Digital Archives (Processed)
Polish-English parallel corpus from the website of the National Digital Archives (https://www.nac.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (412 visninger) (313 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Digitization (Processed)
Polish-English parallel corpus from the website of the Ministry of Digitization, Republic of Poland (http://mac.gov.pl) This dataset has been created within the framework of the European...
ZIP (330 visninger) (229 Downloads)
-
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network) (Processed)
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network, https://eures.praca.gov.pl) This dataset has been created within the...
ZIP (549 visninger) (440 Downloads)
-
Polish-English parallel corpus from the website of the ING Polish Art Foundation (Processed)
Polish-English parallel corpus from the website of the ING Polish Art Foundation (https://ingart.pl) This dataset has been created within the framework of the European Language Resource...
ZIP (310 visninger) (212 Downloads)
-
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (Processed)
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (https://www.rpo.gov.pl/en) This dataset has been created within the framework of the...
ZIP (523 visninger) (420 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Digital Affairs (Processed)
Polish-English parallel corpus from the website of the Ministry of Digital Affairs (http://archiwum.mc.gov.pl and http://krmc.mc.gov.pl) This dataset has been created within the...
ZIP (314 visninger) (214 Downloads)
-
Polish-English parallel corpus from the website of the National Audiovisual Institute (Processed)
Polish-English parallel corpus from the website of the National Audiovisual Institute (http://www.nina.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (348 visninger) (237 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of the Interior and Administration (Processed)
Polish-English parallel corpus from the website of the Ministry of the Interior and Administration, Republic of Poland (https://www.mswia.gov.pl/) This dataset has been created within...
ZIP (334 visninger) (241 Downloads)
-
Polish-English parallel corpus from the website of the Institute of Mathematics of the Polish Academy of Sciences (Processed)
Polish-English parallel corpus from the website of the Institute of Mathematics of the Polish Academy of Sciences (https://www.impan.gov.pl redirect to https://www.impan.pl) This dataset...
ZIP (346 visninger) (229 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Culture and National Heritage (Processed)
Polish-English parallel corpus from the website of the Ministry of Culture and National Heritage, Republic of Poland (http://www.mkidn.gov.pl) This dataset has been created within the...
ZIP (277 visninger) (179 Downloads)
-
Polish-English parallel corpus from the website of the Citizens Information Board (Processed)
Polish-English parallel corpus from the website of the Citizens Information Board, Ireland (http://www.citizensinformation.ie) This dataset has been created within the framework of the...
ZIP (373 visninger) (269 Downloads)
-
Polish-English parallel corpus from the website of the Polish Tourism Organisation (Processed)
Polish-English parallel corpus from the website of the Polish Tourism Organisation (https://pot.gov.pl/en) This dataset has been created within the framework of the European Language...
ZIP (621 visninger) (508 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Development (Processed)
Polish-English parallel corpus from the website of the Ministry of Development, Republic of Poland (http://www.mr.gov.pl) This dataset has been created within the framework of the...
ZIP (406 visninger) (296 Downloads)
-
Polish-English parallel corpus from the website of the National Science Centre (Processed)
Polish-English parallel corpus from the website of the National Science Centre (http://ncn.gov.pl) This dataset has been created within the framework of the European Language Resource...
ZIP (337 visninger) (224 Downloads)
-
Polish-English parallel corpus from the website "geoportal.gov.pl" (Processed)
Polish-English parallel corpus from the website "geoportal.gov.pl (https://www.geoportal.gov.pl) This dataset has been created within the framework of the European Language Resource...
ZIP (195 visninger) (146 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Science and Higher Education (Processed)
Polish-English parallel corpus from the website of the Ministry of Science and Higher Education (http://www.eng.nauka.gov.pl/en/) This dataset has been created within the framework of...
ZIP (358 visninger) (255 Downloads)
-
ENGLISH/POLISH PHRASE BOOK FOR ADMINISTRATIVE STAFF of LOCAL GOVERNMENT UNITS (Processed)
An English/Polish phrase book for the administrative staff of local government units (LGUs). This dataset has been created within the framework of the European Language Resource...
ZIP (612 visninger) (514 Downloads)
-
Compendium The Social Insurance Institution (Processed)
A compendium on the Polish Social Insurance Insitution (ZUS), covering the following issues: short presentation of ZUS, its history, tasks, organizational structure, employees, Social...
ZIP (323 visninger) (203 Downloads)
-
Parallel Global Voices (English - Polish) (Processed)
Parallel Global Voices EN-PL is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (606 visninger) (504 Downloads)