-
Polish-English parallel corpus from the website of the Ministry of Digitization (Processed)
Polish-English parallel corpus from the website of the Ministry of Digitization, Republic of Poland (http://mac.gov.pl) This dataset has been created within the framework of the European...
ZIP (330 views) (229 Downloads)
-
Polish-English parallel corpus from the website "Polish Aid" (Processed)
Polish-English parallel corpus from the website of the website "Polish Aid" (http://www.polskapomoc.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (339 views) (233 Downloads)
-
Financial Stability Reports from the National Bank of Poland (2015-16) (Processed)
Financial Stability Reports from the National Bank of Poland (2015-16) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (316 views) (208 Downloads)
-
Parallel corpus from Bank of Estonia
Parallel corpus from content of Bank of Estonia website This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe...
ZIP (283 views) (180 Downloads)
-
Macroeconomic Developments (Processed)
Bulletins of Macroeconomic Developments This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (365 views) (1 Downloads)
-
Bilingual Icelandic-English parallel corpus from Statistics Iceland website
Contents of https://www.statice.is and https://hagstofa.is/ websites downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the...
ZIP (416 views) (317 Downloads)
-
English-Icelandic parallel corpus from Statistics Iceland (Processed)
English-Icelandic parallel corpus compiled from parallel content collected from Statistics Iceland English and Icelandic home pages https://www.statice.is/ This dataset has been created...
ZIP (481 views) (364 Downloads)
-
Monolingual documents from the Government of Lithuania (Processed)
Monolingual documents received from the Government of the Republic of Lithuania. (Processed) This dataset has been created within the framework of the European Language Resource...
ZIP (490 views) (376 Downloads)
-
Public Procurement Dataset 2 (Processed)
A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in...
ZIP (441 views) (312 Downloads)
-
Methodological Reconciliation
Methodological Reconciliation Table Council Directive 2011_85_EU_3_2016 This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (581 views) (486 Downloads)
-
Bilingual Danish-English parallel corpus from the State Audit Office (Rigsrevisionen) website
Contents of http://rigsrevisionen.dk/ website downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the European Language Resource...
ZIP (322 views) (218 Downloads)
-
Bilingual English-Danish parallel corpus from Danish Ministry of Finance website
Contents of https://uk.fm.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (522 views) (407 Downloads)
-
English-Swedish parallel corpus from Annual Reports of the Swedish Pension System (Processed)
Source PDF files as parallel documents. The original texts are all always Swedish, the English text is its translation. This dataset has been created within the framework of the European...
ZIP (681 views) (578 Downloads)
-
Polish-English parallel corpus from the website of the Central Statistical Office (Processed)
Polish-English parallel corpus from the website of the Central Statistical Office (http://stat.gov.pl/) This dataset has been created within the framework of the European Language...
ZIP (346 views) (231 Downloads)
-
Polish-English parallel corpus from the website "Business in Poland" (Processed)
Polish-English parallel corpus from the website of the website "Business in Poland" (https://www.biznes.gov.pl/en) This dataset has been created within the framework of the European...
ZIP (217 views) (159 Downloads)
-
Bilingual English-Danish parallel corpus from The Danish Gambling Authority website
Contents of https://spillemyndigheden.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...
ZIP (326 views) (231 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Agriculture and Rural Development (Processed)
Polish-English parallel corpus from the website of the Ministry of Agriculture and Rural Development, Republic of Poland (http://www.minrol.gov.pl) This dataset has been created within...
ZIP (309 views) (202 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Justice (Processed)
Polish-English parallel corpus from the website of the Ministry of Justice, Republic of Poland (https://www.ms.gov.pl) This dataset has been created within the framework of the European...
ZIP (346 views) (248 Downloads)
-
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website (Processed)
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website. The resource contains pdf files with each file containing the text in...
ZIP (350 views) (256 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Development (Processed)
Polish-English parallel corpus from the website of the Ministry of Development, Republic of Poland (http://www.mr.gov.pl) This dataset has been created within the framework of the...
ZIP (406 views) (296 Downloads)