-
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network) (Processed)
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network, https://eures.praca.gov.pl) This dataset has been created within the...
ZIP (549 views) (440 Downloads)
-
Bilingual Icelandic-English parallel corpus from Statistics Iceland website
Contents of https://www.statice.is and https://hagstofa.is/ websites downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the...
ZIP (416 views) (317 Downloads)
-
Monolingual documents from the Government of Lithuania (Processed)
Monolingual documents received from the Government of the Republic of Lithuania. (Processed) This dataset has been created within the framework of the European Language Resource...
ZIP (490 views) (376 Downloads)
-
Bilingual resource with Bulgarian strategic documents in the field of telecommunications and broadband (Bulgarian - English)
Bilingual collection of documents in the field of telecommunications and broadband, size on disk 440 kB, Bulgarian-English This dataset has been created within the framework of the...
ZIP (369 views) (258 Downloads)
-
Polish-English parallel corpus from the website of the ING Polish Art Foundation (Processed)
Polish-English parallel corpus from the website of the ING Polish Art Foundation (https://ingart.pl) This dataset has been created within the framework of the European Language Resource...
ZIP (310 views) (212 Downloads)
-
DA-EN Danish Ministry of Higher Education and Science 3 (Processed)
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 110,000 words, topic: research policy (Processed) This dataset has been created within the...
ZIP (330 views) (216 Downloads)
-
English-Slovak corpus of annual reports on immigration and asylum policies from the EMN National Contact Point for the Slovak Republic website (Processed)
English-Slovak corpus of annual reports on immigration and asylum policies from the EMN National Contact Point for the Slovak Republic website (https://emn.sk/en/) This dataset has been...
ZIP (310 views) (209 Downloads)
-
Bilingual hr-en parallel corpus from the National and University Library in Zagreb website (Processed)
Contents of http://www.nsk.hr were crawled, aligned on document and sentence level and converted into a parallel corpus This dataset has been created within the framework of the European...
ZIP (384 views) (290 Downloads)
-
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (Processed)
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (https://www.rpo.gov.pl/en) This dataset has been created within the framework of the...
ZIP (523 views) (420 Downloads)
-
Bilingual documents Bulgarian-English in the field of ICT and Transport (Processed)
Bilingual collection of documents in the field of Internet governance, implementation of the Digital Agenda in Bulgaria, cloud computing and terminological dataset in Internet domain, and...
ZIP (368 views) (270 Downloads)
-
English-Swedish parallel corpus from the translation of 'Sweden a Pocket Guide' book (Processed)
A guide for foreigners who move to Sweden. Source language is Swedish. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (541 views) (371 Downloads)
-
Bilingual hr-en parallel corpus from Croatian Mine Action website (Processed)
Contents of http://www.hcr.hr website downloaded, aligned on document and segment level and converted into parallel corpus This dataset has been created within the framework of the...
ZIP (397 views) (297 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Digital Affairs (Processed)
Polish-English parallel corpus from the website of the Ministry of Digital Affairs (http://archiwum.mc.gov.pl and http://krmc.mc.gov.pl) This dataset has been created within the...
ZIP (314 views) (214 Downloads)
-
English-Swedish parallel corpus from Annual Reports of the Swedish Pension System (Processed)
Source PDF files as parallel documents. The original texts are all always Swedish, the English text is its translation. This dataset has been created within the framework of the European...
ZIP (681 views) (578 Downloads)
-
Polish-English parallel corpus from the website of the National Audiovisual Institute (Processed)
Polish-English parallel corpus from the website of the National Audiovisual Institute (http://www.nina.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (348 views) (237 Downloads)
-
Bilingual English-Danish parallel corpus from The Agency for Culture and Palaces website
Contents of https://slks.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the European...
ZIP (359 views) (255 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of the Interior and Administration (Processed)
Polish-English parallel corpus from the website of the Ministry of the Interior and Administration, Republic of Poland (https://www.mswia.gov.pl/) This dataset has been created within...
ZIP (334 views) (241 Downloads)
-
Polish-English parallel corpus from the website of the Institute of Mathematics of the Polish Academy of Sciences (Processed)
Polish-English parallel corpus from the website of the Institute of Mathematics of the Polish Academy of Sciences (https://www.impan.gov.pl redirect to https://www.impan.pl) This dataset...
ZIP (346 views) (229 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Culture and National Heritage (Processed)
Polish-English parallel corpus from the website of the Ministry of Culture and National Heritage, Republic of Poland (http://www.mkidn.gov.pl) This dataset has been created within the...
ZIP (277 views) (179 Downloads)
-
Polish-English parallel corpus from the website of the Citizens Information Board (Processed)
Polish-English parallel corpus from the website of the Citizens Information Board, Ireland (http://www.citizensinformation.ie) This dataset has been created within the framework of the...
ZIP (373 views) (269 Downloads)