-
English-Finnish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (850 views) (724 Downloads)
-
English-Swedish parallel corpus from Annual Reports of the Swedish Pension System (Processed)
Source PDF files as parallel documents. The original texts are all always Swedish, the English text is its translation. This dataset has been created within the framework of the European...
ZIP (681 views) (578 Downloads)
-
English-Swedish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (641 views) (524 Downloads)
-
Polish-English parallel corpus from the website of the Polish Tourism Organisation (Processed)
Polish-English parallel corpus from the website of the Polish Tourism Organisation (https://pot.gov.pl/en) This dataset has been created within the framework of the European Language...
ZIP (621 views) (508 Downloads)
-
ENGLISH/POLISH PHRASE BOOK FOR ADMINISTRATIVE STAFF of LOCAL GOVERNMENT UNITS (Processed)
An English/Polish phrase book for the administrative staff of local government units (LGUs). This dataset has been created within the framework of the European Language Resource...
ZIP (612 views) (514 Downloads)
-
Romanian - English literature corpus (Processed)
Bilingual Romanian – English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on...
ZIP (594 views) (474 Downloads)
-
English-Danish Parallel corpus from Tatoeba project (Processed)
Parallel corpus from English-Danish translations from tatoeba.org website This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (566 views) (466 Downloads)
-
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network) (Processed)
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network, https://eures.praca.gov.pl) This dataset has been created within the...
ZIP (549 views) (440 Downloads)
-
English-Swedish parallel corpus from the translation of 'Sweden a Pocket Guide' book (Processed)
A guide for foreigners who move to Sweden. Source language is Swedish. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (541 views) (371 Downloads)
-
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (534 views) (420 Downloads)
-
Parallel corpus from Social Insurance Agency -- Försäkringskassan (Sweden) (Processed)
Parallel corpus from Social Insurance Agency (Sweden) (Försäkringskassan) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (530 views) (401 Downloads)
-
Bilingual English-Norwegian parallel corpus from Norwegian Institute of Public Health website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (528 views) (434 Downloads)
-
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (Processed)
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (https://www.rpo.gov.pl/en) This dataset has been created within the framework of the...
ZIP (523 views) (420 Downloads)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic (Processed)
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words It is converted into a 2609-TUs...
ZIP (512 views) (400 Downloads)
-
Bilingual English-Danish parallel corpus from Aarhus 2017 - European Capital of Culture website
Contents of http://www.aarhus2017.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (508 views) (391 Downloads)
-
Bilingual English-Norwegian parallel corpus from KORO / Public Art Norway website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (505 views) (393 Downloads)
-
English-Finnish corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (496 views) (378 Downloads)
-
Bilingual English-Norwegian parallel corpus from the Immigration Appeals Board website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (487 views) (371 Downloads)
-
English-Estonian corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (439 views) (337 Downloads)
-
English-Swedish corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (432 views) (327 Downloads)