-
English-Swedish parallel corpus from the translation of 'Sweden a Pocket Guide' book (Processed)
A guide for foreigners who move to Sweden. Source language is Swedish. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (541 views) (371 Downloads)
-
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (534 views) (420 Downloads)
-
Parallel corpus from Social Insurance Agency -- Försäkringskassan (Sweden) (Processed)
Parallel corpus from Social Insurance Agency (Sweden) (Försäkringskassan) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (530 views) (401 Downloads)
-
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
XML PDF ZIP (530 views) (427 Downloads)
-
Bilingual English-Norwegian parallel corpus from Norwegian Institute of Public Health website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (528 views) (434 Downloads)
-
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (Processed)
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (https://www.rpo.gov.pl/en) This dataset has been created within the framework of the...
ZIP (523 views) (420 Downloads)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic (Processed)
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words It is converted into a 2609-TUs...
ZIP (512 views) (400 Downloads)
-
Bilingual English-Danish parallel corpus from Aarhus 2017 - European Capital of Culture website
Contents of http://www.aarhus2017.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (508 views) (391 Downloads)
-
Bilingual English-Norwegian parallel corpus from KORO / Public Art Norway website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (505 views) (393 Downloads)
-
English-Finnish corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (496 views) (378 Downloads)
-
Monolingual documents from the Government of Lithuania (Processed)
Monolingual documents received from the Government of the Republic of Lithuania. (Processed) This dataset has been created within the framework of the European Language Resource...
ZIP (490 views) (376 Downloads)
-
Bilingual English-Norwegian parallel corpus from the Immigration Appeals Board website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (487 views) (371 Downloads)
-
English-Latvian EASTIN-CL Multilingual Ontology of Assistive Technology
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
XML PDF ZIP (457 views) (358 Downloads)
-
National Health Fund Dataset
The dataset is a 274K-token Polish-English parallel resource in XLIFF format created on the basis of "Diagnosis-Related Groups in Europe" publication of the Polish National Health Fund....
XML PDF ZIP (452 views) (374 Downloads)
-
English-Estonian corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (439 views) (337 Downloads)
-
English-Swedish corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (432 views) (327 Downloads)
-
Central Statistical Office Dataset (Processed)
Two Polish-English publications of the Polish Central Statistical Office in the XLIFF format: 1. "Statistical Yearbook of the Republic of Poland 2015" is the main summary publication...
ZIP (431 views) (326 Downloads)
-
Parallel corpus from Social Insurance Agency -- Försäkringskassan (Sweden)
Parallel corpus from Social Insurance Agency (Sweden) (Försäkringskassan) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (429 views) (303 Downloads)
-
English-Danish EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (424 views) (316 Downloads)
-
Bilingual Icelandic-English parallel corpus from Statistics Iceland website
Contents of https://www.statice.is and https://hagstofa.is/ websites downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the...
ZIP (416 views) (317 Downloads)