Sidevõrkude, sisu ja tehnoloogia peadirektoraat
-
Compendium The Social Insurance Institution (Processed)
A compendium on the Polish Social Insurance Insitution (ZUS), covering the following issues: short presentation of ZUS, its history, tasks, organizational structure, employees, Social...
ZIP (323 kuvad) (203 Allalaadimised)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic (Processed)
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words It is converted into a 2609-TUs...
ZIP (512 kuvad) (400 Allalaadimised)
-
Romanian - English literature corpus (Processed)
Bilingual Romanian – English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on...
ZIP (594 kuvad) (474 Allalaadimised)
-
Central Statistical Office Dataset (Processed)
Two Polish-English publications of the Polish Central Statistical Office in the XLIFF format: 1. "Statistical Yearbook of the Republic of Poland 2015" is the main summary publication...
ZIP (431 kuvad) (326 Allalaadimised)
-
English-Danish EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (424 kuvad) (316 Allalaadimised)
-
Monolingual documents from the Government of Lithuania
Monolingual documents received from the Government of the Republic of Lithuania. This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (299 kuvad) (185 Allalaadimised)
-
Parallel texts from Swedish Labour market agency
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (357 kuvad) (249 Allalaadimised)
-
OROSSIMO Corpus - Medicine & health
A corpus of academic discourse texts belonging to the Medicine & health domain (according to the Dewey Decimal classification, DDC61 - Medicine & health), annotated at structural...
ZIP (568 kuvad) (481 Allalaadimised)
-
English-Danish Parallel corpus from Tatoeba project (Processed)
Parallel corpus from English-Danish translations from tatoeba.org website This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (566 kuvad) (466 Allalaadimised)
-
Expression of interest
International call for expression of interest for the selection of the President of the Hellenic Statistical Authority (EL.STAT.) This dataset has been created within the framework of...
ZIP (528 kuvad) (422 Allalaadimised)
-
Slovak corpus of texts from the Ministry of Culture of the Slovak Republic
Dataset of Slovak legal texts within agenda of the Ministry, plain text format, the size: 108448 words This dataset has been created within the framework of the European Language...
ZIP (560 kuvad) (438 Allalaadimised)
-
Collection of Greek National Spatial Plans
Dataset, 268KB, 5 txt files, national spatial plans (general, aquaculture, tourism, industry, RES, detention facilities) This dataset has been created within the framework of the...
ZIP (211 kuvad) (170 Allalaadimised)
-
English-Lithuanian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (368 kuvad) (256 Allalaadimised)
-
OROSSIMO Corpus - Photography, film & video
A corpus of academic discourse texts belonging to the Photography, film & video domain (according to the Dewey Decimal classification, DDC77 -Photography, computer art, film &...
ZIP (572 kuvad) (447 Allalaadimised)
-
Parallel corpus from Social Insurance Agency -- Försäkringskassan (Sweden) (Processed)
Parallel corpus from Social Insurance Agency (Sweden) (Försäkringskassan) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (530 kuvad) (401 Allalaadimised)
-
Parallel corpus from Social Insurance Agency -- Försäkringskassan (Sweden)
Parallel corpus from Social Insurance Agency (Sweden) (Försäkringskassan) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (429 kuvad) (303 Allalaadimised)
-
English-Latvian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (328 kuvad) (226 Allalaadimised)
-
Parallel texts from Swedish Social Security Authority (Processed)
Parallel texts, email templates and forms in pdf file format. Original in Swedish, all the other texts are translations. One original with translations per folder. Language info is...
ZIP (349 kuvad) (238 Allalaadimised)
-
Parallel texts from Swedish Social Security Authority
Parallel texts, email templates and forms in pdf file format. Original in Swedish, all the other texts are translations. One original with translations per folder. Language info is...
ZIP (336 kuvad) (233 Allalaadimised)
-
Parallel corpus from Social Insurance Agency - Socialstyrelsen (Sweden)
Big term bank with Medical terms in Swedish with an explanation for each term in Swedish This dataset has been created within the framework of the European Language Resource Coordination...
ZIP (389 kuvad) (294 Allalaadimised)