-
Tõlkemälu DGT-TM
DGT-TM on 24 keelt hõlmav tõlkemälu, mis kujutab endast lauseid ja nende käsitsi loodud tõlkeid. See sisaldab segmente Euroopa õigusaktide kogumist ehk ühenduse õigustikust, kuhu kuuluvad...
PDF ZIP (45005 kuvad) (4502 Allalaadimised)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 kuvad) (122 Allalaadimised)
-
Romanian – English parallel wordlists
English and Romanian lemmatized wordlists extracted from various resources (including RO-EN Wordnets, the Romanian – English news corpus, the Romanian – English literature corpus, and...
ZIP (885 kuvad) (765 Allalaadimised)
-
IATE
IATE (= “Inter-Active Terminology for Europe”) is the EU's inter-institutional terminology database. IATE has been used by the language services of the EU institutions and agencies since...
HTML JavaScript ZIP (6456 kuvad) (6082 Allalaadimised)
-
National Health Fund Dataset (Processed)
The dataset is a 274K-token Polish-English parallel resource in XLIFF format created on the basis of "Diagnosis-Related Groups in Europe" publication of the Polish National Health Fund....
ZIP (345 kuvad) (231 Allalaadimised)
-
DA-EN Danish Ministry of Higher Education and Science 2
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 115,000 words, topic: research policy This dataset has been created within the framework of...
ZIP (333 kuvad) (209 Allalaadimised)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words This dataset has been created within...
ZIP (357 kuvad) (249 Allalaadimised)
-
Romanian – English literature corpus
Bilingual Romanian - English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on...
ZIP (411 kuvad) (321 Allalaadimised)
-
English-Estonian corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (288 kuvad) (186 Allalaadimised)
-
English-Swedish corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (432 kuvad) (327 Allalaadimised)
-
English-Finnish corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (496 kuvad) (378 Allalaadimised)
-
English-Estonian corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (439 kuvad) (337 Allalaadimised)
-
English-Swedish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (641 kuvad) (524 Allalaadimised)
-
English-Finnish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (850 kuvad) (724 Allalaadimised)
-
English-Estonian Parallel corpus compiled from translated annual reports from Estonian Academy of Sciences
English-Estonian translated annual reports as source data for parallel corpus -- collected from the web site of Estonian Academy of Sciences http://www.akadeemia.ee/ This dataset has...
ZIP (296 kuvad) (204 Allalaadimised)
-
Parallel Global Voices (Greek - Spanish)
Parallel Global Voices EL-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
(661 kuvad) (557 Allalaadimised)
-
Bilingual documents Bulgarian-English in the field of open data, broadband and information society (Processed)
English-Bulgarian collection in the field of open data, broadband, strategic document of the Information society in the Republic of Bulgaria This dataset has been created within the...
ZIP (503 kuvad) (389 Allalaadimised)
-
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (534 kuvad) (420 Allalaadimised)
-
Polish-English parallel corpus from the website of the National Digital Archives (Processed)
Polish-English parallel corpus from the website of the National Digital Archives (https://www.nac.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (412 kuvad) (313 Allalaadimised)
-
DA-EN Danish Ministry of Higher Education and Science
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size: 120,000 words, topic: innovation, science This dataset has been created within the framework...
ZIP (453 kuvad) (360 Allalaadimised)