-
EuroVoc
EuroVoc is a multilingual, multidisciplinary thesaurus covering the activities of the EU. It contains terms in 24 EU languages (Bulgarian, Croatian, Czech, Danish, Dutch,...
XML HTML RDF XML ZIP (37050 views) (319 Downloads)
-
German-English website parallel corpus from the Federal Foreign Office Berlin
German-English texts extracted from the website of the Federal Foreign Office Berlin. This includes 53,849 pairs that were translated between October 2013 and the beginning of November...
XML PDF ZIP (1379 views) (1177 Downloads)
-
Multilingual Public Procurement Terminology
An internal terminology developed by the Polish Public Procurement Office containing 1408 terms in 11 languages (English, Danish, Spanish, German, Greek, French, Italian, Portugese,...
XML PDF ZIP (1046 views) (930 Downloads)
-
Portuguese-English bilingual corpus from the Portuguese Constitution
Complete text of the Portuguese Constitution in Portuguese and English; Bilingual tmx file in PT-EN This dataset has been created within the framework of the European Language Resource...
XML PDF ZIP (1026 views) (884 Downloads)
-
English-Danish Parallel corpus from Tatoeba project
Parallel corpus from English-Danish translations from tatoeba.org website This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
XML PDF ZIP (958 views) (815 Downloads)
-
BMVI Website
tmx file, 2718 TUs, bilingual German/English, texts from the website of the Federal Ministry of Transport and Digital Infrastructure (BMVI) on transport issues This dataset has been...
XML PDF ZIP (914 views) (774 Downloads)
-
Polish Food Dataset
Polish Food is a quarterly issued by the Polish Ministry of Agriculture and Rural Development and The Agency for Restructuring and Modernisation of Agriculture. The dataset comprises a...
XML PDF ZIP (893 views) (736 Downloads)
-
English-Bulgarian Legal Terms
The resource is a bilingual terminological database representing 1175 terms in English and their translations to Bulgarian. The terms belong to the Law domain (Civil law, Criminal law,...
XML PDF ZIP (862 views) (715 Downloads)
-
English-Finnish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (850 views) (724 Downloads)
-
Parallel Global Voices (Greek - English)
Parallel Global Voices EL-EN is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
XML PDF ZIP (835 views) (710 Downloads)
-
Parallel corpus (Bulgarian - English) in the public administration domain
Parallel (bg-en) corpus of 11262 translation units in the public administration domain. This dataset has been created within the framework of the European Language Resource Coordination...
XML PDF ZIP (814 views) (691 Downloads)
-
Corpus on Finance and Economics from Bank of Latvia
Contents of web site https://makroekonomika.lv/ -- Latvian and https://www.macroeconomics.lv/ -- English aligned as a parallel corpus This dataset has been created within the...
XML PDF ZIP (809 views) (693 Downloads)
-
Bilingual Croatian-English Parallel Corpus
Bilingual Croatian-English Parallel Corpus of 21340 translation units in the public administration domain. This dataset has been created within the framework of the European Language...
XML PDF ZIP (808 views) (701 Downloads)
-
BMVI Publications
TMX file with 11555 TUs, bilingual German/English, publications/brochures of the Federal Ministry of Transport and Digital Infrastructure on transport issues This dataset has been...
XML PDF ZIP (779 views) (671 Downloads)
-
Orossimo Terminological Resource - Medicine & health
A bilingual terminological glossary extracted from academic discourse texts belonging to the Medicine & health domain. This dataset has been created within the framework of the...
XML PDF ZIP (766 views) (651 Downloads)
-
Health Multilingual Terminologies
17 multilingual medical terminologies from Termcat in the following domains: - Anatomy (3610 terms; languages: es, en, ca) - Integrated care (75 terms; languages: es, en,ca) -...
XML PDF ZIP (739 views) (628 Downloads)
-
ANR translation memory containing major publications, as well as several administrative documents and news
Documents / language resources from ANR – Translation memory (.xliff) fr>en(uk) containing 9611 translation units (17 Mb) Major publications • Rapport d’activité 2014 (110...
XML PDF ZIP (717 views) (612 Downloads)
-
Polish Food 4 & Food Policy Dataset
A collection of Polish-English translations of the Polish Food quarterly published by the Polish Ministry of Agriculture, comprising issues 65-68 (85K words in 2473 segments) and the...
XML PDF ZIP (714 views) (610 Downloads)
-
BMI Brochures and Website 2016
Bilingual tmx file of German to English translations of the Federal Ministry of the Interior's website and brochures. Topics include terrorism, cyber security, asylum, cultural property,...
XML PDF ZIP (705 views) (602 Downloads)
-
Hallituskausi 2011-2015 fi-en
Information on the "Hallituskausi 2011–" translation memory: The "Hallituskausi 2011–" translation memory is intended for those translating administrative texts between Finnish and...
XML PDF ZIP (688 views) (573 Downloads)