Spanish-English website parallel corpus
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 21,007 TUs. Period of crawling : 15/11/2016 - 23/01/2017 A strict validation process has...
XML PDF ZIP (548 weergaven) (157 Downloads)
Orossimo Terminological Resource - Medicine & health
A bilingual terminological glossary extracted from academic discourse texts belonging to the Medicine & health domain.
XML PDF ZIP (224 weergaven) (15 Downloads)
German-English website parallel corpus from the Federal Foreign Office Berlin
German-English texts extracted from the website of the Federal Foreign Office Berlin. This includes 53,849 pairs that were translated between October 2013 and the beginning of November...
XML PDF ZIP (203 weergaven) (120 Downloads)
Multilingual Public Procurement Terminology
An internal terminology developed by the Polish Public Procurement Office containing 1408 terms in 11 languages (English, Danish, Spanish, German, Greek, French, Italian, Portugese,...
XML PDF ZIP (194 weergaven) (98 Downloads)
English-Danish Parallel corpus from Tatoeba project
Parallel corpus from English-Danish translations from tatoeba.org website
XML PDF ZIP (162 weergaven) (72 Downloads)
Portuguese-English bilingual corpus from the Portuguese Constitution
Complete text of the Portuguese Constitution in Portuguese and English; Bilingual tmx file in PT-EN
XML PDF ZIP (151 weergaven) (77 Downloads)
Polish Food Dataset
Polish Food is a quarterly issued by the Polish Ministry of Agriculture and Rural Development and The Agency for Restructuring and Modernisation of Agriculture. The dataset comprises a...
XML PDF ZIP (143 weergaven) (93 Downloads)
Health Multilingual Terminologies
17 multilingual medical terminologies from Termcat in the following domains: - Anatomy (3610 terms; languages: es, en, ca) - Integrated care (75 terms; languages: es, en,ca) -...
XML PDF ZIP (140 weergaven) (33 Downloads)
Romanian Ombudsman archive
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
XML PDF ZIP (139 weergaven) (16 Downloads)
tmx file, 2718 TUs, bilingual German/English, texts from the website of the Federal Ministry of Transport and Digital Infrastructure (BMVI) on transport issues
XML PDF ZIP (138 weergaven) (56 Downloads)
Bilingual Croatian-English Parallel Corpus
Bilingual Croatian-English Parallel Corpus of 21340 translation units in the public administration domain.
XML PDF ZIP (134 weergaven) (72 Downloads)
BMI Brochures and Website 2016
Bilingual tmx file of German to English translations of the Federal Ministry of the Interior's website and brochures. Topics include terrorism, cyber security, asylum, cultural property,...
XML PDF ZIP (132 weergaven) (59 Downloads)
Corpus on Finance and Economics from Bank of Latvia
Contents of web site https://makroekonomika.lv/ -- Latvian and https://www.macroeconomics.lv/ -- English aligned as a parallel corpus
XML PDF ZIP (129 weergaven) (61 Downloads)
ANR translation memory containing major publications, as well as several administrative documents and news
Documents / language resources from ANR – Translation memory (.xliff) fr>en(uk) containing 9611 translation units (17 Mb) Major publications • Rapport d’activité 2014 (110 pages)...
XML PDF ZIP (128 weergaven) (21 Downloads)
Romanian – English parallel wordlists
English and Romanian lemmatized wordlists extracted from various resources (including RO-EN Wordnets, the Romanian – English news corpus, the Romanian – English literature corpus, and...
XML PDF ZIP (127 weergaven) (26 Downloads)
TMX file with 11555 TUs, bilingual German/English, publications/brochures of the Federal Ministry of Transport and Digital Infrastructure on transport issues
XML PDF ZIP (122 weergaven) (57 Downloads)
English-Finnish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland
XML PDF ZIP (120 weergaven) (16 Downloads)
Parallel corpus (Bulgarian - English) in the public administration domain
Parallel (bg-en) corpus of 11262 translation units in the public administration domain.
XML PDF ZIP (111 weergaven) (60 Downloads)
Secretariat-General parallel corpus SL-EN and EN-SL (part 1)
English-Slovenian parallel corpus in TMX format from the Secretariat-General of the Government of the Republic of Slovenia in the legal domain
XML PDF ZIP (109 weergaven) (8 Downloads)
Austrian Armed Forces Military Dictionaries
A collection of military dictionaries in the following language pairs: Austrian German-Hungarian Austrian German-Italian Austrian German-English Austrian German-French
XML PDF ZIP (106 weergaven) (12 Downloads)