German-English website parallel corpus from the Federal Foreign Office Berlin
German-English texts extracted from the website of the Federal Foreign Office Berlin. This includes 53,849 pairs that were translated between October 2013 and the beginning of November...
XML PDF ZIP (362 views) (120 Downloads)
Portuguese-English bilingual corpus from the Portuguese Constitution
Complete text of the Portuguese Constitution in Portuguese and English; Bilingual tmx file in PT-EN
XML PDF ZIP (328 views) (77 Downloads)
English-Danish Parallel corpus from Tatoeba project
Parallel corpus from English-Danish translations from tatoeba.org website
XML PDF ZIP (317 views) (72 Downloads)
Polish Food Dataset
Polish Food is a quarterly issued by the Polish Ministry of Agriculture and Rural Development and The Agency for Restructuring and Modernisation of Agriculture. The dataset comprises a...
XML PDF ZIP (298 views) (93 Downloads)
tmx file, 2718 TUs, bilingual German/English, texts from the website of the Federal Ministry of Transport and Digital Infrastructure (BMVI) on transport issues
XML PDF ZIP (292 views) (56 Downloads)
Corpus on Finance and Economics from Bank of Latvia
Contents of web site https://makroekonomika.lv/ -- Latvian and https://www.macroeconomics.lv/ -- English aligned as a parallel corpus
XML PDF ZIP (266 views) (63 Downloads)
TMX file with 11555 TUs, bilingual German/English, publications/brochures of the Federal Ministry of Transport and Digital Infrastructure on transport issues
XML PDF ZIP (251 views) (57 Downloads)
Bilingual Croatian-English Parallel Corpus
Bilingual Croatian-English Parallel Corpus of 21340 translation units in the public administration domain.
XML PDF ZIP (249 views) (72 Downloads)
Parallel corpus (Bulgarian - English) in the public administration domain
Parallel (bg-en) corpus of 11262 translation units in the public administration domain.
XML PDF ZIP (246 views) (60 Downloads)
ANR translation memory containing major publications, as well as several administrative documents and news
Documents / language resources from ANR – Translation memory (.xliff) fr>en(uk) containing 9611 translation units (17 Mb) Major publications • Rapport d’activité 2014 (110 pages)...
XML PDF ZIP (242 views) (21 Downloads)
BMI Brochures and Website 2016
Bilingual tmx file of German to English translations of the Federal Ministry of the Interior's website and brochures. Topics include terrorism, cyber security, asylum, cultural property,...
XML PDF ZIP (240 views) (59 Downloads)
Parallel Global Voices (Greek - English)
Parallel Global Voices EL-EN is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
XML PDF ZIP (235 views) (62 Downloads)
English-Finnish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland
XML PDF ZIP (223 views) (16 Downloads)
Polish Food 4 & Food Policy Dataset
A collection of Polish-English translations of the Polish Food quarterly published by the Polish Ministry of Agriculture, comprising issues 65-68 (85K words in 2473 segments) and the...
XML PDF ZIP (222 views) (60 Downloads)
Parallel Corpus from the Web Site of the the MFA of Latvia
The Corpus has been built from the News and Press Releases published in the Web Site of the Ministry of Foreign Affairs of the Republic of Latvia.
XML PDF ZIP (216 views) (54 Downloads)
Romanian Ombudsman archive
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
XML PDF ZIP (204 views) (16 Downloads)
Civil Aviation Regulations
A collection of parallel Polish-English texts published by the Polish Civil Aviation Office and complemented by two acts of Civil Aviation Legislation. Sentence-level alignment of...
XML PDF ZIP (190 views) (7 Downloads)
Hallituskausi 2011-2015 fi-en
Information on the 'Hallituskausi 2011–' translation memory: The 'Hallituskausi 2011–' translation memory is intended for those translating administrative texts between Finnish and...
XML PDF ZIP (184 views) (54 Downloads)
Central Statistical Office Dataset
Two Polish-English publications of the Polish Central Statistical Office in the XLIFF format: 1. 'Statistical Yearbook of the Republic of Poland 2015' is the main summary publication of...
XML PDF ZIP (178 views) (11 Downloads)
English-Icelandic parallel corpus from Statistics Iceland
English-Icelandic parallel corpus compiled from parallel content collected from Statistics Iceland English and Icelandic home pages
XML PDF ZIP (167 views) (6 Downloads)