-
DGT-Translation Memory
DGT-TM is a translation memory (sentences and their manually produced translations) in 24 languages. It contains segments from the Acquis Communautaire, the body of European legislation,...
ZIP (45005 views) (4502 Downloads)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 views) (122 Downloads)
-
Translations of Hungarian from public websites
A webcrawl of 14 different websites covering parallel corpora of Hungarian with Polish, Czech, Swedish, Finnish, French, German, Italian, English and Slovenian This dataset has been...
ZIP (388 views) (287 Downloads)
-
English-Swedish parallel corpus from the www.visitestonia.com web site
Parallel English-Swedish corpus compiled from the www.visitestonia.com web site by crawling the contents and aligning the parallel data. This dataset has been created within the...
ZIP (317 views) (201 Downloads)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Swedish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (317 views) (203 Downloads)
-
English-Swedish parallel corpus from the web site of Finnish Tax Administration
English-Swedish parallel corpus created from the contents of Finnish Tax Administration web site https://www.vero.fi/ This dataset has been created within the framework of the European...
ZIP (273 views) (162 Downloads)
-
Parallel Global Voices (English - Swedish) (Processed)
Parallel Global Voices EN-SV is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (272 views) (167 Downloads)
-
Khresmoi (Processed)
Parallel data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, German, Hungarian, Polish, Spanish...
ZIP (266 views) (184 Downloads)
-
English-Swedish parallel corpus from the Prime Minister's Office of Finland web site
English-Swedish parallel corpus created from the web site of Prime Minister's Office of Finland This dataset has been created within the framework of the European Language Resource...
ZIP (264 views) (168 Downloads)
-
English-Swedish parallel corpus from the Finnish Government web site
English-Swedish parallel corpus created from the parallel content of the Finnish Government web site https://valtioneuvosto.fi/ This dataset has been created within the framework of the...
ZIP (261 views) (161 Downloads)
-
Avibase (processed)
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (258 views) (152 Downloads)
-
English-Swedish parallel corpus from the contents of City of Turku web site
English-Swedish parallel corpus built from the contents of City of Turku web site http://www.turku.fi/ This dataset has been created within the framework of the European Language...
ZIP (252 views) (156 Downloads)
-
English-Swedish parallel corpus from National Audit Office of Finland
English-Swedish parallel corpus compiled from the contents of National Audit Office of Finland web site http://www.vtv.fi/, mainly audit reports This dataset has been created within the...
ZIP (154 views) (87 Downloads)