-
DGT-Translation Memory
DGT-TM is a translation memory (sentences and their manually produced translations) in 24 languages. It contains segments from the Acquis Communautaire, the body of European legislation,...
ZIP (45005 views) (4502 Downloads)
-
EuroVoc
EuroVoc is a multilingual, multidisciplinary thesaurus covering the activities of the EU. It contains terms in 24 EU languages (Bulgarian, Croatian, Czech, Danish, Dutch,...
XML HTML RDF XML ZIP (37050 views) (319 Downloads)
-
[DEPRECATED] Official Journals of the European Union (Swedish)
This Dataset has been deprecated, and it is now replaced by the following datasets: Official Journals of the European Union 2021 Official Journals of the European Union 2020...
PDF HTML Formex 4 ZIP Excel XLS (1635 views) (1506 Downloads)
-
[DEPRECATED] Official Journals of the European Union (English)
This Dataset has been deprecated, and it is now replaced by the following datasets: Official Journals of the European Union 2021 Official Journals of the European Union 2020...
PDF HTML Formex 4 ZIP Excel XLS (4713 views) (7 Downloads)
-
English-Swedish parallel corpus from the www.visitestonia.com web site
Parallel English-Swedish corpus compiled from the www.visitestonia.com web site by crawling the contents and aligning the parallel data. This dataset has been created within the...
ZIP (317 views) (201 Downloads)
-
English-Swedish parallel corpus from the web site of Finnish Tax Administration
English-Swedish parallel corpus created from the contents of Finnish Tax Administration web site https://www.vero.fi/ This dataset has been created within the framework of the European...
ZIP (273 views) (162 Downloads)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Swedish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (317 views) (203 Downloads)
-
English-Swedish parallel corpus from the contents of City of Turku web site
English-Swedish parallel corpus built from the contents of City of Turku web site http://www.turku.fi/ This dataset has been created within the framework of the European Language...
ZIP (252 views) (156 Downloads)
-
Finnish legislation as a Swedish monolingual corpus
Finnish legislation as a Swedish monolingual corpus This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (286 views) (179 Downloads)
-
Translations of Hungarian from public websites
A webcrawl of 14 different websites covering parallel corpora of Hungarian with Polish, Czech, Swedish, Finnish, French, German, Italian, English and Slovenian This dataset has been...
ZIP (388 views) (287 Downloads)
-
Parallel Global Voices (English - Swedish) (Processed)
Parallel Global Voices EN-SV is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (272 views) (167 Downloads)
-
English-Swedish parallel corpus from the Finnish Government web site
English-Swedish parallel corpus created from the parallel content of the Finnish Government web site https://valtioneuvosto.fi/ This dataset has been created within the framework of the...
ZIP (261 views) (161 Downloads)
-
English-Swedish parallel corpus from the Prime Minister's Office of Finland web site
English-Swedish parallel corpus created from the web site of Prime Minister's Office of Finland This dataset has been created within the framework of the European Language Resource...
ZIP (264 views) (168 Downloads)
-
Khresmoi (Processed)
Parallel data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, German, Hungarian, Polish, Spanish...
ZIP (266 views) (184 Downloads)
-
English-Swedish parallel corpus from National Audit Office of Finland
English-Swedish parallel corpus compiled from the contents of National Audit Office of Finland web site http://www.vtv.fi/, mainly audit reports This dataset has been created within the...
ZIP (154 views) (87 Downloads)
-
Avibase (processed)
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (258 views) (152 Downloads)