-
DGT-Translation Memory
DGT-TM is a translation memory (sentences and their manually produced translations) in 24 languages. It contains segments from the Acquis Communautaire, the body of European legislation,...
PDF ZIP (45005 views) (4502 Downloads)
-
EuroVoc
EuroVoc is a multilingual, multidisciplinary thesaurus covering the activities of the EU. It contains terms in 24 EU languages (Bulgarian, Croatian, Czech, Danish, Dutch,...
XML HTML RDF XML ZIP (37050 views) (319 Downloads)
-
[DEPRECATED] Official Journals of the European Union (English)
This Dataset has been deprecated, and it is now replaced by the following datasets: Official Journals of the European Union 2021 Official Journals of the European Union 2020...
PDF HTML Formex 4 ZIP Excel XLS (4713 views) (7 Downloads)
-
Spanish-English website parallel corpus
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 21,007 TUs. Period of crawling : 15/11/2016 - 23/01/2017 A strict validation...
ZIP (3291 views) (126 Downloads)
-
[DEPRECATED] Official Journals of the European Union (Spanish)
This Dataset has been deprecated, and it is now replaced by the following datasets: Official Journals of the European Union 2021 Official Journals of the European Union 2020...
PDF HTML Formex 4 ZIP Excel XLS (1776 views) (1643 Downloads)
-
Agricultural and Vegetable Catalogue
The seed of varieties of agricultural and plant species and varieties of vegetable species that are published in the EU level Common Catalogue is subject to no marketing restrictions with...
HTML ZIP (1733 views) (1602 Downloads)
-
Multilingual Public Procurement Terminology
An internal terminology developed by the Polish Public Procurement Office containing 1408 terms in 11 languages (English, Danish, Spanish, German, Greek, French, Italian, Portugese,...
XML PDF ZIP (1046 views) (930 Downloads)
-
Spanish-Portuguese website parallel corpus
This is a parallel and aligned corpus of bilingual texts crawled from multilingual websites, which contains 1,249 TUs. This dataset has been created within the framework of the European...
ZIP (780 views) (645 Downloads)
-
Health Multilingual Terminologies
17 multilingual medical terminologies from Termcat in the following domains: - Anatomy (3610 terms; languages: es, en, ca) - Integrated care (75 terms; languages: es, en,ca) -...
XML PDF ZIP (739 views) (628 Downloads)
-
Spanish-French website parallel corpus
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 15,797 TUs. Period of crawling : 15/11/2016 - 23/01/2017. A strict validation...
ZIP (643 views) (535 Downloads)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 views) (487 Downloads)
-
Spanish-Italian website parallel corpus
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 3,319 TUs. Date of crawling : 23/01/2017 A strict validation process has been...
ZIP (586 views) (483 Downloads)
-
Parallel texts from Swedish Labour market agency. Part 2
Same as part 1, but with the Readme-file. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (548 views) (435 Downloads)
-
Spanish-English website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 21,007 TUs. Period of crawling : 15/11/2016 - 23/01/2017 A strict validation...
ZIP (509 views) (413 Downloads)
-
Parallel texts from Swedish National Food Agency
Parallel texts in pdf file format. Original in Swedish, all the other texts are translations. One original with translations per folder. Language info is included in the file's name....
ZIP (478 views) (367 Downloads)
-
Term lists and Dictionaries from Swedish Authorities
This resource also includes a Dictionary from the ELMN that has a set of terms translated from English to all the EU languages. The list of languages that is indicated with this resource...
ZIP (478 views) (365 Downloads)
-
EUIPO - IP case law Spanish-English (Processed)
EUIPO - IP case law (BOA) Spanish-English Years: 2002-2017 This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe...
ZIP (453 views) (335 Downloads)
-
EUIPO - list of goods and services German and Spanish (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (451 views) (335 Downloads)
-
Parallel texts from Swedish Labour market agency. Part 2 (Processed)
Same as part 1, but with the Readme-file. (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (439 views) (332 Downloads)
-
EUIPO - list of goods and services Spanish and French (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (439 views) (334 Downloads)