Resources for Language Technologies
-
DGT-Translation Memory
DGT-Translation Memory DGT-TM er en oversættelseshukommelse (sætninger og deres manuelt fremstillede oversættelse) på 24 sprog. Den indeholder segmenter fra den gældende fællesskabsret –...
PDF ZIP (45005 visninger) (4502 Downloads)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 visninger) (122 Downloads)
-
EUIPO - list of goods and services Spanish and English (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (423 visninger) (313 Downloads)
-
EUIPO - list of goods and services German and Spanish (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (451 visninger) (335 Downloads)
-
EUIPO - list of goods and services Spanish and French (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (439 visninger) (334 Downloads)
-
EUIPO - list of goods and services Italian and Spanish (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (416 visninger) (317 Downloads)
-
Thematic Vocabulary of Geography (processed)
Thematic Vocabulary of Geography This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (214 visninger) (131 Downloads)
-
English-Spanish website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 909 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (291 visninger) (184 Downloads)
-
Electronics and opto-electronics Vocabulary (processed)
Vocabulary used for indexing bibliographical records dealing with “Electronics” in the PASCAL database, until 2014. This resource contains 4454 entries classified under 19 collections....
ZIP (205 visninger) (118 Downloads)
-
Spanish monolingual corpus from contents of Spanish State Official Gazette
Spanish monolingual corpus from contents of Spanish State Official Gazette as collected from https://boe.es/ web site This dataset has been created within the framework of the European...
ZIP (229 visninger) (143 Downloads)
-
PAeSI : Public Administration and Foreign Immigrants (Processed)
The service PAeSI for the immigration has been realized within of the project PAeSI (Public Administration and Foreign Immigrants), in order to prepare an electronic access to...
ZIP (278 visninger) (170 Downloads)
-
Parallel texts from Swedish Labour market agency (Processed)
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (334 visninger) (237 Downloads)
-
Parallel Global Voices (Greek - Spanish)
Parallel Global Voices EL-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
(661 visninger) (557 Downloads)
-
Term lists and Dictionaries from Swedish Authorities
This resource also includes a Dictionary from the ELMN that has a set of terms translated from English to all the EU languages. The list of languages that is indicated with this resource...
ZIP (478 visninger) (365 Downloads)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 visninger) (487 Downloads)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Spanish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (220 visninger) (119 Downloads)
-
Parallel Global Voices (English - Spanish) (Processed)
Parallel Global Voices EN-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (227 visninger) (142 Downloads)
-
Parallel texts from Swedish Labour market agency. Part 2 (Processed)
Same as part 1, but with the Readme-file. (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (439 visninger) (332 Downloads)
-
CHARTER OF VALUES OF CITIZENSHIP AND INTEGRATION (Processed)
The integration agreement form prepared for signing the pact between foreign and state, in addition to providing the alien's commitments, indicates, the statement by the person concerned,...
ZIP (291 visninger) (184 Downloads)
-
Parallel texts from Swedish Work environment Authority
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (406 visninger) (296 Downloads)