Resources for Language Technologies
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 visninger) (122 Downloads)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 visninger) (557 Downloads)
-
EUIPO - list of goods and services German and English (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (427 visninger) (314 Downloads)
-
Parallel Global Voices (English - German) (Processed)
Parallel Global Voices EN-DE is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (246 visninger) (137 Downloads)
-
Austrian Criminal Office Police Glossary
An English/Austrian German glossary of Police-related terminology This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
XML PDF ZIP (618 visninger) (513 Downloads)
-
Vienna Environmental Report2004/2005 (Processed)
2221 Translation Pairs (DE-AT, EN) in a report on environment by the City of Vienna (Stadt Wien). In a stripped and validated TMX file. This dataset has been created within the framework...
ZIP (127 visninger) (72 Downloads)
-
Webpage_Foreign_Office_AA_de-en_2016 (Processed)
Translations from the Website of the German Foreign Office, 2016. Contains 12555 TUs. DE-EN This dataset has been created within the framework of the European Language Resource...
ZIP (179 visninger) (99 Downloads)
-
Energy Report of the City of Vienna (Processed)
Data for 2015/ Year of reporting 2017, Municipal Department 20. Contains 481 TUs in EN-AT and EN, checked and stripped This dataset has been created within the framework of the European...
ZIP (196 visninger) (115 Downloads)
-
Parallel texts from Swedish Labour market agency (Processed)
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (334 visninger) (237 Downloads)
-
BMVI Publications (Processed)
TMX file with 11555 TUs, bilingual German/English, publications/brochures of the Federal Ministry of Transport and Digital Infrastructure on transport issues. During processing of the...
ZIP (368 visninger) (261 Downloads)
-
ISAP Legal Terminology (Processed).
Words relating to law from Informační systém pro aproximaci práva (ISAP) in the Czech Republic. Terms are in Czech and English, French and sometimes in German. For the time being, one...
ZIP (272 visninger) (170 Downloads)
-
Term lists and Dictionaries from Swedish Authorities
This resource also includes a Dictionary from the ELMN that has a set of terms translated from English to all the EU languages. The list of languages that is indicated with this resource...
ZIP (478 visninger) (365 Downloads)
-
German-English website parallel corpus from the Federal Foreign Office Berlin (Processed)
German-English texts extracted from the website of the Federal Foreign Office Berlin. This includes 53,849 pairs that were translated between October 2013 and the beginning of November...
ZIP (262 visninger) (161 Downloads)
-
University of Vienna Termbanks
3 Termbanks about Risk Management, Austrian Asylum Law and University Law/Education Administration This dataset has been created within the framework of the European Language Resource...
XML PDF ZIP (596 visninger) (487 Downloads)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 visninger) (487 Downloads)
-
National Bank of Belgium Terminology (Processed)
A termbase in 4 languages by the National Bank of Belgium. Includes descriptions. Transformed onto TBX This dataset has been created within the framework of the European Language...
ZIP (290 visninger) (180 Downloads)
-
Parallel texts from Swedish Labour market agency. Part 2 (Processed)
Same as part 1, but with the Readme-file. (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (439 visninger) (332 Downloads)
-
Webpage_Foreign_Office_AA_de-en_2017 (Processed)
Translations from the Website of the German Foreign Office, 2017. Contains 24727 TUs. DE-EN This dataset has been created within the framework of the European Language Resource...
ZIP (257 visninger) (171 Downloads)
-
Webpage_Foreign_Office_AA_de-en_2018 (Processed)
Translations from the Website of the German Foreign Office, 2018. Contains 20554 TUs. DE-EN This dataset has been created within the framework of the European Language Resource...
ZIP (150 visninger) (78 Downloads)
-
2017 Activity Report Hohe Tauern National Park (Processed)
1020 Translation Units about the activities of the Hohe TauernNational Park (DE-AT, EN-GB). in TMX stripped and validated files. This dataset has been created within the framework of the...
ZIP (263 visninger) (178 Downloads)