Resources for Language Technologies
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 amharc) (122 Íoslódálacha)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 amharc) (557 Íoslódálacha)
-
EUIPO - list of goods and services German and English (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (427 amharc) (314 Íoslódálacha)
-
Parallel Global Voices (English - German) (Processed)
Parallel Global Voices EN-DE is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (246 amharc) (137 Íoslódálacha)
-
Austrian Criminal Office Police Glossary
An English/Austrian German glossary of Police-related terminology This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
XML PDF ZIP (618 amharc) (513 Íoslódálacha)
-
Vienna Environmental Report2004/2005 (Processed)
2221 Translation Pairs (DE-AT, EN) in a report on environment by the City of Vienna (Stadt Wien). In a stripped and validated TMX file. This dataset has been created within the framework...
ZIP (127 amharc) (72 Íoslódálacha)
-
Webpage_Foreign_Office_AA_de-en_2016 (Processed)
Translations from the Website of the German Foreign Office, 2016. Contains 12555 TUs. DE-EN This dataset has been created within the framework of the European Language Resource...
ZIP (179 amharc) (99 Íoslódálacha)
-
Energy Report of the City of Vienna (Processed)
Data for 2015/ Year of reporting 2017, Municipal Department 20. Contains 481 TUs in EN-AT and EN, checked and stripped This dataset has been created within the framework of the European...
ZIP (196 amharc) (115 Íoslódálacha)
-
Parallel texts from Swedish Labour market agency (Processed)
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (334 amharc) (237 Íoslódálacha)
-
BMVI Publications (Processed)
TMX file with 11555 TUs, bilingual German/English, publications/brochures of the Federal Ministry of Transport and Digital Infrastructure on transport issues. During processing of the...
ZIP (368 amharc) (261 Íoslódálacha)
-
ISAP Legal Terminology (Processed).
Words relating to law from Informační systém pro aproximaci práva (ISAP) in the Czech Republic. Terms are in Czech and English, French and sometimes in German. For the time being, one...
ZIP (272 amharc) (170 Íoslódálacha)
-
Term lists and Dictionaries from Swedish Authorities
This resource also includes a Dictionary from the ELMN that has a set of terms translated from English to all the EU languages. The list of languages that is indicated with this resource...
ZIP (478 amharc) (365 Íoslódálacha)
-
German-English website parallel corpus from the Federal Foreign Office Berlin (Processed)
German-English texts extracted from the website of the Federal Foreign Office Berlin. This includes 53,849 pairs that were translated between October 2013 and the beginning of November...
ZIP (262 amharc) (161 Íoslódálacha)
-
University of Vienna Termbanks
3 Termbanks about Risk Management, Austrian Asylum Law and University Law/Education Administration This dataset has been created within the framework of the European Language Resource...
XML PDF ZIP (596 amharc) (487 Íoslódálacha)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 amharc) (487 Íoslódálacha)
-
National Bank of Belgium Terminology (Processed)
A termbase in 4 languages by the National Bank of Belgium. Includes descriptions. Transformed onto TBX This dataset has been created within the framework of the European Language...
ZIP (290 amharc) (180 Íoslódálacha)
-
Parallel texts from Swedish Labour market agency. Part 2 (Processed)
Same as part 1, but with the Readme-file. (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (439 amharc) (332 Íoslódálacha)
-
Webpage_Foreign_Office_AA_de-en_2017 (Processed)
Translations from the Website of the German Foreign Office, 2017. Contains 24727 TUs. DE-EN This dataset has been created within the framework of the European Language Resource...
ZIP (257 amharc) (171 Íoslódálacha)
-
Webpage_Foreign_Office_AA_de-en_2018 (Processed)
Translations from the Website of the German Foreign Office, 2018. Contains 20554 TUs. DE-EN This dataset has been created within the framework of the European Language Resource...
ZIP (150 amharc) (78 Íoslódálacha)
-
2017 Activity Report Hohe Tauern National Park (Processed)
1020 Translation Units about the activities of the Hohe TauernNational Park (DE-AT, EN-GB). in TMX stripped and validated files. This dataset has been created within the framework of the...
ZIP (263 amharc) (178 Íoslódálacha)