Resources for Language Technologies
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 zobrazenia:) (122 Počet stiahnutí)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 zobrazenia:) (557 Počet stiahnutí)
-
EUIPO - list of goods and services German and Spanish (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (451 zobrazenia:) (335 Počet stiahnutí)
-
EUIPO - list of goods and services German and Italian (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (441 zobrazenia:) (326 Počet stiahnutí)
-
EUIPO - list of goods and services German and English (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (427 zobrazenia:) (314 Počet stiahnutí)
-
EUIPO - list of goods and services German and French (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (374 zobrazenia:) (262 Počet stiahnutí)
-
Parallel Global Voices (English - German) (Processed)
Parallel Global Voices EN-DE is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (246 zobrazenia:) (137 Počet stiahnutí)
-
Terminology_of_international_contracts_German_(Processed)
The German terms extracted from the multilingual terminology of international contracts as provided by the German Foreign Office. Transformed into TBX. 1726 terms corresponding to 1463...
ZIP (298 zobrazenia:) (202 Počet stiahnutí)
-
Austrian Criminal Office Police Glossary
An English/Austrian German glossary of Police-related terminology This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
XML PDF ZIP (618 zobrazenia:) (513 Počet stiahnutí)
-
Terminology_of_the _German_Foreign_Office_German_(Processed)
German terms extracted from the main terminology collection of the German Foreign Office. Transformed into the TBX Format. 33543 terms corresponding to 27297 termEntry IDs. Can be linked...
ZIP (271 zobrazenia:) (168 Počet stiahnutí)
-
Vienna Environmental Report2004/2005 (Processed)
2221 Translation Pairs (DE-AT, EN) in a report on environment by the City of Vienna (Stadt Wien). In a stripped and validated TMX file. This dataset has been created within the framework...
ZIP (127 zobrazenia:) (72 Počet stiahnutí)
-
Webpage_Foreign_Office_AA_de-en_2016 (Processed)
Translations from the Website of the German Foreign Office, 2016. Contains 12555 TUs. DE-EN This dataset has been created within the framework of the European Language Resource...
ZIP (179 zobrazenia:) (99 Počet stiahnutí)
-
Energy Report of the City of Vienna (Processed)
Data for 2015/ Year of reporting 2017, Municipal Department 20. Contains 481 TUs in EN-AT and EN, checked and stripped This dataset has been created within the framework of the European...
ZIP (196 zobrazenia:) (115 Počet stiahnutí)
-
German Legal monolingual corpus from the contensts of the https://www.gesetze-im-internet.de/ web site
German Legal monolingual corpus from the contensts of the https://www.gesetze-im-internet.de/ web site This dataset has been created within the framework of the European Language...
ZIP (190 zobrazenia:) (110 Počet stiahnutí)
-
Parallel texts from Swedish Labour market agency (Processed)
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (334 zobrazenia:) (237 Počet stiahnutí)
-
BMVI Publications (Processed)
TMX file with 11555 TUs, bilingual German/English, publications/brochures of the Federal Ministry of Transport and Digital Infrastructure on transport issues. During processing of the...
ZIP (368 zobrazenia:) (261 Počet stiahnutí)
-
ISAP Legal Terminology (Processed).
Words relating to law from Informační systém pro aproximaci práva (ISAP) in the Czech Republic. Terms are in Czech and English, French and sometimes in German. For the time being, one...
ZIP (272 zobrazenia:) (170 Počet stiahnutí)
-
Webpage_Foreign_Office_AA_de-fr_2016-2018 (Processed)
Translations from the Website of the German Foreign Office, 2016-2018, containing 24.931 Translation Units DE-FR This dataset has been created within the framework of the European...
ZIP (307 zobrazenia:) (193 Počet stiahnutí)
-
Term lists and Dictionaries from Swedish Authorities
This resource also includes a Dictionary from the ELMN that has a set of terms translated from English to all the EU languages. The list of languages that is indicated with this resource...
ZIP (478 zobrazenia:) (365 Počet stiahnutí)
-
German-English website parallel corpus from the Federal Foreign Office Berlin (Processed)
German-English texts extracted from the website of the Federal Foreign Office Berlin. This includes 53,849 pairs that were translated between October 2013 and the beginning of November...
ZIP (262 zobrazenia:) (161 Počet stiahnutí)