Dirección General de Redes de Comunicación, Contenido y Tecnologías
-
English-Slovak parallel corpus of texts from The Ministry of Justice of the Slovak Republic (Processed)
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 112580 words. It was converted into a 2895-TUs...
ZIP (508 presentación) (405 Descargas)
-
ISAP Legal Terminology (Processed).
Words relating to law from Informační systém pro aproximaci práva (ISAP) in the Czech Republic. Terms are in Czech and English, French and sometimes in German. For the time being, one...
ZIP (272 presentación) (170 Descargas)
-
Parallel corpus from Parliament of Estonia (Processed)
Parallel corpus compiled from contents of website of Parliament of Estonia This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (72 presentación) (56 Descargas)
-
OROSSIMO Corpus - Economics (Processed)
A corpus of academic discourse texts belonging to the Economics domain (according to the Dewey Decimal classification, DDC33 - Economics), annotated at structural level conformant to the...
ZIP (272 presentación) (173 Descargas)
-
Parallel Corpus from the Web Site of the the MFA of Latvia (Processed)
The Corpus has been built from the News and Press Releases published in the Web Site of the Ministry of Foreign Affairs of the Republic of Latvia. (Processed) This dataset has been...
ZIP (401 presentación) (290 Descargas)
-
Parallel Global Voices (English - French) (Processed)
Parallel Global Voices EN-FR is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (213 presentación) (130 Descargas)
-
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (534 presentación) (420 Descargas)
-
Secretariat-General parallel corpus SL-EN and EN-SL (part 1) (Processed)
English-Slovenian parallel corpus in TMX format from the Secretariat-General of the Government of the Republic of Slovenia in the legal domain This dataset has been created within the...
ZIP (327 presentación) (219 Descargas)
-
Orossimo Terminological Resource - Computer Science
A bilingual terminological glossary extracted from academic discourse texts belonging to the Computer Science domain. This dataset has been created within the framework of the European...
XML PDF ZIP (539 presentación) (427 Descargas)
-
English-Bulgarian Computer Terms
The resource is a bilingual terminological database representing 729 terms in English and their translations to Bulgarian. The terms belong to the Computer domain. The terminological...
XML PDF ZIP (581 presentación) (478 Descargas)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Portuguese) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (266 presentación) (168 Descargas)
-
Maltese-English website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 26,622 TUs. Date of crawling : 16/12/2016 A strict validation process has been...
ZIP (274 presentación) (179 Descargas)
-
Polish-English parallel corpus from the website of the National Digital Archives (Processed)
Polish-English parallel corpus from the website of the National Digital Archives (https://www.nac.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (412 presentación) (313 Descargas)
-
Croatian-English parallel corpus from the website of the Embassy of Finland, Zagreb (Processed)
Croatian-English parallel corpus from the website of the Embassy of Finland, Zagreb (http://www.finland.hr) This dataset has been created within the framework of the European Language...
ZIP (363 presentación) (272 Descargas)
-
Webpage_Foreign_Office_AA_de-fr_2016-2018 (Processed)
Translations from the Website of the German Foreign Office, 2016-2018, containing 24.931 Translation Units DE-FR This dataset has been created within the framework of the European...
ZIP (307 presentación) (193 Descargas)
-
Hellenic Ministry of Foreign Affairs Greek-English announcements corpus (Processed)
The Hellenic Ministry of Foreign Affairs Greek-English announcements corpus contains announcements from the Hellenic Ministry of Foreign Affairs. This dataset has been created within the...
ZIP (578 presentación) (485 Descargas)
-
Croatian-English corpus with studies on the challenges to the Croatian Accession to the European Union from the Croatian Institute of Public Finance website (Processed)
Croatian-English corpus with studies on the challenges to the Croatian Accession to the European Union from the Croatian Institute of Public Finance website. This dataset has been...
ZIP (336 presentación) (241 Descargas)
-
Parallel corpus (en-pl) from the Export Promotion Portal of Poland (Processed)
A paralell corpus constructed from data acquired form the *.trade.gov.pl websites This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (405 presentación) (289 Descargas)
-
The Coimisineir Teanga Bilingual Corpus of Reference Documents (Processed)
General Reference content from the Language Commissioner's Office. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (334 presentación) (209 Descargas)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Slovak) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (267 presentación) (164 Descargas)