-
Translation memories from Forbruker Europa
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (496 kuvad) (392 Allalaadimised)
-
Bilingual Bulgarian-English corpus from the National Revenue Agency (BG) (Processed)
Bilingual Bulgarian-English corpus of administrative documents on the Refund of Value Added Tax from the Bulgarian National Revenue Agency. This dataset has been created within the...
ZIP (595 kuvad) (494 Allalaadimised)
-
Secretariat-General parallel corpus SL-EN and EN-SL (part 2) (Processed)
English-Slovenian parallel corpus in TMX format from the Secretariat-General of the Government of the Republic of Slovenia in the legal domain This dataset has been created within the...
ZIP (505 kuvad) (393 Allalaadimised)
-
English-Swedish parallel corpus from the www.visitestonia.com web site
Parallel English-Swedish corpus compiled from the www.visitestonia.com web site by crawling the contents and aligning the parallel data. This dataset has been created within the...
ZIP (317 kuvad) (201 Allalaadimised)
-
English-Slovak parallel corpus of texts from The Ministry of Justice of the Slovak Republic
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 112580 words This dataset has been created within...
ZIP (384 kuvad) (1 Allalaadimised)
-
Parallel corpus from Bank of Estonia (Processed)
Parallel corpus from content of Bank of Estonia website (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (330 kuvad) (217 Allalaadimised)
-
Terminology in the domain of Information and Communication Technology (ICT)
Terminology in the domain of Information and Communication Technology (ICT) by Terminology Commission of the Academy of Sciences of Latvia (LAS-TC) This dataset has been created within...
ZIP (267 kuvad) (181 Allalaadimised)
-
Terminology_of_international_contracts_Portuguese_(Processed)
The Portuguese terms extracted from the multilingual terminology of international contracts as provided by the German Foreign Office. Transformed into TBX. 728 terms corresponding to 671...
ZIP (241 kuvad) (139 Allalaadimised)
-
English-Croatian translation memory from the Ministry of Regional Development and EU Funds (Processed)
A translation memory in tmx format with source texts from the Ministry of Regional Development and EU Funds and translations in Croatian by Ciklopea d.o.o. This dataset has been created...
ZIP (323 kuvad) (211 Allalaadimised)
-
Irish Monolingual Corpus from contents of health.gov.ie web site
Irish Monolingual Corpus from contents of health.gov.ie web site This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (252 kuvad) (163 Allalaadimised)
-
Thematic Vocabulary of Geography (processed)
Thematic Vocabulary of Geography This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (214 kuvad) (131 Allalaadimised)
-
Citizens Information Bilingual Web-Corpus (Processed)
A web corpus crawled from http://www.citizensinformation.ie. Contains 10,297 parallel sentences of English/Irish that have undergone manual cleaning. May be reproduced and/or re-used free...
ZIP (243 kuvad) (156 Allalaadimised)
-
Documents from the Ministry of Agriculture, Forestry and Food of the Republic of Slovenia (EN-SL) (Processed)
Documents from the Ministry of Agriculture, Forestry and Food of the Republic of Slovenia (https://www.program-podezelja.si) This dataset has been created within the framework of the...
ZIP (303 kuvad) (193 Allalaadimised)
-
English-Swedish parallel corpus from the web site of Finnish Tax Administration
English-Swedish parallel corpus created from the contents of Finnish Tax Administration web site https://www.vero.fi/ This dataset has been created within the framework of the European...
ZIP (273 kuvad) (162 Allalaadimised)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Swedish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (317 kuvad) (203 Allalaadimised)
-
EUIPO - Trade mark Guidelines (October 2017) (English-French) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (279 kuvad) (175 Allalaadimised)
-
Parallel Global Voices (English - German) (Processed)
Parallel Global Voices EN-DE is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (246 kuvad) (137 Allalaadimised)
-
Romanian – English parallel wordlists (Processed)
English and Romanian lemmatized wordlists extracted from various resources (including RO-EN Wordnets, the Romanian – English news corpus, the Romanian – English literature corpus, and...
ZIP (297 kuvad) (198 Allalaadimised)
-
EIR Romanian-English TM (ECHR-33234/12) (Processed)
Converted ECHR translation memory EN-RO (CASE OF AL NASHIRI v. ROMANIA - Application no. 33234/12); This dataset has been created within the framework of the European Language Resource...
ZIP (276 kuvad) (169 Allalaadimised)
-
EIR Romanian-English Newsletter (2009-March 2011) (Processed)
Translation units were extracted from a collection of 392 files (386 Word and 6 Excel files) in the domain of European affairs (the main 4 EIR’s key areas: studies, training, translation...
ZIP (280 kuvad) (180 Allalaadimised)