-
Secretariat-General parallel corpus SL-EN and EN-SL (part 2) (Processed)
English-Slovenian parallel corpus in TMX format from the Secretariat-General of the Government of the Republic of Slovenia in the legal domain This dataset has been created within the...
ZIP (505 visningar) (393 Nedladdningar)
-
English-Swedish parallel corpus from the www.visitestonia.com web site
Parallel English-Swedish corpus compiled from the www.visitestonia.com web site by crawling the contents and aligning the parallel data. This dataset has been created within the...
ZIP (317 visningar) (201 Nedladdningar)
-
English-Slovak parallel corpus of texts from The Ministry of Justice of the Slovak Republic
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 112580 words This dataset has been created within...
ZIP (384 visningar) (1 Nedladdningar)
-
Parallel corpus from Bank of Estonia (Processed)
Parallel corpus from content of Bank of Estonia website (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (330 visningar) (217 Nedladdningar)
-
Terminology in the domain of Information and Communication Technology (ICT)
Terminology in the domain of Information and Communication Technology (ICT) by Terminology Commission of the Academy of Sciences of Latvia (LAS-TC) This dataset has been created within...
ZIP (267 visningar) (181 Nedladdningar)
-
Terminology_of_international_contracts_Portuguese_(Processed)
The Portuguese terms extracted from the multilingual terminology of international contracts as provided by the German Foreign Office. Transformed into TBX. 728 terms corresponding to 671...
ZIP (241 visningar) (139 Nedladdningar)
-
English-Croatian translation memory from the Ministry of Regional Development and EU Funds (Processed)
A translation memory in tmx format with source texts from the Ministry of Regional Development and EU Funds and translations in Croatian by Ciklopea d.o.o. This dataset has been created...
ZIP (323 visningar) (211 Nedladdningar)
-
Irish Monolingual Corpus from contents of health.gov.ie web site
Irish Monolingual Corpus from contents of health.gov.ie web site This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (252 visningar) (163 Nedladdningar)
-
Thematic Vocabulary of Geography (processed)
Thematic Vocabulary of Geography This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (214 visningar) (131 Nedladdningar)
-
Citizens Information Bilingual Web-Corpus (Processed)
A web corpus crawled from http://www.citizensinformation.ie. Contains 10,297 parallel sentences of English/Irish that have undergone manual cleaning. May be reproduced and/or re-used free...
ZIP (243 visningar) (156 Nedladdningar)
-
Documents from the Ministry of Agriculture, Forestry and Food of the Republic of Slovenia (EN-SL) (Processed)
Documents from the Ministry of Agriculture, Forestry and Food of the Republic of Slovenia (https://www.program-podezelja.si) This dataset has been created within the framework of the...
ZIP (303 visningar) (193 Nedladdningar)
-
English-Swedish parallel corpus from the web site of Finnish Tax Administration
English-Swedish parallel corpus created from the contents of Finnish Tax Administration web site https://www.vero.fi/ This dataset has been created within the framework of the European...
ZIP (273 visningar) (162 Nedladdningar)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Swedish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (317 visningar) (203 Nedladdningar)
-
EUIPO - Trade mark Guidelines (October 2017) (English-French) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (279 visningar) (175 Nedladdningar)
-
Parallel Global Voices (English - German) (Processed)
Parallel Global Voices EN-DE is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (246 visningar) (137 Nedladdningar)
-
Romanian – English parallel wordlists (Processed)
English and Romanian lemmatized wordlists extracted from various resources (including RO-EN Wordnets, the Romanian – English news corpus, the Romanian – English literature corpus, and...
ZIP (297 visningar) (198 Nedladdningar)
-
EIR Romanian-English TM (ECHR-33234/12) (Processed)
Converted ECHR translation memory EN-RO (CASE OF AL NASHIRI v. ROMANIA - Application no. 33234/12); This dataset has been created within the framework of the European Language Resource...
ZIP (276 visningar) (169 Nedladdningar)
-
EIR Romanian-English Newsletter (2009-March 2011) (Processed)
Translation units were extracted from a collection of 392 files (386 Word and 6 Excel files) in the domain of European affairs (the main 4 EIR’s key areas: studies, training, translation...
ZIP (280 visningar) (180 Nedladdningar)
-
Press Releases (01.2018-01.2019) of the PIO (Processed)
It contains 5162 translation units,extracted from the press releases (01.2018-01.2019) of the Press and Information Office (PIO), Ministry of Interior, Republic of Cyprus . This dataset...
ZIP (259 visningar) (162 Nedladdningar)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Greek) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (192 visningar) (112 Nedladdningar)