-
Monolingual corpus from Minutes of the Plenary Sessions of the Croatian Parliament (2016-2018) (Processed)
Minutes of the Plenary Sessions of the Croatian Parliament (2016-2018) were downloaded from http://edoc.sabor.hr . This dataset has been created within the framework of the European...
ZIP (169 начини на показване) (85 Изтегляния)
-
EUIPO - IP case law Italian-English (Processed)
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (274 начини на показване) (175 Изтегляния)
-
Monolingual Greek corpus in the public administration domain
Monolingual Greek corpus, containing 14261776 tokens and 840314 lexical types in the public administration domain. This dataset has been created within the framework of the European...
ZIP (395 начини на показване) (280 Изтегляния)
-
Portuguese legislation in FR
Portuguese legislation in French (the Parliament's official translations) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (492 начини на показване) (372 Изтегляния)
-
The Coimisineir Teanga Bilingual Corpus of Reference Documents
General Reference content from the Language Commissioner's Office Size: 6 bilingual Word documents and 44 parallel Word documents This dataset has been created within the framework...
ZIP (339 начини на показване) (248 Изтегляния)
-
The Gaois bilingual corpus of English-Irish legislation
Bilingual corpus of English-Irish legislation provided by the Department of Justice, in two parallel .txt files. Contains 98,758 parallel sentences. This dataset has been created within...
ZIP (432 начини на показване) (317 Изтегляния)
-
Corpus of State-related content from the Latvian Web (Processed)
Latvian Web, home pages of ministries and state public services, army, etc. were crawled, and parallel Latvian-English content was collected. (Processed) This dataset has been created...
ZIP (451 начини на показване) (346 Изтегляния)
-
Translation memories from The Ministry of Foreign Affairs of Norway
Translation memories containing translations of EU legislative acts from English to Norwegian Bokmål.
XML PDF ZIP (663 начини на показване) (540 Изтегляния)
-
Corpus RIZIV
Corpus with Dutch and French of the national institute for illness and invalidity insurance
ZIP (625 начини на показване) (545 Изтегляния)
-
English-Swedish parallel corpus from the www.visitestonia.com web site
Parallel English-Swedish corpus compiled from the www.visitestonia.com web site by crawling the contents and aligning the parallel data. This dataset has been created within the...
ZIP (317 начини на показване) (201 Изтегляния)
-
Terminology in the domain of Information and Communication Technology (ICT)
Terminology in the domain of Information and Communication Technology (ICT) by Terminology Commission of the Academy of Sciences of Latvia (LAS-TC) This dataset has been created within...
ZIP (267 начини на показване) (181 Изтегляния)
-
Terminology_of_international_contracts_Portuguese_(Processed)
The Portuguese terms extracted from the multilingual terminology of international contracts as provided by the German Foreign Office. Transformed into TBX. 728 terms corresponding to 671...
ZIP (241 начини на показване) (139 Изтегляния)
-
English-Croatian translation memory from the Ministry of Regional Development and EU Funds (Processed)
A translation memory in tmx format with source texts from the Ministry of Regional Development and EU Funds and translations in Croatian by Ciklopea d.o.o. This dataset has been created...
ZIP (323 начини на показване) (211 Изтегляния)
-
Irish Monolingual Corpus from contents of health.gov.ie web site
Irish Monolingual Corpus from contents of health.gov.ie web site This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (252 начини на показване) (163 Изтегляния)
-
Thematic Vocabulary of Geography (processed)
Thematic Vocabulary of Geography This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (214 начини на показване) (131 Изтегляния)
-
Citizens Information Bilingual Web-Corpus (Processed)
A web corpus crawled from http://www.citizensinformation.ie. Contains 10,297 parallel sentences of English/Irish that have undergone manual cleaning. May be reproduced and/or re-used free...
ZIP (243 начини на показване) (156 Изтегляния)
-
Documents from the Ministry of Agriculture, Forestry and Food of the Republic of Slovenia (EN-SL) (Processed)
Documents from the Ministry of Agriculture, Forestry and Food of the Republic of Slovenia (https://www.program-podezelja.si) This dataset has been created within the framework of the...
ZIP (303 начини на показване) (193 Изтегляния)
-
English-Swedish parallel corpus from the web site of Finnish Tax Administration
English-Swedish parallel corpus created from the contents of Finnish Tax Administration web site https://www.vero.fi/ This dataset has been created within the framework of the European...
ZIP (273 начини на показване) (162 Изтегляния)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Swedish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (317 начини на показване) (203 Изтегляния)
-
EUIPO - Trade mark Guidelines (October 2017) (English-French) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (279 начини на показване) (175 Изтегляния)