-
Memorja tat-Traduzzjoni tad-DĠT
Memorja tat-Traduzzjoni tad-DĠT Id-DGT-TM hi memorja tat-traduzzjoni (is-sentenzi u t-traduzzjonijiet tagħhom prodotti manwalment) b’24 lingwa. Hi fiha segmenti mill-Acquis...
ZIP (45005 żjara/żjarat) (4502 Iddownlodja)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 żjara/żjarat) (122 Iddownlodja)
-
IATE
IATE (= “Inter-Active Terminology for Europe”) is the EU's inter-institutional terminology database. IATE has been used by the language services of the EU institutions and agencies since...
HTML JavaScript ZIP (6456 żjara/żjarat) (6082 Iddownlodja)
-
EJTN Handbook (Processed)
Handbook on judical training (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (373 żjara/żjarat) (266 Iddownlodja)
-
Monolingual corpus from Minutes of the Plenary Sessions of the Croatian Parliament (2016-2018) (Processed)
Minutes of the Plenary Sessions of the Croatian Parliament (2016-2018) were downloaded from http://edoc.sabor.hr . This dataset has been created within the framework of the European...
ZIP (169 żjara/żjarat) (85 Iddownlodja)
-
EUIPO - IP case law Italian-English (Processed)
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (274 żjara/żjarat) (175 Iddownlodja)
-
Bilingual English-Norwegian parallel corpus from Norwegian Maritime Authority website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (453 żjara/żjarat) (340 Iddownlodja)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 żjara/żjarat) (557 Iddownlodja)
-
National Health Fund Dataset (Processed)
The dataset is a 274K-token Polish-English parallel resource in XLIFF format created on the basis of "Diagnosis-Related Groups in Europe" publication of the Polish National Health Fund....
ZIP (345 żjara/żjarat) (231 Iddownlodja)
-
Monolingual Greek corpus in the public administration domain
Monolingual Greek corpus, containing 14261776 tokens and 840314 lexical types in the public administration domain. This dataset has been created within the framework of the European...
ZIP (395 żjara/żjarat) (280 Iddownlodja)
-
Letter of rights for persons arrested and or detained
Police form, 12 pages. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (413 żjara/żjarat) (296 Iddownlodja)
-
DA-EN Danish Ministry of Higher Education and Science 2
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 115,000 words, topic: research policy This dataset has been created within the framework of...
ZIP (333 żjara/żjarat) (209 Iddownlodja)
-
Portuguese legislation in FR
Portuguese legislation in French (the Parliament's official translations) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (492 żjara/żjarat) (372 Iddownlodja)
-
The Coimisineir Teanga Bilingual Corpus of Reference Documents
General Reference content from the Language Commissioner's Office Size: 6 bilingual Word documents and 44 parallel Word documents This dataset has been created within the framework...
ZIP (339 żjara/żjarat) (248 Iddownlodja)
-
The Gaois bilingual corpus of English-Irish legislation
Bilingual corpus of English-Irish legislation provided by the Department of Justice, in two parallel .txt files. Contains 98,758 parallel sentences. This dataset has been created within...
ZIP (432 żjara/żjarat) (317 Iddownlodja)
-
Corpus of State-related content from the Latvian Web (Processed)
Latvian Web, home pages of ministries and state public services, army, etc. were crawled, and parallel Latvian-English content was collected. (Processed) This dataset has been created...
ZIP (451 żjara/żjarat) (346 Iddownlodja)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words This dataset has been created within...
ZIP (357 żjara/żjarat) (249 Iddownlodja)
-
Convention on the transfer of sentenced persons (English - Greek) (Processed)
Convention, additional protocol on the convention, recomendation R (84) 11 of the Council of Europe, templates on the approval/rejection of transfer requests regarding the convention on...
ZIP (498 żjara/żjarat) (383 Iddownlodja)
-
Romanian – English literature corpus
Bilingual Romanian - English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on...
ZIP (411 żjara/żjarat) (321 Iddownlodja)
-
Translation memories from The Ministry of Foreign Affairs of Norway
Translation memories containing translations of EU legislative acts from English to Norwegian Bokmål.
XML PDF ZIP (663 żjara/żjarat) (540 Iddownlodja)