-
The Icelandic Terminology bank (Processed)
The Icelandic Terminology bank republished from http://www.malfong.is/index.php?pg=idord&lang=en This dataset has been created within the framework of the European Language Resource...
ZIP (228 visualizzazioni) (135 Download)
-
SIP Publications (Processed)
Publications from the Luxembourgish government edited by Service information et presse - 11538 Translation Units This dataset has been created within the framework of the European...
ZIP (577 visualizzazioni) (466 Download)
-
DA-EN Danish Ministry of Higher Education and Science (Processed)
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size: 120,000 words, topic: innovation, science (Processed) This dataset has been created within...
ZIP (344 visualizzazioni) (222 Download)
-
Bilingual English-Icelandic parallel corpus from Icelandic Post and Telecom Administration website
Contents of https://www.pfs.is/ website downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the European Language Resource...
ZIP (446 visualizzazioni) (344 Download)
-
Portuguese-French bilingual corpus from Portuguese law on referendum
Law on the referendum in Portugal; bilingual tmx file in PT-FR This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe...
XML PDF ZIP (409 visualizzazioni) (305 Download)
-
Malta Government Gazette
Bilingual gazette (English-Maltese) of the government of Malta. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe...
ZIP (686 visualizzazioni) (512 Download)
-
OROSSIMO Corpus - Photography - film & video (Processed)
A corpus of academic discourse texts belonging to the Photography, film & video domain (according to the Dewey Decimal classification, DDC77 -Photography, computer art, film &...
ZIP (250 visualizzazioni) (156 Download)
-
Parallel Global Voices (Greek - Spanish) (Processed)
Parallel Global Voices EL-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (227 visualizzazioni) (120 Download)
-
Priorities and schedule of the Dutch Presidency of the EU (in Bulgarian) (Processed)
Priorities of the Dutch Presidency of the EU in Justice and Home affairs This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (223 visualizzazioni) (122 Download)
-
Collection of Greek National Spatial Plans (Processed)
national spatial plans (general, aquaculture, tourism, industry, RES, detention facilities) This dataset has been created within the framework of the European Language Resource...
ZIP (176 visualizzazioni) (101 Download)
-
Czech Association of Medical Physicists - Physics Glossary (Processed)
A dictionary of 3281 terms relating to physics for medicine in Czech - English This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (377 visualizzazioni) (284 Download)
-
Bilingual English-Lithuanian parallel corpus from Seimas of the Republic of Lithuania website
Contents of http://www.lrs.lt were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (330 visualizzazioni) (222 Download)
-
Parallel English-Icelandic corpus from the contents of Icelandic National Debt Management Agency website
Contents of http://www.lanamal.is website downloaded, aligned and converted into a parallel corpus This dataset has been created within the framework of the European Language Resource...
ZIP (260 visualizzazioni) (155 Download)
-
Orossimo Terminological Resource - Law
A bilingual terminological glossary extracted from academic discourse texts belonging to the Law domain. This dataset has been created within the framework of the European Language...
XML PDF ZIP (504 visualizzazioni) (394 Download)
-
Bilingual Croatian-English Parallel Corpus (Processed)
Bilingual Croatian-English Parallel Corpus of 21340 translation units in the public administration domain. This dataset has been created within the framework of the European Language...
ZIP (428 visualizzazioni) (330 Download)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic (Processed)
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words It is converted into a 2609-TUs...
ZIP (512 visualizzazioni) (400 Download)
-
Polish Food Dataset (Processed)
Polish Food is a quarterly issued by the Polish Ministry of Agriculture and Rural Development and The Agency for Restructuring and Modernisation of Agriculture. The dataset comprises a...
ZIP (479 visualizzazioni) (367 Download)
-
OROSSIMO Corpus - Law (Processed)
A corpus of academic discourse texts belonging to the Law domain (according to the Dewey Decimal classification, DDC34 - Law). This dataset has been created within the framework of the...
ZIP (265 visualizzazioni) (163 Download)
-
Documents for Translation Tendering Batch 2 (Processed)
This collection contains documents in Dutch used in connection with tendering of translation work. The documents come in a variety of formats, among the Word, PDF and Excel. origin: Dutch...
ZIP (273 visualizzazioni) (182 Download)
-
OROSSIMO Corpus - Computer Science (Processed)
A corpus of academic discourse texts belonging to the Computer Science domain (according to the Dewey Decimal classification, DDC00 - Computer science, knowledge & systems). This...
ZIP (288 visualizzazioni) (183 Download)