Resources for Language Technologies
-
Cuimhne Aistriúcháin Ard-Stiúrthóireacht an Aistriúcháin (DGT-TM)
Cuimhne aistriúcháin is ea DGT-TM (abairtí agus na haistriúcháin a cuireadh orthu) atá ar fáil i 24 theanga. Sa chuimhne seo tá píosaí ón Acquis Communautaire, corpas reachtaíochta an...
PDF ZIP (45005 amharc) (4502 Íoslódálacha)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 amharc) (122 Íoslódálacha)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 amharc) (557 Íoslódálacha)
-
Monolingual Greek corpus in the public administration domain
Monolingual Greek corpus, containing 14261776 tokens and 840314 lexical types in the public administration domain. This dataset has been created within the framework of the European...
ZIP (395 amharc) (280 Íoslódálacha)
-
Letter of rights for persons arrested and or detained
Police form, 12 pages. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (413 amharc) (296 Íoslódálacha)
-
Convention on the transfer of sentenced persons (English - Greek) (Processed)
Convention, additional protocol on the convention, recomendation R (84) 11 of the Council of Europe, templates on the approval/rejection of transfer requests regarding the convention on...
ZIP (498 amharc) (383 Íoslódálacha)
-
Bilingual collection of documents about the Cyprus Problem (Processed)
A parallel corpus(Greek-English) regarding the Cyprus Problem. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe...
ZIP (391 amharc) (275 Íoslódálacha)
-
Press Releases (01.2018-01.2019) of the PIO (Processed)
It contains 5162 translation units,extracted from the press releases (01.2018-01.2019) of the Press and Information Office (PIO), Ministry of Interior, Republic of Cyprus . This dataset...
ZIP (259 amharc) (162 Íoslódálacha)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Greek) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (192 amharc) (112 Íoslódálacha)
-
Macroeconomic Developments
Bulletins of Macroeconomic Developments This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (371 amharc) (261 Íoslódálacha)
-
The Constitution of Greece (English-Greek) (Processed)
The Constitution of Greece in EL and EN This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (256 amharc) (150 Íoslódálacha)
-
Monolingual corpus from Minutes of the Plenary Sessions of the Hellenic Parliament (2018) (Processed)
Minutes of the Plenary Sessions of the Hellenic Parliament (2018) were downloaded from https://www.hellenicparliament.gr . This dataset has been created within the framework of the...
ZIP (273 amharc) (166 Íoslódálacha)
-
Greek-English parallel corpus from EQF Referencing Report (Processed)
Greek-English parallel corpus from EQF Referencing Report This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe...
ZIP (195 amharc) (110 Íoslódálacha)
-
Parallel Global Voices (Greek - Spanish)
Parallel Global Voices EL-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
(661 amharc) (557 Íoslódálacha)
-
OROSSIMO Corpus - Economics (Processed)
A corpus of academic discourse texts belonging to the Economics domain (according to the Dewey Decimal classification, DDC33 - Economics), annotated at structural level conformant to the...
ZIP (272 amharc) (173 Íoslódálacha)
-
Orossimo Terminological Resource - Computer Science
A bilingual terminological glossary extracted from academic discourse texts belonging to the Computer Science domain. This dataset has been created within the framework of the European...
XML PDF ZIP (539 amharc) (427 Íoslódálacha)
-
Hellenic Ministry of Foreign Affairs Greek-English announcements corpus (Processed)
The Hellenic Ministry of Foreign Affairs Greek-English announcements corpus contains announcements from the Hellenic Ministry of Foreign Affairs. This dataset has been created within the...
ZIP (578 amharc) (485 Íoslódálacha)
-
Term lists and Dictionaries from Swedish Authorities
This resource also includes a Dictionary from the ELMN that has a set of terms translated from English to all the EU languages. The list of languages that is indicated with this resource...
ZIP (478 amharc) (365 Íoslódálacha)
-
Parallel Global Voices (Greek - French) (Processed)
Parallel Global Voices EL-FR is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (222 amharc) (138 Íoslódálacha)
-
Parallel corpus (Greek - English) in the law domain (Processed) (Part1)
Parallel (el-en) corpus of 1979 translation units in the law domain. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (333 amharc) (223 Íoslódálacha)