Resources for Language Technologies
-
Cuimhne Aistriúcháin Ard-Stiúrthóireacht an Aistriúcháin (DGT-TM)
Cuimhne aistriúcháin is ea DGT-TM (abairtí agus na haistriúcháin a cuireadh orthu) atá ar fáil i 24 theanga. Sa chuimhne seo tá píosaí ón Acquis Communautaire, corpas reachtaíochta an...
PDF ZIP (45005 amharc) (4502 Íoslódálacha)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 amharc) (122 Íoslódálacha)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 amharc) (557 Íoslódálacha)
-
Letter of rights for persons arrested and or detained
Police form, 12 pages. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (413 amharc) (296 Íoslódálacha)
-
Portuguese legislation in FR
Portuguese legislation in French (the Parliament's official translations) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (492 amharc) (372 Íoslódálacha)
-
Corpus RIZIV
Corpus with Dutch and French of the national institute for illness and invalidity insurance
ZIP (625 amharc) (545 Íoslódálacha)
-
EUIPO - list of goods and services French and English (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (509 amharc) (396 Íoslódálacha)
-
EUIPO - list of goods and services Spanish and French (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (439 amharc) (334 Íoslódálacha)
-
EUIPO - list of goods and services Italian and French (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (408 amharc) (315 Íoslódálacha)
-
EUIPO - list of goods and services German and French (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (374 amharc) (262 Íoslódálacha)
-
Thematic Vocabulary of Geography (processed)
Thematic Vocabulary of Geography This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (214 amharc) (131 Íoslódálacha)
-
EUIPO - Trade mark Guidelines (October 2017) (English-French) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (279 amharc) (175 Íoslódálacha)
-
Electronics and opto-electronics Vocabulary (processed)
Vocabulary used for indexing bibliographical records dealing with “Electronics” in the PASCAL database, until 2014. This resource contains 4454 entries classified under 19 collections....
ZIP (205 amharc) (118 Íoslódálacha)
-
French Monolingual legal corpus from Official Journal of France
French Monolingual legal corpus from Official Journal of France as collected from https://www.legifrance.gouv.fr/ web site This dataset has been created within the framework of the...
ZIP (288 amharc) (194 Íoslódálacha)
-
PAeSI : Public Administration and Foreign Immigrants (Processed)
The service PAeSI for the immigration has been realized within of the project PAeSI (Public Administration and Foreign Immigrants), in order to prepare an electronic access to...
ZIP (278 amharc) (170 Íoslódálacha)
-
Parallel texts from Swedish Labour market agency (Processed)
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (334 amharc) (237 Íoslódálacha)
-
Portuguese legislation in English and French (Processed)
Portuguese legislation in English and French (the Parliament's official translations) (Processed) This dataset has been created within the framework of the European Language Resource...
ZIP (291 amharc) (185 Íoslódálacha)
-
ISAP Legal Terminology (Processed).
Words relating to law from Informační systém pro aproximaci práva (ISAP) in the Czech Republic. Terms are in Czech and English, French and sometimes in German. For the time being, one...
ZIP (272 amharc) (170 Íoslódálacha)
-
Parallel Global Voices (English - French) (Processed)
Parallel Global Voices EN-FR is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (213 amharc) (130 Íoslódálacha)
-
Webpage_Foreign_Office_AA_de-fr_2016-2018 (Processed)
Translations from the Website of the German Foreign Office, 2016-2018, containing 24.931 Translation Units DE-FR This dataset has been created within the framework of the European...
ZIP (307 amharc) (193 Íoslódálacha)