Resources for Language Technologies
-
Cuimhne Aistriúcháin Ard-Stiúrthóireacht an Aistriúcháin (DGT-TM)
Cuimhne aistriúcháin is ea DGT-TM (abairtí agus na haistriúcháin a cuireadh orthu) atá ar fáil i 24 theanga. Sa chuimhne seo tá píosaí ón Acquis Communautaire, corpas reachtaíochta an...
PDF ZIP (45005 amharc) (4502 Íoslódálacha)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 amharc) (122 Íoslódálacha)
-
EUIPO - list of goods and services Spanish and English (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (423 amharc) (313 Íoslódálacha)
-
EUIPO - list of goods and services German and Spanish (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (451 amharc) (335 Íoslódálacha)
-
EUIPO - list of goods and services Spanish and French (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (439 amharc) (334 Íoslódálacha)
-
EUIPO - list of goods and services Italian and Spanish (Processed)
EUIPO list of goods and services format: TMX This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (416 amharc) (317 Íoslódálacha)
-
Thematic Vocabulary of Geography (processed)
Thematic Vocabulary of Geography This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (214 amharc) (131 Íoslódálacha)
-
English-Spanish website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 909 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (291 amharc) (184 Íoslódálacha)
-
Electronics and opto-electronics Vocabulary (processed)
Vocabulary used for indexing bibliographical records dealing with “Electronics” in the PASCAL database, until 2014. This resource contains 4454 entries classified under 19 collections....
ZIP (205 amharc) (118 Íoslódálacha)
-
Spanish monolingual corpus from contents of Spanish State Official Gazette
Spanish monolingual corpus from contents of Spanish State Official Gazette as collected from https://boe.es/ web site This dataset has been created within the framework of the European...
ZIP (229 amharc) (143 Íoslódálacha)
-
PAeSI : Public Administration and Foreign Immigrants (Processed)
The service PAeSI for the immigration has been realized within of the project PAeSI (Public Administration and Foreign Immigrants), in order to prepare an electronic access to...
ZIP (278 amharc) (170 Íoslódálacha)
-
Parallel texts from Swedish Labour market agency (Processed)
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (334 amharc) (237 Íoslódálacha)
-
Parallel Global Voices (Greek - Spanish)
Parallel Global Voices EL-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
(661 amharc) (557 Íoslódálacha)
-
Term lists and Dictionaries from Swedish Authorities
This resource also includes a Dictionary from the ELMN that has a set of terms translated from English to all the EU languages. The list of languages that is indicated with this resource...
ZIP (478 amharc) (365 Íoslódálacha)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 amharc) (487 Íoslódálacha)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Spanish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (220 amharc) (119 Íoslódálacha)
-
Parallel Global Voices (English - Spanish) (Processed)
Parallel Global Voices EN-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (227 amharc) (142 Íoslódálacha)
-
Parallel texts from Swedish Labour market agency. Part 2 (Processed)
Same as part 1, but with the Readme-file. (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (439 amharc) (332 Íoslódálacha)
-
CHARTER OF VALUES OF CITIZENSHIP AND INTEGRATION (Processed)
The integration agreement form prepared for signing the pact between foreign and state, in addition to providing the alien's commitments, indicates, the statement by the person concerned,...
ZIP (291 amharc) (184 Íoslódálacha)
-
Parallel texts from Swedish Work environment Authority
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (406 amharc) (296 Íoslódálacha)