Resources for Language Technologies
-
Thematic Vocabulary of Geography (processed)
Thematic Vocabulary of Geography This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (214 views) (131 Downloads)
-
English-Spanish website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 909 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (291 views) (184 Downloads)
-
Electronics and opto-electronics Vocabulary (processed)
Vocabulary used for indexing bibliographical records dealing with “Electronics” in the PASCAL database, until 2014. This resource contains 4454 entries classified under 19 collections....
ZIP (205 views) (118 Downloads)
-
Spanish monolingual corpus from contents of Spanish State Official Gazette
Spanish monolingual corpus from contents of Spanish State Official Gazette as collected from https://boe.es/ web site This dataset has been created within the framework of the European...
ZIP (229 views) (143 Downloads)
-
PAeSI : Public Administration and Foreign Immigrants (Processed)
The service PAeSI for the immigration has been realized within of the project PAeSI (Public Administration and Foreign Immigrants), in order to prepare an electronic access to...
ZIP (278 views) (170 Downloads)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Spanish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (220 views) (119 Downloads)
-
Parallel Global Voices (English - Spanish) (Processed)
Parallel Global Voices EN-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (227 views) (142 Downloads)
-
CHARTER OF VALUES OF CITIZENSHIP AND INTEGRATION (Processed)
The integration agreement form prepared for signing the pact between foreign and state, in addition to providing the alien's commitments, indicates, the statement by the person concerned,...
ZIP (291 views) (184 Downloads)
-
Spanish-Italian website parallel corpus
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 3,319 TUs. Date of crawling : 23/01/2017 A strict validation process has been...
ZIP (586 views) (483 Downloads)
-
INFORMATION FOR VICTIMS OF A CRIME (Processed)
INFORMATION FOR VICTIMS OF A CRIME (Processed) Provided by Flavia Vecchione. Procura della Repubblica website Corpora Multilingual This dataset has been created within the...
ZIP (255 views) (159 Downloads)
-
Spanish-English website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 21,007 TUs. Period of crawling : 15/11/2016 - 23/01/2017 A strict validation...
ZIP (509 views) (413 Downloads)
-
Spanish-Portuguese website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 1,249 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (283 views) (177 Downloads)
-
Parallel Global Voices (Greek - Spanish) (Processed)
Parallel Global Voices EL-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (227 views) (120 Downloads)
-
Spanish-German website parallel corpus
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 2,840 TUs. Period of crawling : 15/11/2016 - 23/01/2017. A strict validation process...
ZIP (377 views) (315 Downloads)
-
Spanish-Italian website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 3,319 TUs. Date of crawling : 23/01/2017 A strict validation process was already...
ZIP (277 views) (178 Downloads)
-
PAeSI : Public Administration and Foreign Immigrants
The service PAeSI for the immigration has been realized within of the project PAeSI (Public Administration and Foreign Immigrants), in order to prepare an electronic access to...
ZIP (252 views) (162 Downloads)
-
Spanish-English website parallel corpus
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 21,007 TUs. Period of crawling : 15/11/2016 - 23/01/2017 A strict validation...
ZIP (3291 views) (126 Downloads)
-
Spanish-German website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 2,840 TUs. Period of crawling : 15/11/2016 - 23/01/2017. A strict validation process...
ZIP (194 views) (126 Downloads)
-
Spanish-French website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 15,797 TUs. Period of crawling : 15/11/2016 - 23/01/2017. A strict validation...
ZIP (227 views) (146 Downloads)
-
Spanish-French website parallel corpus
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 15,797 TUs. Period of crawling : 15/11/2016 - 23/01/2017. A strict validation...
ZIP (643 views) (535 Downloads)