-
Spanish-English website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 21,007 TUs. Period of crawling : 15/11/2016 - 23/01/2017 A strict validation...
ZIP (509 views) (413 Downloads)
-
Spanish-Portuguese website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 1,249 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (283 views) (177 Downloads)
-
Parallel Global Voices (Greek - Spanish) (Processed)
Parallel Global Voices EL-ES is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (227 views) (120 Downloads)
-
Spanish-Italian website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 3,319 TUs. Date of crawling : 23/01/2017 A strict validation process was already...
ZIP (277 views) (178 Downloads)
-
PAeSI : Public Administration and Foreign Immigrants
The service PAeSI for the immigration has been realized within of the project PAeSI (Public Administration and Foreign Immigrants), in order to prepare an electronic access to...
ZIP (252 views) (162 Downloads)
-
Parallel texts from Swedish Labour market agency
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (357 views) (249 Downloads)
-
Spanish-German website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 2,840 TUs. Period of crawling : 15/11/2016 - 23/01/2017. A strict validation process...
ZIP (194 views) (126 Downloads)
-
Spanish-French website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 15,797 TUs. Period of crawling : 15/11/2016 - 23/01/2017. A strict validation...
ZIP (227 views) (146 Downloads)
-
Parallel texts from Swedish Social Security Authority (Processed)
Parallel texts, email templates and forms in pdf file format. Original in Swedish, all the other texts are translations. One original with translations per folder. Language info is...
ZIP (349 views) (238 Downloads)
-
Parallel texts from Swedish Social Security Authority
Parallel texts, email templates and forms in pdf file format. Original in Swedish, all the other texts are translations. One original with translations per folder. Language info is...
ZIP (336 views) (233 Downloads)
-
Avibase (processed)
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (258 views) (152 Downloads)
-
Cyprus at a glance
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (268 views) (197 Downloads)