-
Romanian-English corpus with studies, reports and statistical data in the field of culture from the National Institute for Cultural Research and Training website (Processed)
Romanian-English corpus with studies, reports and statistical data in the field of culture from the National Institute for Cultural Research and Training website This dataset has been...
ZIP (362 views) (254 Downloads)
-
Parallel texts from Swedish Work environment Authority
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (406 views) (296 Downloads)
-
Romanian Ombudsman archive (Processed)
Parallel aligned corpus in tmx format built from the Romanian Ombudsman archive. This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (341 views) (240 Downloads)
-
Rural Development Programme of Romania (Processed)
Rural Development Programme of Romania available at http://madr.ro (Ministry of Agriculture and Rural Development) This dataset has been created within the framework of the European...
ZIP (157 views) (95 Downloads)
-
Romanian - English literature corpus (Processed)
Bilingual Romanian – English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on...
ZIP (594 views) (474 Downloads)
-
Romanian - English news corpus (Processed)
Bilingual Romanian – English news corpus built from SouthEast European Times (2008 dump). The texts are positionaly aligned, i.e. the sentence on line i in the English text is aligned...
ZIP (362 views) (253 Downloads)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, This dataset has been created within the framework of the European Language Resource...
ZIP (367 views) (249 Downloads)
-
Parallel texts from Swedish Labour market agency
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (357 views) (249 Downloads)
-
Romanian – English New Criminal Procedure Code
The New Civil Procedure Code in Romanian and English (bilingual) comprising 364.816 words. This dataset has been created within the framework of the European Language Resource...
ZIP (418 views) (331 Downloads)
-
General Romanian-English bilingual corpus
Romanian – English corpus built from a Wikipedia dump. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (504 views) (385 Downloads)
-
Romanian – English news corpus
Bilingual Romanian - English news corpus built from SouthEast European Times (2008 dump). The texts are positionaly aligned, i.e. the sentence on line i in the English text is aligned...
ZIP (565 views) (454 Downloads)
-
General Romanian-English bilingual corpus (Processed)
Romanian – English corpus built from a Wikipedia dump. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (512 views) (403 Downloads)
-
Parallel texts from Swedish Social Security Authority (Processed)
Parallel texts, email templates and forms in pdf file format. Original in Swedish, all the other texts are translations. One original with translations per folder. Language info is...
ZIP (349 views) (238 Downloads)
-
Parallel texts from Swedish Social Security Authority
Parallel texts, email templates and forms in pdf file format. Original in Swedish, all the other texts are translations. One original with translations per folder. Language info is...
ZIP (336 views) (233 Downloads)
-
Romanian Ombudsman archive
Parallel aligned corpus in tmx format built from the Romanian Ombudsman archive. The source texts are also included. This dataset has been created within the framework of the European...
XML PDF ZIP (554 views) (456 Downloads)
-
Parallel texts from Swedish Labour market agency. Part 2
Same as part 1, but with the Readme-file. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (548 views) (435 Downloads)
-
Romanian – English New Criminal Procedure Code (Processed)
The New Civil Procedure Code in Romanian and English (bilingual) comprising 6.495 Translation Units. This dataset has been created within the framework of the European Language Resource...
ZIP (503 views) (406 Downloads)
-
Romanian Ombudsman archive
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (20 views) (8 Downloads)