-
Memoria di traduzione della DGT
Memoria di traduzione della DGT DGT-TM è una memoria di traduzione (di frasi e della loro traduzione realizzata manualmente) in 24 lingue. Contiene i segmenti dell’acquis comunitario, il...
PDF ZIP (45005 visualizzazioni) (4502 Download)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 visualizzazioni) (122 Download)
-
Romanian – English parallel wordlists
English and Romanian lemmatized wordlists extracted from various resources (including RO-EN Wordnets, the Romanian – English news corpus, the Romanian – English literature corpus, and...
ZIP (885 visualizzazioni) (765 Download)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 visualizzazioni) (557 Download)
-
Letter of rights for persons arrested and or detained
Police form, 12 pages. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (413 visualizzazioni) (296 Download)
-
Romanian – English literature corpus
Bilingual Romanian - English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on...
ZIP (411 visualizzazioni) (321 Download)
-
Romanian – English parallel wordlists (Processed)
English and Romanian lemmatized wordlists extracted from various resources (including RO-EN Wordnets, the Romanian – English news corpus, the Romanian – English literature corpus, and...
ZIP (297 visualizzazioni) (198 Download)
-
EIR Romanian-English TM (ECHR-33234/12) (Processed)
Converted ECHR translation memory EN-RO (CASE OF AL NASHIRI v. ROMANIA - Application no. 33234/12); This dataset has been created within the framework of the European Language Resource...
ZIP (276 visualizzazioni) (169 Download)
-
EIR Romanian-English Newsletter (2009-March 2011) (Processed)
Translation units were extracted from a collection of 392 files (386 Word and 6 Excel files) in the domain of European affairs (the main 4 EIR’s key areas: studies, training, translation...
ZIP (280 visualizzazioni) (180 Download)
-
Parallel Global Voices (English - Romanian) (Processed)
Parallel Global Voices EN-RO is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (280 visualizzazioni) (181 Download)
-
Monolingual Romanian corpus in the public administration domain (Processed)
Monolingual Romanian corpus, containing 360833 sentences (9064764 words) in the public administration domain. This dataset has been created within the framework of the European Language...
ZIP (295 visualizzazioni) (186 Download)
-
Romanian Parliament Transcripts 1996-2018 (Processed)
The data is obtained from cdep.ro website and contains 500k+ instances of speech from the parliament podium from 1996 to 2018. Sentence splitting and deduplication onm sentence level have...
ZIP (210 visualizzazioni) (127 Download)
-
Parallel texts from Swedish Labour market agency (Processed)
Parallel texts, all in pdf files, have been gathered from Arbetsförmedlingen. The language of each document is indicated in its title. The original version is always in Swedish (with...
ZIP (334 visualizzazioni) (237 Download)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Romanian) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (239 visualizzazioni) (141 Download)
-
EIR terminology (banking) (RO-EN) (Processed)
banking terms (RO, EN) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (261 visualizzazioni) (173 Download)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 visualizzazioni) (487 Download)
-
Letter of rights for persons arrested and or detained (Processed)
Collection of transaltion units (1906 in total) in 21 language pairs extracted from 7 Police forms (one form 12 pages long in each of the following languages: BG, EL, EN, FR, LV, PL, RO)....
ZIP (452 visualizzazioni) (338 Download)
-
Parallel texts from Swedish Labour market agency. Part 2 (Processed)
Same as part 1, but with the Readme-file. (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (439 visualizzazioni) (332 Download)
-
EIR terminology (legal) (RO-EN) (Processed)
legal terminology terminology (CJUE: legal glossary and entries extracted from the Treaty of Lisbon; RO, EN) This dataset has been created within the framework of the European Language...
ZIP (191 visualizzazioni) (121 Download)
-
EIR Romanian-English SPOS (2011-2017) (Processed)
Translation Units were extract from 18 Word files (9 Romanian and 9 English) in the field of European Affairs - Strategy and Policy Studies (SPOS); 101 849 words (in Romanian) This...
ZIP (251 visualizzazioni) (150 Download)