-
Orossimo Terminological Resource - Economics
A bilingual terminological glossary extracted from academic discourse texts belonging to the Economics domain. This dataset has been created within the framework of the European Language...
XML PDF ZIP (640 visualizzazioni) (528 Download)
-
Bilingual resource with Bulgarian strategic documents in the field of telecommunications and broadband (Bulgarian - English) (Processed)
Bilingual collection of documents in the field of telecommunications and broadband, size on disk 440 kB, Bulgarian-English (Processed) This dataset has been created within the framework...
ZIP (387 visualizzazioni) (283 Download)
-
Public Procurement Dataset 1 (Processed)
A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in...
ZIP (410 visualizzazioni) (304 Download)
-
Polish Food Dataset 2 (Processed)
A collection of Polish-English translations of the Polish Food quarterly published by the Polish Ministry of Agriculture, including issues 69-73 (103K words in 3025 segments). The...
ZIP (392 visualizzazioni) (301 Download)
-
Bilingual English-Swedish parallel corpus from the official Nordic cooperation website
Contents of the Nordic Co-operation web site http://www.norden.org downloaded and converted into a parallel corpus This dataset has been created within the framework of the European...
ZIP (344 visualizzazioni) (250 Download)
-
Romanian - English news corpus (Processed)
Bilingual Romanian – English news corpus built from SouthEast European Times (2008 dump). The texts are positionaly aligned, i.e. the sentence on line i in the English text is aligned...
ZIP (362 visualizzazioni) (253 Download)
-
PKN Orlen Dataset (Processed)
Dataset of the Polish public sector company PKN Orlen, a major Polish oil refiner and petrol retailer. The dataset comprises 4 Polish-English files in XLIFF format, 100K word tokens in...
ZIP (362 visualizzazioni) (254 Download)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, This dataset has been created within the framework of the European Language Resource...
ZIP (367 visualizzazioni) (249 Download)
-
Secretariat-General parallel corpus SL-EN and EN-SL (part 1)
English-Slovenian parallel corpus in TMX format from the Secretariat-General of the Government of the Republic of Slovenia in the legal domain This dataset has been created within the...
XML PDF ZIP (524 visualizzazioni) (430 Download)
-
Translations of Lithuanian legislation from Seimas of the Republic of Lithuania
Translation Memories of Lithuanian legislation from Seimas of the Republic of Lithuania This dataset has been created within the framework of the European Language Resource Coordination...
XML PDF ZIP (470 visualizzazioni) (359 Download)
-
Public Procurement Dataset 2
A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in...
XML PDF ZIP (336 visualizzazioni) (276 Download)
-
Austrian Armed Forces Military Dictionaries
A collection of military dictionaries in the following language pairs: Austrian German-Hungarian Austrian German-Italian Austrian German-English Austrian German-French This...
XML PDF ZIP (650 visualizzazioni) (543 Download)
-
Parallel corpus (Polish - English) from the website of the Polish Investment and Trade Agency (Processed)
Parallel (pl-en) corpus of 14736 translation units in the "BUSINESS AND COMPETITION" and "ECONOMICS" domains. This dataset has been created within the framework of the European Language...
ZIP (554 visualizzazioni) (425 Download)
-
The UCD Bórd na Gaeilge Corpus of bilingual PDFs and Word documents
Parallel data provided by the language office at UCD (University College Dublin) Size: 3 Word documents, 67 PDFs This dataset has been created within the framework of the European...
ZIP (294 visualizzazioni) (196 Download)
-
Parallel corpus from Parliament of Estonia
Parallel corpus compiled from contents of website of Parliament of Estonia This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (407 visualizzazioni) (307 Download)
-
The Coimisineir Teanga Bilingual Web Corpus
Web content from the Language Commissioner's Office. Two TXT files containing 6808 words of parallel data This dataset has been created within the framework of the European Language...
ZIP (268 visualizzazioni) (180 Download)
-
Czech Banking Association Terminology
Terms in Czech - English relating to finance This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
XML PDF ZIP (578 visualizzazioni) (488 Download)
-
Corpus on Finance and Economics from Bank of Latvia (Processed)
Contents of web site https://makroekonomika.lv/ -- Latvian and https://www.macroeconomics.lv/ -- English aligned as a parallel corpus This dataset has been created within the...
ZIP (285 visualizzazioni) (184 Download)
-
Polish Ministry of Foreign Affairs Regional Dataset
A collection of Polish-English whitepapers published by the Polish Ministry of Foreign Affairs, including "Eastern Partnership" (10K words in 492 segments) and "Poland's 10 years in the...
XML PDF ZIP (547 visualizzazioni) (437 Download)
-
Documents concerning Federal Constitutional Law in Austria
Alignment documents concerning Austrian Federal Constitutional Law This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (458 visualizzazioni) (360 Download)