-
Polish-English parallel corpus from the website of the Ministry of Science and Higher Education (Processed)
Polish-English parallel corpus from the website of the Ministry of Science and Higher Education (http://www.eng.nauka.gov.pl/en/) This dataset has been created within the framework of...
ZIP (358 visningar) (255 Nedladdningar)
-
Bilingual English-Danish parallel corpus from Danmarks Statistik website
Contents of https://www.dst.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (289 visningar) (181 Nedladdningar)
-
Bilingual English-Icelandic parallel corpus from Nordisk eTax website
Contents of https://www.nordisketax.net/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...
ZIP (273 visningar) (175 Nedladdningar)
-
Unofficial Consolidated legislative texts (Slovene) (Processed)
A text file resulted from a collection (corpus in json format) of unofficial Consolidated text of the Laws, Regulations and other general acts in Slovenia. The collection comprised 21556...
ZIP (241 visningar) (147 Nedladdningar)
-
Bilingual English-Danish parallel corpus from Danish Ministry of Higher Education and Science website
Contents of https://ufm.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the European...
ZIP (423 visningar) (313 Nedladdningar)
-
Laws of Malta (Processed)
Compilation of bilingual Maltese legislation (Maltese-English). This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe...
ZIP (442 visningar) (339 Nedladdningar)
-
Parallel corpus from the website of the Chancellery of the Prime Minister of Poland (Processed)
Polish-English parallel corpus from the website of the Chancellery of the Prime Minister of Poland https://www.premier.gov.pl This dataset has been created within the framework of the...
ZIP (474 visningar) (349 Nedladdningar)
-
Legal Texts (Processed)
Corpora of parallel legal texts supplied by Claudia Foti. - GRAND CHAMBER CASE OF AL-SKEINI AND OTHERS v. THE UNITED KINGDOM (Application no. 55721/07) JUDGMENT STRASBOURG7 July 2011...
ZIP (308 visningar) (200 Nedladdningar)
-
Bilingual English-Danish parallel corpus from The Viking Ship Museum website
Contents of https://www.vikingeskibsmuseet.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the...
ZIP (394 visningar) (293 Nedladdningar)
-
Bilingual collection of reports of the Greek Public Power Corporation (Processed)
A bilingual collection of translation units extracted fro the annual financial and corporate responsibility reports of the Greek Public Power Corporation. This dataset has been created...
ZIP (393 visningar) (303 Nedladdningar)
-
Quarterly Reports of the Parliamentary Budget Office (Hellenic Parliament) (Processed)
A collection of 32 reports (16 in EL and 16 In EL) of the Parliamentary Budget Office (Hellenic Parliament) This dataset has been created within the framework of the European Language...
ZIP (517 visningar) (399 Nedladdningar)
-
Bilingual English-Danish parallel corpus from The Danish Environmental Protection Agency website
Contents of https://eng.mst.dk/ and https://mst.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the...
ZIP (326 visningar) (217 Nedladdningar)
-
Parallel texts from the Swedish Competition Authority - Konkurrensverket (Processed)
Parallel texts. The original texts are all always Swedish, the English text is its translation. This dataset has been created within the framework of the European Language Resource...
ZIP (346 visningar) (251 Nedladdningar)
-
Translation memory from Swedish National Audit Office (NAO) - Riksrevisionen (Processed)
Translation memory from Swedish National Audit Office This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility...
ZIP (322 visningar) (225 Nedladdningar)
-
Bilingual English-Danish parallel corpus from Danish Ministry of Transport, Building and Housing website
Contents of https://www.trm.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (281 visningar) (179 Nedladdningar)
-
Spanish-English website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 21,007 TUs. Period of crawling : 15/11/2016 - 23/01/2017 A strict validation...
ZIP (509 visningar) (413 Nedladdningar)
-
Bilingual English-Danish parallel corpus from The Danish Medicines Agency website
Contents of https://laegemiddelstyrelsen.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework...
ZIP (340 visningar) (233 Nedladdningar)
-
Portuguese-English bilingual corpus from Legislation concerning the Portuguese Parliament (Processed)
Legislation concerning Portuguese Parliament; three bilingual tmx files in PT-EN This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (293 visningar) (197 Nedladdningar)
-
Portuguese-English bilingual corpus from the Portuguese Constitution (Processed)
Complete text of the Portuguese Constitution in Portuguese and English; Bilingual tmx file in PT-EN This dataset has been created within the framework of the European Language Resource...
ZIP (301 visningar) (208 Nedladdningar)
-
Bilingual hr-en parallel corpus from Croatian National Bank website (Processed)
Contents of http://www.hnb.hr were crawled, aligned on document and sentence level and converted into a parallel corpus This dataset has been created within the framework of the European...
ZIP (388 visningar) (293 Nedladdningar)