-
National Health Fund Dataset (Processed)
The dataset is a 274K-token Polish-English parallel resource in XLIFF format created on the basis of "Diagnosis-Related Groups in Europe" publication of the Polish National Health Fund....
ZIP (345 visningar) (231 Nedladdningar)
-
DA-EN Danish Ministry of Higher Education and Science 2
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 115,000 words, topic: research policy This dataset has been created within the framework of...
ZIP (333 visningar) (209 Nedladdningar)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words This dataset has been created within...
ZIP (357 visningar) (249 Nedladdningar)
-
Romanian – English literature corpus
Bilingual Romanian - English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on...
ZIP (411 visningar) (321 Nedladdningar)
-
English-Estonian corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (288 visningar) (186 Nedladdningar)
-
English-Swedish corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (432 visningar) (327 Nedladdningar)
-
English-Finnish corpus from Finnish Information Bank (Processed)
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
ZIP (496 visningar) (378 Nedladdningar)
-
English-Estonian corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (439 visningar) (337 Nedladdningar)
-
English-Swedish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (641 visningar) (524 Nedladdningar)
-
English-Finnish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (850 visningar) (724 Nedladdningar)
-
English-Estonian Parallel corpus compiled from translated annual reports from Estonian Academy of Sciences
English-Estonian translated annual reports as source data for parallel corpus -- collected from the web site of Estonian Academy of Sciences http://www.akadeemia.ee/ This dataset has...
ZIP (296 visningar) (204 Nedladdningar)
-
Bilingual documents Bulgarian-English in the field of open data, broadband and information society (Processed)
English-Bulgarian collection in the field of open data, broadband, strategic document of the Information society in the Republic of Bulgaria This dataset has been created within the...
ZIP (503 visningar) (389 Nedladdningar)
-
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (534 visningar) (420 Nedladdningar)
-
Polish-English parallel corpus from the website of the National Digital Archives (Processed)
Polish-English parallel corpus from the website of the National Digital Archives (https://www.nac.gov.pl) This dataset has been created within the framework of the European Language...
ZIP (412 visningar) (313 Nedladdningar)
-
DA-EN Danish Ministry of Higher Education and Science
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size: 120,000 words, topic: innovation, science This dataset has been created within the framework...
ZIP (453 visningar) (360 Nedladdningar)
-
Parallel Global Voices (Bulgarian - English) (Processed)
Parallel Global Voices BG-EN is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (622 visningar) (520 Nedladdningar)
-
Romanian-English corpus with studies, reports and statistical data in the field of culture from the National Institute for Cultural Research and Training website (Processed)
Romanian-English corpus with studies, reports and statistical data in the field of culture from the National Institute for Cultural Research and Training website This dataset has been...
ZIP (362 visningar) (254 Nedladdningar)
-
English-Swedish parallel corpus from the web site of the Swedish Migration Board - Migrationsverket (Processed)
All texts have been collected from their website of the Swedish Migration Board. The original text is always in Swedish, the other texts are translations from Swedish. This dataset has...
ZIP (312 visningar) (221 Nedladdningar)
-
Polish-English parallel corpus from the website of the Ministry of Digitization (Processed)
Polish-English parallel corpus from the website of the Ministry of Digitization, Republic of Poland (http://mac.gov.pl) This dataset has been created within the framework of the European...
ZIP (330 visningar) (229 Nedladdningar)
-
Bilingual English-Danish parallel corpus from Aarhus 2017 - European Capital of Culture website
Contents of http://www.aarhus2017.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (508 visningar) (391 Nedladdningar)