Kārtot pēc:
-
English-Finnish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (850 skatījumi) (724 Lejupielādes)
-
English-Swedish parallel corpus from Annual Reports of the Swedish Pension System (Processed)
Source PDF files as parallel documents. The original texts are all always Swedish, the English text is its translation. This dataset has been created within the framework of the European...
ZIP (681 skatījumi) (578 Lejupielādes)
-
English-Swedish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (641 skatījumi) (524 Lejupielādes)
-
Corpus RIZIV
Corpus with Dutch and French of the national institute for illness and invalidity insurance
ZIP (625 skatījumi) (545 Lejupielādes)
-
Polish-English parallel corpus from the website of the Polish Tourism Organisation (Processed)
Polish-English parallel corpus from the website of the Polish Tourism Organisation (https://pot.gov.pl/en) This dataset has been created within the framework of the European Language...
ZIP (621 skatījumi) (508 Lejupielādes)
-
ENGLISH/POLISH PHRASE BOOK FOR ADMINISTRATIVE STAFF of LOCAL GOVERNMENT UNITS (Processed)
An English/Polish phrase book for the administrative staff of local government units (LGUs). This dataset has been created within the framework of the European Language Resource...
ZIP (612 skatījumi) (514 Lejupielādes)
-
Romanian - English literature corpus (Processed)
Bilingual Romanian – English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on...
ZIP (594 skatījumi) (474 Lejupielādes)
-
OROSSIMO Corpus - Photography, film & video
A corpus of academic discourse texts belonging to the Photography, film & video domain (according to the Dewey Decimal classification, DDC77 -Photography, computer art, film &...
ZIP (572 skatījumi) (447 Lejupielādes)
-
OROSSIMO Corpus - Medicine & health
A corpus of academic discourse texts belonging to the Medicine & health domain (according to the Dewey Decimal classification, DDC61 - Medicine & health), annotated at structural...
ZIP (568 skatījumi) (481 Lejupielādes)
-
English-Danish Parallel corpus from Tatoeba project (Processed)
Parallel corpus from English-Danish translations from tatoeba.org website This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (566 skatījumi) (466 Lejupielādes)
-
Slovak corpus of texts from the Ministry of Culture of the Slovak Republic
Dataset of Slovak legal texts within agenda of the Ministry, plain text format, the size: 108448 words This dataset has been created within the framework of the European Language...
ZIP (560 skatījumi) (438 Lejupielādes)
-
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network) (Processed)
Polish-English parallel corpus from the website of Public Employment Services in Poland (member of EURES network, https://eures.praca.gov.pl) This dataset has been created within the...
ZIP (549 skatījumi) (440 Lejupielādes)
-
English-Swedish parallel corpus from the translation of 'Sweden a Pocket Guide' book (Processed)
A guide for foreigners who move to Sweden. Source language is Swedish. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (541 skatījumi) (371 Lejupielādes)
-
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology (Processed)
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
ZIP (534 skatījumi) (420 Lejupielādes)
-
Parallel corpus from Social Insurance Agency -- Försäkringskassan (Sweden) (Processed)
Parallel corpus from Social Insurance Agency (Sweden) (Försäkringskassan) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (530 skatījumi) (401 Lejupielādes)
-
Bilingual English-Norwegian parallel corpus from Norwegian Institute of Public Health website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (528 skatījumi) (434 Lejupielādes)
-
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (Processed)
Polish-English parallel corpus from the website of the Office of the Commissioner for Human Rights (https://www.rpo.gov.pl/en) This dataset has been created within the framework of the...
ZIP (523 skatījumi) (420 Lejupielādes)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic (Processed)
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words It is converted into a 2609-TUs...
ZIP (512 skatījumi) (400 Lejupielādes)
-
Bilingual English-Danish parallel corpus from Aarhus 2017 - European Capital of Culture website
Contents of http://www.aarhus2017.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (508 skatījumi) (391 Lejupielādes)
-
Bilingual English-Norwegian parallel corpus from KORO / Public Art Norway website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (505 skatījumi) (393 Lejupielādes)