Resources for Language Technologies
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 visninger) (122 Downloads)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words This dataset has been created within...
ZIP (357 visninger) (249 Downloads)
-
English-Slovak parallel corpus of texts from The Ministry of Justice of the Slovak Republic
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 112580 words This dataset has been created within...
ZIP (384 visninger) (1 Downloads)
-
Bilingual en-sk parallel corpus of annual reports from the Statistical Office of the Slovak Republic (Processed)
Bilingual en-sk parallel corpus of annual reports from the Statistical Office of the Slovak Republic for the 2006-2017 period This dataset has been created within the framework of the...
ZIP (187 visninger) (122 Downloads)
-
Slovak–English glossary of diseases (Processed)
Slovak–English glossary of diseases from the Wikidata knowledge base This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (241 visninger) (174 Downloads)
-
English-Slovak parallel corpus of texts from The Ministry of Justice of the Slovak Republic (Processed)
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 112580 words. It was converted into a 2895-TUs...
ZIP (508 visninger) (405 Downloads)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Slovak) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (267 visninger) (164 Downloads)
-
English-Slovak corpus of annual reports from the Slovak National Centre for Human Rights website (Processed)
English-Slovak corpus of annual reports from the Slovak National Centre for Human Rights for the 2008-2017 period This dataset has been created within the framework of the European...
ZIP (337 visninger) (249 Downloads)
-
Monolingual corpus from Minutes of the Sessions of the National Council of the Slovak Republic (2017-2018) (Processed)
Minutes of the Sessions of the National Council of the Slovak Republic (2017-2018) were downloaded from https://www.nrsr.sk/dl/Browser/Default?legId=13&termNr=7 . This dataset has...
ZIP (245 visninger) (154 Downloads)
-
English-Slovak corpus of annual reports on immigration and asylum policies from the EMN National Contact Point for the Slovak Republic website (Processed)
English-Slovak corpus of annual reports on immigration and asylum policies from the EMN National Contact Point for the Slovak Republic website (https://emn.sk/en/) This dataset has been...
ZIP (310 visninger) (209 Downloads)
-
Slovak corpus of texts from the Ministry of Culture of the Slovak Republic (Processed)
Dataset of Slovak legal texts within agenda of the Ministry, plain text format This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (239 visninger) (149 Downloads)
-
Slovak corpus of texts from the Ministry of Justice of the Slovak Republic (Processed)
Dataset of various legal texts within agenda of the Ministry, plain text format, size 107745 words This dataset has been created within the framework of the European Language Resource...
ZIP (240 visninger) (149 Downloads)
-
English-Slovak parallel corpus of texts from The Ministry of Culture of the Slovak Republic (Processed)
Dataset of various English-Slovak legal texts within agenda of the Ministry, plain text format alligned at the sentence level, the size: 105791 words It is converted into a 2609-TUs...
ZIP (512 visninger) (400 Downloads)
-
Slovak corpus of texts from the Ministry of Culture of the Slovak Republic
Dataset of Slovak legal texts within agenda of the Ministry, plain text format, the size: 108448 words This dataset has been created within the framework of the European Language...
ZIP (560 visninger) (438 Downloads)
-
Slovak corpus of texts from the Ministry of Justice of the Slovak Republic
Dataset of various legal texts within agenda of the Ministry, plain text format, size 107745 words This dataset has been created within the framework of the European Language Resource...
ZIP (381 visninger) (269 Downloads)
-
Avibase (processed)
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (258 visninger) (152 Downloads)
-
DGT-Translation Memory
DGT-Translation Memory DGT-TM er en oversættelseshukommelse (sætninger og deres manuelt fremstillede oversættelse) på 24 sprog. Den indeholder segmenter fra den gældende fællesskabsret –...
ZIP (45005 visninger) (4502 Downloads)