Třídit podle:
-
Překladová paměť DGT
Překladová paměť DGT DGT-TM je překladová paměť (soubor obsahující věty v původním jazyce a jejich překlady od profesionálních překladatelů) ve 24 jazycích. Obsahuje texty acquis...
ZIP (45005 zobrazení) (4502 Počet stažení)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 zobrazení) (122 Počet stažení)
-
Polish Court Rulings Corpus (Processed)
The Polish Court Rulings Corpus contains 62 726 rulings of Polish courts, over 178 million words of running text. The texts of the rulings together with some metadata were acquired from...
ZIP (285 zobrazení) (174 Počet stažení)
-
Polish Ministry of Foreign Affairs reports in EN and PL (Processed)
The dataset comprises the EN and PL versions of two reports created by the Polish Ministry of Foreign Affairs, “Rules for communicating the POLSKA brand” and “Polish Presidency of the...
ZIP (407 zobrazení) (303 Počet stažení)
-
Monolingual Polish corpus in the public administration domain
Monolingual Polish corpus, containing 22372690 tokens and 1805280 lexical types in the public administration domain. This dataset has been created within the framework of the European...
ZIP (431 zobrazení) (317 Počet stažení)
-
Polish-English Internal Aviation Glossaries (Processed)
A set of bilingual glossaries developed by the Civil Aviation Authority of Republic of Poland, totalling 8548 Polish and English terms with commentaries and reference notes, including...
ZIP (269 zobrazení) (175 Počet stažení)
-
Translations of Hungarian from public websites
A webcrawl of 14 different websites covering parallel corpora of Hungarian with Polish, Czech, Swedish, Finnish, French, German, Italian, English and Slovenian This dataset has been...
ZIP (388 zobrazení) (287 Počet stažení)
-
International Statistical Classification of Diseases and Related Health Problems - ICD-10 (EN-PL) (Processed)
International Classification of Diseases is a widely used classification in healthcare and healthcare management, i.a. for and coding of diseases for financial settlement between...
ZIP (208 zobrazení) (142 Počet stažení)
-
Monolingual Polish corpus in the culture domain (part1) (Processed)
Monolingual Polish corpus from the Warsaw - Official Tourist Website. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (291 zobrazení) (182 Počet stažení)
-
Monolingual Polish corpus in the law domain (Processed)
Monolingual (pol) corpus, including content of websites that are relevant to law and justice This dataset has been created within the framework of the European Language Resource...
ZIP (298 zobrazení) (192 Počet stažení)
-
Khresmoi (Processed)
Parallel data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, German, Hungarian, Polish, Spanish...
ZIP (266 zobrazení) (184 Počet stažení)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Polish) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (251 zobrazení) (137 Počet stažení)
-
Polish-English parallel corpus from the website of the Ministry of the Interior and Administration (Processed)
Polish-English parallel corpus from the website of the Ministry of the Interior and Administration, Republic of Poland (https://www.mswia.gov.pl/) This dataset has been created within...
ZIP (334 zobrazení) (241 Počet stažení)
-
Monolingual Polish corpus in the public administration domain (Processed)
Monolingual Polish corpus, containing 22372690 tokens and 1805280 lexical types in the public administration domain. This dataset has been created within the framework of the European...
ZIP (212 zobrazení) (115 Počet stažení)
-
Monolingual corpus from Minutes of the Polish Senat (Posiedzenia) (2015-2018) (Processed)
The Monolingual Corpus from Minutes of the the Polish Senat (Posiedzenia) (2015-2018) is part of "The Polish Parliamentary Corpus" which is available at http://clip.ipipan.waw.pl/PPC ....
ZIP (259 zobrazení) (164 Počet stažení)
-
Avibase (processed)
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (258 zobrazení) (152 Počet stažení)