Resources for Language Technologies
-
DA-EN Danish Ministry of Higher Education and Science 2
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 115,000 words, topic: research policy This dataset has been created within the framework of...
ZIP (333 visninger) (209 Downloads)
-
Bilingual resource with Bulgarian strategic documents in the field of telecommunications and broadband (Bulgarian - English)
Bilingual collection of documents in the field of telecommunications and broadband, size on disk 440 kB, Bulgarian-English This dataset has been created within the framework of the...
ZIP (369 visninger) (258 Downloads)
-
DA-EN Danish Ministry of Higher Education and Science 3 (Processed)
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 110,000 words, topic: research policy (Processed) This dataset has been created within the...
ZIP (330 visninger) (216 Downloads)
-
Bilingual hr-en parallel corpus from the Journal of the Croatian Association of Civil Engineers website (Processed)
Contents of http://casopis-gradjevinar.hr were crawled, aligned on document and sentence level and converted into a parallel corpus This dataset has been created within the framework of...
ZIP (451 visninger) (357 Downloads)
-
Polish-English parallel corpus from the website of the National Centre for Research and Development (Processed)
Polish-English parallel corpus from the website of the National Centre for Research and Development (https://www.ncbr.gov.pl) This dataset has been created within the framework of the...
ZIP (532 visninger) (420 Downloads)
-
Polish-English parallel corpus from the website of the National Centre for Nuclear Research (Processed)
Polish-English parallel corpus from the website of the National Centre for Nuclear Research (https://www.ncbj.gov.pl/) This dataset has been created within the framework of the European...
ZIP (462 visninger) (338 Downloads)
-
Polish-English parallel corpus from the website "geoportal.gov.pl" (Processed)
Polish-English parallel corpus from the website "geoportal.gov.pl (https://www.geoportal.gov.pl) This dataset has been created within the framework of the European Language Resource...
ZIP (195 visninger) (146 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Science and Higher Education (Processed)
Polish-English parallel corpus from the website of the Ministry of Science and Higher Education (http://www.eng.nauka.gov.pl/en/) This dataset has been created within the framework of...
ZIP (358 visninger) (255 Downloads)
-
Bilingual English-Danish parallel corpus from Danish Ministry of Higher Education and Science website
Contents of https://ufm.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the European...
ZIP (423 visninger) (313 Downloads)
-
Polish-English parallel corpus from the website "Science in Poland" (Processed)
Polish-English parallel corpus from the website "Science in Poland" (https://scienceinpoland.pap.pl/en and https://naukawpolsce.pap.pl/) This dataset has been created within the...
ZIP (201 visninger) (157 Downloads)
-
Czech Association of Medical Physicists - Physics Glossary (Processed)
A dictionary of 3281 terms relating to physics for medicine in Czech - English This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (377 visninger) (284 Downloads)
-
DA-EN Danish Ministry of Higher Education and Science 2 (Processed)
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 115,000 words, topic: research policy (Processed) This dataset has been created within the...
ZIP (302 visninger) (203 Downloads)
-
Bilingual resource with Bulgarian strategic documents in the field of telecommunications and broadband (Bulgarian - English) (Processed)
Bilingual collection of documents in the field of telecommunications and broadband, size on disk 440 kB, Bulgarian-English (Processed) This dataset has been created within the framework...
ZIP (387 visninger) (283 Downloads)
-
OROSSIMO Corpus - Computer Science
A corpus of academic discourse texts belonging to the Computer Science domain (according to the Dewey Decimal classification, DDC00 - Computer science, knowledge & systems), annotated...
ZIP (368 visninger) (256 Downloads)
-
DA-EN Danish Ministry of Higher Education and Science 3
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 110,000 words, topic: research policy This dataset has been created within the framework of...
ZIP (366 visninger) (264 Downloads)
-
DA-EN Danish Ministry of Higher Education and Science 4 (Processed)
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 115,000 words, topis: research policy (Processed) This dataset has been created within the...
ZIP (318 visninger) (216 Downloads)
-
DA-EN Danish Ministry of Higher Education and Science 4
Parallel texts Danish-English from the Danish Ministry of Higher Education and Science, size 115,000 words, topis: research policy This dataset has been created within the framework of...
ZIP (466 visninger) (370 Downloads)
-
Bilingual English-Norwegian parallel corpus from Nofima institute website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (371 visninger) (255 Downloads)
-
Bilingual English-Norwegian parallel corpus from Petroleum Safety Authority Norway website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (362 visninger) (255 Downloads)
-
Bilingual English-Norwegian parallel corpus from Geological survey of Norway website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (394 visninger) (265 Downloads)