Bilingual hr-en parallel corpus from the National and University Library in Zagreb website (Processed)
Description
Contents of http://www.nsk.hr were crawled, aligned on document and sentence level and converted into a parallel corpus
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) actions SMART 2014/1074 and SMART 2015/1091. For further information on the project: http://lr-coordination.eu.
eurovoc domains
- Identifier
- ELRC_1058
- Modified Date
- 2017-12-21
- Language
- Croatian, English
- Catalogue
- European Union Open Data Portal