Bilingual English-Danish parallel corpus from Aarhus 2017 - European Capital of Culture website
Description
Contents of http://www.aarhus2017.dk were crawled, aligned on document and sentence level and converted into a parallel corpus.
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) actions SMART 2014/1074 and SMART 2015/1091. For further information on the project: http://lr-coordination.eu.
eurovoc domains
- Identifier
- ELRC_885
- Landing Page
- http://data.europa.eu/euodp/en/data/dataset/elrc_885
- Modified Date
- 2018-10-16
- Language
- English, Danish
- Catalogue
- European Union Open Data Portal