Public Procurement Dataset 1 (Processed)
Description
A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in the XLiFF format. There are two publications in the collection: a) Report on functioning of public procurement system in 2009 (raport_uzp_2009.xlf, 1495 segments 65237 words) and b) Report on functioning of public procurement system in 2010 (raport_uzp_2010.xlf, 1188 segments, 58684 words). The total size of the collection is 123 921 words in 2683 parallel segments. It was converted into a 1578-TUs English-Polish resource in TMX format.
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) actions SMART 2014/1074 and SMART 2015/1091. For further information on the project: http://lr-coordination.eu.
eurovoc domains
- Identifier
- ELRC_486
- Landing Page
- http://data.europa.eu/euodp/en/data/dataset/elrc_486
- Modified Date
- 2017-03-31
- Language
- Polish, English
- Catalogue
- European Union Open Data Portal