Romanian - English news corpus (Processed)
Descrizione
Bilingual Romanian – English news corpus built from SouthEast European Times (2008 dump). The texts are positionaly aligned, i.e. the sentence on line i in the English text is aligned with the sentence on line i in the Romanian text. Alignment was manually validated.
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) actions SMART 2014/1074 and SMART 2015/1091. For further information on the project: http://lr-coordination.eu.
Settori Eurovoc
- Identificatore
- ELRC_493
- Pagina principale
- http://data.europa.eu/euodp/en/data/dataset/elrc_493
- Data della modifica
- 2016-12-20
- Lingua
- rumeno, inglese
- Catalogue
- European Union Open Data Portal