Resources for Language Technologies
-
Macroeconomic Developments
Bulletins of Macroeconomic Developments This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (371 visninger) (261 Downloads)
-
Expression of interest (Processed)
International call for expression of interest for the selection of the President of the Hellenic Statistical Authority (EL.STAT.) This dataset has been created within the framework of...
ZIP (426 visninger) (309 Downloads)
-
Croatian-English corpus with the Rural Development Programme for the Period 2014-2020 from the Croatian Rural Development Programme website (Processed)
Croatian-English corpus with the Rural Development Programme for the Period 2014-2020 from the Croatian Rural Development Programme website This dataset has been created within the...
ZIP (370 visninger) (267 Downloads)
-
Macroeconomic Developments (Processed)
Bulletins of Macroeconomic Developments This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (365 visninger) (1 Downloads)
-
Bilingual Icelandic-English parallel corpus from Statistics Iceland website
Contents of https://www.statice.is and https://hagstofa.is/ websites downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the...
ZIP (416 visninger) (317 Downloads)
-
Monolingual documents from the Government of Lithuania (Processed)
Monolingual documents received from the Government of the Republic of Lithuania. (Processed) This dataset has been created within the framework of the European Language Resource...
ZIP (490 visninger) (376 Downloads)
-
Polish-English parallel corpus from the website "Business in Poland" (Processed)
Polish-English parallel corpus from the website of the website "Business in Poland" (https://www.biznes.gov.pl/en) This dataset has been created within the framework of the European...
ZIP (217 visninger) (159 Downloads)
-
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website (Processed)
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website. The resource contains pdf files with each file containing the text in...
ZIP (350 visninger) (256 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Development (Processed)
Polish-English parallel corpus from the website of the Ministry of Development, Republic of Poland (http://www.mr.gov.pl) This dataset has been created within the framework of the...
ZIP (406 visninger) (296 Downloads)
-
Employment in Poland 2009 report in EN-PL (Processed)
The report "Employment in Poland 2009 – Entrepreneurship for Work" is a thorough study of the most significant processes occurring in the Polish and European labour markets. The dataset...
ZIP (321 visninger) (271 Downloads)
-
Quarterly Reports of the Parliamentary Budget Office (Hellenic Parliament) (Processed)
A collection of 32 reports (16 in EL and 16 In EL) of the Parliamentary Budget Office (Hellenic Parliament) This dataset has been created within the framework of the European Language...
ZIP (517 visninger) (399 Downloads)
-
English-Swedish parallel texts from The Swedish Agency for Economic and Regional Growth - Tillväxtverket (Processed)
Parallel texts from The Swedish Agency for Economic and Regional Growth (Tillväxtverket). Original texts are in Swedish, the English texts are translations. This dataset has been created...
ZIP (634 visninger) (520 Downloads)
-
Parallel English-Icelandic corpus from the contents of Icelandic National Debt Management Agency website
Contents of http://www.lanamal.is website downloaded, aligned and converted into a parallel corpus This dataset has been created within the framework of the European Language Resource...
ZIP (260 visninger) (155 Downloads)
-
Parallel corpus (Polish - English) from the website of the Polish Investment and Trade Agency (Processed)
Parallel (pl-en) corpus of 14736 translation units in the "BUSINESS AND COMPETITION" and "ECONOMICS" domains. This dataset has been created within the framework of the European Language...
ZIP (554 visninger) (425 Downloads)
-
Corpus on Finance and Economics from Bank of Latvia (Processed)
Contents of web site https://makroekonomika.lv/ -- Latvian and https://www.macroeconomics.lv/ -- English aligned as a parallel corpus This dataset has been created within the...
ZIP (285 visninger) (184 Downloads)
-
Monolingual documents from the Government of Lithuania
Monolingual documents received from the Government of the Republic of Lithuania. This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (299 visninger) (185 Downloads)
-
Collection of Greek National Spatial Plans
Dataset, 268KB, 5 txt files, national spatial plans (general, aquaculture, tourism, industry, RES, detention facilities) This dataset has been created within the framework of the...
ZIP (211 visninger) (170 Downloads)
-
OROSSIMO Corpus - Economics
A corpus of academic discourse texts belonging to the Economics domain (according to the Dewey Decimal classification, DDC33 - Economics), annotated at structural level conformant to the...
ZIP (366 visninger) (255 Downloads)
-
Bilingual English-Norwegian parallel corpus from the Office of the Auditor General (Riksrevisjonen) website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (819 visninger) (675 Downloads)