Resources for Language Technologies
-
Macroeconomic Developments
Bulletins of Macroeconomic Developments This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (371 visninger) (261 Downloads)
-
Expression of interest (Processed)
International call for expression of interest for the selection of the President of the Hellenic Statistical Authority (EL.STAT.) This dataset has been created within the framework of...
ZIP (426 visninger) (309 Downloads)
-
Croatian-English corpus with the Rural Development Programme for the Period 2014-2020 from the Croatian Rural Development Programme website (Processed)
Croatian-English corpus with the Rural Development Programme for the Period 2014-2020 from the Croatian Rural Development Programme website This dataset has been created within the...
ZIP (370 visninger) (267 Downloads)
-
Macroeconomic Developments (Processed)
Bulletins of Macroeconomic Developments This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (365 visninger) (1 Downloads)
-
Bilingual Icelandic-English parallel corpus from Statistics Iceland website
Contents of https://www.statice.is and https://hagstofa.is/ websites downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the...
ZIP (416 visninger) (317 Downloads)
-
Monolingual documents from the Government of Lithuania (Processed)
Monolingual documents received from the Government of the Republic of Lithuania. (Processed) This dataset has been created within the framework of the European Language Resource...
ZIP (490 visninger) (376 Downloads)
-
Bilingual English-Danish parallel corpus from Visit Vejle website
Contents of https://www.visitvejle.com were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...
ZIP (342 visninger) (238 Downloads)
-
Bilingual English-Danish parallel corpus from Danish Ministry of Foreign Affairs website
Contents of http://um.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the European...
ZIP (367 visninger) (264 Downloads)
-
Bilingual English-Danish parallel corpus from Odense Municipality website
Contents of https://www.odense.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (364 visninger) (256 Downloads)
-
Bilingual English-Danish parallel corpus from The Geological Survey of Denmark and Greenland (GEUS) website
Contents of http://www.geus.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (391 visninger) (287 Downloads)
-
Polish-English parallel corpus from the website "Business in Poland" (Processed)
Polish-English parallel corpus from the website of the website "Business in Poland" (https://www.biznes.gov.pl/en) This dataset has been created within the framework of the European...
ZIP (217 visninger) (159 Downloads)
-
Bilingual English-Danish parallel corpus from VisitDenmark - The official tourism site of Denmark website
Contents of https://www.visitdenmark.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...
ZIP (403 visninger) (284 Downloads)
-
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website (Processed)
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website. The resource contains pdf files with each file containing the text in...
ZIP (350 visninger) (256 Downloads)
-
Polish-English parallel corpus from the website of the Ministry of Development (Processed)
Polish-English parallel corpus from the website of the Ministry of Development, Republic of Poland (http://www.mr.gov.pl) This dataset has been created within the framework of the...
ZIP (406 visninger) (296 Downloads)
-
Employment in Poland 2009 report in EN-PL (Processed)
The report "Employment in Poland 2009 – Entrepreneurship for Work" is a thorough study of the most significant processes occurring in the Polish and European labour markets. The dataset...
ZIP (321 visninger) (271 Downloads)
-
Bilingual English-Danish parallel corpus from National Museum of Denmark website
Contents of https://natmus.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (301 visninger) (195 Downloads)
-
Quarterly Reports of the Parliamentary Budget Office (Hellenic Parliament) (Processed)
A collection of 32 reports (16 in EL and 16 In EL) of the Parliamentary Budget Office (Hellenic Parliament) This dataset has been created within the framework of the European Language...
ZIP (517 visninger) (399 Downloads)
-
English-Swedish parallel texts from The Swedish Agency for Economic and Regional Growth - Tillväxtverket (Processed)
Parallel texts from The Swedish Agency for Economic and Regional Growth (Tillväxtverket). Original texts are in Swedish, the English texts are translations. This dataset has been created...
ZIP (634 visninger) (520 Downloads)
-
Parallel English-Icelandic corpus from the contents of Icelandic National Debt Management Agency website
Contents of http://www.lanamal.is website downloaded, aligned and converted into a parallel corpus This dataset has been created within the framework of the European Language Resource...
ZIP (260 visninger) (155 Downloads)
-
Orossimo Terminological Resource - Economics
A bilingual terminological glossary extracted from academic discourse texts belonging to the Economics domain. This dataset has been created within the framework of the European Language...
XML PDF ZIP (640 visninger) (528 Downloads)