Resources for Language Technologies
-
Macroeconomic Developments
Bulletins of Macroeconomic Developments This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (371 amharc) (261 Íoslódálacha)
-
Expression of interest (Processed)
International call for expression of interest for the selection of the President of the Hellenic Statistical Authority (EL.STAT.) This dataset has been created within the framework of...
ZIP (426 amharc) (309 Íoslódálacha)
-
Croatian-English corpus with the Rural Development Programme for the Period 2014-2020 from the Croatian Rural Development Programme website (Processed)
Croatian-English corpus with the Rural Development Programme for the Period 2014-2020 from the Croatian Rural Development Programme website This dataset has been created within the...
ZIP (370 amharc) (267 Íoslódálacha)
-
Macroeconomic Developments (Processed)
Bulletins of Macroeconomic Developments This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (365 amharc) (1 Íoslódálacha)
-
Bilingual Icelandic-English parallel corpus from Statistics Iceland website
Contents of https://www.statice.is and https://hagstofa.is/ websites downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the...
ZIP (416 amharc) (317 Íoslódálacha)
-
Monolingual documents from the Government of Lithuania (Processed)
Monolingual documents received from the Government of the Republic of Lithuania. (Processed) This dataset has been created within the framework of the European Language Resource...
ZIP (490 amharc) (376 Íoslódálacha)
-
Bilingual English-Danish parallel corpus from Visit Vejle website
Contents of https://www.visitvejle.com were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...
ZIP (342 amharc) (238 Íoslódálacha)
-
Bilingual English-Danish parallel corpus from Danish Ministry of Foreign Affairs website
Contents of http://um.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the European...
ZIP (367 amharc) (264 Íoslódálacha)
-
Bilingual English-Danish parallel corpus from Odense Municipality website
Contents of https://www.odense.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (364 amharc) (256 Íoslódálacha)
-
Bilingual English-Danish parallel corpus from The Geological Survey of Denmark and Greenland (GEUS) website
Contents of http://www.geus.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (391 amharc) (287 Íoslódálacha)
-
Polish-English parallel corpus from the website "Business in Poland" (Processed)
Polish-English parallel corpus from the website of the website "Business in Poland" (https://www.biznes.gov.pl/en) This dataset has been created within the framework of the European...
ZIP (217 amharc) (159 Íoslódálacha)
-
Bilingual English-Danish parallel corpus from VisitDenmark - The official tourism site of Denmark website
Contents of https://www.visitdenmark.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...
ZIP (403 amharc) (284 Íoslódálacha)
-
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website (Processed)
Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website. The resource contains pdf files with each file containing the text in...
ZIP (350 amharc) (256 Íoslódálacha)
-
Polish-English parallel corpus from the website of the Ministry of Development (Processed)
Polish-English parallel corpus from the website of the Ministry of Development, Republic of Poland (http://www.mr.gov.pl) This dataset has been created within the framework of the...
ZIP (406 amharc) (296 Íoslódálacha)
-
Employment in Poland 2009 report in EN-PL (Processed)
The report "Employment in Poland 2009 – Entrepreneurship for Work" is a thorough study of the most significant processes occurring in the Polish and European labour markets. The dataset...
ZIP (321 amharc) (271 Íoslódálacha)
-
Bilingual English-Danish parallel corpus from National Museum of Denmark website
Contents of https://natmus.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (301 amharc) (195 Íoslódálacha)
-
Quarterly Reports of the Parliamentary Budget Office (Hellenic Parliament) (Processed)
A collection of 32 reports (16 in EL and 16 In EL) of the Parliamentary Budget Office (Hellenic Parliament) This dataset has been created within the framework of the European Language...
ZIP (517 amharc) (399 Íoslódálacha)
-
English-Swedish parallel texts from The Swedish Agency for Economic and Regional Growth - Tillväxtverket (Processed)
Parallel texts from The Swedish Agency for Economic and Regional Growth (Tillväxtverket). Original texts are in Swedish, the English texts are translations. This dataset has been created...
ZIP (634 amharc) (520 Íoslódálacha)
-
Parallel English-Icelandic corpus from the contents of Icelandic National Debt Management Agency website
Contents of http://www.lanamal.is website downloaded, aligned and converted into a parallel corpus This dataset has been created within the framework of the European Language Resource...
ZIP (260 amharc) (155 Íoslódálacha)
-
Orossimo Terminological Resource - Economics
A bilingual terminological glossary extracted from academic discourse texts belonging to the Economics domain. This dataset has been created within the framework of the European Language...
XML PDF ZIP (640 amharc) (528 Íoslódálacha)