Resources for Language Technologies
-
English-Estonian corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (439 visninger) (337 Downloads)
-
English-Swedish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (641 visninger) (524 Downloads)
-
English-Finnish corpus from Finnish Information Bank
http://www.infopankki.fi - Finland in your language - Information about Finland - Moving to Finland - Living in Finland This dataset has been created within the framework of the European...
XML PDF ZIP (850 visninger) (724 Downloads)
-
Orossimo Terminological Resource - Medicine & health
A bilingual terminological glossary extracted from academic discourse texts belonging to the Medicine & health domain. This dataset has been created within the framework of the...
XML PDF ZIP (766 visninger) (651 Downloads)
-
English-Danish Parallel corpus from Tatoeba project
Parallel corpus from English-Danish translations from tatoeba.org website This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
XML PDF ZIP (958 visninger) (815 Downloads)
-
English-Estonian EASTIN-CL Multilingual Ontology of Assistive Technology
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
XML PDF ZIP (530 visninger) (427 Downloads)
-
Central Statistical Office Dataset
Two Polish-English publications of the Polish Central Statistical Office in the XLIFF format: 1. "Statistical Yearbook of the Republic of Poland 2015" is the main summary publication...
XML PDF ZIP (663 visninger) (565 Downloads)
-
Health Multilingual Terminologies
17 multilingual medical terminologies from Termcat in the following domains: - Anatomy (3610 terms; languages: es, en, ca) - Integrated care (75 terms; languages: es, en,ca) -...
XML PDF ZIP (739 visninger) (628 Downloads)
-
English-Latvian EASTIN-CL Multilingual Ontology of Assistive Technology
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
XML PDF ZIP (457 visninger) (358 Downloads)
-
Parallel Global Voices (Greek - French)
Parallel Global Voices EL-FR is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
XML PDF ZIP (640 visninger) (514 Downloads)
-
English-Lithuanian EASTIN-CL Multilingual Ontology of Assistive Technology
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
XML PDF ZIP (555 visninger) (455 Downloads)
-
Parallel Global Voices (Greek - English)
Parallel Global Voices EL-EN is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
XML PDF ZIP (835 visninger) (710 Downloads)
-
TERMIS: Slovene-English terminology in the field of public relations
The terminological database for the field of public relations contains 2000 terms with information on accent, norm, explanations, English translations, typical collocations and examples...
XML PDF ZIP (657 visninger) (550 Downloads)
-
Orossimo Terminological Resource - Photography, film & video
A bilingual terminological glossary extracted from academic discourse texts belonging to the Photography, film & video domain. This dataset has been created within the framework of...
XML PDF ZIP (683 visninger) (564 Downloads)
-
English-Danish EASTIN-CL Multilingual Ontology of Assistive Technology
EASTIN-CL Multilingual Ontology of Assistive Technology was created within the EASTIN-CL project aimed at applying language technologies to portal of assistive technologies...
XML PDF ZIP (591 visninger) (483 Downloads)
-
National Health Fund Dataset
The dataset is a 274K-token Polish-English parallel resource in XLIFF format created on the basis of "Diagnosis-Related Groups in Europe" publication of the Polish National Health Fund....
XML PDF ZIP (452 visninger) (374 Downloads)