Resources for Language Technologies
-
DGT-Translation Memory
DGT-Translation Memory DGT-TM er en oversættelseshukommelse (sætninger og deres manuelt fremstillede oversættelse) på 24 sprog. Den indeholder segmenter fra den gældende fællesskabsret –...
PDF ZIP (45005 visninger) (4502 Downloads)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 visninger) (122 Downloads)
-
EJTN Handbook (Processed)
Handbook on judical training (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (373 visninger) (266 Downloads)
-
Letter of rights for persons arrested on the basis of a European Arrest Warrant (Processed)
Letter of rights for persons arrested on the basis of a European Arrest Warrant (EAW), 1 page, (Processed) This dataset has been created within the framework of the European Language...
ZIP (666 visninger) (557 Downloads)
-
Letter of rights for persons arrested and or detained
Police form, 12 pages. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (413 visninger) (296 Downloads)
-
Bilingual Bulgarian-English corpus from the National Revenue Agency (BG) (Processed)
Bilingual Bulgarian-English corpus of administrative documents on the Refund of Value Added Tax from the Bulgarian National Revenue Agency. This dataset has been created within the...
ZIP (595 visninger) (494 Downloads)
-
English-Bulgarian Computer Terms (Processed)
The resource is a bilingual terminological database representing 729 terms in English and their translations to Bulgarian. The terms belong to the Computer domain. The terminological...
ZIP (273 visninger) (180 Downloads)
-
Bilingual documents Bulgarian-English in the field of open data, broadband and information society (Processed)
English-Bulgarian collection in the field of open data, broadband, strategic document of the Information society in the Republic of Bulgaria This dataset has been created within the...
ZIP (503 visninger) (389 Downloads)
-
English-Bulgarian Computer Terms
The resource is a bilingual terminological database representing 729 terms in English and their translations to Bulgarian. The terms belong to the Computer domain. The terminological...
XML PDF ZIP (581 visninger) (478 Downloads)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Bulgarian) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (228 visninger) (145 Downloads)
-
Parallel corpus (Bulgarian - English) in the public administration domain (Processed)
Parallel (bg-en) corpus of 11262 translation units in the public administration domain. This dataset has been created within the framework of the European Language Resource Coordination...
ZIP (366 visninger) (258 Downloads)
-
Parallel texts from Swedish Work environment Authority (Processed)
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (615 visninger) (487 Downloads)
-
Letter of rights for persons arrested and or detained (Processed)
Collection of transaltion units (1906 in total) in 21 language pairs extracted from 7 Police forms (one form 12 pages long in each of the following languages: BG, EL, EN, FR, LV, PL, RO)....
ZIP (452 visninger) (338 Downloads)
-
Bulgarian–English glossary of diseases (Processed)
Bulgarian–English glossary of diseases from the Wikidata knowledge base This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (144 visninger) (94 Downloads)
-
Parallel Global Voices (Bulgarian - English) (Processed)
Parallel Global Voices BG-EN is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (622 visninger) (520 Downloads)
-
Monolingual Bulgarian corpus in the culture domain (part 2) (Processed)
Monolingual Bulgarian corpus including content from websites related to the "culture" domain. This dataset has been created within the framework of the European Language Resource...
ZIP (245 visninger) (160 Downloads)
-
Parallel texts from Swedish Work environment Authority
Parallel texts from the Swedish Work Environment authority, all in pdf format. Original in Swedish, all the other texts are translations. One original with translations per folder....
ZIP (406 visninger) (296 Downloads)
-
Bilingual resource with Bulgarian strategic documents in the field of telecommunications and broadband (Bulgarian - English)
Bilingual collection of documents in the field of telecommunications and broadband, size on disk 440 kB, Bulgarian-English This dataset has been created within the framework of the...
ZIP (369 visninger) (258 Downloads)
-
EJTN Handbook
Handbook on judical training This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation...
ZIP (336 visninger) (227 Downloads)
-
English-Bulgarian Economy Terms (Processed)
The resource is a bilingual terminological database representing 899 terms in English and their translations to Bulgarian. The terms belong to the domain of Economy (Business...
ZIP (154 visninger) (90 Downloads)