-
ГД „Писмени преводи“ — Преводаческа памет
DGT-TM е преводаческа памет (изречения и техния ръчен превод) на 24 езика. Тя съдържа сегменти от достиженията на правото на ЕС — европейското законодателство, включващо всички договори,...
ZIP (45005 начини на показване) (4502 Изтегляния)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 начини на показване) (122 Изтегляния)
-
The Coimisineir Teanga Bilingual Corpus of Reference Documents
General Reference content from the Language Commissioner's Office Size: 6 bilingual Word documents and 44 parallel Word documents This dataset has been created within the framework...
ZIP (339 начини на показване) (248 Изтегляния)
-
The Gaois bilingual corpus of English-Irish legislation
Bilingual corpus of English-Irish legislation provided by the Department of Justice, in two parallel .txt files. Contains 98,758 parallel sentences. This dataset has been created within...
ZIP (432 начини на показване) (317 Изтегляния)
-
Irish Monolingual Corpus from contents of health.gov.ie web site
Irish Monolingual Corpus from contents of health.gov.ie web site This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (252 начини на показване) (163 Изтегляния)
-
Citizens Information Bilingual Web-Corpus (Processed)
A web corpus crawled from http://www.citizensinformation.ie. Contains 10,297 parallel sentences of English/Irish that have undergone manual cleaning. May be reproduced and/or re-used free...
ZIP (243 начини на показване) (156 Изтегляния)
-
English-Irish website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 1134 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (214 начини на показване) (126 Изтегляния)
-
Legal acts of Ireland as Irish Monolingual Corpus
Legal acts of Ireland as Irish Monolingual Corpus collected from documents of http://acts.ie/ web site This dataset has been created within the framework of the European Language...
ZIP (248 начини на показване) (157 Изтегляния)
-
The Coimisineir Teanga Bilingual Corpus of Reference Documents (Processed)
General Reference content from the Language Commissioner's Office. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (334 начини на показване) (209 Изтегляния)
-
The Coimisineir Teanga Bilingual Web Corpus (Processed)
Web content from the Language Commissioner's Office. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (288 начини на показване) (188 Изтегляния)
-
The UCD Bord na Gaeilge Corpus of bilingual PDFs and Word documents (Processed)
Parallel data provided by the language office at UCD (University College Dublin) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (291 начини на показване) (189 Изтегляния)
-
The Coimisineir Teanga Bilingual Corpus of Reports and Press Releases (Processed)
Reports and Press Release data from the Language Commissioner's Office. This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (478 начини на показване) (375 Изтегляния)
-
The Gaois bilingual corpus of English-Irish legislation (Processed)
Bilingual corpus of English-Irish legislation provided by the Department of Justice. This dataset has been created within the framework of the European Language Resource Coordination...
ZIP (383 начини на показване) (275 Изтегляния)
-
The Udáras na Gaeltachta Corpus of bilingual PDFs and Word documents (Processed)
Information brochures and leaflets. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (256 начини на показване) (167 Изтегляния)
-
The UCD Bórd na Gaeilge Corpus of bilingual PDFs and Word documents
Parallel data provided by the language office at UCD (University College Dublin) Size: 3 Word documents, 67 PDFs This dataset has been created within the framework of the European...
ZIP (294 начини на показване) (196 Изтегляния)
-
The Coimisineir Teanga Bilingual Web Corpus
Web content from the Language Commissioner's Office. Two TXT files containing 6808 words of parallel data This dataset has been created within the framework of the European Language...
ZIP (268 начини на показване) (180 Изтегляния)
-
The Udáras na Gaeltachta Corpus of bilingual PDFs and Word documents
Word documents and PDF files of information brochures and leaflets. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (292 начини на показване) (219 Изтегляния)
-
The Coimisineir Teanga Bilingual Corpus of Reports and Press Releases
Reports and Press Release data from the Language Commissioner's Office. 19 parallel Word documents. This dataset has been created within the framework of the European Language Resource...
ZIP (258 начини на показване) (156 Изтегляния)