Otsingukriteerium:
-
The Coimisineir Teanga Bilingual Corpus of Reference Documents
General Reference content from the Language Commissioner's Office Size: 6 bilingual Word documents and 44 parallel Word documents This dataset has been created within the framework...
ZIP (339 kuvad) (248 Allalaadimised)
-
The Gaois bilingual corpus of English-Irish legislation
Bilingual corpus of English-Irish legislation provided by the Department of Justice, in two parallel .txt files. Contains 98,758 parallel sentences. This dataset has been created within...
ZIP (432 kuvad) (317 Allalaadimised)
-
Irish Monolingual Corpus from contents of health.gov.ie web site
Irish Monolingual Corpus from contents of health.gov.ie web site This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (252 kuvad) (163 Allalaadimised)
-
Citizens Information Bilingual Web-Corpus (Processed)
A web corpus crawled from http://www.citizensinformation.ie. Contains 10,297 parallel sentences of English/Irish that have undergone manual cleaning. May be reproduced and/or re-used free...
ZIP (243 kuvad) (156 Allalaadimised)
-
English-Irish website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 1134 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (214 kuvad) (126 Allalaadimised)
-
Legal acts of Ireland as Irish Monolingual Corpus
Legal acts of Ireland as Irish Monolingual Corpus collected from documents of http://acts.ie/ web site This dataset has been created within the framework of the European Language...
ZIP (248 kuvad) (157 Allalaadimised)
-
The Coimisineir Teanga Bilingual Corpus of Reference Documents (Processed)
General Reference content from the Language Commissioner's Office. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (334 kuvad) (209 Allalaadimised)
-
The Coimisineir Teanga Bilingual Web Corpus (Processed)
Web content from the Language Commissioner's Office. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility -...
ZIP (288 kuvad) (188 Allalaadimised)
-
The UCD Bord na Gaeilge Corpus of bilingual PDFs and Word documents (Processed)
Parallel data provided by the language office at UCD (University College Dublin) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (291 kuvad) (189 Allalaadimised)
-
The Coimisineir Teanga Bilingual Corpus of Reports and Press Releases (Processed)
Reports and Press Release data from the Language Commissioner's Office. This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (478 kuvad) (375 Allalaadimised)
-
The Gaois bilingual corpus of English-Irish legislation (Processed)
Bilingual corpus of English-Irish legislation provided by the Department of Justice. This dataset has been created within the framework of the European Language Resource Coordination...
ZIP (383 kuvad) (275 Allalaadimised)
-
The Udáras na Gaeltachta Corpus of bilingual PDFs and Word documents (Processed)
Information brochures and leaflets. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated...
ZIP (256 kuvad) (167 Allalaadimised)
-
The UCD Bórd na Gaeilge Corpus of bilingual PDFs and Word documents
Parallel data provided by the language office at UCD (University College Dublin) Size: 3 Word documents, 67 PDFs This dataset has been created within the framework of the European...
ZIP (294 kuvad) (196 Allalaadimised)
-
The Coimisineir Teanga Bilingual Web Corpus
Web content from the Language Commissioner's Office. Two TXT files containing 6808 words of parallel data This dataset has been created within the framework of the European Language...
ZIP (268 kuvad) (180 Allalaadimised)
-
The Udáras na Gaeltachta Corpus of bilingual PDFs and Word documents
Word documents and PDF files of information brochures and leaflets. This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting...
ZIP (292 kuvad) (219 Allalaadimised)
-
The Coimisineir Teanga Bilingual Corpus of Reports and Press Releases
Reports and Press Release data from the Language Commissioner's Office. 19 parallel Word documents. This dataset has been created within the framework of the European Language Resource...
ZIP (258 kuvad) (156 Allalaadimised)