Resources for Language Technologies
-
Bilingual English-Norwegian parallel corpus from Norwegian Maritime Authority website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (453 visninger) (340 Downloads)
-
BMVI Publications (Processed)
TMX file with 11555 TUs, bilingual German/English, publications/brochures of the Federal Ministry of Transport and Digital Infrastructure on transport issues. During processing of the...
ZIP (368 visninger) (261 Downloads)
-
Civil Aviation Regulations (Processed)
A collection of parallel Polish-English texts published by the Polish Civil Aviation Office and complemented by two acts of Civil Aviation Legislation. Sentence-level alignment of...
ZIP (395 visninger) (300 Downloads)
-
Bilingual English-Danish parallel corpus from Danish Maritime Authority website
Contents of https://www.dma.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (315 visninger) (212 Downloads)
-
Bilingual documents Bulgarian-English in the field of ICT and Transport (Processed)
Bilingual collection of documents in the field of Internet governance, implementation of the Digital Agenda in Bulgaria, cloud computing and terminological dataset in Internet domain, and...
ZIP (368 visninger) (270 Downloads)
-
Bilingual documents Bulgarian-English in the field of transport (Processed)
Bilingual Bulgarian-English collection of documents; 549 KB (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (547 visninger) (428 Downloads)
-
Polish-English parallel corpus from the website of the Citizens Information Board (Processed)
Polish-English parallel corpus from the website of the Citizens Information Board, Ireland (http://www.citizensinformation.ie) This dataset has been created within the framework of the...
ZIP (373 visninger) (269 Downloads)
-
Bilingual English-Danish parallel corpus from The Viking Ship Museum website
Contents of https://www.vikingeskibsmuseet.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the...
ZIP (394 visninger) (293 Downloads)
-
Bilingual English-Danish parallel corpus from Danish Ministry of Transport, Building and Housing website
Contents of https://www.trm.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...
ZIP (281 visninger) (179 Downloads)
-
Polish-English parallel corpus from the website of the State Marine Accident Investigation Commission (Processed)
Polish-English parallel corpus from the website of the State Marine Accident Investigation Commission (http://pkbwm.gov.pl) This dataset has been created within the framework of the...
ZIP (336 visninger) (231 Downloads)
-
BMVI Website (Processed)
tmx file, 2718 TUs, bilingual German/English, texts from the website of the Federal Ministry of Transport and Digital Infrastructure (BMVI) on transport issues. Original tmx file...
ZIP (602 visninger) (490 Downloads)
-
BMVI Website
tmx file, 2718 TUs, bilingual German/English, texts from the website of the Federal Ministry of Transport and Digital Infrastructure (BMVI) on transport issues This dataset has been...
XML PDF ZIP (914 visninger) (774 Downloads)
-
Polish-English Internal Aviation Glossaries
A set of bilingual glossaries developed by the Civil Aviation Authority of Republic of Poland, totalling 8548 Polish and English terms with commentaries and reference notes, including...
XML PDF ZIP (486 visninger) (366 Downloads)
-
BMVI Publications
TMX file with 11555 TUs, bilingual German/English, publications/brochures of the Federal Ministry of Transport and Digital Infrastructure on transport issues This dataset has been...
XML PDF ZIP (779 visninger) (671 Downloads)
-
Civil Aviation Regulations
A collection of parallel Polish-English texts published by the Polish Civil Aviation Office and complemented by two acts of Civil Aviation Legislation. Sentence-level alignment of...
XML PDF ZIP (617 visninger) (519 Downloads)
-
Bilingual documents Bulgarian-English in the field of transport
Bilingual Bulgarian-English collection of documents; 549 KB This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe...
ZIP (322 visninger) (216 Downloads)
-
Bilingual English-Norwegian parallel corpus from BarentsWatch website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (391 visninger) (286 Downloads)
-
Bilingual English-Norwegian parallel corpus from the Accident Investigation Board Norway website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (456 visninger) (347 Downloads)
-
Bilingual English-Norwegian parallel corpus from Avinor company website
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (603 visninger) (486 Downloads)
-
Statens Vegvesen Translation Memories
This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action SMART...
ZIP (377 visninger) (269 Downloads)