-
DGT-Translation Memory
DGT-TM is a translation memory (sentences and their manually produced translations) in 24 languages. It contains segments from the Acquis Communautaire, the body of European legislation,...
ZIP (45005 views) (4502 Downloads)
-
COVID-19 multilingual terminology in IATE
The dataset is a collection of multilingual entries related to the SARS-CoV-2 virus and the COVID-19 pandemic, available in IATE, the European Union terminology database. It is a...
Excel XLSX (1490 views) (122 Downloads)
-
Terminology_of_international_contracts_Portuguese_(Processed)
The Portuguese terms extracted from the multilingual terminology of international contracts as provided by the German Foreign Office. Transformed into TBX. 728 terms corresponding to 671...
ZIP (241 views) (139 Downloads)
-
Terminology_of_the _German_Foreign_Office_Portuguese_(Processed)
Portuguese terms extracted from the main terminology collection of the German Foreign Office. Transformed into the TBX Format. 3754 terms corresponding to 3055 termEntry IDs. Can be...
ZIP (243 views) (147 Downloads)
-
English-Portuguese website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 843 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (260 views) (177 Downloads)
-
Portuguese monolingual corpus from contents of the Official Journal of Portugal
Portuguese monolingual corpus from contents of the Official Journal of Portugal This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (259 views) (162 Downloads)
-
EUIPO - Trade mark Guidelines (October 2017) (English-Portuguese) (Processed)
The EUIPO Guidelines are the main point of reference for users of the European Union trade mark system and professional advisers who want to make sure they have the latest information on...
ZIP (266 views) (168 Downloads)
-
Legislation PT (Processed)
Portuguese legislation in PT (laws published in the Diário da República - Official Journal of the Portuguese Republic) (Processed) This dataset has been created within the framework of...
ZIP (87 views) (33 Downloads)
-
German-Portuguese website parallel corpus from the Federal Foreign Office Berlin (Processed)
German-Portuguese texts extracted from the website of the Federal Foreign Office Berlin. This includes 415 pairs that were translated between September 2013 and the beginning of December...
ZIP (229 views) (132 Downloads)
-
Parallel Global Voices (English - Portuguese) (Processed)
Parallel Global Voices EN-PT is a parallel corpus generated from the Global Voices multilingual group of websites (http://globalvoices.org/), where volunteers publish and translate news...
ZIP (210 views) (131 Downloads)
-
Khresmoi (Processed)
Parallel data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, German, Hungarian, Polish, Spanish...
ZIP (266 views) (184 Downloads)
-
Spanish-Portuguese website parallel corpus (Processed)
This is a parallel corpus of bilingual texts crawled from multilingual websites, which contains 1,249 TUs. Manual validation has been performed on a sample of the data. This dataset...
ZIP (283 views) (177 Downloads)
-
Portuguese-French bilingual corpus from Portuguese law on referendum (Processed)
Law on the referendum in Portugal; bilingual tmx file in PT-FR (Processed) This dataset has been created within the framework of the European Language Resource Coordination (ELRC)...
ZIP (287 views) (184 Downloads)
-
German-Portuguese website parallel corpus from the Federal Foreign Office Berlin
German-Portuguese texts extracted from the website of the Federal Foreign Office Berlin. This includes 415 pairs that were translated between September 2013 and the beginning of December...
XML PDF ZIP (535 views) (427 Downloads)
-
Spanish-Portuguese website parallel corpus
This is a parallel and aligned corpus of bilingual texts crawled from multilingual websites, which contains 1,249 TUs. This dataset has been created within the framework of the European...
ZIP (780 views) (645 Downloads)