OLAC Record
oai:lindat.mff.cuni.cz:11234/1-1477

Metadata
Title:Europarl QTLeap WSD/NED corpus
Bibliographic Citation:http://hdl.handle.net/11234/1-1477
Creator:Agirre, Eneko
Branco, António
Popel, Martin
Simov, Kiril
Date (W3CDTF):2015-05-16T17:30:22Z
Date Available:2015-05-16T17:30:22Z
Description:This corpora is part of Deliverable 5.5 of the European Commission project QTLeap FP7-ICT-2013.4.1-610516 (http://qtleap.eu). The texts are sentences from the Europarl parallel corpus (Koehn, 2005). We selected the monolingual sentences from parallel corpora for the following pairs: Bulgarian-English, Czech-English, Portuguese-English and Spanish-English. The English corpus is comprised by the English side of the Spanish-English corpus. Basque is not in Europarl. In addition, it contains the Basque and English sides of the GNOME corpus. The texts have been automatically annotated with NLP tools, including Word Sense Disambiguation, Named Entity Disambiguation and Coreference resolution. Please check deliverable D5.6 in http://qtleap.eu/deliverables for more information.
Identifier (URI):http://hdl.handle.net/11234/1-1477
Language:Basque
Bulgarian
Czech
English
Portuguese
Spanish
Language (ISO639):eus
bul
ces
eng
por
spa
Publisher:University of the Basque Country, UPV/EHU
Faculty of Science, Univeristy of Lisbon, FCUL
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Bulgarian Academy of Sciences, IICT-BAS
Rights:Creative Commons - Attribution 4.0 International (CC BY 4.0)
http://creativecommons.org/licenses/by/4.0/
Subject:annotated corpus
multilingual
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-1477
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Agirre, Eneko; Branco, António; Popel, Martin; Simov, Kiril. 2015. University of the Basque Country, UPV/EHU.
Terms: area_Europe country_BG country_CZ country_ES country_GB country_PT dcmi_Text iso639_bul iso639_ces iso639_eng iso639_eus iso639_por iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-1477
Up-to-date as of: Thu Oct 5 0:40:23 EDT 2023