OLAC Record
oai:catalogue.elra.info:ELRA-W0018

Metadata
Title:ARCADE/ROMANSEVAL corpus
Abstract:The corpus contains raw data from the JOC corpus developed in the MULTEXT project financed by the European Commission (LRE 62-050), composed of 1 million words in English and four romance languages: French, Italian, Spanish and Portuguese (Written Question and Answers from the Official Journal of the European Commission). The annotation concerns all the contexts of 60 different test words (20 nouns, 20 adjectives, 20 verbs), i.e. ca. 3700 contexts all together. It comprises: semantic tagging of all the occurrences of the test words in the JOC corpus for French and Italian; and word-level alignment of all the occurrences of the test words between French and English.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):1998-11-23
Date Issued (W3CDTF):2004-09-14
Date Modified (W3CDTF):2013-01-24
Description:Written Corpora
The ARCADE/ROMANSEVAL corpus was used as a reference corpus in two international competitions: ? ARCADE, an exercise on multilingual text alignment financed by AUPELF-UREF ? ROMANSEVAL, part of the SENSEVAL exercise sponsored by ACL-SIGLEX and EURALEX, on word sense disambiguation. The corpus contains raw data from the JOC corpus developed in the MULTEXT project financed by the European Commission (LRE 62-050), composed of 1 million words in English and four romance languages: French, Italian, Spanish and Portuguese (Written Question and Answers from the Official Journal of the European Commission). The annotation concerns all the contexts of 60 different test words (20 nouns, 20 adjectives, 20 verbs), i.e. ca. 3700 contexts all together, and comprises: ? semantic tagging of all the occurrences of the test words in the JOC corpus for French and Italian; ? word-level alignment of all the occurrences of the test words between French and English. Additional information:http://www.lpl.univ-aix.fr/projects/arcadehttp://www.lpl.univ-aix.fr/projects/romanseval
Identifier:ELRA-W0018
http://catalog.elra.info/product_info.php?products_id=535
Language:English
French
Italian
Language (ISO639):eng
fra
ita
Medium:CD-ROM
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0018
DateStamp:  1998-11-23
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Europe country_FR country_GB country_IT dcmi_Text iso639_eng iso639_fra iso639_ita olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0018
Up-to-date as of: Fri May 5 1:18:24 EDT 2017