OLAC Record
oai:catalogue.elra.info:ELRA-W0033

Metadata
Title:CRATER 2 Corpus
Abstract:The CRATER 2 parallel corpus is an extension of the CRATER corpus, available in the catalogue under reference W0003. It consists of 1,500,000 tokens for English and French and of 1,000,000 tokens for Spanish, with morphosyntactical annotations. CRATER 2 (ref. ELRA-W0033) includes CRATER (ref. ELRA-W0003)
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2002-01-31
Date Issued (W3CDTF):2004-09-14
Date Modified (W3CDTF):2012-05-23
Description:Written Corpora
The CRATER corpus was built upon the foundations of an earlier project, ET10/63, which was funded in the final phase of the Eurotra programme. The Corpus Resources and Terminology Extraction project (MLAP-93 20) extended the bilingual annotated English-French International Telecommunications Union corpus produced within ET10/63 to include Spanish. The CRATER 2 corpus was produced by the Department of Linguistics & Modern English Language, Lancaster University (United Kingdom) with funding from ELRA. The ELRA funding in turn was provided by the European Commission project LRsP Modern English Language, Lancaster University (United Kingdom) with funding from ELRA. The ELRA funding in turn was provided by the European Commission project LRsP&P (Language Resources Production & Packaging - LE4-8335). This project has enhanced the CRATER corpus, available under the reference ELRA-W0003 in the ELRA catalogue. CRATER 2 has significantly expanded the French/English component of the parallel corpus by increasing the size of the English/French corpus from 1,000,000 words per language to approximately 1,500,000 tokens per language. The offer consists of 1,500,000 tokens for English and French and of 1,000,000 tokens for Spanish, with morphosyntactical annotations (human-edited). CRATER 2 (ref. ELRA-W0033) includes CRATER (ref. ELRA-W0003).
Identifier:ELRA-W0033
http://catalog.elra.info/product_info.php?products_id=636
Language:French
English
Spanish, Castilian
Language (ISO639):fra
eng
spa
Medium:CD-ROM
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0033
DateStamp:  2002-01-31
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Europe country_ES country_FR country_GB dcmi_Text iso639_eng iso639_fra iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0033
Up-to-date as of: Mon Oct 9 1:51:11 EDT 2017