OLAC Record
oai:catalogue.elra.info:ELRA-W0057

Metadata
Title:PANACEA English-French and English-Greek parallel corpus acquired for Environment domain
Abstract:This package consists of an English-French and English-Greek sentence-aligned parallel corpus from the Environment domain automatically acquired from the web during 2010 and 2011. It was acquired in the framework of the PANACEA project. Data and language pairs are split into training, test and development test sets.
Access Rights:Rights available for: Research Use
Date Available (W3CDTF):2013-01-30
Date Issued (W3CDTF):2012-10-31
Date Modified (W3CDTF):2013-01-30
Description:Written Corpora
The PANACEA English-French and English-Greek parallel corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme. This package consists of an English-French and English-Greek sentence-aligned parallel corpus from the Environment domain automatically acquired from the web during 2010 and 2011. Data and language pairs are split into training, test and development test sets as follows:
filenamesentencestokensvocabulary
env.en-el.dev.el1000305106065
env.en-el.dev.en1000278654325
env.en-el.test.el2000635519263
env.en-el.test.en2000580736078
env.en-el.train.el965326774223011
env.en-el.train.en965324082214581
env.en-fr.dev.en1392413825888
env.en-fr.dev.fr1392496576386
env.en-fr.test.en2000588717076
env.en-fr.test.fr2000707447727
env.en-fr.train.en1024030078615668
env.en-fr.train.fr1024036292117485
All corpus files are provided as plain text in UTF8 character encoding, one sentence per line with line numbers identifying parallel sentences.
Identifier:ELRA-W0057
http://catalog.elra.info/product_info.php?products_id=1182
Language:English
French
Language (ISO639):eng
fra
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0057
DateStamp:  2013-01-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2012. ELRA (European Language Resources Association).
Terms: area_Europe country_FR country_GB dcmi_Text iso639_eng iso639_fra olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0057
Up-to-date as of: Sun Nov 12 1:45:28 EST 2017