OLAC Record
oai:catalogue.elra.info:ELRA-W0053

Metadata
Title:Catalan-Spanish Parallel Corpus
Abstract:This corpus contains more than 100 million words and it contains 10 years of bilingual articles from ?El Peri?dico de Catalunya?. The data are aligned at sentence level and stored in text files, in a one sentence per line basis. The data are provided in plain text, with no encoding whatsoever.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2010-06-18
Date Issued (W3CDTF):2010-06-18
Date Modified (W3CDTF):2010-06-18
Description:Written Corpora
This corpus contains more than 100 million words and it contains 10 years of bilingual articles from ?El Peri?dico de Catalunya?. Both language data are rather close as the Catalan text is a translation of the Spanish one, partly achieved by means of Machine translation and then post-edited. The data are aligned at sentence level and stored in text files, in a one sentence per line basis. The data are provided in plain text, with no encoding whatsoever.
Identifier:ELRA-W0053
http://catalog.elra.info/product_info.php?products_id=1122
Language:Spanish, Castilian
Catalan, Valencian
Language (ISO639):spa
cat
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0053
DateStamp:  2010-06-18
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2010. ELRA (European Language Resources Association).
Terms: area_Europe country_ES dcmi_Text iso639_cat iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0053
Up-to-date as of: Fri May 5 1:20:21 EDT 2017