OLAC Record
oai:catalogue.elra.info:ELRA-S0250

Metadata
Title:TC-STAR English-Spanish Training Corpora for Machine Translation: Aligned Final Text Editions of EPPS
Abstract:This corpus consists of respectively 34 million (English) and 38 million (Spanish) running words of bilingual sentence segmented and aligned texts in English and Spanish obtained from the Final Text Editions provided by the European Parliament (from April 1996 to Sept. 2004, Dec. 2004 to May 2005, and Dec. 2005 to May 2006. The data is accompanied by tools for further preprocessing.
Access Rights:Rights available for: Commercial Use, Research Use
Date Available (W3CDTF):2007-11-15
Date Issued (W3CDTF):2007-11-15
Date Modified (W3CDTF):2007-11-15
Description:Speech Related
TC-STAR is a European integrated project focusing on all core technologies for Speech-to-Speech Translation (SST): Automatic Speech Recognition (ASR), Spoken Language Translation (SLT), and Text to Speech Synthesis (TTS). This corpus consists of respectively 34 million (English) and 38 million (Spanish) running words of bilingual sentence segmented and aligned texts in English and Spanish obtained from the Final Text Editions provided by the European Parliament (http://www.europarl.europa.eu) from April 1996 to Sept. 2004, Dec. 2004 to May 2005, and Dec. 2005 to May 2006. The data is accompanied by tools for further preprocessing.
Identifier:ELRA-S0250
http://catalog.elra.info/product_info.php?products_id=1033
Language:English
Spanish, Castilian
Language (ISO639):eng
spa
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0250
DateStamp:  2007-11-15
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2007. ELRA (European Language Resources Association).
Terms: area_Europe country_ES country_GB dcmi_Sound iso639_eng iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0250
Up-to-date as of: Fri Jun 23 1:05:55 EDT 2017