OLAC Record

Title:UMC 0.1: Czech-Russian-English Multilingual Corpus
Bibliographic Citation:http://hdl.handle.net/11858/00-097C-0000-0001-4909-7
Creator:Klyueva, Natalia
Bojar, Ondřej
Date (W3CDTF):2011-06-28T10:42:32Z
Date Available:2008-10-02T00:00:00Z
Description:UMC 0.1 Czech-English-Russian is a multilingual parallel corpus of texts in Czech, Russian and English languages with automatic pairwise sentence alignments. The primary aim of UMC is to extend the set of languages covered by the corpus CzEng mainly for the purposes of machine translation. All the texts were downloaded from a single source — The Project Syndicate (Copyright: Project Syndicate 1995-2008), which contains a huge collection of high-quality news articles and commentaries. We were given the permission to use the texts for research and non-commercial purposes.
FP6-IST-5-034291-STP (EuroMatrix)
Identifier (URI):http://hdl.handle.net/11858/00-097C-0000-0001-4909-7
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0)
Subject:multi-language corpus
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-4909-7
DateStamp:  2021-03-22
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Klyueva, Natalia; Bojar, Ondřej. 2011. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text

Up-to-date as of: Tue Mar 23 7:05:30 EDT 2021