OLAC Record
oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-4916-9

Metadata
Title:CzEng 0.7
Bibliographic Citation:http://hdl.handle.net/11858/00-097C-0000-0001-4916-9
Creator:Bojar, Ondřej
Žabokrtský, Zdeněk
Češka, Pavel
Beňa, Peter
Janíček, Miroslav
Date (W3CDTF):2011-06-28T16:13:23Z
Date Available:2009-11-02T10:32:27Z
Description:CzEng 0.7 is a Czech-English parallel corpus compiled at the Institute of Formal and Applied Linguistics (ÚFAL), Charles University, Prague. The corpus contains no manual annotation. It is limited only to texts which have been already available in an electronic form and which are not protected by authors' rights in the Czech Republic. The main purpose of the corpus is to support Czech-English and English-Czech machine translation research with the necessary data. CzEng 0.7 consists of a large set of parallel textual documents mainly from the fields of European law, information technology, and fiction, all of them converted into a uniform XML-based file format and provided with automatic sentence alignment.
Identifier (URI):http://hdl.handle.net/11858/00-097C-0000-0001-4916-9
Is Replaced By (URI):http://hdl.handle.net/11234/1-1458
Language:Czech
English
Language (ISO639):ces
eng
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
http://creativecommons.org/licenses/by-nc-sa/3.0/
Subject:parallel corpus
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-4916-9
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Bojar, Ondřej; Žabokrtský, Zdeněk; Češka, Pavel; Beňa, Peter; Janíček, Miroslav. 2011. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ country_GB dcmi_Text iso639_ces iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11858/00-097C-0000-0001-4916-9
Up-to-date as of: Thu Oct 5 0:38:47 EDT 2023