OLAC Record
oai:lindat.mff.cuni.cz:11234/1-3739

Metadata
Title:HaCzech: Dataset of Handwritten Czech
Bibliographic Citation:http://hdl.handle.net/11234/1-3739
Creator:Procházka, Štěpán
Straka, Milan
Date (W3CDTF):2021-07-26T12:50:14Z
Date Available:2021-07-26T12:50:14Z
Description:The dataset of handwritten Czech text lines, sourced from two chronicles (municipal chronicles 1931-1944, school chronicles 1913-1933). The dataset comprises 25k lines machine-extracted from scanned pages, and provides manual annotation of text contents for a subset of size 2k.
Identifier (URI):http://hdl.handle.net/11234/1-3739
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
http://creativecommons.org/licenses/by-nc-sa/4.0/
Subject:htr
ocr
manuscripts
chronicles
handwriting
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-3739
DateStamp:  2021-07-26
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Procházka, Štěpán; Straka, Milan. 2021. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-3739
Up-to-date as of: Thu Oct 5 0:41:23 EDT 2023