OLAC Record
oai:catalogue.elra.info:ELRA-S0283

Metadata
Title:Laboratory Conditions Czech Audio-Visual Speech Corpus
Abstract:This is an audio-visual speech database for training and testing of Czech audio-visual continuous speech recognition systems. The corpus consists of about 25 hours of audio-visual records of 65 speakers in laboratory conditions. Data collection was done with static illumination, and recorded subjects were instructed to remain static. The average speaker age was 22 years old. Speakers were asked to read 200 sentences each (50 common for all speakers and 150 specific to each speaker).
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2008-11-05
Date Issued (W3CDTF):2008-09-18
Date Modified (W3CDTF):2008-11-05
Description:Multimodal/Multimedia Resources
This is an audio-visual speech database for training and testing of Czech audio-visual continuous speech recognition systems. The corpus consists of about 25 hours of audio-visual records of 65 speakers in laboratory conditions. Data collection was done with static illumination, and recorded subjects were instructed to remain static. The average speaker age was 22 years old. Speakers were asked to read 200 sentences each (50 common for all speakers and 150 specific to each speaker). The average total length of recording per speaker is 23 minutes. All audio-visual data are transcribed (.trs files) and divided into sentences (one sentence per file). For each video file we get the description file containing information about the position and size of the region of interest. Acoustic data are stored in wave files using PCM format, sampling frequency 44kHz, resolution 16 bits. Each speaker?s acoustic data set represents about 140 MB of disk space (about 9 GB as a whole). Visual data are stored in video files (.avi format) using the digital video (DV) codec. Visual data per speaker take about 3 GB of disk (about 195 GB as a whole) and are stored on an IDE hard disk (NTFS format).
Identifier:ELRA-S0283
http://catalog.elra.info/product_info.php?products_id=1081
Language:Czech
Language (ISO639):ces
Publisher:ELRA (European Language Resources Association)
Type (DCMI):MovingImage
Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0283
DateStamp:  2008-11-05
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2008. ELRA (European Language Resources Association).
Terms: area_Europe country_CZ dcmi_MovingImage dcmi_Sound iso639_ces olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0283
Up-to-date as of: Mon Feb 27 0:31:29 EST 2017