OLAC Record

Title:German SpeechDat-Car
Abstract:This speech database contains the recordings in a car of 338 speakers, who uttered around 120 read and spontaneous items. Recordings have been made through 5 different channels, of which 4 were in-car microphones (1 close-talk microphone, 3 far-talk microphones) and 1 channel over the GSM network.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2002-04-04
Date Issued (W3CDTF):2004-09-14
Date Modified (W3CDTF):2009-08-05
The German SpeechDat-Car database comprises 338 German speakers recorded over the mobile telephone network. This database is partitioned into 17 DVDs and 1 CD. The speech databases made within the SpeechDat-Car project were validated by SPEX, the Netherlands, to assess their compliance with the SpeechDat-Car format and content specifications. The speech data files are in two formats. The signal data format for the in-car mobile platform recordings is 16 kHz, 16 bit, uncompressed unsigned integers in Intel format (lo-hi byte order); the channels are multiplexed in a single file, with the channel sequence being 0-1-2-3. The format of the fixed platform audio files is 8 kHz, 8 bit alaw encoding. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information. Each speaker uttered the following items: - 2 voice activation keywords - 1 sequence of 10 isolated digits - 7 connected digits : 1 sheet number (4+ digits), 1 spontaneous telephone number (9-11 digits), 3 read telephone numbers, 1 credit card number (16 digits), 1 PIN code (6 digits) - 3 dates : 1 spontaneous date (e.g. birthday), 1 prompted date, 1 relative or general date expression - 2 word spotting phrases using an application word (embedded) - German data phrases - 4 isolated digits - 7 spelled words : 1 spontaneous (own forename or surname), 1 spelling of directory city name, 4 real word/name, 1 artificial name for coverage - 1 money amount - 1 natural number - 7 directory assistance names : 1 spontaneous (own forename or surname), 1 city of birth / growing up (spontaneous), 2 most frequent cities, 2 most frequent company/agency, 1 "forename surname" - 9 phonetically rich sentences - 2 time phrases : 1 time of day (spontaneous), 1 time phrase (word style) - 4 phonetically rich words - 69 application words: 13 mobile phone application words, 22 IVR function keywords, 32 car products keywords, 2 additional common application words - 2 additional language dependent keywords - spontaneous sentences The following age distribution has been obtained: 187 speakers are between 16 and 30, 72 speakers are between 31 and 45, 70 speakers are between 46 and 60, and 9 speakers are over 60. A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
Language (ISO639):deu
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0122
DateStamp:  2002-04-04
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Europe country_DE dcmi_Sound iso639_deu olac_primary_text

Up-to-date as of: Mon Jan 14 13:32:13 EST 2019