OLAC Record

Title:American English SpeechDat-Car
Abstract:The American English SpeechDat-Car database comprises recordings in a car of 314 speakers (150 males, 164 females), who uttered around 120 read and spontaneous items. Recordings have been made through 5 different channels, of which 4 were in-car microphones (1 close-talk microphone, 3 far-talk microphones) and 1 channel over the GSM network.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2001-11-05
Date Issued (W3CDTF):2005-05-09
Date Modified (W3CDTF):2007-02-22
The American English SpeechDat-Car database contains the recordings of 314 American English speakers (150 males, 164 females) recorded over the mobile telephone network. This database is partitioned into 94 CDs (or 13 DVDs). The speech data files are in two formats. Four of the microphones were recorded on the computer in the trunk of the car. These are stored as 16 kHz, 16 bit and uncompressed. The fifth microphone was connected to the cell phone, and was recorded on a remote machine. The U.S. telephone network uses a digital encoding of 8bit, 8kHz, with Mu-law compression. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information. This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat-Car format and content specifications. Each speaker uttered the following items: * 2 voice activation keywords * 1 sequence of 10 isolated digits * 7 connected digits (1 sheet number -5 digits, 1 spontaneous telephone number, 3 read telephone numbers, 1 credit card number ?14/16 digits, 1 PIN code -6 digits) * 3 dates (1 spontaneous date e.g. birthday, 1 prompted date, 1 relative or general date expression) * 2 word spotting phrases using an embedded application word * 4 isolated digits * 7 spelled words (1 spontaneous e.g. own forename or surname, 1 directory city name, 4 real word/name, 1 artificial name for coverage) * 1 money amount * 1 natural number * 7 directory assistance names (1 spontaneous e.g. own forename or surname, 1 city of birth/growing up, 2 most frequent cities, 2 most frequent company/agency, 1 "forename surname") * 9 phonetically rich sentences * 2 time phrases (1 spontaneous time of day, 1 word style time phrase) * 4 phonetically rich words * 67 application words (13 mobile phone application words, 22 IVR function keywords, 32 car products keywords) * 2 additional language dependent keywords * spontaneous sentences for the last 100 speakers The following age distribution has been obtained: 130 speakers are between 16 and 30, 101 speakers are between 31 and 45, 79 speakers are between 46 and 60, and 4 speakers are over 60. A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
Language (ISO639):eng
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0115
DateStamp:  2001-11-05
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2005. ELRA (European Language Resources Association).
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text

Up-to-date as of: Sun Jan 21 1:36:47 EST 2018