OLAC Record
oai:catalogue.elra.info:ELRA-S0293

Metadata
Title:The HIWIRE database, a noisy and non-native English speech corpus for cockpit communication
Access Rights: Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):2008-11-25
Date Issued (W3CDTF):2008-11-25
Date Modified (W3CDTF):2008-11-25
Description:This database has been collected and packaged under the auspices of the IST-EU STREP project HIWIRE (Human Input that Works In Real Environments). The database was designed to be used as a tool for development and test of speech processing and recognition techniques dealing with robust non-native speech recognition.The database contains 8,099 English utterances pronounced by non-native speakers (31 French, 20 Greek, 20 Italian, and 10 Spanish speakers). The collected utterances correspond to human input in a command and control aeronautics application. The data was recorded in studio with a close-talking microphone and real noise recorded in an airplane cockpit was artificially added to the data. The signals are provided in clean (studio recordings with close talking microphone), low, mid and high noise conditions. The three noise levels correspond approximately to signal-to-noise ratios of 10dB, 5dB and -5 dB respectively.Clean audio data has been recorded in different office rooms using a close-talking microphone for lowest ambient acoustic effects (Plantronics USB-45). The used sampling frequency is 16 kHz and data is stored in Windows PCM WAV 16 bits mono format.Recordings correspond to prompts extracted from an aeronautic command and control application. A total of 8,099 utterances have been recorded corresponding to 81 speakers pronouncing 100 utterances each. The speaker distribution is as follows:
Country# Speakers# Utterances
France31 (38.3%)3100
Greece20 (24.7%)2000
Italy20 (24.7%)2000
Spain10 (12.3%)999
Total818099
To generate the noisy data utterances, the speech level is maintained and only the noise amplitude is modified to obtain the desired SNR. The noise amplitude is adjusted to obtain three different averaged SNR values of 10dB, 5dB and -5dB which are referenced as low noise (LN), mid noise (MN) and high noise (HN) conditions. For each given condition the noise level remains constant.The speech data are pcm-wav files (16kHz / 16 bits / mono) stored on one DVD. The total size is 3.03 Gbytes for 33.053 files.
Identifier:ELRA-S0293
ISLRN: 934-733-835-065-0
Identifier (URI):https://catalog.elra.info/en-us/repository/browse/ELRA-S0293/
Language:English
Language (ISO639):eng
Medium:Not specified
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0293
DateStamp:  2008-11-25
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2008. ELRA (European Language Resources Association).
Terms: area_Europe country_GB dcmi_Sound iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0293
Up-to-date as of: Fri Mar 8 7:25:56 EST 2024