OLAC Record: SPINA Corpus ("Robots Commands")

OLAC Record
oai:catalogue.elra.info:ELRA-S0044

Metadata

Title: SPINA Corpus ("Robots Commands")

Access Rights: Rights available for: nonCommercialUse, commercialUse

Date Available (W3CDTF): 1997-10-28

Date Issued (W3CDTF): 1997-10-28

Date Modified (W3CDTF): 2005-05-02

Description: This German corpus contains read speech of 22 different speakers (6 male, 16 female). The corpus consists of 10 robot command sentences and 62 robot command words. Each speaker reads the whole corpus 5 times, except one speaker who reads the sentence corpus 16 times and the word corpus 51 times. The speakers were recorded at two different sites in Germany (University of Goettingen, University of Bochum).The corpus contains a total of 10,810 recorded utterances.All speakers are within the age of 25-30. Two speakers are non-native speakers. One file gives information about the speakers (speaker ID, recording site, sex).The task for the speaker was to read carefully but fluently. If an error occurred, the recording was interrupted by the supervisor and the sentence was repeated. The signal files are raw files without any header, 16 bit per sample, linear, most significant byte first, 16 kHz sample frequency.The orthography of the corpus is given in two distinct files which contain the prompted words and the prompted sentences as an ordered list.The recording conditions are as follows:Microphone:AKG acoustics, C414B-TL, condensator microphone omnidirectional, built-in attenuator and high pass filter switched off, distance to mouth 50 cm.Environment: Studio Quality, echo cancelled room, about 121 qqmPreamplifier: John Hardy, M-1Sampling rate: 48 kHz to DAT recorder, filtered to 16 kHzResolution: 16 Bit, most significant byte firstThe speech data were digitally filtered to 8 kHz cut-off frequency and downsampled to 16 kHz.The corpus consists of 1 volume, total size 266,361 KB uncompressed data. The signal of each utterance is stored in a separate file. Symbolic information like segmentations or labelling (e.g. Phonological Segmentation of words or Word Segmentation of sentences) are stored in files with the same prefix but with different extensions.

Identifier: ELRA-S0044

ISLRN: 259-624-258-966-0

Identifier (URI): https://catalog.elra.info/en-us/repository/browse/ELRA-S0044/

Language: German

Language (ISO639): deu

Medium: Not specified

Publisher: ELRA (European Language Resources Association)

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: ELRA Catalogue of Language Resources

Description: http://www.language-archives.org/archive/catalogue.elra.info

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:catalogue.elra.info:ELRA-S0044

DateStamp: 1997-10-28

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: n.a. 1997. ELRA (European Language Resources Association).
Terms: area_Europe country_DE dcmi_Sound iso639_deu olac_primary_text

http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0044
Up-to-date as of: Wed Oct 1 0:54:41 EDT 2025

Metadata
Title:		SPINA Corpus ("Robots Commands")
Access Rights:		Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):		1997-10-28
Date Issued (W3CDTF):		1997-10-28
Date Modified (W3CDTF):		2005-05-02
Description:		This German corpus contains read speech of 22 different speakers (6 male, 16 female). The corpus consists of 10 robot command sentences and 62 robot command words. Each speaker reads the whole corpus 5 times, except one speaker who reads the sentence corpus 16 times and the word corpus 51 times. The speakers were recorded at two different sites in Germany (University of Goettingen, University of Bochum).The corpus contains a total of 10,810 recorded utterances.All speakers are within the age of 25-30. Two speakers are non-native speakers. One file gives information about the speakers (speaker ID, recording site, sex).The task for the speaker was to read carefully but fluently. If an error occurred, the recording was interrupted by the supervisor and the sentence was repeated. The signal files are raw files without any header, 16 bit per sample, linear, most significant byte first, 16 kHz sample frequency.The orthography of the corpus is given in two distinct files which contain the prompted words and the prompted sentences as an ordered list.The recording conditions are as follows:Microphone:AKG acoustics, C414B-TL, condensator microphone omnidirectional, built-in attenuator and high pass filter switched off, distance to mouth 50 cm.Environment: Studio Quality, echo cancelled room, about 121 qqmPreamplifier: John Hardy, M-1Sampling rate: 48 kHz to DAT recorder, filtered to 16 kHzResolution: 16 Bit, most significant byte firstThe speech data were digitally filtered to 8 kHz cut-off frequency and downsampled to 16 kHz.The corpus consists of 1 volume, total size 266,361 KB uncompressed data. The signal of each utterance is stored in a separate file. Symbolic information like segmentations or labelling (e.g. Phonological Segmentation of words or Word Segmentation of sentences) are stored in files with the same prefix but with different extensions.
Identifier:		ELRA-S0044
Identifier:		ISLRN: 259-624-258-966-0
Identifier (URI):		https://catalog.elra.info/en-us/repository/browse/ELRA-S0044/
Language:		German
Language (ISO639):		deu
Medium:		Not specified
Publisher:		ELRA (European Language Resources Association)
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		ELRA Catalogue of Language Resources
Description:		http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:catalogue.elra.info:ELRA-S0044
DateStamp:		1997-10-28
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		n.a. 1997. ELRA (European Language Resources Association).
Terms:		area_Europe country_DE dcmi_Sound iso639_deu olac_primary_text