OLAC Record: aGender

OLAC Record
oai:catalogue.elra.info:ELRA-S0365

Metadata

Title: aGender

Access Rights: Rights available for: nonCommercialUse, commercialUse

Date Available (W3CDTF): 2013-12-17

Date Issued (W3CDTF): 2013-12-17

Date Modified (W3CDTF): 2013-12-17

Description: aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The purpose of the corpus is the automatic detection of gender and/or age (7 mixed classes ranging from 7 - 80 years). The corpus contains the voices of 945 German speakers (approx. minimum of 100 speakers per class), each delivering 18 speech items in up to six different sessions. The time/date of the individual recordings sessions were not controlled, neither the total number of sessions per speaker.The audio signal was recorded over standard cell phones (GSM standard) and landline connections in 8000 Hz, 8 bit alaw format. Data were then expanded to 8000Hz, 16bit PCM (all 16 bits are valid!).The selection of speakers is approximately evenly distributed over the seven target classes, with class 1 also being balanced for gender. The read material consists of an altered version of the SpeechDat text material, containing short fixed and free text typical for automated call centers.A typical utterance is about 2 seconds in length, but there are also some utterances are between 3 and 6 seconds. In total, the corpus consists of 47 hours of speech. Two sets were defined on that data: A training set (81.5%) and a test set (175 speakers, 25 per class, 18.5%), each with disjunctive speaker sets. For the test set no class information is given in this corpus. Number of speakers in training/development set: 770Number of speakers in test set: 175Number of sessions in train/devel: 3625Number of utterances: 65241Number of training/development utterances: 53076Number of test utterances: 12165For a general information, see also:Felix Burkhardt, Martin Eckert, Wiebke Johannsen, Joachim Stegmann (2010): A Database of Age and Gender Annotated Telephone Speech. In: Proceedings of the LREC 2010, Malta.

Identifier: ELRA-S0365

ISLRN: 038-476-412-610-4

Identifier (URI): https://catalog.elra.info/en-us/repository/browse/ELRA-S0365/

Language: German

Language (ISO639): deu

Medium: Not specified

Publisher: ELRA (European Language Resources Association)

Type (DCMI): Sound

Type (OLAC): primary_text

OLAC Info

Archive: ELRA Catalogue of Language Resources

Description: http://www.language-archives.org/archive/catalogue.elra.info

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:catalogue.elra.info:ELRA-S0365

DateStamp: 2013-12-17

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: n.a. 2013. ELRA (European Language Resources Association).
Terms: area_Europe country_DE dcmi_Sound iso639_deu olac_primary_text

http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0365
Up-to-date as of: Wed Oct 1 0:56:13 EDT 2025

Metadata
Title:		aGender
Access Rights:		Rights available for: nonCommercialUse, commercialUse
Date Available (W3CDTF):		2013-12-17
Date Issued (W3CDTF):		2013-12-17
Date Modified (W3CDTF):		2013-12-17
Description:		aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The purpose of the corpus is the automatic detection of gender and/or age (7 mixed classes ranging from 7 - 80 years). The corpus contains the voices of 945 German speakers (approx. minimum of 100 speakers per class), each delivering 18 speech items in up to six different sessions. The time/date of the individual recordings sessions were not controlled, neither the total number of sessions per speaker.The audio signal was recorded over standard cell phones (GSM standard) and landline connections in 8000 Hz, 8 bit alaw format. Data were then expanded to 8000Hz, 16bit PCM (all 16 bits are valid!).The selection of speakers is approximately evenly distributed over the seven target classes, with class 1 also being balanced for gender. The read material consists of an altered version of the SpeechDat text material, containing short fixed and free text typical for automated call centers.A typical utterance is about 2 seconds in length, but there are also some utterances are between 3 and 6 seconds. In total, the corpus consists of 47 hours of speech. Two sets were defined on that data: A training set (81.5%) and a test set (175 speakers, 25 per class, 18.5%), each with disjunctive speaker sets. For the test set no class information is given in this corpus. Number of speakers in training/development set: 770Number of speakers in test set: 175Number of sessions in train/devel: 3625Number of utterances: 65241Number of training/development utterances: 53076Number of test utterances: 12165For a general information, see also:Felix Burkhardt, Martin Eckert, Wiebke Johannsen, Joachim Stegmann (2010): A Database of Age and Gender Annotated Telephone Speech. In: Proceedings of the LREC 2010, Malta.
Identifier:		ELRA-S0365
Identifier:		ISLRN: 038-476-412-610-4
Identifier (URI):		https://catalog.elra.info/en-us/repository/browse/ELRA-S0365/
Language:		German
Language (ISO639):		deu
Medium:		Not specified
Publisher:		ELRA (European Language Resources Association)
Type (DCMI):		Sound
Type (OLAC):		primary_text
OLAC Info
Archive:		ELRA Catalogue of Language Resources
Description:		http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:catalogue.elra.info:ELRA-S0365
DateStamp:		2013-12-17
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		n.a. 2013. ELRA (European Language Resources Association).
Terms:		area_Europe country_DE dcmi_Sound iso639_deu olac_primary_text