![]() |
OLAC Record oai:catalogue.elra.info:ELRA-S0496 |
Metadata | ||
Title: | Chinese Kids Speech database (Lower Grade) | |
Access Rights: | Rights available for: nonCommercialUse, commercialUse | |
Date Available (W3CDTF): | 2025-07-18 | |
Date Issued (W3CDTF): | 2025-07-18 | |
Description: | The Chinese Kids Speech database (Lower Grade) contains the total recordings of 184 Chinese Kids speakers (98 males and 86 females), from 6 to 10 years’ old recorded in quiet rooms using smartphone. This database may be combined with the Chinese Kids Speech database (Upper Grade) also available in the ELRA Catalogue under reference ELRA-S0497.Number of speakers, utterances, duration and age are as follows :Number of speakers (Male/Female): 184 (98/86)Number of utterances (average): 237 utt/spkrTotal number of utterances: 43,667Age: from 6 to 10Total hours of data: 871,426 sentences were used. Recordings were made through smartphones and audio data stored in .wav files as sequences of 16KHz Mono, 16 bits, Linear PCM.Database・Audio data: WAV format, 16KHz, 16bit, mono (recorded with smartphone)・Transcription data: TSV format(tab-delimited), UTF-8 (without BOM) ), Line ending: LF・Size: 9.4GBAgeMaleFemaleTotal611617711819818294794736831011718Structure of database :├─ readme.txt├─ Chinese Kids Speech Database (Lower grade).pdfDescription document of the database├─ transcription(Lower).tsvTranscription└─ Low/directory of audio data └─ (1st/2nd/3rd)directory of version ID └─(0/1)directory of gender (0: male, 1: female) └─(audio_file)audio file (WAV format, 16KHz, 16bit, mono)Field information of “transcription(Lower).tsv” are as follows:Field numberContents0Script ID1Speaker ID2Audio file name3Transcription (in Chinese)File naming conventions of audio files are as follows:Field numberContentsDescriptionRemarks0Script IDFour digitsXXXX: four digits1Speaker IDThree digitsXXX: three digits2AgeTwo digitsFrom 06 to 103Gender0: male, 1: female4Utterance No.Three digitsSequential numbering starting from 001 within each speaker5Recording dateYYYYMMDDHHMM6Recording device nameRecording device nameEx. NTH-AN007OSOperating System info of recording deviceEx. android-118Durationduration in msecDuration of the actual spoken utteranceFiled separation character is “_”.For example, if the audio file name is “1318_373_09_1_010_202205041857_NTH-AN00_android-11_5480.wav “, this file has the following meaning:1318: script ID373: speaker ID09: age (nine years old)1: gender (female)010: utterance number202205041857: recording date (May 4, 2022, at 6:57 PM)NTH-AN00: recording device nameandroid-11: operating system info of recording device5480: duration of the actual spoken utterance (5,480 msec) | |
Identifier: | ELRA-S0496 | |
ISLRN: 369-011-475-593-5 | ||
Identifier (URI): | https://catalog.elra.info/en-us/repository/browse/ELRA-S0496/ | |
Language: | Chinese | |
Language (ISO639): | zho | |
Medium: | Not specified | |
Publisher: | ELRA (European Language Resources Association) | |
Type (DCMI): | Sound | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | ELRA Catalogue of Language Resources | |
Description: | http://www.language-archives.org/archive/catalogue.elra.info | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:catalogue.elra.info:ELRA-S0496 | |
DateStamp: | 2025-07-18 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | n.a. 2025. ELRA (European Language Resources Association). | |
Terms: | dcmi_Sound iso639_zho olac_primary_text |