OLAC Record
oai:catalogue.elra.info:ELRA-S0228-01

Metadata
Title:Mandarin Chinese Speech Synthesis Corpus (Basic Corpus)
Abstract:This corpus contains the recordings of 1 native Chinese speaker (female). The corpus is composed of 20 texts with 109,227 words and has been proofread manually. The corpus contents include: phrases, digit strings, letter strings, uncommon words, neutral tone, final retroflexion, Latin alphabet, interrogative sentences, 282 English words. The speaker has been recorded in a professional recording studio over 2 channels: microphone and glottis wave (fundamental frequency) signals for a total of 18.2 hours. Speech samples are stored as sequences of 16-bit 44,1 kHz PCM on two channels. The total data size is 5.67 Gb for a total of 12,679 files. The data is encoded in GB-2312 format. The transcriptions include labels for four-class pause boundaries. This database is aimed to be used within text-to-speech and speech synthesis applications.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2007-01-17
Date Issued (W3CDTF):2006-12-20
Date Modified (W3CDTF):2007-01-17
Description:Desktop/Microphone
This corpus contains the recordings of 1 native Chinese speaker (female). The corpus is composed of 20 texts with 109,227 words and has been proofread manually. The corpus contents include: phrases, digit strings, letter strings, uncommon words, neutral tone, final retroflexion, Latin alphabet, interrogative sentences, 282 English words. The speaker has been recorded in a professional recording studio over 2 channels: microphone and glottis wave (fundamental frequency) signals for a total of 18.2 hours. Speech samples are stored as sequences of 16-bit 44,1 kHz PCM on two channels. The total data size is 5.67 Gb for a total of 12,679 files. The data is encoded in GB-2312 format. The transcriptions include labels for four-class pause boundaries. This database is aimed to be used within text-to-speech and speech synthesis applications.
Identifier:ELRA-S0228-01
http://catalog.elra.info/product_info.php?products_id=922
Language:Chinese
Language (ISO639):zho
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0228-01
DateStamp:  2007-01-17
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2006. ELRA (European Language Resources Association).
Terms: dcmi_Sound iso639_zho olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0228-01
Up-to-date as of: Fri Jun 23 1:05:34 EDT 2017