OLAC Record
oai:catalogue.elra.info:ELRA-S0068

Metadata
Title:Portuguese SpeechDat(M) database
Abstract:This speech database contains the recordings of 1,001 speakers (453 males, 548 females) recorded over the Portuguese fixed telephone network. Each speaker uttered around 40 read and spontaneous items.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):1999-03-05
Date Issued (W3CDTF):2004-09-14
Date Modified (W3CDTF):2007-08-28
Description:Telephone
The Portuguese SpeechDat(M) database contains the recordings of 1,001 speakers (453 males, 548 females). This speech database was collected by Portugal Telecom within the European SpeechDat project. Speech signals are stored as sequences of 8 kHz, 8-bit A-law. Files are stored according to the file specifications proposed in the SpeechDat database format specification. The file formats and headers follow the SAM recommendations (header files separated from signal files). This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat format and content specifications. Each speaker uttered the following items: * 3 natural numbers * 1 isolated digit * 2 connected digits (1 credit card number, 1 telephone number) * 2 money amounts * 2 dates * 1 time phrase * 6 application words * 3 spelled-out words * 3 word spotting phrases * 9 sentences * 4 yes/no questions * 1 spontaneous date * 1 spontaneous time * 1 region name The approach adopted for speaker recruitment involved selecting speakers among the employees of Portugal Telecom (about 20,000) and their relatives. The company has a wide geographical coverage, thus guaranteeing a good representation of many regional accents. The following age distribution has been obtained: 12 speakers are under 16, 345 speakers are between 17 and 30, 436 speakers are between 31 and 45, 196 speakers are between 46 and 60 and 8 speakers are over 60; the age of two speakers is unknown and two others said they were born in 1996. A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
Identifier:ELRA-S0068
http://catalog.elra.info/product_info.php?products_id=538
Language:Portuguese
Language (ISO639):por
Medium:CD-ROM
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0068
DateStamp:  1999-03-05
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Europe country_PT dcmi_Sound iso639_por olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0068
Up-to-date as of: Mon Feb 27 0:30:02 EST 2017