OLAC Record
oai:catalogue.elra.info:ELRA-S0381

Metadata
Title:TRAD Pashto Broadcast News Speech Corpus
Abstract:This corpus contains 108 hours of broadcast news recordings transcribed, covering more than 1,000 speakers. Transcriptions are provided together with the audio files and include about 46,000 segments and 1.1M words.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2016-04-06
Date Issued (W3CDTF):2016-04-06
Date Modified (W3CDTF):2016-04-06
Description:Broadcast Resources
This corpus contains transcribed broadcast news recordings in Pashto. Recordings are collected from 5 sources: Ashna TV, Azadi Radio, Deewa Radio, Mashaal Radio and Shamshad TV. The corpus contains 108 hours of recordings covering more than 1,000 speakers. Transcriptions are provided together with the audio files and include about 46,000 segments and 1.1M words. Pashto is an indo-iranian language spoken by the Pashtun people mainly in Pakistan and Afghanistan. This corpus was produced by ELDA within the PEA TRAD project supported by the French Ministry of Defence (DGA).
Identifier:ELRA-S0381
http://catalog.elra.info/product_info.php?products_id=1265
Language:Pushto
Language (ISO639):pus
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0381
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2016. ELRA (European Language Resources Association).
Terms: dcmi_Sound iso639_pus olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0381
Up-to-date as of: Tue Aug 27 21:15:19 EDT 2019