OLAC Record
oai:catalogue.elra.info:ELRA-S0275

Metadata
Title:Slovenian BNSI Broadcast News Speech Corpus
Abstract:This speech database consists of TV news shows (both evening news, ?TV Dnevnik? and late night news, ?Odmevi?), from the archive of a Slovenian national broadcaster RTV Slovenia. The recordings took place between June 1999 and May 2003. The database comprises a total of 36 hours of recordings, transcribed and manually checked using the Transcriber tool. 1,565 speakers were recorded (1,069 males, 477 females, 19 unspecified).
Access Rights:Rights available for: Commercial Use, Research Use
Date Available (W3CDTF):2008-04-22
Date Issued (W3CDTF):2008-04-21
Date Modified (W3CDTF):2008-04-22
Description:Broadcast Resources
This speech database consists of TV news shows (both evening news, ?TV Dnevnik? and late night news, ?Odmevi?), from the archive of a Slovenian national broadcaster RTV Slovenia. The recordings took place between June 1999 and May 2003. The database comprises a total of 36 hours of recordings (training set: 30 hours, development set: 3 hours and test set: 3 hours), transcribed and manually checked using the Transcriber tool. Transcription conventions are based on documents defined by LDC, LIMSI and COST 278 BN SIG. There are 268,000 words in transcriptions, out of which 37,000 are distinct words. The transcription files contain: orthographic transcriptions, information on acoustic conditions and background, segmentation on turn and section level. The topic is described and marked (25 topic categories) for each section of news show. Speaker information consists of gender, speaking style, accent and origin. 1,565 speakers were recorded (1,069 males, 477 females, 19 unspecified). The speech signal is as follows: 16kHz, 16 bit, WAV, 1 channel.
Identifier:ELRA-S0275
http://catalog.elra.info/product_info.php?products_id=1064
Language:Slovenian
Language (ISO639):slv
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0275
DateStamp:  2008-04-22
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2008. ELRA (European Language Resources Association).
Terms: area_Europe country_SI dcmi_Sound iso639_slv olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0275
Up-to-date as of: Sun Nov 12 1:45:08 EST 2017