OLAC Record
oai:catalogue.elra.info:ELRA-S0349

Metadata
Title:Quaero Broadcast News Extended Named Entity corpus
Abstract:This corpus consists of the manual annotation of (i) the ESTER 2 (see also ELRA-S0338) manual transcription corpus and (ii) the Quaero Speech Recognition Evaluation corpus (manual and automatic transcriptions coming from 3 different ASR systems). The corpus is fully manually annotated according to the Quaero extended and structured named entity definition.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2013-02-13
Date Issued (W3CDTF):2013-02-07
Date Modified (W3CDTF):2013-02-13
Description:Broadcast Resources
The Quaero Broadcast News Extended Named Entity corpus consists of the manual annotation of (i) the ESTER 2 corpus (see ELRA-S0338) and (ii) the Quaero Speech Recognition Evaluation corpus (manual and automatic transcriptions coming from 3 different ASR systems). The first part is the training corpus and the second one is the test corpus. The corpus is fully manually annotated according to the Quaero extended and structured named entity definition, which differentiates entity "types" and "components". The training part of the corpus is only composed of broadcast news data and contains 188 shows, 1,291,225 words, 113,885 types and 146,405 components. The test corpus is composed of both broadcast news and broadcast conversations data and contains 18 shows, 108,010 words, 5,523 types and 8,902 components. The Quaero Broadcast News Extended Named Entity Corpus consists of: - a manually transcribed and fully annotated radio broadcast news and broadcast conversation corpus amounting to about 1.5 million words, - a sub-corpus serving as a mini-reference corpus for quality evaluation purposes, - tools developed for annotation and evaluation, - guidelines.
Identifier:ELRA-S0349
http://catalog.elra.info/product_info.php?products_id=1195
Language:French
Language (ISO639):fra
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0349
DateStamp:  2013-02-13
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2013. ELRA (European Language Resources Association).
Terms: area_Europe country_FR dcmi_Sound iso639_fra olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-S0349
Up-to-date as of: Wed Mar 29 3:50:58 EDT 2017