OLAC Record
oai:www.ldc.upenn.edu:LDC2001S91

Metadata
Title:1997 HUB4 Broadcast News Evaluation Non-English Test Material
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Fiscus, Jonathan G., et al. 1997 HUB4 Broadcast News Evaluation Non-English Test Material LDC2001S91. Web Download. Philadelphia: Linguistic Data Consortium, 2001
Contributor:Fiscus, Jonathan G.
Garofolo, John S.
Przybocki, Mark
Fisher, William
Pallett, David
Date (W3CDTF):2001
Description:*Introduction* 1997 HUB4 Broadcast News Evaulation Non-English Test Material was developled by the Linguistic Data Consortium. It contains the evaluation test material used in the 1997 DARPA/NIST Continuous Speech Recognition Broadcast News HUB4 Non-English Benchmark Test administered by the NIST Spoken Natural Language Processing Group. *Data* The test material is contained in two SPHERE-formatted waveform files. The file h4ne97sp.sph (set1) contains one hour of Spanish broadcast news excerpts from 1997. The file h4ne97ma.sph (set2) contains one hour of Mandarin broadcast news excerpts from 1997. Each file should be separately recognized per the HUB4 Non English Evaluation Specification. Note: 1997 HUB4 English evaluation material is contained in 1997 HUB4 English Evaluation Speech and Transcripts LDC2002S11. *Updates* There are no updates at this time. *Additional Licensing Instructions* This 'members-only' corpora is available to current members who can request the data at the listed reduced-license fee. Contact ldc@ldc.upenn.edu for information about becoming a member.
Extent:Corpus size: 232448 KB
Identifier:LDC2001S91
https://catalog.ldc.upenn.edu/LDC2001S91
ISBN: 1-58563-182-5
ISLRN: 639-420-515-411-0
DOI: 10.35111/4b0m-6182
Language:Spanish
Mandarin Chinese
Language (ISO639):spa
cmn
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2001S91
Rights Holder:Portions © 1997 China Central TV, KAZN-AM, Televisa, Univision Network Limited Partnership, © 1997, 2001 Trustees of the University of Pennsylvania
Type (DCMI):Sound
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2001S91
DateStamp:  2021-07-28
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Fiscus, Jonathan G.; Garofolo, John S.; Przybocki, Mark; Fisher, William; Pallett, David. 2001. Linguistic Data Consortium.
Terms: area_Asia area_Europe country_CN country_ES dcmi_Sound iso639_cmn iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2001S91
Up-to-date as of: Mon Mar 25 7:19:32 EDT 2024