OLAC Record
oai:www.ldc.upenn.edu:LDC2000T54

Metadata
Title:Speech in Noisy Environments (SPINE) Evaluation Transcripts
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Schmidt-Nielsen, Astrid, et al. Speech in Noisy Environments (SPINE) Evaluation Transcripts LDC2000T54. Web Download. Philadelphia: Linguistic Data Consortium, 2000
Contributor:Schmidt-Nielsen, Astrid
Marsh, Elaine
Cieri, Christopher
Strassel, Stephanie
Rennert, Kara
Date (W3CDTF):2000
Date Issued (W3CDTF):2002-06-17
Description:*Introduction* Speech in Noisy Environments (SPINE) Evaluation Transcripts was developed for the Department of Defense (DoD) Digital Voice Processing Consortium (DDVPC) by Arcon Corp. The corresponding audio, Speech in Noisy Environments (SPINE) Evaluation Audio, is available as LDC2000S96. These corpora supported the 2000 Speech in Noisy Environments evaluation. For an example transcript, please click here. The 2000 Speech in Noisy Environments Evaluation (SPINE1) was a first attempt to assess the state of the art and practice in speech recognition technology in noisy military environments and to exchange information on innovative speech recognition technology in the context of fully implemented systems that perform realistic tasks. It was intended to be of interest to all university, industrial and commercial speech system developers working on the problem of robust speech recognition. The evaluation gave participants the opportunity to participate in a flexible evaluation, suited to development needs and abilities. This work was sponsored in part by National Science Foundation Grant No. IIS-9982201. *Data* The SPINE1 evaluation focused on the task of transcribing speech produced in noisy environments with the emphasis on speech produced in noisy military environments. The evaluation was designed to promote research progress in this area, to provide the opportunity for participants to try out new ideas for developing robust speech recognition systems that were of both scientific and practical interest, and to measure the performance of this technology. The evaluation task was to transcribe speech produced in noisy environments. The training and test speech data used for this evaluation were generated by ARCON Corp. for the DoD Digital Voice Processing Consortium (DDVPC) under controlled conditions. The speech data consists of conversations between two communicators working on a collaborative battleship-like task in which they seek and shoot at targets (ARCON Communicability Exercise, ACE). Participants could talk freely, but the total vocabulary used was fairly limited. Each person was seated in a sound chamber in which a previously recorded military background noise environment was accurately reproduced. The participants used handsets and transmission channels that were resident to the particular environment. The evaluation data includes 20 talker-pairs, with six five-minute conversations per talker-pair (about 600 minutes total), from a set of four scenarios. *Updates* August 13, 2001: A tagging error was discovered in which several files containing occurrences of the incorrect tag "[{noise}]," were converted to the correct tag, "[/noise]." There were 433 occurrences of this error across all files. Also, a single occurrence of two instances of "[noise/]" on the same line was corrected to "[/noise]" in the second instance. The corpus has been corrected.
Extent:Corpus size: 240 KB
Identifier:LDC2000T54
https://catalog.ldc.upenn.edu/LDC2000T54
ISBN: 1-58563-189-2
ISLRN: 742-218-645-985-8
DOI: 10.35111/k42c-jh17
Language:English
Language (ISO639):eng
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2000T54
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2000T54
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Schmidt-Nielsen, Astrid; Marsh, Elaine; Cieri, Christopher; Strassel, Stephanie; Rennert, Kara. 2000. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_eng olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2000T54
Up-to-date as of: Mon Mar 25 7:19:31 EDT 2024