OLAC Record
oai:www.ldc.upenn.edu:LDC98T27

Metadata
Title:HUB5 Spanish Transcripts
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Munoz, Elisa, et al. HUB5 Spanish Transcripts LDC98T27. Web Download. Philadelphia: Linguistic Data Consortium, 1998
Contributor:Munoz, Elisa
Alabiso, Jennifer
MacIntyre, Robert
Graff, David
Date (W3CDTF):1998
Description:LDC98S70 - Speech data LDC98T27 - Transcripts *Introduction* This release of HUB5 Spanish training data consists of 106 calls derived from the CALLFRIEND Spanish (Language ID) collection. The transcripts cover a contiguous 10-30 minute segment taken from a recorded conversation lasting up to 30 minutes. These calls were originally collected by the LDC in support of the project on Language Recognition, sponsored by the U.S. Department of Defense. All of these calls are being designated as additional training data for the project on Large Vocabulary Conversational Speech Recognition (LVCSR) in Spanish. *Data* Speakers were solicited by the LDC to participate in this telephone speech collection effort via the internet, publications (advertisements) and personal contacts. A total of 200 call originators were found, each of whom placed a telephone call via a toll-free robot operator maintained by the LDC. Access to the robot operator was possible via a unique Personal Identification Number (PIN) issued by the recruiting staff at the LDC when the caller enrolled in the project. Once a caller was recruited to participate, he/she was given a free choice of whom to call. Recruits were given no guidelines concerning what they should talk about. Most participants called family members or close friends. All calls originated in North America and were placed to various locations within North America, Puerto Rico or the Dominican Republic. The participants were made aware that their telephone call would be recorded, as were the call recipients. The call was allowed only if both parties agreed to being recorded. Each caller was allowed to talk up to 30 minutes. Upon successful completion of the call, the caller was paid $20 (in addition to making a free long-distance telephone call). Each caller was allowed to place only one telephone call. HUB5 Spanish speech and transcript data may be obtained by contacting the LDC *Updates* There are no updates at this time.
Identifier:LDC98T27
https://catalog.ldc.upenn.edu/LDC98T27
ISBN: 1-58563-134-5
ISLRN: 997-940-878-462-1
DOI: 10.35111/z46b-j130
Language:Spanish
Language (ISO639):spa
License:LDC User Agreement for Non-Members: https://catalog.ldc.upenn.edu/license/ldc-non-members-agreement.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC98T27
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC98T27
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Munoz, Elisa; Alabiso, Jennifer; MacIntyre, Robert; Graff, David. 1998. Linguistic Data Consortium.
Terms: area_Europe country_ES dcmi_Text iso639_spa olac_primary_text


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC98T27
Up-to-date as of: Mon Mar 25 7:20:04 EDT 2024