OLAC Record
oai:www.ldc.upenn.edu:LDC97L20

Metadata
Title:CALLHOME American English Lexicon (PRONLEX)
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Kingsbury, Paul, et al. CALLHOME American English Lexicon (PRONLEX) LDC97L20. Web Download. Philadelphia: Linguistic Data Consortium, 1994
Contributor:Kingsbury, Paul
Strassel, Stephanie
McLemore, Cynthia
MacIntyre, Robert
Date (W3CDTF):1994
Description:*Introduction* The CALLHOME English collection includes a lexical component. The CALLHOME American English Lexicon was originally distributed under the name COMLEX Pronouncing Lexicon, or PRONLEX. Organizations that have already received PRONLEX will not be required to purchase the CALLHOME American English Lexicon. *Data* The latest version of PRONLEX contains 90,988 lexical entries and includes coverage of WSJ30, WSJ64, Switchboard and CALLHOME English. (WSJ30K and WSJ64K are word lists selected from several years of Wall Street Journal texts used in recent ARPA Continuous Speech Recognition corpora. Switchboard is a three million word corpus of telephone conversations on a variety of topics.) The PRONLEX documentation describes the principles observed for word transcription. Although predictable variation in pronunciation due to dialect or variable reduction has not been notated in the lexicon itself, the documentation notes systematic dialectal variants, which may be generated by rule. In addition, alternate pronunciations are given for words whose pronunciation varies by part of speech (e.g., abstrAct, Abstract), or in less systematic but salient ways (especially names). Classes of exceptions to the transcription principles, such as names, function, words and foreign words, are tagged. Here is a sample page. The transcripts and documentation (LDC97T14) are available, as well as a corpus of telephone speech (LDC97L20). *Updates* There are no updates at this time.
Extent:Corpus size: 1976 KB
Identifier:LDC97L20
https://catalog.ldc.upenn.edu/LDC97L20
ISBN: 1-58563-110-8
ISLRN: 119-159-358-214-6
Language:English
Language (ISO639):eng
License:CALLHOME Lexicon Agreement (Commercial): https://catalog.ldc.upenn.edu/license/callhome-lexicon-commercial.pdf
CALLHOME Lexicon Agreement (Non-Commercial): https://catalog.ldc.upenn.edu/license/callhome-lexicon-non-commercial.pdf
CALLHOME Lexicon Agreement (Non-Member): https://catalog.ldc.upenn.edu/license/callhome-lexicon-non-member.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC97L20
Rights Holder:Portions © 1994-1997 Trustees of the University of Pennsylvania
Subject:English language
Subject (ISO639):eng
Type (DCMI):Text
Type (OLAC):lexicon

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC97L20
DateStamp:  2017-08-17
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Kingsbury, Paul; Strassel, Stephanie; McLemore, Cynthia; MacIntyre, Robert. 1994. Linguistic Data Consortium.
Terms: area_Europe country_GB dcmi_Text iso639_eng olac_lexicon

Inferred Metadata

Country: United Kingdom
Area: Europe


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC97L20
Up-to-date as of: Sun Nov 26 2:02:49 EST 2017