OLAC Record
oai:www.ldc.upenn.edu:LDC2006T19

Metadata
Title:TDT5 Topics and Annotations
Creator (author):Meghan Glenn, Stephanie Strassel, Junbo Kong, Kazuaki Maeda
Date (W3CDTF):2006-12-19
Description:Release type: General
Non-member fee: 500.00 USD
Online documentation: http://www.ldc.upenn.edu/Catalog/docs/LDC2006T19
Application: topic detection and tracking
Application: machine translation
Application: machine learning
Application: language modeling
Application: information extraction
Application: information detection
Membership year: 2006
Data source: newswire
Extent:Corpus size: 80896.000 KB
Identifier:LDC2006T19
ISBN: 1-58563-418-2
Language:English
Mandarin Chinese
Standard Arabic
Language (ISO639):eng
cmn
arb
Medium:Distribution: Web Download
Rights:Non-member license: http://www.ldc.upenn.edu/Catalog/nonmem_agree/generic.license.html
Subject:English language
Mandarin Chinese language
Standard Arabic language
Subject (ISO639):eng
cmn
arb
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2006T19
DateStamp:  2008-12-17
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Meghan Glenn, Stephanie Strassel, Junbo Kong, Kazuaki Maeda. 2006. The LDC Corpus Catalog.
Terms: area_Asia area_Europe country_CN country_GB country_SA dcmi_Text iso639_arb iso639_cmn iso639_eng olac_primary_text

Inferred Metadata

Country: ChinaUnited KingdomSaudi Arabia
Area: AsiaEurope


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2006T19
Up-to-date as of: Tue Jan 3 4:23:12 EST 2012