OLAC Record
oai:www.ldc.upenn.edu:LDC2006T02

Metadata
Title:Arabic Gigaword Second Edition
Creator (author):David Graff, Ke Chen, Junbo Kong, and Kazuaki Maeda
Date (W3CDTF):2006-01-19
Description:Release type: General
Non-member fee: 3000.00 USD
Reduced-license fee: 1500.00 USD
Extra-copy fee: 200.00 USD
Online documentation: http://catalog.ldc.upenn.edu/docs/LDC2006T02
Application: information retrieval
Application: language modeling
Application: natural language processing
Membership year: 2006
Data source: newswire
Extent:Corpus size: 1153433.600 KB
Identifier:LDC2006T02
ISBN: 1-58563-371-2
Language:Standard Arabic
Language (ISO639):arb
Medium:Distribution: 1 DVD
Rights:Non-member license: http://catalog.ldc.upenn.edu/nonmem_agree/generic.license.html
Subject:Standard Arabic language
Subject (ISO639):arb
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2006T02
DateStamp:  2011-11-16
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: David Graff, Ke Chen, Junbo Kong, and Kazuaki Maeda. 2006. The LDC Corpus Catalog.
Terms: area_Asia country_SA dcmi_Text iso639_arb olac_primary_text

Inferred Metadata

Country: Saudi Arabia
Area: Asia


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2006T02
Up-to-date as of: Mon May 5 0:35:03 EDT 2014