OLAC Record
oai:www.ldc.upenn.edu:LDC2003T12

Metadata
Title:Arabic Gigaword
Creator (author):David Graff
Date (W3CDTF):2003-07-22
Description:Release type: General
Non-member fee: 3000.00 USD
Reduced-license fee: 1500.00 USD
Extra-copy fee: 200.00 USD
Online documentation: http://catalog.ldc.upenn.edu/docs/LDC2003T12
Application: natural language processing
Application: language modeling
Application: information retrieval
Related research project: TIDES
Related research project: GALE
Related research project: EARS
Membership year: 2003
Data source: newswire
Extent:Corpus size: 1153433.600 KB
Identifier:LDC2003T12
ISBN: 1-58563-271-6
Language:Standard Arabic
Language (ISO639):arb
Medium:Distribution: 1 DVD
Rights:Non-member license: http://catalog.ldc.upenn.edu/nonmem_agree/generic.license.html
Subject:Standard Arabic language
Subject (ISO639):arb
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2003T12
DateStamp:  2011-08-23
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: David Graff. 2003. The LDC Corpus Catalog.
Terms: area_Asia country_SA dcmi_Text iso639_arb olac_primary_text

Inferred Metadata

Country: Saudi Arabia
Area: Asia


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2003T12
Up-to-date as of: Thu Apr 3 0:19:31 EDT 2014