OLAC Record
oai:www.ldc.upenn.edu:LDC2007T40

Metadata
Title:Arabic Gigaword Third Edition
Creator (author):David Graff
Date (W3CDTF):2007-11-20
Description:Release type: General
Non-member fee: 4000.00 USD
Reduced-license fee: 2000.00 USD
Extra-copy fee: 200.00 USD
Online documentation: http://catalog.ldc.upenn.edu/docs/LDC2007T40
Application: natural language processing
Application: language modeling
Application: information retrieval
Related research project: GALE
Membership year: 2007
Data source: newswire
Extent:Corpus size: 1887436.800 KB
Identifier:LDC2007T40
ISBN: 1-58563-460-3
Language:Standard Arabic
Language (ISO639):arb
Medium:Distribution: 1 DVD
Rights:Non-member license: http://catalog.ldc.upenn.edu/nonmem_agree/generic.license.html
Subject:Standard Arabic language
Subject (ISO639):arb
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2007T40
DateStamp:  2011-11-14
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: David Graff. 2007. The LDC Corpus Catalog.
Terms: area_Asia country_SA dcmi_Text iso639_arb olac_primary_text

Inferred Metadata

Country: Saudi Arabia
Area: Asia


http://www.language-archives.org/item.php/oai:www.ldc.upenn.edu:LDC2007T40
Up-to-date as of: Thu Apr 3 0:20:26 EDT 2014