OLAC Record
oai:catalogue.elra.info:ELRA-W0027

Metadata
Title:An-Nahar Newspaper Text Corpus
Abstract:The An-Nahar Newspaper Text Corpus comprises articles in Arabic (Lebanon) from 1995 to 2000 (6 years) stored as HTML files onCDRommedia. Each yearcontains 45000 articles and 24 million words.
Access Rights:Rights available for: Research Use
Date Available (W3CDTF):2001-07-25
Date Issued (W3CDTF):2004-05-12
Date Modified (W3CDTF):2010-07-06
Description:Written Corpora
The An-Nahar Lebanon Newspaper Text Corpus comprises articles in standard Arabic from 1995 to 2000 (6 years) stored as HTML files on CDRom media. Each year contains 45 000 articles and 24 million words. Each article includes information such as title, newspaper's name, date, country, type, page, etc. For each year, the size in byte is as follows: 1995 : 128 MB 1996 : 138 MB 1997 : 152 MB 1998 : 140 MB 1999 : 130 MB 2000 : 118 MB
Identifier:ELRA-W0027
http://catalog.elra.info/product_info.php?products_id=767
Language:Arabic
Language (ISO639):ara
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0027
DateStamp:  2001-07-25
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: dcmi_Text iso639_ara olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0027
Up-to-date as of: Wed Mar 29 3:49:45 EDT 2017