OLAC Record: Nepali Monolingual written corpus

OLAC Record
oai:catalogue.elra.info:ELRA-W0076

Metadata

Title: Nepali Monolingual written corpus

Access Rights: Rights available for: nonCommercialUse

Date Available (W3CDTF): 2014-01-15

Date Issued (W3CDTF): 2014-01-15

Date Modified (W3CDTF): 2014-01-15

Description: The Nepali Monolingual written corpus is one of the 3 resources that constitute the Nepali National Corpus. The Nepali National Corpus was produced in 2006 in the framework of the project Bhasha Sanchar (“language communication”), also known as Nelralec, for Nepali Language Resources and Localization for Education and Communication; funded by the EU Asia IT&C programme, reference number ASIE/2004/091-777.The Nepali Monolingual written corpus comprises the core corpus (core sample) and the general corpus. The core sample (CS) represents the collection of Nepali written texts from 15 different genres with 2000 words each published between 1990 and 1992. It is based on FLOB/FROWN corpora and contains 802,000 words. The general corpus (GC) consists of written texts collected opportunistically from a wide range of sources such as the internet webs, newspapers, books, publishers and authors. It contains 1,400,000 words. This part of the corpus was intended to allow corpus analyses that depend on a very large corpus.The written corpus is morphogically-annotated. A part-of-speech (POS) tagset has been produced within the project: the Nelralec Tagset. This is a categorisation system for the manual and automated analysis of morphosyntactic units in Nepali.

Identifier: ELRA-W0076

ISLRN: 325-796-965-405-9

Identifier (URI): https://catalog.elra.info/en-us/repository/browse/ELRA-W0076/

Language: Nepali (macrolanguage)

Language (ISO639): nep

Medium: downloadable

Publisher: ELRA (European Language Resources Association)

Type (DCMI): Text

Type (OLAC): primary_text

OLAC Info

Archive: ELRA Catalogue of Language Resources

Description: http://www.language-archives.org/archive/catalogue.elra.info

GetRecord: OAI-PMH request for OLAC format

GetRecord: Pre-generated XML file

OAI Info

OaiIdentifier: oai:catalogue.elra.info:ELRA-W0076

DateStamp: 2014-01-15

GetRecord: OAI-PMH request for simple DC format

Search Info
Citation: n.a. 2014. ELRA (European Language Resources Association).
Terms: dcmi_Text iso639_nep olac_primary_text

http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0076
Up-to-date as of: Wed Jul 15 7:05:06 EDT 2026

Metadata
Title:		Nepali Monolingual written corpus
Access Rights:		Rights available for: nonCommercialUse
Date Available (W3CDTF):		2014-01-15
Date Issued (W3CDTF):		2014-01-15
Date Modified (W3CDTF):		2014-01-15
Description:		The Nepali Monolingual written corpus is one of the 3 resources that constitute the Nepali National Corpus. The Nepali National Corpus was produced in 2006 in the framework of the project Bhasha Sanchar (“language communication”), also known as Nelralec, for Nepali Language Resources and Localization for Education and Communication; funded by the EU Asia IT&C programme, reference number ASIE/2004/091-777.The Nepali Monolingual written corpus comprises the core corpus (core sample) and the general corpus. The core sample (CS) represents the collection of Nepali written texts from 15 different genres with 2000 words each published between 1990 and 1992. It is based on FLOB/FROWN corpora and contains 802,000 words. The general corpus (GC) consists of written texts collected opportunistically from a wide range of sources such as the internet webs, newspapers, books, publishers and authors. It contains 1,400,000 words. This part of the corpus was intended to allow corpus analyses that depend on a very large corpus.The written corpus is morphogically-annotated. A part-of-speech (POS) tagset has been produced within the project: the Nelralec Tagset. This is a categorisation system for the manual and automated analysis of morphosyntactic units in Nepali.
Identifier:		ELRA-W0076
Identifier:		ISLRN: 325-796-965-405-9
Identifier (URI):		https://catalog.elra.info/en-us/repository/browse/ELRA-W0076/
Language:		Nepali (macrolanguage)
Language (ISO639):		nep
Medium:		downloadable
Publisher:		ELRA (European Language Resources Association)
Type (DCMI):		Text
Type (OLAC):		primary_text
OLAC Info
Archive:		ELRA Catalogue of Language Resources
Description:		http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:		OAI-PMH request for OLAC format
GetRecord:		Pre-generated XML file
OAI Info
OaiIdentifier:		oai:catalogue.elra.info:ELRA-W0076
DateStamp:		2014-01-15
GetRecord:		OAI-PMH request for simple DC format
Search Info
Citation:		n.a. 2014. ELRA (European Language Resources Association).
Terms:		dcmi_Text iso639_nep olac_primary_text