OLAC Record
oai:catalogue.elra.info:ELRA-W0077

Metadata
Title:English-Nepali Parallel Corpus
Abstract:This corpus consists of a collection of national development texts in English and Nepali. A small set of data is aligned at the sentence level (27,060 English words; 21,756 Nepali words), and a larger set of texts at the document level (617,340 English words; 596,571 Nepali words). An additional set of monolingual data in Nepali is also provided (386,879 words in Nepali).
Access Rights:Rights available for: Research Use
Date Available (W3CDTF):2014-01-15
Date Created (W3CDTF):2015-02-10
Date Issued (W3CDTF):2014-01-15
Date Modified (W3CDTF):2014-01-15
Description:Written Corpora
The Nepali Monolingual written corpus is one of the 3 resources that constitute the Nepali National Corpus. The Nepali National Corpus was produced in 2006 in the framework of the project Bhasha Sanchar (?language communication?), also known as Nelralec, for Nepali Language Resources and Localization for Education and Communication; funded by the EU Asia IT&C programme, reference number ASIE/2004/091-777. This corpus consists of a collection of national development texts in English and Nepali and is divided as follows: - a small set of data aligned at the sentence level (27,060 English words; 21,756 Nepali words), provided in the tmx format (xml file): - a larger set of texts aligned at the document level (617,340 English words; 596,571 Nepali words), provided in raw text and in the original word processing format. - an additional set of monolingual data in Nepali (386,879 words in Nepali), provided in raw text and in the original word processing format.
Identifier:ELRA-W0077
http://catalog.elra.info/product_info.php?products_id=1217
Language:English
Nepali (macrolanguage); Nepali
Language (ISO639):eng
nep
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0077
DateStamp:  2014-01-15
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2014. ELRA (European Language Resources Association).
Terms: area_Europe country_GB dcmi_Text iso639_eng iso639_nep olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0077
Up-to-date as of: Mon Feb 27 0:31:53 EST 2017