OLAC Record
oai:lindat.mff.cuni.cz:11372/LRT-1470

Metadata
Title:BasiLex Corpus
Bibliographic Citation:http://hdl.handle.net/11372/LRT-1470
Creator:Tellings, Agnes
Date (W3CDTF):2015-04-21T13:09:59Z
Date Available:2015-04-21T13:09:59Z
Description:The BasiLex Corpus is an annotated collection of texts written for children in the elementary school age. BasiLex contains 13.5 million tokens, 11.5 million of these being words. About 40% of the words come from educative materials, 40% from child literature, and 20% from media. The corpus was lemmatized and POS-tagged by means of Frog.
Identifier (URI):http://hdl.handle.net/11372/LRT-1470
Language:Dutch
Language (ISO639):nld
Publisher:Dutch-Flemish HLT Agency
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-1470
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Tellings, Agnes. 2015. Dutch-Flemish HLT Agency.
Terms: area_Europe country_NL dcmi_Text iso639_nld olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11372/LRT-1470
Up-to-date as of: Sun Oct 22 1:40:51 EDT 2017