OLAC Record

Title:BasiLex Corpus
Bibliographic Citation:http://hdl.handle.net/11372/LRT-1470
Creator:Tellings, Agnes
Date (W3CDTF):2015-04-21T13:09:59Z
Date Available:2015-04-21T13:09:59Z
Description:The BasiLex Corpus is an annotated collection of texts written for children in the elementary school age. BasiLex contains 13.5 million tokens, 11.5 million of these being words. About 40% of the words come from educative materials, 40% from child literature, and 20% from media. The corpus was lemmatized and POS-tagged by means of Frog.
Identifier (URI):http://hdl.handle.net/11372/LRT-1470
Language (ISO639):nld
Publisher:Dutch-Flemish HLT Agency
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11372/LRT-1470
DateStamp:  2016-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Tellings, Agnes. 2015. Dutch-Flemish HLT Agency.
Terms: area_Europe country_NL dcmi_Text iso639_nld olac_primary_text

Up-to-date as of: Mon May 28 0:43:10 EDT 2018