OLAC Record

Title:CELEX Dutch lexical database - Phonology Subset
Abstract:Dutch lexical database containing lemmas (124136 entries), wordforms (381292 entries), abbreviations (1622 entries), syllables (31358 entries). The database is divided into different subsets: L0029-01 Complete set of data; L0029-02 Subset Orthography; L0029-03 Subset Phonology; L0029-04 Subset Morphology Infl.; L0029-05 Subset Morphology Der.; L0029-06 Subset Syntax; L0029-07 Subset Frequency.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):1997-09-15
Date Issued (W3CDTF):2004-09-14
Date Modified (W3CDTF):2005-10-05
Description:Monolingual Lexicons
The Dutch CELEX data is derived from R.H. Baayen, R. Piepenbrock & L. Gulikers, The CELEX Lexical Database (CD-ROM), Release 2, Dutch Version 3.1, Linguistic Data Consortium, University of Pennsylvania, Philadelphia, PA, 1995. Apart from orthographic features, the CELEX database comprises representations of the phonological, morphological, syntactic and frequency properties of lemmata. For the Dutch data, frequencies have been disambiguated on the basis of the 42.4m Dutch Instituut voor Nederlandse Lexicologie text corpora. To make for greater compatibility with other operating systems, the databases have not been tailored to fit any particular database management program. Instead, the information is presented in a series of plain ASCII files, which can be queried with tools such as AWK and ICON. Unique identity numbers allow the linking of information from different files.This database can be divided into different subsets: ? orthography: with or without diacritics, with or without word division positions, alternative spellings, number of letters/syllables; ? phonology: phonetic transcriptions with syllable boundaries or primary and secondary stress markers, consonant-vowel patterns, number of phonemes/syllables, alternative pronunciations, frequency per phonetic syllable within words; ? morphology: division into stems and affixes, flat or hierarchical representations, stems and their inflections; ? syntax: word class, subcategorisations per word class; ? frequency of the entries: disambiguated for homographic lemmata.
Language:Dutch, Flemish
Language (ISO639):nld
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-L0029-03
DateStamp:  1997-09-15
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Europe country_NL dcmi_Text iso639_nld olac_primary_text

Up-to-date as of: Mon Jan 14 13:31:52 EST 2019