OLAC Record

Abstract:Lexicon for written and spoken French including 440,000 inflected forms with spelling, contextual variants at morphological, phonological and phonetic levels, and morphosyntatic attributes.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2001-04-06
Date Issued (W3CDTF):2004-09-14
Date Modified (W3CDTF):2017-06-26
Description:Speech Related
MHATLex is a new enhanced lexical resource for written and speech automatic processing for French. It is derived from BDLex (see ELRA-S0004). It contains three levels of representation: - Syntactic level: S - Phonological word level: W - Phonetic level: P At the W level, a word has two representations: - input representation (W representation) where words are simply imported from the lexicon, - output representation (W' or phonotypical) where words have the phonotypical representation imposed by their context in the sentence. The lexicons contain inflected words (among which canonical words). MHATLexSt (& BDLex) MHATLexW: about 50,000 entries (canonical) & 440,000 entries (inflected) MHATLexW': about 81,000 entries (canonical) & 854,000 entries (inflected) Words are represented with their orthography, pronunciation, morpho-syntactic features, and frequency indicator. Only the pronunciation related part changes according to the lexicon (except if the user want to generate his own lexicon by skipping some features). Four lexicons can be generated from MHATLex: - MHATLexW : this is the central lexical resource which enables to generate the other lexicons - MHATLexW' (or MHATLexPht) : gives the word representations for each pertinent context. - MHATLexSt : with standard and simplified format of the pronunciation. - BDLex (or BDLex50) : already distributed by ELDA (ELRA-S0003 and S0004). The current BDLex, derived from MHATLexW, contains some updates. When purchasing MHATLex, the package includes BDLex (ELRA-S0004). Integrity checks were made and the lexicon was parsed using nsgmls.
Language (ISO639):fra
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Sound
Type (OLAC):primary_text


Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-S0100
DateStamp:  2001-04-06
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Europe country_FR dcmi_Sound iso639_fra olac_primary_text

Up-to-date as of: Mon Oct 8 0:56:56 EDT 2018