OLAC Record

Title:Czech SubLex 1.0
Bibliographic Citation:http://hdl.handle.net/11858/00-097C-0000-0022-FF60-B
Creator:Veselovská, Kateřina
Bojar, Ondřej
Date (W3CDTF):2013-12-02T22:10:38Z
Date Available:2013-12-02T22:10:38Z
Description:Czech subjectivity lexicon, i.e. a list of subjectivity clues for sentiment analysis in Czech. The list contains 4626 evaluative items (1672 positive and 2954 negative) together with their part of speech tags, polarity orientation and source information. The core of the Czech subjectivity lexicon has been gained by automatic translation of a freely available English subjectivity lexicon downloaded from http://www.cs.pitt.edu/mpqa/subj_lexicon.html. For translating the data into Czech, we used parallel corpus CzEng 1.0 containing 15 million parallel sentences (233 million English and 206 million Czech tokens) from seven different types of sources automatically annotated at surface and deep layers of syntactic representation. Afterwards, the lexicon has been manually refined by an experienced annotator.
The work on this project has been supported by the GAUK 3537/2011 grant and by SVV project number 267 314.
Identifier (URI):http://hdl.handle.net/11858/00-097C-0000-0022-FF60-B
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Subject:subjectivity lexicon
sentiment analysis
opinion mining
polarity clues
Czech language
Subject (ISO639):ces
Type (DCMI):Text
Type (OLAC):lexicon


Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11858/00-097C-0000-0022-FF60-B
DateStamp:  2021-03-22
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Veselovská, Kateřina; Bojar, Ondřej. 2013. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_lexicon

Inferred Metadata

Country: Czech Republic
Area: Europe

Up-to-date as of: Tue Mar 23 7:05:39 EDT 2021