Title:Czech SubLex 1.0
Bibliographic Citation:http://hdl.handle.net/11858/00-097C-0000-0022-FF60-B
Creator:Veselovská, Kateřina
Bojar, Ondřej
Date (W3CDTF):2013-12-02T22:10:38Z
Date Available:2013-12-02T22:10:38Z
Description:Czech subjectivity lexicon, i.e. a list of subjectivity clues for sentiment analysis in Czech. The list contains 4626 evaluative items (1672 positive and 2954 negative) together with their part of speech tags, polarity orientation and source information. The core of the Czech subjectivity lexicon has been gained by automatic translation of a freely available English subjectivity lexicon downloaded from http://www.cs.pitt.edu/mpqa/subj_lexicon.html. For translating the data into Czech, we used parallel corpus CzEng 1.0 containing 15 million parallel sentences (233 million English and 206 million Czech tokens) from seven different types of sources automatically annotated at surface and deep layers of syntactic representation. Afterwards, the lexicon has been manually refined by an experienced annotator.
The work on this project has been supported by the GAUK 3537/2011 grant and by SVV project number 267 314.
Identifier (URI):http://hdl.handle.net/11858/00-097C-0000-0022-FF60-B
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Subject:subjectivity lexicon
sentiment analysis
opinion mining
polarity clues
Czech language
Subject (ISO639):ces
Type (DCMI):Text
Type (OLAC):lexicon


Citation: Veselovská, Kateřina; Bojar, Ondřej. 2013. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Country: Czech Republic
Area: Europe

