OLAC Record
oai:lindat.mff.cuni.cz:11234/1-3248

Metadata
Title:COSTRA 1.1: A Dataset of Complex Sentence Transformations and Comparisons
Bibliographic Citation:http://hdl.handle.net/11234/1-3248
Creator:Barančíková, Petra
Bojar, Ondřej
Date (W3CDTF):2020-06-19T09:13:14Z
Date Available:2020-06-19T09:13:14Z
Description:Costra 1.1 is a new dataset for testing geometric properties of sentence embeddings spaces. In particular, it concentrates on examining how well sentence embeddings capture complex phenomena such paraphrases, tense or generalization. The dataset is a direct expansion of Costra 1.0, which was extended with more sentences and sentence comparisons.
Identifier (URI):http://hdl.handle.net/11234/1-3248
Language:Czech
Language (ISO639):ces
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Creative Commons - Attribution 4.0 International (CC BY 4.0)
http://creativecommons.org/licenses/by/4.0/
Subject:paraphrases
sentence embeddings
evaluation
sentence
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-3248
DateStamp:  2021-03-22
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Barančíková, Petra; Bojar, Ondřej. 2020. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-3248
Up-to-date as of: Tue Mar 23 7:07:38 EDT 2021