OLAC Record

Title:EvaLatin 2020: data
Bibliographic Citation:http://hdl.handle.net/20.500.11752/OPEN-526
Creator:Sprugnoli, Rachele
Pellegrini, Matteo
Cecchini, Flavio Massimiliano
Passarotti, Marco
Date (W3CDTF):2021-03-09T10:26:37Z
Date Available:2021-03-09T10:26:37Z
Description:Training and gold test data released in EvaLatin 2020, the evaluation campaign of NLP tools for Latin. The two shared tasks proposed in EvaLatin 2020, i. e. Lemmatization and Part-of-Speech tagging, were aimed at fostering research in the field of language technologies for Classical languages. The shared dataset consists of texts taken from the Perseus Digital Library, processed with UDPipe models and then manually corrected by Latin experts. The training set includes only prose texts by Classical authors. The test set, alongside with prose texts by the same authors represented in the training set, also includes data relative to poetry and to the Medieval period.
Identifier (URI):http://hdl.handle.net/20.500.11752/OPEN-526
Language (ISO639):lat
Publisher:CIRCSE Research Centre, Università Cattolica del Sacro Cuore
Rights:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
POS tagging
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  ILC-CNR for CLARIN-IT repository hosted at Institute for Computational Linguistics "A. Zampolli", National Research Council, in Pisa
Description:  http://www.language-archives.org/archive/dspace-clarin-it.ilc.cnr.it
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:dspace-clarin-it.ilc.cnr.it:20.500.11752/OPEN-526
DateStamp:  2021-03-09
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Sprugnoli, Rachele; Pellegrini, Matteo; Cecchini, Flavio Massimiliano; Passarotti, Marco. 2021. CIRCSE Research Centre, Universit√† Cattolica del Sacro Cuore.
Terms: area_Europe country_VA dcmi_Text iso639_lat olac_primary_text

Up-to-date as of: Thu Apr 20 1:22:37 EDT 2023