OLAC Record
oai:lindat.mff.cuni.cz:11234/1-2899

Metadata
Title:CoNLL 2017 and 2018 Shared Task Blind and Preprocessed Test Data
Bibliographic Citation:http://hdl.handle.net/11234/1-2899
Creator:Zeman, Daniel
Straka, Milan
Date (W3CDTF):2018-11-28T13:44:33Z
Date Available:2018-11-28T13:44:33Z
Description:CoNLL 2017 and 2018 shared tasks: Multilingual Parsing from Raw Text to Universal Dependencies This package contains the test data in the form in which they ware presented to the participating systems: raw text files and files preprocessed by UDPipe. The metadata.json files contain lists of files to process and to output; README files in the respective folders describe the syntax of metadata.json. For full training, development and gold standard test data, see Universal Dependencies 2.0 (CoNLL 2017) Universal Dependencies 2.2 (CoNLL 2018) See the download links at http://universaldependencies.org/. For more information on the shared tasks, see http://universaldependencies.org/conll17/ http://universaldependencies.org/conll18/ Contents: conll17-ud-test-2017-05-09 ... CoNLL 2017 test data conll18-ud-test-2018-05-06 ... CoNLL 2018 test data conll18-ud-test-2018-05-06-for-conll17 ... CoNLL 2018 test data with metadata and filenames modified so that it is digestible by the 2017 systems.
Identifier (URI):http://hdl.handle.net/11234/1-2899
Language:Afrikaans
Arabic
Breton
Bulgarian
Russia Buriat
Catalan
Czech
Church Slavic
Danish
German
Modern Greek (1453-)
English
Estonian
Basque
Faroese
Persian
Finnish
French
Old French (842-ca. 1400)
Irish
Galician
Gothic
Ancient Greek (to 1453)
Hebrew
Hindi
Croatian
Upper Sorbian
Hungarian
Armenian
Indonesian
Italian
Japanese
Kazakh
Northern Kurdish
Korean
Latin
Latvian
Dutch
Norwegian
Nigerian Pidgin
Polish
Portuguese
Romanian
Russian
Slovak
Slovenian
Northern Sami
Spanish
Serbian
Swedish
Thai
Turkish
Uighur
Ukrainian
Urdu
Vietnamese
Chinese
Language (ISO639):afr
ara
bre
bul
bxr
cat
ces
chu
dan
deu
ell
eng
est
eus
fao
fas
fin
fra
fro
gle
glg
got
grc
heb
hin
hrv
hsb
hun
hye
ind
ita
jpn
kaz
kmr
kor
lat
lav
nld
nor
pcm
pol
por
ron
rus
slk
slv
sme
spa
srp
swe
tha
tur
uig
ukr
urd
vie
zho
Publisher:Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Rights:Licence Universal Dependencies v2.2
https://lindat.mff.cuni.cz/repository/xmlui/page/licence-UD-2.2
Subject:tokenization
word segmentation
morphology
tagging
syntax
parsing
universal dependencies
Type:corpus
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University
Description:  http://www.language-archives.org/archive/lindat.mff.cuni.cz
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:lindat.mff.cuni.cz:11234/1-2899
DateStamp:  2021-06-29
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Zeman, Daniel; Straka, Milan. 2018. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL).
Terms: area_Africa area_Asia area_Europe country_AM country_BG country_CN country_CZ country_DE country_DK country_ES country_FI country_FR country_GB country_GR country_HR country_HU country_ID country_IE country_IL country_IN country_IT country_JP country_KR country_KZ country_NG country_NL country_NO country_PK country_PL country_PT country_RO country_RS country_RU country_SE country_SI country_SK country_TH country_TR country_UA country_VA country_VN country_ZA dcmi_Text iso639_afr iso639_ara iso639_bre iso639_bul iso639_bxr iso639_cat iso639_ces iso639_chu iso639_dan iso639_deu iso639_ell iso639_eng iso639_est iso639_eus iso639_fao iso639_fas iso639_fin iso639_fra iso639_fro iso639_gle iso639_glg iso639_got iso639_grc iso639_heb iso639_hin iso639_hrv iso639_hsb iso639_hun iso639_hye iso639_ind iso639_ita iso639_jpn iso639_kaz iso639_kmr iso639_kor iso639_lat iso639_lav iso639_nld iso639_nor iso639_pcm iso639_pol iso639_por iso639_ron iso639_rus iso639_slk iso639_slv iso639_sme iso639_spa iso639_srp iso639_swe iso639_tha iso639_tur iso639_uig iso639_ukr iso639_urd iso639_vie iso639_zho olac_primary_text


http://www.language-archives.org/item.php/oai:lindat.mff.cuni.cz:11234/1-2899
Up-to-date as of: Thu Oct 5 0:40:56 EDT 2023