![]() |
OLAC Record oai:lindat.mff.cuni.cz:11234/1-4629 |
| Metadata | ||
| Title: | Universal Segmentations 1.0 (UniSegments 1.0) | |
| Bibliographic Citation: | http://hdl.handle.net/11234/1-4629 | |
| Creator: | Žabokrtský, Zdeněk | |
| Bafna, Nyati | ||
| Bodnár, Jan | ||
| Kyjánek, Lukáš | ||
| Svoboda, Emil | ||
| Ševčíková, Magda | ||
| Vidra, Jonáš | ||
| Angle, Sachi | ||
| Ansari, Ebrahim | ||
| Arkhangelskiy, Timofey | ||
| Batsuren, Khuyagbaatar | ||
| Bella, Gábor | ||
| Bertinetto, Pier Marco | ||
| Bonami, Olivier | ||
| Celata, Chiara | ||
| Daniel, Michael | ||
| Fedorenko, Alexei | ||
| Filko, Matea | ||
| Giunchiglia, Fausto | ||
| Haghdoost, Hamid | ||
| Hathout, Nabil | ||
| Khomchenkova, Irina | ||
| Khurshudyan, Victoria | ||
| Levonian, Dmitri | ||
| Litta, Eleonora | ||
| Medvedeva, Maria | ||
| Muralikrishna, S. N. | ||
| Namer, Fiammetta | ||
| Nikravesh, Mahshid | ||
| Padó, Sebastian | ||
| Passarotti, Marco | ||
| Plungian, Vladimir | ||
| Polyakov, Alexey | ||
| Potapov, Mihail | ||
| Pruthwik, Mishra | ||
| Rao B, Ashwath | ||
| Rubakov, Sergei | ||
| Samar, Husain | ||
| Sharma, Dipti Misra | ||
| Šnajder, Jan | ||
| Šojat, Krešimir | ||
| Štefanec, Vanja | ||
| Talamo, Luigi | ||
| Tribout, Delphine | ||
| Vodolazsky, Daniil | ||
| Vydrin, Arseniy | ||
| Zakirova, Aigul | ||
| Zeller, Britta | ||
| Date (W3CDTF): | 2022-01-24T15:25:57Z | |
| Date Available: | 2022-01-24T15:25:57Z | |
| Description: | Universal Segmentations (UniSegments) is a collection of lexical resources capturing morphological segmentations harmonised into a cross-linguistically consistent annotation scheme for many languages. The annotation scheme consists of simple tab-separated columns that stores a word and its morphological segmentations, including pieces of information about the word and the segmented units, e.g., part-of-speech categories, type of morphs/morphemes etc. The current public version of the collection contains 38 harmonised segmentation datasets covering 30 different languages. | |
| Identifier (URI): | http://hdl.handle.net/11234/1-4629 | |
| Language: | Czech | |
| Catalan | ||
| German | ||
| English | ||
| Persian | ||
| Finnish | ||
| French | ||
| Serbo-Croatian | ||
| Croatian | ||
| Hungarian | ||
| Italian | ||
| Komi-Zyrian | ||
| Latin | ||
| Moksha | ||
| Mari (Russia) | ||
| Mongolian | ||
| Erzya | ||
| Polish | ||
| Portuguese | ||
| Russian | ||
| Spanish | ||
| Swedish | ||
| Tajik | ||
| Udmurt | ||
| Armenian | ||
| Bengali | ||
| Hindi | ||
| Malayalam | ||
| Marathi | ||
| Kannada | ||
| Language (ISO639): | ces | |
| cat | ||
| deu | ||
| eng | ||
| fas | ||
| fin | ||
| fra | ||
| hbs | ||
| hrv | ||
| hun | ||
| ita | ||
| kpv | ||
| lat | ||
| mdf | ||
| chm | ||
| mon | ||
| myv | ||
| pol | ||
| por | ||
| rus | ||
| spa | ||
| swe | ||
| tgk | ||
| udm | ||
| hye | ||
| ben | ||
| hin | ||
| mal | ||
| mar | ||
| kan | ||
| Publisher: | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) | |
| Rights: | Universal Segmentations 1.0 License Terms | |
| https://lindat.mff.cuni.cz/repository/xmlui/page/licence-unisegs-1.0 | ||
| Subject: | universal segmentations | |
| morphological segmentation | ||
| word segmentation | ||
| segmentation | ||
| morphology | ||
| morphemes | ||
| morphological dictionary | ||
| unisegments | ||
| morph | ||
| multilingual | ||
| Czech language | ||
| Catalan language | ||
| German language | ||
| English language | ||
| Persian language | ||
| Finnish language | ||
| French language | ||
| Serbo-Croatian language | ||
| Croatian language | ||
| Hungarian language | ||
| Italian language | ||
| Komi-Zyrian language | ||
| Latin language | ||
| Moksha language | ||
| Mari (Russia) language | ||
| Mongolian language | ||
| Erzya language | ||
| Polish language | ||
| Portuguese language | ||
| Russian language | ||
| Spanish language | ||
| Swedish language | ||
| Tajik language | ||
| Udmurt language | ||
| Armenian language | ||
| Bengali language | ||
| Hindi language | ||
| Malayalam language | ||
| Marathi language | ||
| Kannada language | ||
| Subject (ISO639): | ces | |
| cat | ||
| deu | ||
| eng | ||
| fas | ||
| fin | ||
| fra | ||
| hbs | ||
| hrv | ||
| hun | ||
| ita | ||
| kpv | ||
| lat | ||
| mdf | ||
| chm | ||
| mon | ||
| myv | ||
| pol | ||
| por | ||
| rus | ||
| spa | ||
| swe | ||
| tgk | ||
| udm | ||
| hye | ||
| ben | ||
| hin | ||
| mal | ||
| mar | ||
| kan | ||
| Type: | lexicalConceptualResource | |
| Type (DCMI): | Text | |
| Type (OLAC): | lexicon | |
OLAC Info |
||
| Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
| Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
| GetRecord: | OAI-PMH request for OLAC format | |
| GetRecord: | Pre-generated XML file | |
OAI Info |
||
| OaiIdentifier: | oai:lindat.mff.cuni.cz:11234/1-4629 | |
| DateStamp: | 2022-01-24 | |
| GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
| Citation: | Žabokrtský, Zdeněk; Bafna, Nyati; Bodnár, Jan; Kyjánek, Lukáš; Svoboda, Emil; Ševčíková, Magda; Vidra, Jonáš; Angle, Sachi; Ansari, Ebrahim; Arkhangelskiy, Timofey; Batsuren, Khuyagbaatar; Bella, Gábor; Bertinetto, Pier Marco; Bonami, Olivier; Celata, Chiara; Daniel, Michael; Fedorenko, Alexei; Filko, Matea; Giunchiglia, Fausto; Haghdoost, Hamid; Hathout, Nabil; Khomchenkova, Irina; Khurshudyan, Victoria; Levonian, Dmitri; Litta, Eleonora; Medvedeva, Maria; Muralikrishna, S. N.; Namer, Fiammetta; Nikravesh, Mahshid; Padó, Sebastian; Passarotti, Marco; Plungian, Vladimir; Polyakov, Alexey; Potapov, Mihail; Pruthwik, Mishra; Rao B, Ashwath; Rubakov, Sergei; Samar, Husain; Sharma, Dipti Misra; Šnajder, Jan; Šojat, Krešimir; Štefanec, Vanja; Talamo, Luigi; Tribout, Delphine; Vodolazsky, Daniil; Vydrin, Arseniy; Zakirova, Aigul; Zeller, Britta. 2022. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL). | |
| Terms: | area_Asia area_Europe country_AM country_BD country_CZ country_DE country_ES country_FI country_FR country_GB country_HR country_HU country_IN country_IT country_PL country_PT country_RU country_SE country_TJ country_VA dcmi_Text iso639_ben iso639_cat iso639_ces iso639_chm iso639_deu iso639_eng iso639_fas iso639_fin iso639_fra iso639_hbs iso639_hin iso639_hrv iso639_hun iso639_hye iso639_ita iso639_kan iso639_kpv iso639_lat iso639_mal iso639_mar iso639_mdf iso639_mon iso639_myv iso639_pol iso639_por iso639_rus iso639_spa iso639_swe iso639_tgk iso639_udm olac_lexicon | |
Inferred Metadata | ||
| Country: | ArmeniaBangladeshCzech RepublicGermanySpainFinlandFranceUnited KingdomCroatiaHungaryIndiaItalyPolandPortugalRussian FederationSwedenTajikistanVatican State | |
| Area: | AsiaEurope | |