![]() |
OLAC Record oai:lindat.mff.cuni.cz:11234/1-3691 |
| Metadata | ||
| Title: | RobeCzech Base | |
| Bibliographic Citation: | http://hdl.handle.net/11234/1-3691 | |
| Creator: | Straka, Milan | |
| Náplava, Jakub | ||
| Straková, Jana | ||
| Samuel, David | ||
| Date (W3CDTF): | 2021-05-25T08:15:54Z | |
| Date Available: | 2021-05-25T08:15:54Z | |
| Description: | RobeCzech is a monolingual RoBERTa language representation model trained on Czech data. RoBERTa is a robustly optimized Transformer-based pretraining approach. We show that RobeCzech considerably outperforms equally-sized multilingual and Czech-trained contextualized language representation models, surpasses current state of the art in all five evaluated NLP tasks and reaches state-of-theart results in four of them. The RobeCzech model is released publicly at https://hdl.handle.net/11234/1-3691 and https://huggingface.co/ufal/robeczech-base, both for PyTorch and TensorFlow. | |
| Identifier (URI): | http://hdl.handle.net/11234/1-3691 | |
| Language: | Czech | |
| Language (ISO639): | ces | |
| Publisher: | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) | |
| Rights: | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | |
| http://creativecommons.org/licenses/by-nc-sa/4.0/ | ||
| Subject: | Czech | |
| BERT | ||
| RoBERTa | ||
| Czech language | ||
| Subject (ISO639): | ces | |
| Type: | languageDescription | |
| Type (DCMI): | Text | |
| Type (OLAC): | language_description | |
OLAC Info |
||
| Archive: | LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University | |
| Description: | http://www.language-archives.org/archive/lindat.mff.cuni.cz | |
| GetRecord: | OAI-PMH request for OLAC format | |
| GetRecord: | Pre-generated XML file | |
OAI Info |
||
| OaiIdentifier: | oai:lindat.mff.cuni.cz:11234/1-3691 | |
| DateStamp: | 2021-06-29 | |
| GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
| Citation: | Straka, Milan; Náplava, Jakub; Straková, Jana; Samuel, David. 2021. Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL). | |
| Terms: | area_Europe country_CZ dcmi_Text iso639_ces olac_language_description | |
Inferred Metadata | ||
| Country: | Czech Republic | |
| Area: | Europe | |