![]() |
OLAC Record oai:www.ldc.upenn.edu:LDC2006T01 |
Metadata | ||
Title: | Prague Dependency Treebank 2.0 | |
Access Rights: | Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining | |
Bibliographic Citation: | Hajič, Jan , et al. Prague Dependency Treebank 2.0 LDC2006T01. Web Download. Philadelphia: Linguistic Data Consortium, 2006 | |
Contributor: | Hajič, Jan | |
Panevová, Jarmila | ||
Hajičová, Eva | ||
Sgall, Petr | ||
Pajas, Petr | ||
Štěpánek, Jan | ||
Havelka, Jiří | ||
Mikulová, Marie | ||
Žabokrtský, Zdeněk | ||
Ševčíková-Razímová, Magda | ||
Urešová, Zdeňka | ||
Date (W3CDTF): | 2006 | |
Date Issued (W3CDTF): | 2006-07-21 | |
Description: | *Introduction* The Prague Dependency Treebank 2.0 (PDT 2.0) contains a large amount of Czech texts with complex and interlinked morphological (two million words), syntactic (1.5 MW) and complex semantic annotation (0.8 MW) in addition, certain properties of sentence information structure and coreference relations are annotated at the semantic level. PDT 2.0 is based on the long-standing Praguian linguistic tradition, adapted for the current Computational Linguistics research needs. The corpus itself uses the latest annotation technology. Software tools for corpus search, annotation and language analysis are included. Extensive documentation (in English) is provided as well. *Samples* For an example of the data in this publication, please examine these samples. | |
Extent: | Corpus size: 515072 KB | |
Identifier: | LDC2006T01 | |
https://catalog.ldc.upenn.edu/LDC2006T01 | ||
ISBN: 1-58563-370-4 | ||
ISLRN: 942-053-729-014-3 | ||
DOI: 10.35111/e6p0-9s32 | ||
Language: | Czech | |
Language (ISO639): | ces | |
License: | Prague Dependency Treebank 2.0: https://catalog.ldc.upenn.edu/license/prague-dependency-treebank-2.pdf | |
Medium: | Distribution: Web Download | |
Publisher: | Linguistic Data Consortium | |
Publisher (URI): | https://www.ldc.upenn.edu | |
Relation (URI): | https://catalog.ldc.upenn.edu/docs/LDC2006T01 | |
Rights Holder: | Portions © 1991, 1994,1995 Lidové noviny daily newspapers, © 1992 Mladá fronta Dnes daily newspapers, © 1994 Ceskomoravský Profit business weekly, © 1992-1993 Vesmír scientific magazine, Academia Publishers, © 1996-2005 Institute of Formal and Applied Linguistics and Center for Computational Linguistics, Faculty of Mathematics and Physics, Charles University, © 2006 Trustees of the University of Pennsylvania | |
Type (DCMI): | Text | |
Type (OLAC): | primary_text | |
OLAC Info |
||
Archive: | The LDC Corpus Catalog | |
Description: | http://www.language-archives.org/archive/www.ldc.upenn.edu | |
GetRecord: | OAI-PMH request for OLAC format | |
GetRecord: | Pre-generated XML file | |
OAI Info |
||
OaiIdentifier: | oai:www.ldc.upenn.edu:LDC2006T01 | |
DateStamp: | 2020-11-30 | |
GetRecord: | OAI-PMH request for simple DC format | |
Search Info | ||
Citation: | Hajič, Jan; Panevová, Jarmila; Hajičová, Eva; Sgall, Petr; Pajas, Petr; Štěpánek, Jan; Havelka, Jiří; Mikulová, Marie; Žabokrtský, Zdeněk; Ševčíková-Razímová, Magda; Urešová, Zdeňka. 2006. Linguistic Data Consortium. | |
Terms: | area_Europe country_CZ dcmi_Text iso639_ces olac_primary_text |