OLAC Record

Title:Korean Treebank Annotations Version 2.0
Access Rights:Licensing Instructions for Subscription & Standard Members, and Non-Members: http://www.ldc.upenn.edu/language-resources/data/obtaining
Bibliographic Citation:Han, Na-Rae, et al. Korean Treebank Annotations Version 2.0 LDC2006T09. Web Download. Philadelphia: Linguistic Data Consortium, 2006
Contributor:Han, Na-Rae
Ryu, Shijong
Chae, Sook-Hee
Yang, Seung-yun
Lee, Seunghun
Palmer, Martha
Date (W3CDTF):2006
Date Issued (W3CDTF):2006-04-17
Description:*Introduction* The Korean Treebank Annotations Version 2.0 is an extension of the Korean English Treebank Annotations corpus, LDC2002T26 (2002). It is essentially an electronic corpus of Korean texts annotated with morphological and syntactic information. The original texts for the Korean Treebank 2.0 were selected from The Korean Newswire corpus published by LDC, catalog number LDC2000T45, which is a collection of Korean Press Agency news articles from June 2, 1994 to March 20, 2000. Korean Treebank 2.0 is based on the March 2000 portion of the corpus and includes 647 articles. The annotated corpus can find many uses, including training of morphological analyzers, part-of-speech taggers and syntactic parsers. The text is encoded as KSC-5601(EUC-KR). Version 1.1 of the treebank is included in this release. *Samples* For an example of the data in the corpus, please review this sample.
Extent:Corpus size: 19456 KB
ISBN: 1-58563-381-X
ISLRN: 365-025-522-700-1
DOI: 10.35111/02nk-p662
Language (ISO639):kor
License:Korean Treebank Annotations Version 2.0 Agreement: https://catalog.ldc.upenn.edu/license/korean-treebank-annotations-version-2.pdf
Medium:Distribution: Web Download
Publisher:Linguistic Data Consortium
Publisher (URI):https://www.ldc.upenn.edu
Relation (URI):https://catalog.ldc.upenn.edu/docs/LDC2006T09
Rights Holder:© 2001-2002 CoGenTex, Inc., © 2000 Korean Press Agency, © 2000-2005, 2006 Trustees of the University of Pennsylvania
Type (DCMI):Text
Type (OLAC):primary_text


Archive:  The LDC Corpus Catalog
Description:  http://www.language-archives.org/archive/www.ldc.upenn.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:www.ldc.upenn.edu:LDC2006T09
DateStamp:  2020-11-30
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Han, Na-Rae; Ryu, Shijong; Chae, Sook-Hee; Yang, Seung-yun; Lee, Seunghun; Palmer, Martha. 2006. Linguistic Data Consortium.
Terms: area_Asia country_KR dcmi_Text iso639_kor olac_primary_text

Up-to-date as of: Fri Dec 25 20:03:27 EST 2020