Archive Details


Academia Sinica Balanced Corpus of Modern Chinese

Size:1
Repository Name:Academia Sinica Balanced Corpus of Modern Chinese
Institution:Academia Sinica
ArchiveURL:http://www.sinica.edu.tw/SinicaCorpus/
Curator:Corpus Research Group in Institute of Linguistics,Chinese Knowledge Information Processing Group of Institute of Information Science and Academia Sinica Computing Centre of Academia Sinica
Short Location:Taipei, Taiwan
Synopsis:-Purpose: Academia Sinica Balanced Corpus of Modern Chinese, simplified as Sinica Corpus, is designed for analyzing modern Chinese. Every text in the corpus is segmented and each segmented word is tagged with its part-of-speech. Texts are collected from different areas and classified according to various topics and modes. Therefore, it can be regarded as a representive model of modern Chinese language. -Coverage: Taiwan,TGN:Chung-hua Min-kuo (nation) -Scope: Newspaper ,General Magazine ,Academic Journal ,Textbook ,Reference ,Thesis ,General Book ,Audio/Visual Medium ,Conversation/Interview ,Elsewhere
Access:This notice regulates your usage of this web site and its associated services including interface, corpus data, segmenting and tagging standard, etc. All rights are reserved by Academia Sinica. In your research you may apply the data resulting from the searching processes of our interface systems. However, you are prohibited to abstract, alter or publish any searching results voluntarily. The copyright of corpus data is still reserved by original author or source and cannot be reproduced, copied or violate anything involving intellectual property.
Administrator:ore@language-archives.org
Base URL:http://www.language-archives.org/sr/www.language-archives.org/devel/sr/SinicaCorpus.xml
Repository ID:SinicaCorpus.sinica.edu.tw
OAI Version:2.0
OLAC Version:1.0
Records in Archive:http://www.language-archives.org/archive_records/SinicaCorpus.sinica.edu.tw
Explore:Visit archive with the Repository Explorer
Last Harvested:2008-12-01
Reports:Archive Metrics and Integrity Checks