| Metadata | ||
| Title: | Sinica Corpus | |
| Academia Sinica Balanced Corpus of Modern Chinese | ||
| Contributor: | [role = editor] Chinese Knowledge Information Processing Group of Institute of Information Science of Academia Sinica | |
| [role = editor] Corpus Research Group in Institute of Linguistics, Academia Sinica | ||
| [role = sponsor] Academia Sinica Computing Centre | ||
| Chiang Ching-Kuo Foundation for International Scholarly Exchange | ||
| National Science Council | ||
| [role = sponsor] Chu-Ren Huang | ||
| Keh-jiann Chen | ||
| Knowledge Representation and Language Engineering for Mandarin Chinese-Core Technology and Tool Libraries for Language Processing | ||
| Coverage: | Modern | |
| Chung-hua Min-kuo (nation) | ||
| Taiwan | ||
| Creator: | Chinese Knowledge Information Processing Group of Institute of Information Science of Academia Sinica | |
| Corpus Research Group in Institute of Linguistics, Academia Sinica | ||
| Academia Sinica Computing Centre | ||
| Date: | 1990 [Created] | |
| 1995, July, 1995 [Available] | ||
| 1996, November, 1996 [Available] | ||
| 1991 [Modified] | ||
| 1994 [Modified] | ||
| 1997 [Modified] | ||
| 2001, December, 2001 [Modified] | ||
| Description: | tute of Information Science, Institute of Linguistics, ASCC. All Rights Reserved. | |
| http://www.sinica.edu.tw/ftms-bin/kiwi1/mkiwi.sh | ||
| http://www.sinica.edu.tw/SinicaCorpus/modern_c_help.html | ||
| Format: | 5 million words, 120868 KB, saved in text file and presented in HTML format | |
| big5 | ||
| Criteria for POS and Feature tagging in Academia Sinica Balanced Corpus of Modern Chinese | ||
| Identifier: | http://www.sinica.edu.tw/SinicaCorpus/ | |
| Language: | [sourcecode = JAVASCRIPT] | |
| [sourcecode = C] | ||
| [language = cmn] Chinese, Mandarin | ||
| Publisher: | Academia Sinica | |
| http://www.sinica.edu.tw/ | ||
| Institute of Linguistics, Preparatory Office Academia Sinica | ||
| http://www.ling.sinica.edu.tw/ | ||
| Institute of Information Science Academia Sinica | ||
| http://www.iis.sinica.edu.tw/ | ||
| Relation: | [cpu = x86] At least 32M memory [Requires] | |
| [cpu = MIPS] At least 32M memory [Requires] | ||
| [cpu = Alpha] At least 32M memory [Requires] | ||
| [cpu = PowerPC] At least 32M memory [Requires] | ||
| [cpu = Sparc] At least 32M memory [Requires] | ||
| [cpu = 680x0] At least 32M memory [Requires] | ||
| [os = Unix/Linux] [Requires] | ||
| Academia Sinica Tagged Corpus of Early Mandarin Chinese [References] | ||
| Academia Sinica Formosan Language Archive [References] | ||
| http://www.sinica.edu.tw/Early_Mandarin/ [References] | ||
| http://www.ling.sinica.edu.tw/formosan/ [References] | ||
| Rights: | This notice regulates your usage of this web site and its associated services including interface, corpus data, segmenting and tagging standard, etc. All rights are reserved by Academia Sinica. In your research you may apply the data resulting from the searching processes of our interface systems. However, you are prohibited to abstract, alter or publish any searching results voluntarily. The copyright of corpus data is still reserved by original author or source and cannot be reproduced, copied or violate anything involving intellectual property. | |
| Source: | General Magazine: Common Wealth-Taiwan leading's magazine, Sinorama Magazine, Travel, WorldScreen. | |
| Newspaper: China Times, Liberty Times, Mandarin Daily News,Newsletter of Computing Centre. | ||
| Academic Journal: Institute of Ethnology Publications, Institute of Biomedical Sciences (IBMS) Newsletter. | ||
| Textbook: Elementary School Mandarin Textbook 12 volumes. | ||
| Reference: CKIP Technical Reports. | ||
| Thesis: Paper. | ||
| General Book:8 volumes of the Common Psychology Series published by Hong's Foundation for Education and Culture, Carnival in Brazil published by China Times Publishing Co. | ||
| Audio/Visual Medium: paragraphs in Bulletin Board System in Taiwan. | ||
| Conversation/Interview: Interview Record of participants in Democracy Movement, Daily conversation of mainland Chinese students in America. | ||
| Elsewhere: Texts that cannot be categorized in above media. | ||
| Subject: | [language = eng] English | |
| [language = cmn] Chinese, Mandarin | ||
| Type: | [DCMIType] Text | |
OLAC Info |
||
| Archive: | Academia Sinica Balanced Corpus of Modern Chinese | |
| Description: | http://www.language-archives.org/archive/SinicaCorpus.sinica.edu.tw | |
| GetRecord: | OAI-PMH request for OLAC format | |
OAI Info |
||
| OaiIdentifier: | oai:SinicaCorpus.sinica.edu.tw:SinicaCorpus | |
| DateStamp: | 2002-12-14 | |
| GetRecord: | OAI-PMH request for simple DC format | |
Search Info |
||
| Terms | dcmi_Text iso639_cmn iso639_eng | |