OLAC Record
oai:catalogue.elra.info:ELRA-W0004

Metadata
Title:ECI/MCI (European Corpus Initiative/Multilingual Corpus I)
Abstract:Over 98 million words, covering most of the major European languages, as well as Turkish, Japanese, Russian, Chinese, Malay and more.
Access Rights:Rights available for: Research Use
Date Available (W3CDTF):1996-09-01
Date Issued (W3CDTF):2004-09-14
Date Modified (W3CDTF):2015-02-05
Description:Written Corpora
The European Corpus Initiative (ECI) was founded to oversee the acquisition and preparation of a large multilingual corpus, and supports existing and projected national and international efforts to carefully design, collect and publish large-scale multilingual written and spoken corpora. ECI has produced the Multilingual Corpus I (ECI/MCI) of over 98 million words, covering most of the major European languages, as well as Turkish, Japanese, Russian, Chinese, Malay and more. The primary focus in this effort is on textual material of all kinds, including transcriptions of spoken material. Just a sampling of the contents of the CD-ROM: German newspaper texts from the Frankfurter Rundschau from July 1992 -March 1993. provided by Universit?t Gesamthochschule, Paderborn, Germany. Approximately 34 million words. French newspaper texts from Le Monde, consisting of material from September 1989, October 1989, and January 1990. Provided by LIMSI CNRS, France. Approximately 4.1 million words. Extracts from the Leiden Corpus of Dutch, consisting of newspapers, transcribed speech, etc. Provided by Institut voor Nederlandse Lexicologie, Leiden, Holland. Approximately 5.5 million words. International Labor Organisation (ILO) "Official Bulletin, B Series". Vols LXVII(1984) - LXXII(1989). Parallel texts in English, French and Spanish provided by the International Labor Organisation. Approximately 5 million words.
Identifier:ELRA-W0004
http://catalog.elra.info/product_info.php?products_id=85
Language:Turkish
Albanian
Bulgarian
Chinese
Czech
Dutch, Flemish
English
Estonian
French
Gaelic, Scottish Gaelic
German
Modern Greek (1453-); Greek, Modern (1453-)
Italian
Japanese
Latin
Lithuanian
Malay (macrolanguage); Malay
Spanish, Castilian
Serbian
Danish
Russian
Norwegian
Uzbek
Portuguese
Swedish
Language (ISO639):tur
sqi
bul
zho
ces
nld
eng
est
fra
gla
deu
ell
ita
jpn
lat
lit
msa
spa
dan
rus
nor
uzb
por
swe
Medium:CD-ROM
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0004
DateStamp:  1996-09-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Asia area_Europe country_BG country_CZ country_DE country_DK country_ES country_FR country_GB country_GR country_IT country_JP country_LT country_NL country_NO country_PT country_RU country_SE country_TR country_VA dcmi_Text iso639_bul iso639_ces iso639_dan iso639_deu iso639_ell iso639_eng iso639_est iso639_fra iso639_gla iso639_ita iso639_jpn iso639_lat iso639_lit iso639_msa iso639_nld iso639_nor iso639_por iso639_rus iso639_spa iso639_sqi iso639_swe iso639_tur iso639_uzb iso639_zho olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0004
Up-to-date as of: Wed Mar 29 3:47:43 EDT 2017