OLAC Record
oai:catalogue.elra.info:ELRA-W0005

Metadata
Title:ECI-ELSNET Italian & German tagged sub-corpus
Abstract:The data is extracted from the ECI corpus (the German Frankfurter Rundschau part) & the Italian corpus of ILC/CNR. It contains the following domains: Economy (17,000 words), Politics (14,000 words), Culture (18,000 words), Sports (9,000 words), Local Events (8,500 words).
Access Rights:Rights available for: Research Use
Date Available (W3CDTF):1996-09-01
Date Issued (W3CDTF):2004-09-14
Date Modified (W3CDTF):2005-05-24
Description:Written Corpora
The objective is to provide a small but fine grained morphosyntactically tagged corpus, 50.000 running words for each of the two languages (Italian and German) to be used in research work on tagging methods and models. The text for German comes from the Frankfurter Rundschau extracted from the ECI corpus, the Italian material comes from the Italian corpus of ILC - CNR. For German the data concerns several domains including Economy (17,000 word forms), Politics (14,000 word forms), Culture (18,000 word forms), Sports (9,000 word forms), and Local Events (8500 word forms). The situation for Italian is comparable to that. Word occurrences are tagged with very fine grained tagsets which are based on the EAGLES morphosyntactic guidelines. The tagging, done automatically, has been manually checked. The CD-ROM contains: the text in SGML format; the DBT software which allows different browsing and operations on the annotated text and the EAGLES guidelines for morphosyntactic.
Identifier:ELRA-W0005
http://catalog.elra.info/product_info.php?products_id=86
Language:Italian
German
Language (ISO639):ita
deu
Medium:CD-ROM
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0005
DateStamp:  1996-09-01
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Europe country_DE country_IT dcmi_Text iso639_deu iso639_ita olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0005
Up-to-date as of: Mon Feb 27 0:28:59 EST 2017