OLAC Record
oai:catalogue.elra.info:ELRA-W0024-02

Metadata
Title:PAROLE Portuguese Corpus - tagged subset
Abstract:The parole Portuguese corpus contains approximately 3 million running words of European Portuguese distributed by Medium (Newspaper, Book, Periodical, Miscellaneous). The corpus was classified and encoded according to the common core parole encoding standard. The file format of this corpus is SGML. Also availabe, a subcorpus consists of about 250,000 words morpho-syntactically tagged. Disambiguation was manually checked.
Access Rights:Rights available for: Research Use, Commercial Use
Date Available (W3CDTF):2000-05-19
Date Issued (W3CDTF):2004-05-12
Date Modified (W3CDTF):2004-05-12
Description:Written Corpora
The parole Portuguese corpus contains approximately 3 million running words of European Portuguese distributed by Medium, as follows: * Newspaper: about 65%, covering the period 1996-1997 of 3 titles; * Book: about 20%, concerning 12 titles from 3 editing houses; * Periodical: about 5%, concerning 7 weekly issues of 1 title, 1996; * Miscellaneous: about 10%, concerning several files distributed by 8 titles. The corpus was classified and encoded according to the common core parole encoding standard. The file format of this corpus is SGML. A subcorpus of the PAROLE Portuguese Corpus, which reproduces approximately the whole Corpus distribution by Medium (Newspaper: about 65%, Book: ab. 20%, Periodical: ab. 5%, Miscellaneous: ab. 10%) is also available. It has about 250,000 words morpho-syntactically tagged accordingly to the parole common tagset and morpho-syntactic annotation standards. Disambiguation was manually checked.
Identifier:ELRA-W0024-02
http://catalog.elra.info/product_info.php?products_id=766
Language:Portuguese
Language (ISO639):por
Publisher:ELRA (European Language Resources Association)
Type (DCMI):Text
Type (OLAC):primary_text

OLAC Info

Archive:  ELRA Catalogue of Language Resources
Description:  http://www.language-archives.org/archive/catalogue.elra.info
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:catalogue.elra.info:ELRA-W0024-02
DateStamp:  2000-05-19
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: n.a. 2004. ELRA (European Language Resources Association).
Terms: area_Europe country_PT dcmi_Text iso639_por olac_primary_text


http://www.language-archives.org/item.php/oai:catalogue.elra.info:ELRA-W0024-02
Up-to-date as of: Fri Jun 23 1:05:13 EDT 2017