Sample Metadata Record

oai:www.ldc.upenn.edu:LDC2008T01


XML format

<olac:olac>
<dc:title>Hungarian-English Parallel Text, Version 1.0</dc:title>
<dc:creator xsi:type="olac:role" olac:code="author">Dániel Varga, László Németh, Péter Halácsy, András Kornai, et. al.</dc:creator>
<dc:date xsi:type="dcterms:W3CDTF">2008-01-22</dc:date>
<dc:description>Version: 1.0</dc:description>
<dc:description>Data type: Text</dc:description>
<dc:description>Release type: General</dc:description>
<dc:description>Non-member fee: 1000.00 USD</dc:description>
<dc:description>Reduced-license fee: 500.00 USD</dc:description>
<dc:description>Extra-copy fee: 150.00 USD</dc:description>
<dc:description>Online documentation: http://www.ldc.upenn.edu/Catalog/docs/LDC2008T01</dc:description>
<dc:description>Application: cross-lingual information retrieval</dc:description>
<dc:description>Application: machine translation</dc:description>
<dc:description>Membership year: 2008</dc:description>
<dc:description>Data source: web collection</dc:description>
<dc:description>Data source: varied</dc:description>
<dc:description>Data source: newswire</dc:description>
<dc:description>Data source: news magazine</dc:description>
<dcterms:extent>Corpus size: 1992294.400 KB</dcterms:extent>
<dcterms:medium>Distribution: 1 CD</dcterms:medium>
<dc:identifier>LDC2008T01</dc:identifier>
<dc:identifier>ISBN: 1-58563-461-1</dc:identifier>
<dc:rights>Member licence: http://www.ldc.upenn.edu/Catalog/mem_agree/LDC2008T01_agreement.html</dc:rights>
<dc:rights>Non-member license: http://www.ldc.upenn.edu/Catalog/nonmem_agree/LDC2008T01_agreement.html</dc:rights>
</olac:olac>

Display format

 Title  Hungarian-English Parallel Text, Version 1.0
 Creator (author)  Dániel Varga, László Németh, Péter Halácsy, András Kornai, et. al.
 Date  (W3CDTF)  2008-01-22
 Description  Version: 1.0
 Description  Data type: Text
 Description  Release type: General
 Description  Non-member fee: 1000.00 USD
 Description  Reduced-license fee: 500.00 USD
 Description  Extra-copy fee: 150.00 USD
 Description  Online documentation: http://www.ldc.upenn.edu/Catalog/docs/LDC2008T01
 Description  Application: cross-lingual information retrieval
 Description  Application: machine translation
 Description  Membership year: 2008
 Description  Data source: web collection
 Description  Data source: varied
 Description  Data source: newswire
 Description  Data source: news magazine
 Extent  Corpus size: 1992294.400 KB
 Medium  Distribution: 1 CD
 Identifier  LDC2008T01
 Identifier  ISBN: 1-58563-461-1
 Rights  Member licence: http://www.ldc.upenn.edu/Catalog/mem_agree/LDC2008T01_agreement.html
 Rights  Non-member license: http://www.ldc.upenn.edu/Catalog/nonmem_agree/LDC2008T01_agreement.html

Metadata quality analysis

OLAC metadata records are scored for metadata quality on a 10-point scale explained in OLAC Metadata Metrics. The score for the above record (along with comments on changes that could improve the score) is as follows:

Component + - Comments
Title   1   0 
Date   1   0 
Agent   1   0 
About   1   0 
Depth   1   0 
Content Language   0   1  Add a dc:language element with an ISO 639-3 code to identify the language in which the resource is written or spoken.
Subject Language   1   0 
OLAC Type   0   1  Add a dc:type element that uses the OLAC linguistic-type encoding scheme to identify the type of the resource from a linguistic point of view.
DCMI Type   0   1  Add a dc:type element that uses the DCMIType encoding scheme to identify the generic type of the resource.
Precision   0.67   0.33  For the full score, make use of at least one more encoding scheme in addition to the ones counted explicitly in other components of the score. For instance,
  • use dcterms:IMT on dc:format
Quality score  6.67