Recommended metadata extensions

Date issued:2002-11-30
Status of document:Proposed Recommendation. This document is in the midst of open review by the community.
This version:http://www.language-archives.org/REC/olac-extensions-20021130.html
Latest version:http://www.language-archives.org/REC/olac-extensions.html
Previous version:None.
Abstract:

This document lists all the metadata extensions that are recommended by the Open Language Archives Community for use in describing language resources. (The date issued for this document is the revision date for the text in section 1 and the list of extensions included in section 2. The information about a particular extension may be more recent than this date; see the specific revision date given for each extension.)

Editors: Gary Simons, SIL International (mailto:gary_simons@sil.org)
Steven Bird, University of Melbourne (mailto:sb@cs.mu.oz.au)
Copyright © 2002 Gary Simons (SIL International) and Steven Bird (University of Melbourne). This material may be distributed and repurposed subject to the terms and conditions set forth in the Creative Commons Attribution-ShareAlike 2.5 License.

Table of contents

  1. Introduction
  2. Recommended extensions
References

1. Introduction

The OLAC metadata standard [OLAC-MS] follows the generic resource description standard known as "Qualified Dublin Core" [DCQ]. In order to meet the specific needs of the language resources community, the OLAC metadata standard incorporates an extension mechanism that makes it possible to describe language resources with greater precision. The mechanism uses the xsi:type attribute to override the basic definition of a Dublin Core metadata element with a definition that has a more precise semantics. For instance, the following subject description,

<dc:subject>Dschang language</dc:subject>

can be formally identified as pertaining to language and as relating to the specific language associated with a standardized language identification code by employing an OLAC extension:

<dc:subject xsi:type="olac:language" code="x-sil-BAN"/>

Details on how the extension mechanism works may be found in the last two sections of [OLAC-MS].

Any party may develop a metadata extension and use it in OLAC metadata records. When an extension is proven to work and is judged to have wide relevance across the language resources community, then it may be put forward to the community as a recommended best practice. If, after following the process described in [OLAC-Process], the community reaches consensus that the extension should indeed be used where applicable by all OLAC member archives, then the extension achieves the status of an OLAC Recommendation.

This document provides the complete list of all extensions that have been adopted as OLAC Recommendations. Each extension is described in terms of a title and the following five attributes:

Name

The symbolic name that is used as the value of the xsi:type attribute to indicate that the extension is being used in a metadata element.

Date

The latest revision date of the extension or its controlled vocabulary.

Applies to

The Dublin Core elements with which the extension may be used.

Description

A summary description of what the extension is used for.

Documentation

A link to a complete document that defines and exemplifies the extension. If the extension involves a controlled vocabulary, the document should also enumerate and define the terms of the vocabulary.

2. Recommended extensions

The Open Language Archives Community recommends that all participating data providers use the following metadata extensions for describing language resources whenever they are applicable:

Code for Discourse Types

Name:olac:discourse-type
Date:2002-11-21
Applies to:type, subject
Description:Provides a controlled vocabulary for identifying approximately ten discourse types. It is used with Type to identify the genre of a language resource (particularly a primary text). It may also be used with Subject to identify a work as being about a particular genre.
Documentation:http://www.language-archives.org/REC/discourse.html
Schema:http://www.language-archives.org/OLAC/1.0/olac-discourse-type.xsd

Code for Language Identification

Name:olac:language
Date:2003-07-01
Applies to:language, subject
Description:Provides codes for identifying all known languages, both living and extinct.
Documentation:http://www.language-archives.org/REC/language.html
Schema:http://www.language-archives.org/OLAC/1.0/olac-language.xsd

Code for Linguistic Field

Name:olac:linguistic-field
Date:2003-01-21
Applies to:subject
Description:These codes describe the content of a resource as relevant to a particular subfield of linguistic science.
Documentation:http://www.language-archives.org/REC/field.html
Schema:http://www.language-archives.org/OLAC/1.0/olac-linguistic-field.xsd

Code for Linguistic Data Types

Name:olac:linguistic-type
Date:2002-12-12
Applies to:type
Description:Provides a broad classification of the nature of the resource from a linguistic point of view (namely, as a lexicon, a primary text, a data set, or a language description).
Documentation:http://www.language-archives.org/REC/type.html
Schema:http://www.language-archives.org/OLAC/1.0/olac-linguistic-type.xsd

Code for Participant Roles

Name:olac:role
Date:2003-08-27
Applies to:creator, contributor
Description:Provides a controlled vocabulary for identifying the role of a Creator or Contributor more precisely. The vocabulary identifies approximately twenty roles that are common in the development of language resources.
Documentation:http://www.language-archives.org/REC/role.html
Schema:http://www.language-archives.org/OLAC/1.0/olac-role.xsd


References

[DCQ]Dublin Core Qualifiers.
<http://dublincore.org/documents/2000/07/11/dcmes-qualifiers/>
[OLAC-MS]OLAC Metadata Standard.
<http://www.language-archives.org/OLAC/metadata.html>
[OLAC-Process]OLAC Process.
<http://www.language-archives.org/OLAC/process.html>