OLAC Record
oai:scholarspace.manoa.hawaii.edu:10125/26154

Metadata
Title:Novel developments in ELAN
Bibliographic Citation:Sloetjes, Han, Stehouwer, Herman, Drude, Sebastian, Sloetjes, Han, Stehouwer, Herman, Drude, Sebastian; 2013-02-28; Modern language documentation depends on suitable software infrastructure. ELAN is a well-known tool developed at The Language Archive / MPI-PL which allows multi-tier, multi-speaker, time-linked annotation of audio and video recordings, in particular in a field work and language documentation setting. In the past two years ELAN has been under constant development. Here we will give an overview of the major recent enhancements to ELAN and ongoing work. These changes combined provide for a better and much faster process for the field linguist. Below we address five aspects, each consisting of multiple new features. We will discuss briefly their impact on typical workflows. First, there are modes that help you perform specialized tasks more efficiently. These are a) the segmentation mode, b) the transcription mode, and c) the interlinearization mode. With a focused user interface for each task, the segmentation and the transcription modes together provide very efficient means for the initial steps of a typical workflow. The interlinearization mode, which is still in an early phase of development, is optimized for the next steps of (morphological) parsing, glossing and tagging. It does so by providing an interface to a new program: Lexan. Lexan is an extensible system for "annotyzers" (annotation-suggestion modules). These can be used to perform many complex and simple tasks: from tier copying via word segmentation and interlinearization to machine learning. Second, the interoperability with FLEx (FieldWork Language Explorer) has been improved. An export function for the FLEx file format now complements the, updated, import function. Third, extensive support for performing operations on multiple files have been added. These include a) file-format conversion (including Toolbox and Praat), and b) creation of similarly structured EAF files for a selection of media files. Fourth, facilities have been added to create new tiers with annotations on the basis of existing tiers while applying logical operations. E.g. if the annotation occurs in both tier A and tier B, then copy it combined to tier C. The concept of creating new tiers on the basis of existing ones is currently further explored in Lexan (mentioned above). However these features provide for a straightforward interface to basic, but extremely helpful operations. Fifth, preliminary interaction with relevant web services (online audio-video and text processors that create annotations) has been implemented. In short, in the past years several crucial features have been added that make ELAN better and faster to use in many aspects.; Kaipuleohone University of Hawai'i Digital Language Archive;http://hdl.handle.net/10125/26154.
Contributor (speaker):Sloetjes, Han
Stehouwer, Herman
Drude, Sebastian
Creator:Sloetjes, Han
Stehouwer, Herman
Drude, Sebastian
Date (W3CDTF):2013-02-28
Description:Modern language documentation depends on suitable software infrastructure. ELAN is a well-known tool developed at The Language Archive / MPI-PL which allows multi-tier, multi-speaker, time-linked annotation of audio and video recordings, in particular in a field work and language documentation setting. In the past two years ELAN has been under constant development. Here we will give an overview of the major recent enhancements to ELAN and ongoing work. These changes combined provide for a better and much faster process for the field linguist. Below we address five aspects, each consisting of multiple new features. We will discuss briefly their impact on typical workflows. First, there are modes that help you perform specialized tasks more efficiently. These are a) the segmentation mode, b) the transcription mode, and c) the interlinearization mode. With a focused user interface for each task, the segmentation and the transcription modes together provide very efficient means for the initial steps of a typical workflow. The interlinearization mode, which is still in an early phase of development, is optimized for the next steps of (morphological) parsing, glossing and tagging. It does so by providing an interface to a new program: Lexan. Lexan is an extensible system for "annotyzers" (annotation-suggestion modules). These can be used to perform many complex and simple tasks: from tier copying via word segmentation and interlinearization to machine learning. Second, the interoperability with FLEx (FieldWork Language Explorer) has been improved. An export function for the FLEx file format now complements the, updated, import function. Third, extensive support for performing operations on multiple files have been added. These include a) file-format conversion (including Toolbox and Praat), and b) creation of similarly structured EAF files for a selection of media files. Fourth, facilities have been added to create new tiers with annotations on the basis of existing tiers while applying logical operations. E.g. if the annotation occurs in both tier A and tier B, then copy it combined to tier C. The concept of creating new tiers on the basis of existing ones is currently further explored in Lexan (mentioned above). However these features provide for a straightforward interface to basic, but extremely helpful operations. Fifth, preliminary interaction with relevant web services (online audio-video and text processors that create annotations) has been implemented. In short, in the past years several crucial features have been added that make ELAN better and faster to use in many aspects.
Identifier (URI):http://hdl.handle.net/10125/26154
Language:English
Language (ISO639):eng
Rights:Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported
Table Of Contents:26154.mp3
26154.pdf

OLAC Info

Archive:  Language Documentation and Conservation
Description:  http://www.language-archives.org/archive/ldc.scholarspace.manoa.hawaii.edu
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:scholarspace.manoa.hawaii.edu:10125/26154
DateStamp:  2017-05-11
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Sloetjes, Han; Stehouwer, Herman; Drude, Sebastian. 2013. Language Documentation and Conservation.
Terms: area_Europe country_GB iso639_eng


http://www.language-archives.org/item.php/oai:scholarspace.manoa.hawaii.edu:10125/26154
Up-to-date as of: Sat Apr 20 18:39:37 EDT 2024