OLAC Record

Title:LT TTT
Creator:Andrei Mikheev
Claire Grover
Colin Matheson
Description:The LT TTT system provides a flexible means of tokenising texts and adding markup at various levels. The main component of the LT TTT system is a program called fsgmatch. This is a general purpose cascaded transducer which processes an input stream deterministically and rewrites it according to a set of rules provided in a grammar file. Although it can be used to alter the input in a variety of ways, the grammars provided with the LT TTT system are all used simply to add mark-up information. We have provided grammars to segment texts into paragraphs, segment paragraphs into words, recognise numerical expressions, mark up money, date and time expressions in newspaper texts, and mark up bibliographic information in academic texts. The documentation provides a description of the rule formalis
Contact: grover@cogsci.ed.ac.uk
Documentation: online
Platform: Solaris
Distribution: Online
Price (Academic, Commercial, Multi-user) : free , to negotiate , to negotiate
Identifier:LT TTT
Publisher:LTG, University of Edinburgh
Type:Annotation Tools , Corpus Analysis , Part-of-Speech Tagging , Processing Mark-Up Languages , Tokenization


Archive:  The Natural Language Software Registry
Description:  http://www.language-archives.org/archive/dfki.de
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:dfki.de:LT
DateStamp:  2002-12-14
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: Andrei Mikheev; Claire Grover; Colin Matheson. 2000. LTG, University of Edinburgh.

Up-to-date as of: Mon Jul 18 8:22:55 EDT 2022