OLAC Record

Title:AMI Corpus - Annotations and Metadata
Bibliographic Citation:Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain McCowan, Wilfried Post, Dennis Reidsma, and Pierre Wellner, "The AMI Meeting Corpus: A Pre-announcement", in Machine Learning for Multimodal Interaction II, edited by S. Renals and S. Bengio, LNCS 3869, Springer-Verlag, Berlin/Heidelberg, 2006, pages 28-39.
Jean Carletta, "Unleashing the killer corpus: experiences in creating the multieverything AMI Meeting Corpus", Language Resources and Evaluation, vol. 41, n. 2, 2007, pages 181-190.
Andrei Popescu-Belis and Paula Estrella, "Generating Usable Formats for Metadata and Annotations in a Large Meeting Corpus", Proc. of ACL 2007 (45th International Conference of the Association for Computational Linguistics), Prague, 2007, pages 93-96.
Contributor (annotator):University of Edinburgh
Contributor (author):AMI Consortium
Contributor (editor):Jean Carletta
Contributor (recorder):Idiap Research Institute
University of Edinburgh
TNO (Netherlands Organization for Applied Scientific Research)
Contributor (sponsor):European Union - Framework Programmes 6 and 7
Swiss National Science Foundation - IM2 National Center of Competence in Research
Coverage:Martigny, Switzerland
Edinburgh, Scotland
Delft, The Netherlands
Description:The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. The recordings have been fully transcribed, and annotations for many different phenomena (such as dialogue acts or head movement) have been made, mainly using the NITE XML Toolkit. This sub-part of the corpus references the manual annotation files, including transcripts, along with a sample output of automatic speech recognition. Depending on the annotation dimension, the manual annotations may cover all or only part of the AMI Corpus.
Description (URI):http://corpus.amiproject.org
Extent:171 meetings
NXT (NITE XML Toolkit)
IMDI (ISLE Metadata Initiative)
Format (IMT):text/xml
Is Part Of:AMI Meeting Corpus
Language (ISO639):eng
Publisher:AMI Consortium
Rights:All of the signals and transcription, and some of the annotations, have been released publicly under the AMI Meeting Corpus license, very similar to the Creative Commons Attribution NonCommercial ShareAlike 2.5 License (http://creativecommons.org/licenses/by-nc-sa/2.5).
Rights (URI):http://corpus.amiproject.org/documentations/license
Subject:Multi-party meetings
Table Of Contents:Manual transcripts (all 171 meetings)
Named entity annotation (117 meetings)
Dialogue acts annotation (139 meetings)
Decision point annotation (47 meetings)
Topic segmentation (139 meetings)
Abstractive summaries (142 meetings)
Extractive summaries (137 meetings)
Focus of attention annotation (14 meetings)
Hand movement annotation (17 meetings)
Head movement annotation (46 meetings)
Body ("leg") movement annotation (125 meetings)
Metadata for each meeting and each participant
Type (DCMI):Collection
Type (Discourse):dialogue


Archive:  MMM - Multi-Modal Media File Server
Description:  http://www.language-archives.org/archive/mmm.idiap.ch
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:mmm.idiap.ch:4
DateStamp:  2009-11-18
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: AMI Consortium. 2005-2007. AMI Meeting Corpus.
Terms: area_Europe country_GB dcmi_Collection iso639_eng olac_dialogue

Up-to-date as of: Sun Oct 19 0:15:13 EDT 2014