OLAC Record

Title:AMI Corpus - Non Scenario-based Meetings Media
Bibliographic Citation:Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain McCowan, Wilfried Post, Dennis Reidsma, and Pierre Wellner, "The AMI Meeting Corpus: A Pre-announcement", in Machine Learning for Multimodal Interaction II, edited by S. Renals and S. Bengio, LNCS 3869, Springer-Verlag, Berlin/Heidelberg, 2006, pages 28-39.
Jean Carletta, "Unleashing the killer corpus: experiences in creating the multieverything AMI Meeting Corpus", Language Resources and Evaluation, vol. 41, n. 2, 2007, pages 181-190.
Contributor (annotator):University of Edinburgh
Contributor (author):AMI Consortium
Contributor (editor):Jean Carletta
Contributor (recorder):Idiap Research Institute
University of Edinburgh
TNO (Netherlands Organization for Applied Scientific Research)
Contributor (sponsor):European Union - Framework Programmes 6 and 7
Swiss National Science Foundation - IM2 National Center of Competence in Research
Coverage:Martigny, Switzerland
Edinburgh, Scotland
Delft, The Netherlands
Description:The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. This sub-part consists of about one third of the data, and includes natural meetings on topics such as speech processing or statistical tools, as well as planning for a fictitious movie club, or furnishing a fictitious office. This sub-part groups the media files of non-scenario-based meetings, which have been recorded using synchronised recording devices (close-talking and far-field microphones, individual and room-view video cameras, projection, a whiteboard, individual pens).
The IDs of non-scenario-based meetings in the AMI Corpus take the following form: [EI][NB][1-3]0[0-1][0-9][a-d]. The first character indicates location: E for Edinburgh or I for Idiap. The second character is either N for naturally occurring meetings, or B for the ISSCO office and movie club scenarios. The numbers work in series: 1000 for Idiap, 2000 for Edinburgh, and 4000 for ISSCO (recorded in the Idiap room). The postfix a/b/c/d is optional and indicates membership in a series.
Description (URI):http://corpus.amiproject.org
Extent:ca. 30 hours
33 meetings
Format (IMT):audio/x-wav
Is Part Of:AMI Meeting Corpus
Language (ISO639):eng
Publisher:AMI Consortium
Rights:This media has been released publicly under the AMI Meeting Corpus license, very similar to the Creative Commons Attribution NonCommercial ShareAlike 2.5 License (http://creativecommons.org/licenses/by-nc-sa/2.5).
Rights (URI):http://corpus.amiproject.org/documentations/license
Subject:Multi-party meetings
Table Of Contents:EN2001a-e; EN2002a-d; EN2003a; EN2004a; EN2005a; EN2006a-b; EN2009b-d; IB4001; IB4002; IB4003; IB4004; IB4005; IB4010; IB4011; IN1001; IN1002; IN1005; IN1007; IN1008; IN1009; IN1012; IN1013; IN1014; IN1016
Type (DCMI):Sound
Type (Discourse):dialogue


Archive:  MMM - Multi-Modal Media File Server
Description:  http://www.language-archives.org/archive/mmm.idiap.ch
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:mmm.idiap.ch:3
DateStamp:  2009-11-18
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: AMI Consortium. 2005-2007. AMI Meeting Corpus.
Terms: area_Europe country_GB dcmi_Collection dcmi_MovingImage dcmi_Sound dcmi_StillImage iso639_eng olac_dialogue

Up-to-date as of: Sun Oct 19 0:15:13 EDT 2014