OLAC Record
oai:mmm.idiap.ch:5

Metadata
Title:The M4 Meeting Corpus
Bibliographic Citation:I. McCowan, S. Bengio, D. Gatica-Perez, G. Lathoud, F. Monay, D. Moore, P. Wellner, and H. Bourlard, "Modeling Human Interaction in Meetings", in Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hong Kong, 2003, pages 748-751.
Contributor (author):M4 Project
Contributor (recorder):Idiap Research Institute
Contributor (sponsor):European Union - IST Programme
Swiss National Science Foundation - IM2 National Center of Competence in Research
Coverage:Martigny, Switzerland
Date:2002
Description:The M4 Meeting Corpus is a multi-modal data set consisting of about 60 scripted meeting recordings. The corpus has been recorded using synchronised recording devices (close-talking and far-field microphones, individual and room-view video cameras, and video projection. The meetings included in the corpus last about five minutes each, and are scripted, in the sense that participants enact a series of conversational states that are signalled to them externally (e.g. monologue, discussion, or note-taking). The topic of the discussion is selected before the meetings, e.g., presentation of one's research area or latest reading. The corpus has been fully transcribed, and annotations for several phenomena are available.
Description (URI):http://www.idiap.ch/mmm/corpora/m4-corpus
Extent:5 hours
60 meetings
Format:WAV
RealAudio
AVI
RealVideo
XML
HTML
NXT (NITE XML Toolkit)
Format (IMT):audio/x-wav
audio/vnd.rn-realaudio
video/x-msvideo
video/vnd.rn-realvideo
text/xml
text/html
Identifier:M4_Corpus
Language:English
Language (ISO639):eng
Publisher:AMI Consortium
Rights:The data is free for research use. Please acknowledge the bibliographic reference quoted in this record when using results obtained with the M4 Corpus in other publications.
Subject:Multi-party meetings
Table Of Contents:Media files for meetings TRN-01 to TRN-30 and TST-01 to TST-29
Manual transcripts for meetings TRN-01 to TRN-12, TRN-28, TST-03, TST-05, TRN-07, TRN-09 to TRN-28
Annotations: dialogue acts, gaze estimation, speech transcription, transcription segments, word segmentation, adjacency pairs and addressee information. Available for meetings TRN 1, 2, 7, 9, 12 and TST 3, 14, 23, 26, 27, in NXT (NITE XML Toolkit) format.
Type (DCMI):Sound
Text
MovingImage
Collection
Type (Discourse):dialogue
narrative
Type (OLAC):primary_text

OLAC Info

Archive:  MMM - Multi-Modal Media File Server
Description:  http://www.language-archives.org/archive/mmm.idiap.ch
GetRecord:  OAI-PMH request for OLAC format
GetRecord:  Pre-generated XML file

OAI Info

OaiIdentifier:  oai:mmm.idiap.ch:5
DateStamp:  2009-11-18
GetRecord:  OAI-PMH request for simple DC format

Search Info

Citation: M4 Project. 2002. AMI Consortium.
Terms: area_Europe country_GB dcmi_Collection dcmi_MovingImage dcmi_Sound dcmi_Text iso639_eng olac_dialogue olac_narrative olac_primary_text


http://www.language-archives.org/item.php/oai:mmm.idiap.ch:5
Up-to-date as of: Sun Oct 19 0:15:13 EDT 2014