Skip to main content
  • Conference proceedings
  • © 2008

Machine Learning for Multimodal Interaction

5th International Workshop, MLMI 2008, Utrecht, The Netherlands, September 8-10, 2008, Proceedings

Conference proceedings info: MLMI 2008.

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (32 papers)

  1. Front Matter

  2. Face, Gesture and Nonverbal Communication

    1. Visual Focus of Attention in Dynamic Meeting Scenarios

      • Michael Voit, Rainer Stiefelhagen
      Pages 1-13
    2. What Does the Face-Turning Action Imply in Consensus Building Communication?

      • Tetsuro Onishi, Takatsugu Hirayama, Takashi Matsuyama
      Pages 26-37
    3. Distinguishing the Communicative Functions of Gestures

      • Kristiina Jokinen, Costanza Navarretta, Patrizia Paggio
      Pages 38-49
    4. Ambiguity Modeling in Latent Spaces

      • Carl Henrik Ek, Jon Rihan, Philip H. S. Torr, Grégory Rogez, Neil D. Lawrence
      Pages 62-73
  3. Audio-Visual Scene Analysis and Speech Processing

    1. Inclusion of Video Information for Detection of Acoustic Events Using the Fuzzy Integral

      • Taras Butko, Andrey Temko, Climent Nadeu, Cristian Canton
      Pages 74-85
    2. Audio-Visual Clustering for 3D Speaker Localization

      • Vasil Khalidov, Florence Forbes, Miles Hansard, Elise Arnaud, Radu Horaud
      Pages 86-97
    3. A Hybrid Generative-Discriminative Approach to Speaker Diarization

      • Athanasios K. Noulas, Tim van Kasteren, Ben J. A. Kröse
      Pages 98-109
    4. A Neural Network Based Regression Approach for Recognizing Simultaneous Speech

      • Weifeng Li, Kenichi Kumatani, John Dines, Mathew Magimai-Doss, Hervé Bourlard
      Pages 110-118
    5. Hilbert Envelope Based Features for Far-Field Speech Recognition

      • Samuel Thomas, Sriram Ganapathy, Hynek Hermansky
      Pages 119-124
    6. Multimodal Unit Selection for 2D Audiovisual Text-to-Speech Synthesis

      • Wesley Mattheyses, Lukas Latacz, Werner Verhelst, Hichem Sahli
      Pages 125-136
  4. Social Signal Processing

    1. Decision-Level Fusion for Audio-Visual Laughter Detection

      • Boris Reuderink, Mannes Poel, Khiet Truong, Ronald Poppe, Maja Pantic
      Pages 137-148
    2. Daily Routine Classification from Mobile Phone Data

      • Katayoun Farrahi, Daniel Gatica-Perez
      Pages 173-184
  5. Human-Human Spoken Dialogue Processing

    1. Hybrid Multi-step Disfluency Detection

      • Sebastian Germesin, Tilman Becker, Peter Poller
      Pages 185-195
    2. Exploring Features and Classifiers for Dialogue Act Segmentation

      • Harm op den Akker, Christian Schulz
      Pages 196-207
    3. Detecting Action Items in Meetings

      • Gabriel Murray, Steve Renals
      Pages 208-213

Other Volumes

  1. Machine Learning for Multimodal Interaction

About this book

This book constitutes the refereed proceedings of the 5th International Workshop on Machine Learning for Multimodal Interaction, MLMI 2008, held in Utrecht, The Netherlands, in September 2008. The 12 revised full papers and 15 revised poster papers presented together with 5 papers of a special session on user requirements and evaluation of multimodal meeting browsers/assistants were carefully reviewed and selected from 47 submissions. The papers cover a wide range of topics related to human-human communication modeling and processing, as well as to human-computer interaction, using several communication modalities. Special focus is given to the analysis of non-verbal communication cues and social signal processing, the analysis of communicative content, audio-visual scene analysis, speech processing, interactive systems and applications.

Bibliographic Information

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access