Skip to main content
  • Book
  • © 2008

Multimodal Processing and Interaction

Audio, Video, Text

  • Emphasis on multimodal information processing aspects of multimedia and cross-interaction of multiple modalities
  • Broad spectrum of novel perspectives, analytic tools, algorithms, design practices and applications in multimedia science and engineering

Part of the book series: Multimedia Systems and Applications (MMSA, volume 33)

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (15 chapters)

  1. Front Matter

    Pages 1-23
  2. Review of the State of the Art

    1. Front Matter

      Pages 1-1
    2. Cross-Modal Integration for Performance Improving in Multimedia: A Review

      • Petros Maragos, Patrick Gros, Athanassios Katsamanis, George Papandreou
      Pages 1-46
    3. Human-Computer Interfaces to Multimedia Content a Review

      • Alexandros Potamianos, Manolis Perakakis
      Pages 1-39
  3. Integrated Multimedia Analysis And Recognition

    1. Front Matter

      Pages 1-1
    2. Stochastic Models for Multimodal Video Analysis

      • Manolis Delakis, Guillaume Gravier, Patrick Gros
      Pages 1-19
    3. Adaptive Multimodal Fusion by Uncertainty Compensation with Application to Audio-Visual Speech Recognition

      • George Papandreou, Athanassios Katsamanis, Athanassios Katsamanis, Vassilis Pitsikalis, Petros Maragos
      Pages 1-15
    4. Action Recognition in Multimedia Streams

      • Rozenn Dahyot, François Pitié, Daire Lennon, Naomi Harte, Anil Kokaram
      Pages 1-16
    5. Surveillance Using Both Video and Audio

      • Yigithan Dedeoglu, B. Ugur Toreyin, Ugur Gudukbay, A. Enis Cetin
      Pages 1-13
    6. Movie Analysis with Emphasis to Dialogue and Action Scene Detection

      • Emmanouil Benetos, Spyridon Siatras, Constantine Kotropoulos, Nikos Nikolaidis, Ioannis Pitas
      Pages 1-21
    7. Audiovisual Attention Modeling and Salient Event Detection

      • Georgios Evangelopoulos, Konstantinos Rapantzikos, Petros Maragos, Yannis Avrithis, Alexandros Potamianos
      Pages 1-21
    8. Toward the Integration of Natural Language Processing and Automatic Speech Recognition: Using Morpho-Syntax and Pragmatics for Transcription

      • Stéphane Huet, Gwénolé Lecorvé, Guillaume Gravier, Pascale Sébillot
      Pages 1-18
  4. Searching Multimedia Content

    1. Front Matter

      Pages 1-1
    2. Interactive Image Retrieval Using a Hybrid Visual and Conceptual Content Representation

      • Marin Ferecatu, Nozha Boujemaa, Michel Crucianu
      Pages 1-20
  5. Interfaces to Multimedia Content

    1. Front Matter

      Pages 1-1
    2. IDesign Principles for Multimodal Spoken Dialogue Systems

      • Alexandros Potamianos, Manolis Perakakis
      Pages 1-18
    3. Eye Tracking: A New Interface for Visual Exploration

      • Oyewole K. Oyekoya, Fred W. M. Stentiford
      Pages 1-14
    4. User Interaction for Mobile Devices

      • Sanni Siltanen, Charles Woodward, Seppo Valli, Petri Honkamaa, Andreas Rauber
      Pages 1-17

About this book

Multimodal Processing and Interaction: Audio, Video and Text presents high quality, state-of-the-art research ideas and results from theoretic, algorithmic and application viewpoints. This edited volume contains both state-of-the-art reviews and original contributions by leading experts in the scientific and technological field of multimedia. It grew out of a four-year collaboration among research groups participating in the European network of Excellence on Multimedia Understanding, Semantics, Computation and Learning (MUSCLE).

Multimodal Processing and Interaction: Audio, Video and Text covers a broad spectrum of novel perspectives, analytic tools, algorithms, design practices and applications in multimedia science and engineering with emphasis on multimodal integration and modality fusion. This volume also contains contributions in the area of interaction with multimedia, especially multimodal interfaces for accessing multimedia content.

Multimodal Processing and Interaction: Audio, Video and Text is designed for a professional audience composed of practitioners and researchers in industry and academia. This book is suitable for advanced-level students in computer science and engineering as well.

 

Editors and Affiliations

  • School of Electrical &, Computing Engineering, National Technical University of Athens, Athens, Greece

    Petros Maragos

  • Dept. Electronic & Computer, Engineering, Technical University Crete, Chania, Crete, Greece

    Alexandros Potamianos

  • Inst. Recherche en Informatique et, Systemes Aleatoires (IRISA), INRIA Rennes, Rennes CX, France

    Patrick Gros

Bibliographic Information

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access