Skip to main content
Book cover

Machine Learning for Multimedia Content Analysis

  • Book
  • © 2007

Overview

  • First book dedicated to the multimedia community to address unique problems and interesting applications of machine learning in this area
  • Includes examples of unsupervised learning, generative models and discriminative models
  • Includes Maximum Margin Markov (M3) networks, which strives to combine the advantages of both the graphical models and Support Vector Machines (SVM)
  • Includes supplementary material: sn.pub/extras

Part of the book series: Multimedia Systems and Applications (MMSA, volume 30)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 159.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (10 chapters)

  1. Unsupervised Learning

  2. Discriminative Graphical Models

Keywords

About this book

Challenges in complexity and variability of multimedia data have led to revolutions in machine learning techniques. Multimedia data, such as digital images, audio streams and motion video programs, exhibit richer structures than simple, isolated data items. A number of pixels in a digital image collectively conveys certain visual content to viewers. A TV video program consists of both audio and image streams that unfold the underlying story.  To recognize the visual content of a digital image, or to understand the underlying story of a video program, we may need to label sets of pixels or groups of image and audio frames jointly.

Machine Learning for Multimedia Content Analysis introduces machine learning techniques that are particularly powerful and effective for modeling spatial, temporal structures of multimedia data and for accomplishing common tasks of multimedia content analysis. This book systematically covers these techniques in an intuitive fashion and demonstrates their applications through case studies. This volume uses a large number of figures to illustrate and visualize complex concepts, and provides insights into the characteristics of many algorithms through examinations of their loss functions and straightforward comparisons.

Machine Learning for Multimedia Content Analysis is designed for an academic and professional audience. Researchers will find this book an invaluable tool for applying machine learning techniques to multimedia content analysis. This volume is also suitable for practitioners in industry.

 

Reviews

From the reviews:

"The objectives of this book are to bring together powerful machine learning techniques that are suitable for modeling multimedia data, and to showcase their application to common multimedia content analysis tasks. The book is designed for students and researchers who want to apply machine learning techniques to multimedia content analysis. … Motivated researchers working in this field can certainly benefit by reading about the methods and case studies described here. It could also serve as a good reference … ." (Rao Vemuri, Computing Reviews, Vol. 50 (1), January, 2009)

Authors and Affiliations

  • NEC Laboratories America, Inc., Cupertino, USA

    Yihong Gong, Wei Xu

Bibliographic Information

Publish with us