Skip to main content

Computer Vision and Machine Learning with RGB-D Sensors

  • Book
  • © 2014

Overview

  • Describes recent advances in RGB-D based computer vision algorithms, with an emphasis on advanced machine learning techniques for interpreting the RGBD information
  • Covers a range of different techniques from computer vision, machine learning, audio, speech and signal processing, communications, artificial intelligence and media technology
  • Includes contributions from leading researchers in this area, with strong industrial-research experience of the practical issues
  • Includes supplementary material: sn.pub/extras

Part of the book series: Advances in Computer Vision and Pattern Recognition (ACVPR)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (14 chapters)

  1. Reconstruction, Mapping and Synthesis

  2. Detection, Segmentation and Tracking

  3. Learning-based Recognition

Keywords

About this book

This book presents an interdisciplinary selection of cutting-edge research on RGB-D based computer vision. Features: discusses the calibration of color and depth cameras, the reduction of noise on depth maps and methods for capturing human performance in 3D; reviews a selection of applications which use RGB-D information to reconstruct human figures, evaluate energy consumption and obtain accurate action classification; presents an approach for 3D object retrieval and for the reconstruction of gas flow from multiple Kinect cameras; describes an RGB-D computer vision system designed to assist the visually impaired and another for smart-environment sensing to assist elderly and disabled people; examines the effective features that characterize static hand poses and introduces a unified framework to enforce both temporal and spatial constraints for hand parsing; proposes a new classifier architecture for real-time hand pose recognition and a novel hand segmentation and gesture recognition system.

Editors and Affiliations

  • University of Sheffield, United Kingdom

    Ling Shao

  • Civolution Technology, Eindhoven, The Netherlands

    Jungong Han

  • Microsoft Research, Cambridge, United Kingdom

    Pushmeet Kohli

  • Microsoft Research, Redmond, USA

    Zhengyou Zhang

About the editors

Dr. Ling Shao is a Senior Lecturer (Associate Professor) in the Department of Electronic and Electrical Engineering at the University of Sheffield, UK. His publications include the Springer title Multimedia Interaction and Intelligent User Interfaces.

Dr. Jungong Han is a Senior Scientist at Civolution Technology, Eindhoven, and a Guest Researcher at the Eindhoven University of Technology, Netherlands.

Dr. Pushmeet Kohli is a Senior Researcher in the Machine Learning and Perception Group at Microsoft Research Cambridge and an Associate in the Psychometrics Centre at the University of Cambridge, UK.

Dr. Zhengyou Zhang, IEEE Fellow and ACM Fellow, is a Principal Researcher and Research Manager of the Multimedia, Interaction, and Communication Group at Microsoft Research Redmond, WA, USA.

Bibliographic Information

Publish with us