Skip to main content
  • Book
  • © 2020

Human Centric Visual Analysis with Deep Learning

Authors:

  • Presents effective deep learning based solutions for human centric visual analysis
  • Summarizes the latest studies in human centric visual analysis
  • Covers many practical examples and applications for human centric visual analysis

Buy it now

Buying options

eBook USD 119.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (10 chapters)

  1. Front Matter

    Pages i-xii
  2. Motivation and Overview

    1. Front Matter

      Pages 1-1
    2. The Foundation and Advances of Deep Learning

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 3-13
    3. Human-Centric Visual Analysis: Tasks and Progress

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 15-25
  3. Localizing Persons in Images

    1. Front Matter

      Pages 27-28
    2. Face Localization and Enhancement

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 29-45
    3. Pedestrian Detection with RPN and Boosted Forest

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 47-54
  4. Parsing Person in Detail

    1. Front Matter

      Pages 55-57
    2. Self-supervised Structure-Sensitive Learning for Human Parsing

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 59-68
    3. Instance-Level Human Parsing

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 69-83
    4. Video Instance-Level Human Parsing

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 85-93
  5. Identifying and Verifying Persons

    1. Front Matter

      Pages 95-98
    2. Person Verification

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 99-114
    3. Face Verification

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 115-130
  6. Higher Level Tasks

    1. Front Matter

      Pages 131-133
    2. Human Activity Understanding

      • Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo
      Pages 135-156

About this book

This book introduces the applications of deep learning in various human centric visual analysis tasks, including classical ones like face detection and alignment and some newly rising tasks like fashion clothing parsing. Starting from an overview of current research in human centric visual analysis, the book then presents a tutorial of basic concepts and techniques of deep learning. In addition, the book systematically investigates the main human centric analysis tasks of different levels, ranging from detection and segmentation to parsing and higher-level understanding. At last, it presents the state-of-the-art solutions based on deep learning for every task, as well as providing sufficient references and extensive discussions.

Specifically, this book addresses four important research topics, including 1) localizing persons in images, such as face and pedestrian detection; 2) parsing persons in details, such as human pose and clothing parsing, 3) identifying and verifying persons, such as face and human identification, and 4) high-level human centric tasks, such as person attributes and human activity understanding.

This book can serve as reading material and reference text for academic professors / students or industrial engineers working in the field of vision surveillance, biometrics, and human-computer interaction, where human centric visual analysis are indispensable in analysing human identity, pose, attributes, and behaviours for further understanding.


Authors and Affiliations

  • School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China

    Liang Lin, Dongyu Zhang

  • School of Information Engineering, The Chinese University of Hong Kong, Hong Kong, Hong Kong

    Ping Luo

  • School of Computer Science, Harbin Institute of Technology, Harbin, China

    Wangmeng Zuo

About the authors

Liang Lin is a Full Professor at Sun Yat-sen University, and the CEO of DMAI Great China. He served as the Executive Director of the SenseTime Group from 2016 to 2018, leading the R&D teams in developing cutting-edge, deliverable solutions in computer vision, data analysis and mining, and intelligent robotic systems. He has authored or co-authored more than 200 papers in leading academic journals and conferences (e.g., TPAMI/IJCV, CVPR/ICCV/NIPS/ICML/AAAI). He is an associate editor of IEEE Trans, Human-Machine Systems and IET Computer Vision, and he served as the area/session chair for numerous conferences, such as CVPR, ICME, ICCV, ICMR. He was the recipient of Annual Best Paper Award by Pattern Recognition (Elsevier) in 2018, Dimond Award for best paper in IEEE ICME in 2017, ACM NPARBest Paper Runners-Up Award in 2010, Google Faculty Award in 2012, award for the best student paper in IEEE ICME in 2014, and Hong Kong Scholars Award in 2014. He is a Fellow of IET.

Dongyu Zhang is a Research Scientist at the School of Data and Computer Science, Sun Yat-sen University (SYSU), China. He received his Master’s and Ph.D. degree in Computer Science from the Harbin Institute of Technology (HIT), China, in 2003 and 2008, respectively. His current research interests include deep learning, image modeling and biometrics.

Ping Luo is a Research Assistant Professor at the Chinese University of Hong Kong, where he received his Ph.D. degree in 2014. His research interests focus on machine learning and computer vision, including deep learning optimization and theory, face and pedestrian analysis, image parsing, and large-scale object recognition and detection. Dr. Luo has published more than 60 papers in the top-tier academic journals and conferences,including TPAMI, IJCV, NIPS, ICML, and CVPR. His papers have over 6000 citations in Google Scholar. Because of his contribution in deep learning and computer vision, Dr. Luo was awarded the Microsoft Research Fellowship in 2013. Only ten scholars in the Asia-Pacific area received this award each year. Besides, he was elected the Hong Kong PhD Fellowship in 2011 by the Research Grants Council of Hong Kong.

Wangmeng Zuo is a Professor at the School of Computer Science and Technology, Harbin Institute of Technology (HIT), China. He received his Ph.D. degree in Computer Application Technology from the HIT in 2007. From July 2004 to December 2004, from November 2005 to August 2006, and from July 2007 to February 2008, he was a Research Assistant at the Department of Computing, Hong Kong Polytechnic University. From August 2009 to February 2010, he was a Visiting Professor at Microsoft Research Asia. His current research interests include image restoration, image editing, image classification, object detection, and visual tracking. Dr. Zuo is an Associate Editor of the IET Biometrics, and a Guest Editor of Neurocomputing, Pattern Recognition, and IEEE Transactions on Neural Network and Learning Systems.

Bibliographic Information

  • Book Title: Human Centric Visual Analysis with Deep Learning

  • Authors: Liang Lin, Dongyu Zhang, Ping Luo, Wangmeng Zuo

  • DOI: https://doi.org/10.1007/978-981-13-2387-4

  • Publisher: Springer Singapore

  • eBook Packages: Computer Science, Computer Science (R0)

  • Copyright Information: Springer Nature Singapore Pte Ltd. 2020

  • Hardcover ISBN: 978-981-13-2386-7Published: 27 November 2019

  • eBook ISBN: 978-981-13-2387-4Published: 13 November 2019

  • Edition Number: 1

  • Number of Pages: XII, 156

  • Number of Illustrations: 7 b/w illustrations, 46 illustrations in colour

  • Topics: Image Processing and Computer Vision, Pattern Recognition, Biometrics

Buy it now

Buying options

eBook USD 119.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 159.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access