Skip to main content

Pathological Voice Analysis

  • Book
  • © 2020

Overview

  • Offers a systematic introduction to pathological voice analysis

  • Provides an overview of key steps in voice analysis for biomedical applications, including sample collection, preprocessing, feature extraction and learning, and classification

  • Presents state-of-the-art algorithms for important techniques, including pitch estimation, GCI detection, feature learning and multi-audio fusion

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (8 chapters)

Keywords

About this book

While voice is widely used in speech recognition and speaker identification, its application in biomedical fields is much less common. This book systematically introduces the authors’ research on voice analysis for biomedical applications, particularly pathological voice analysis.

Firstly, it reviews the field to highlight the biomedical value of voice. It then offers a comprehensive overview of the workflow and aspects of pathological voice analysis, including voice acquisition systems, voice pitch estimation methods, glottal closure instant detection, feature extraction and learning, and the multi-audio fusion approaches. Lastly, it discusses the experimental results that have shown the superiority of these techniques.

This book is useful to researchers, professionals and postgraduate students working in fields such as speech signal processing, pattern recognition, and biomedical engineering. It is also a valuable resource for those involved in interdisciplinary research.  


Authors and Affiliations

  • The Chinese University of Hong Kong (Shenzhen), Shenzhen Institute of Artificial Intelligence and Robotics for Society, Guangdong, China

    David Zhang

  • Huawei Technologies, Beijing, China

    Kebin Wu

About the authors

David Zhang graduated in Computer Science from Peking University. He received his MSc and PhD in Computer Science from the Harbin Institute of Technology (HIT), in 1982 and 1985 respectively. From 1986 to 1988 he was a Postdoctoral Fellow at Tsinghua University and then an Associate Professor at the Academia Sinica, Beijing. In 1994 he received his second PhD in Electrical and Computer Engineering from the University of Waterloo, Ontario, Canada. Currently, he is with the School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), China. He also serves as Visiting Chair Professor at Tsinghua University and HIT, and Adjunct Professor at Jiao Tong University, Peking University, the National University of Defense Technology and the University of Waterloo. He is the founder and editor-in-chief of the International Journal of Image and Graphics (IJIG); book editor for the Springer International Series on Biometrics (KISB); organizer of the first International Conference on Biometrics Authentication (ICBA); and associate editor of more than ten international journals, including IEEE Transactions. He has published over 20 monographs, 400 international journal papers and 40 patents in the USA/Japan/HK/China. He was listed as a Highly Cited Researcher in Engineering by Clarivate Analytics (formerly known as Thomson Reuters) in 2014, 2015, 2016, 2017 and 2018. Professor Zhang is a Croucher Senior Research Fellow, Distinguished Speaker of the IEEE Computer Society, and a Fellow of both IEEE and IAPR.

Kebin Wu received her B.S. degree in Electronic and Information Engineering from the Harbin Institute of Technology in 2011 and her Ph.D. degree from Tsinghua University in 2018. Her research interests include pathological voice analysis, computer vision and statistical pattern recognition.

Bibliographic Information

Publish with us