Name: Pathological Voice Analysis
ISBN: 978-981-32-9196-6

Overview

Authors:

David Zhang
1. The Chinese University of Hong Kong (Shenzhen), Shenzhen Institute of Artificial Intelligence and Robotics for Society, Guangdong, China
View author publications

You can also search for this author in PubMed Google Scholar
Kebin Wu
1. Huawei Technologies, Beijing, China
View author publications

You can also search for this author in PubMed Google Scholar

Offers a systematic introduction to pathological voice analysis
Provides an overview of key steps in voice analysis for biomedical applications, including sample collection, preprocessing, feature extraction and learning, and classification
Presents state-of-the-art algorithms for important techniques, including pitch estimation, GCI detection, feature learning and multi-audio fusion

2243 Accesses
3 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 149.00

Price excludes VAT (USA)

Softcover Book USD 199.99

Price excludes VAT (USA)

Hardcover Book USD 199.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (8 chapters)

Front Matter

Pages i-x

Download chapter PDF
Introduction
- David Zhang, Kebin Wu
Pages 1-28
Pathological Voice Acquisition
- David Zhang, Kebin Wu
Pages 29-45
Pitch Estimation
- David Zhang, Kebin Wu
Pages 47-74
Glottal Closure Instants Detection
- David Zhang, Kebin Wu
Pages 75-106
Feature Learning
- David Zhang, Kebin Wu
Pages 107-121
Joint Learning for Voice Based Disease Detection
- David Zhang, Kebin Wu
Pages 123-145
Robust Multi-View Discriminative Learning for Voice Based Disease Detection
- David Zhang, Kebin Wu
Pages 147-166
Book Review and Future Work
- David Zhang, Kebin Wu
Pages 167-170
Back Matter

Pages 171-174

Download chapter PDF

Keywords

About this book

While voice is widely used in speech recognition and speaker identification, its application in biomedical fields is much less common. This book systematically introduces the authors’ research on voice analysis for biomedical applications, particularly pathological voice analysis.

Firstly, it reviews the field to highlight the biomedical value of voice. It then offers a comprehensive overview of the workflow and aspects of pathological voice analysis, including voice acquisition systems, voice pitch estimation methods, glottal closure instant detection, feature extraction and learning, and the multi-audio fusion approaches. Lastly, it discusses the experimental results that have shown the superiority of these techniques.

This book is useful to researchers, professionals and postgraduate students working in fields such as speech signal processing, pattern recognition, and biomedical engineering. It is also a valuable resource for those involved in interdisciplinary research.

Authors and Affiliations

The Chinese University of Hong Kong (Shenzhen), Shenzhen Institute of Artificial Intelligence and Robotics for Society, Guangdong, China

David Zhang
Huawei Technologies, Beijing, China

Kebin Wu

About the authors

David Zhang graduated in Computer Science from Peking University. He received his MSc and PhD in Computer Science from the Harbin Institute of Technology (HIT), in 1982 and 1985 respectively. From 1986 to 1988 he was a Postdoctoral Fellow at Tsinghua University and then an Associate Professor at the Academia Sinica, Beijing. In 1994 he received his second PhD in Electrical and Computer Engineering from the University of Waterloo, Ontario, Canada. Currently, he is with the School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), China. He also serves as Visiting Chair Professor at Tsinghua University and HIT, and Adjunct Professor at Jiao Tong University, Peking University, the National University of Defense Technology and the University of Waterloo. He is the founder and editor-in-chief of the International Journal of Image and Graphics (IJIG); book editor for the Springer International Series on Biometrics (KISB); organizer of the first International Conference on Biometrics Authentication (ICBA); and associate editor of more than ten international journals, including IEEE Transactions. He has published over 20 monographs, 400 international journal papers and 40 patents in the USA/Japan/HK/China. He was listed as a Highly Cited Researcher in Engineering by Clarivate Analytics (formerly known as Thomson Reuters) in 2014, 2015, 2016, 2017 and 2018. Professor Zhang is a Croucher Senior Research Fellow, Distinguished Speaker of the IEEE Computer Society, and a Fellow of both IEEE and IAPR.

Kebin Wu received her B.S. degree in Electronic and Information Engineering from the Harbin Institute of Technology in 2011 and her Ph.D. degree from Tsinghua University in 2018. Her research interests include pathological voice analysis, computer vision and statistical pattern recognition.

Bibliographic Information

Book Title: Pathological Voice Analysis
Authors: David Zhang, Kebin Wu
DOI: https://doi.org/10.1007/978-981-32-9196-6
Publisher: Springer Singapore
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: Springer Nature Singapore Pte Ltd. 2020
Hardcover ISBN: 978-981-32-9195-9Published: 04 August 2020
Softcover ISBN: 978-981-32-9198-0Published: 26 August 2021
eBook ISBN: 978-981-32-9196-6Published: 03 August 2020
Edition Number: 1
Number of Pages: X, 174
Number of Illustrations: 3 b/w illustrations, 41 illustrations in colour
Topics: Pattern Recognition, Signal, Image and Speech Processing, Biomedical Engineering and Bioengineering, Speech Pathology

Publish with us

Policies and ethics

Pathological Voice Analysis

Overview

Access this book

Other ways to access

Table of contents (8 chapters)

Front Matter

Back Matter

Keywords

About this book

Authors and Affiliations

The Chinese University of Hong Kong (Shenzhen), Shenzhen Institute of Artificial Intelligence and Robotics for Society, Guangdong, China

Huawei Technologies, Beijing, China

About the authors

Bibliographic Information

Publish with us

Search

Navigation