Overview

Editors:

Dorothea Kolossa⁰,
Reinhold Häb-Umbach¹

Dorothea Kolossa
1. Institute of Communication Acoustics, Ruhr-Universität Bochum, Bochum, Germany
View editor publications

You can also search for this editor in PubMed Google Scholar
Reinhold Häb-Umbach
1. , Dept. of Communications Engineering, University of Paderborn, Paderborn, Germany
View editor publications

You can also search for this editor in PubMed Google Scholar

Scientists and researchers in the field of speech recognition will find an overview of the state of the art in robust speech recognition.
Professionals working in speech recognition will find strategies for improving results in various conditions of mismatch.
The contributing authors are among the leading researchers in this field.

11k Accesses
85 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 84.99

Price excludes VAT (USA)

Softcover Book USD 109.99

Price excludes VAT (USA)

Hardcover Book USD 109.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (13 chapters)

Front Matter

Pages i-xviii

Download chapter PDF
Introduction
- Reinhold Haeb-Umbach, Dorothea Kolossa
Pages 1-5
Theoretical Foundations
1. Front Matter
  
  Pages 7-7
  
  Download chapter PDF
2. Uncertainty Decoding and Conditional Bayesian Estimation
  
  Reinhold Haeb-Umbach
  
  Pages 9-33
3. Uncertainty Propagation
  
  Ramón Fernandez Astudillo, Dorothea Kolossa
  
  Pages 35-64
Applications: Noise Robustness
1. Front Matter
  
  Pages 65-65
  
  Download chapter PDF
2. Front-End, Back-End, and Hybrid Techniques for Noise-Robust Speech Recognition
  
  Li Deng
  
  Pages 67-99
3. Model-Based Approaches to Handling Uncertainty
  
  M. J. F. Gales
  
  Pages 101-125
4. Reconstructing Noise-Corrupted Spectrographic Components for Robust Speech Recognition
  
  Bhiksha Raj, Rita Singh
  
  Pages 127-156
5. Automatic Speech Recognition Using Missing Data Techniques: Handling of Real-World Data
  
  Jort F. Gemmeke, Maarten Van Segbroeck, Yujun Wang, Bert Cranen, Hugo Van hamme
  
  Pages 157-185
6. Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition
  
  Volker Leutnant, Reinhold Haeb-Umbach
  
  Pages 187-221
Applications: Reverberation Robustness
1. Front Matter
  
  Pages 223-223
  
  Download chapter PDF
2. Variance Compensation for Recognition of Reverberant Speech with Dereverberation Preprocessing
  
  Marc Delcroix, Shinji Watanabe, Tomohiro Nakatani
  
  Pages 225-255
3. A Model-Based Approach to Joint Compensation of Noise and Reverberation for Speech Recognition
  
  Alexander Krueger, Reinhold Haeb-Umbach
  
  Pages 257-290
Applications: Multiple Speakers and Modalities
1. Front Matter
  
  Pages 291-291
  
  Download chapter PDF
2. Evidence Modeling for Missing Data Speech Recognition Using Small Microphone Arrays
  
  Marco Kühne, Roberto Togneri, Sven Nordholm
  
  Pages 293-318
3. Recognition of Multiple Speech Sources Using ICA
  
  Eugen Hoffmann, Dorothea Kolossa, Reinhold Orglmeister
  
  Pages 319-344
4. Use of Missing and Unreliable Data for Audiovisual Speech Recognition
  
  Alexander Vorwerk, Steffen Zeiler, Dorothea Kolossa, Ramón Fernandez Astudillo, Dennis Lerch
  
  Pages 345-375
Back Matter

Pages 377-380

Download chapter PDF

Keywords

About this book

Automatic speech recognition suffers from a lack of robustness with respect to noise, reverberation and interfering speech. The growing field of speech recognition in the presence of missing or uncertain input data seeks to ameliorate those problems by using not only a preprocessed speech signal but also an estimate of its reliability to selectively focus on those segments and features that are most reliable for recognition. This book presents the state of the art in recognition in the presence of uncertainty, offering examples that utilize uncertainty information for noise robustness, reverberation robustness, simultaneous recognition of multiple speech signals, and audiovisual speech recognition.

The book is appropriate for scientists and researchers in the field of speech recognition who will find an overview of the state of the art in robust speech recognition, professionals working in speech recognition who will find strategies for improving recognition results in various conditions of mismatch, and lecturers of advanced courses on speech processing or speech recognition who will find a reference and a comprehensive introduction to the field. The book assumes an understanding of the fundamentals of speech recognition using Hidden Markov Models.

Editors and Affiliations

Institute of Communication Acoustics, Ruhr-Universität Bochum, Bochum, Germany

Dorothea Kolossa
, Dept. of Communications Engineering, University of Paderborn, Paderborn, Germany

Reinhold Häb-Umbach

About the editors

Prof. Dr.-Ing. Dorothea Kolossa is a professor at the Institut für Kommunikationsakustik of the Ruhr-Universität Bochum, Germany; her research interests are automatic speech recognition, digital speech signal processing, and blind source separation.

Prof. Dr.-Ing. Reinhold Haeb-Umbach heads the Dept. of Communications Engineering of the University of Paderborn, Germany; his research interest are speech signal processing and automatic speech recognition, statistical learning and pattern recognition, and signal processing for digital communications.

Bibliographic Information

Book Title: Robust Speech Recognition of Uncertain or Missing Data
Book Subtitle: Theory and Applications
Editors: Dorothea Kolossa, Reinhold Häb-Umbach
DOI: https://doi.org/10.1007/978-3-642-21317-5
Publisher: Springer Berlin, Heidelberg
eBook Packages: Engineering, Engineering (R0)
Copyright Information: Springer-Verlag Berlin Heidelberg 2011
Hardcover ISBN: 978-3-642-21316-8Published: 14 July 2011
Softcover ISBN: 978-3-642-43868-4Published: 12 November 2014
eBook ISBN: 978-3-642-21317-5Published: 14 July 2011
Edition Number: 1
Number of Pages: XVIII, 380
Topics: Signal, Image and Speech Processing, Artificial Intelligence, Computational Linguistics

Publish with us

Policies and ethics

Robust Speech Recognition of Uncertain or Missing Data

Overview

Access this book

Other ways to access

Table of contents (13 chapters)

Front Matter

Theoretical Foundations

Front Matter

Applications: Noise Robustness

Front Matter

Applications: Reverberation Robustness

Front Matter

Applications: Multiple Speakers and Modalities

Front Matter

Back Matter

Keywords

About this book

Editors and Affiliations

Institute of Communication Acoustics, Ruhr-Universität Bochum, Bochum, Germany

, Dept. of Communications Engineering, University of Paderborn, Paderborn, Germany

About the editors

Bibliographic Information

Publish with us

Search

Navigation