Name: DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement
ISBN: 978-3-031-02564-8

Overview

Authors:

Richard C. Hendriks ⁰,
Timo Gerkmann ¹,
Jesper Jensen ²

Richard C. Hendriks
1. Delft University of Technology, The Netherlands
View author publications

You can also search for this author in PubMed Google Scholar
Timo Gerkmann
1. Universität Oldenburg, Germany
View author publications

You can also search for this author in PubMed Google Scholar
Jesper Jensen
1. Oticon A/S, Denmark Aalborg University, Denmark
View author publications

You can also search for this author in PubMed Google Scholar

Part of the book series: Synthesis Lectures on Speech and Audio Processing (SLSAP)

619 Accesses
21 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 19.99

Price excludes VAT (USA)

Softcover Book USD 16.99 ~~USD 29.99~~

Discount applied Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

As speech processing devices like mobile phones, voice controlled devices, and hearing aids have increased in popularity, people expect them to work anywhere and at any time without user intervention. However, the presence of acoustical disturbances limits the use of these applications, degrades their performance, or causes the user difficulties in understanding the conversation or appreciating the device. A common way to reduce the effects of such disturbances is through the use of single-microphone noise reduction algorithms for speech enhancement. The field of single-microphone noise reduction for speech enhancement comprises a history of more than 30 years of research. In this survey, we wish to demonstrate the significant advances that have been made during the last decade in the field of discrete Fourier transform domain-based single-channel noise reduction for speech enhancement.Furthermore, our goal is to provide a concise description of a state-of-the-art speech enhancement system, and demonstrate the relative importance of the various building blocks of such a system. This allows the non-expert DSP practitioner to judge the relevance of each building block and to implement a close-to-optimal enhancement system for the particular application at hand. Table of Contents: Introduction / Single Channel Speech Enhancement: General Principles / DFT-Based Speech Enhancement Methods: Signal Model and Notation / Speech DFT Estimators / Speech Presence Probability Estimation / Noise PSD Estimation / Speech PSD Estimation / Performance Evaluation Methods / Simulation Experiments with Single-Channel Enhancement Systems / Future Directions

Single channel speech enhancement using an MVDR filter in the frequency domain

Article 27 March 2019

A Modified NMF-Based Filter Bank Approach for Enhancement of Speech Data in Nonstationary Noise

DNN-Based Calibrated-Filter Models for Speech Enhancement

Article 27 January 2021

Table of contents (10 chapters)

Front Matter

Pages i-xii

Download chapter PDF
Introduction
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 1-4
Single Channel Speech Enhancement-General Principles
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 5-11
DFT-Based Speech Enhancement Methods-Signal Model and Notation
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 13-14
Speech DFT Estimators
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 15-22
Speech Presence Probability Estimation
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 23-28
Noise PSD Estimation
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 29-36
Speech PSD Estimation
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 37-42
Performance Evaluation Methods
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 43-48
Simulation Experiments with Single-Channel Enhancement Systems
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 49-53
Future Directions
- Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Pages 55-56
Back Matter

Pages 57-70

Download chapter PDF

Authors and Affiliations

Delft University of Technology, The Netherlands

Richard C. Hendriks
Universität Oldenburg, Germany

Timo Gerkmann
Oticon A/S, Denmark Aalborg University, Denmark

Jesper Jensen

About the authors

Dr. ir. Richard C. Hendriks obtained his M.Sc. and Ph. D. degrees (both cum laude) in electrical engineering from Delft University of Technology, Delft, The Netherlands, in 2003 and 2008, respectively. From 2003 till 2007 he was a Ph.D. researcher at Delft University of Technology, Delft, The Netherlands. From 2007 till 2010 he was a postdoctoral researcher at Delft University of Technology. Since 2010 he is an assistant professor in the Signal and Information Processing Lab of the faculty of Electrical Engineering, Mathematics and Computer Science at Delft University of Technology. In the autumn of 2005, he was a Visiting Researcher at the Institute of Communication Acoustics, Ruhr-University Bochum, Bochum, Germany. From March 2008 till March 2009 he was a visiting researcher at Oticon A/S, Copenhagen, Denmark. His main research interests are digital speech and audio processing, including single-channel and multi-channel acoustical noise reduction, speech enhancement and intelligibility improvement.Prof. Dr.-Ing. Timo Gerkmann studied electrical engineering at the universities of Bremen and Bochum, Germany. He received his Dipl.-Ing. degree in 2004 and his Dr.-Ing. degree in 2010 both at the Institute of Communication Acoustics (IKA) at the Ruhr- Universitat Bochum, Bochum, Germany. From January 2005 to July 2005 he was with Siemens Corporate Research in Princeton, NJ, USA. In 2011 he was a postdoctoral researcher at the Sound and Image Processing Lab at the Royal Institute of Technology (KTH), Stockholm, Sweden. Since December 2011 he heads the Speech Signal Processing Group at the Universitat Oldenburg, Oldenburg, Germany. His main research interests are on speech enhancement algorithms and modeling of speech signals.
Jesper Jensen received the M.Sc. degree in electrical engineering and the Ph.D. degree in signal processing from Aalborg University, Aalborg, Denmark, in 1996 and 2000, respectively. From 1996 to 2000, he was with the Center for Person Kommunikation (CPK), Aalborg University, as a Ph.D. student and Assistant Research Professor. From 2000 to 2007, he was a Post-Doctoral Researcher and Assistant Professor with Delft University of Technology, Delft, The Netherlands, and an External Associate Professor with Aalborg University. Currently, he is a Senior Researcher with Oticon A/S, Copenhagen, Denmark, where his main responsibility is scouting and development of new signal processing concepts for hearing aid applications. He is also a Professor with the Section for Multimedia Information and Signal Processing (MISP), Department of Electronic Systems at Aalborg University, Denmark. His main interests are in the area of acoustic signal processing, including signal retrieval from noisy observations, coding, speech and audio modification and synthesis, intelligibility enhancement of speech signals, signal processing for hearing aid applications, and perceptual aspects of signal processing.

Bibliographic Information

Book Title: DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement
Authors: Richard C. Hendriks, Timo Gerkmann, Jesper Jensen
Series Title: Synthesis Lectures on Speech and Audio Processing
DOI: https://doi.org/10.1007/978-3-031-02564-8
Publisher: Springer Cham
eBook Packages: Synthesis Collection of Technology (R0), eBColl Synthesis Collection 4
Copyright Information: Springer Nature Switzerland AG 2013
Softcover ISBN: 978-3-031-01436-9Published: 11 February 2013
eBook ISBN: 978-3-031-02564-8Published: 31 May 2022
Series ISSN: 1932-121X
Series E-ISSN: 1932-1678
Edition Number: 1
Number of Pages: XII, 70
Topics: Electrical Engineering, Signal, Image and Speech Processing, Engineering Acoustics

Publish with us

Policies and ethics

DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement

Overview

Access this book

Other ways to access

About this book

Similar content being viewed by others

Single channel speech enhancement using an MVDR filter in the frequency domain

A Modified NMF-Based Filter Bank Approach for Enhancement of Speech Data in Nonstationary Noise

DNN-Based Calibrated-Filter Models for Speech Enhancement

Table of contents (10 chapters)

Front Matter

Introduction

Single Channel Speech Enhancement-General Principles

DFT-Based Speech Enhancement Methods-Signal Model and Notation

Speech DFT Estimators

Speech Presence Probability Estimation

Noise PSD Estimation

Speech PSD Estimation

Performance Evaluation Methods

Simulation Experiments with Single-Channel Enhancement Systems

Future Directions

Back Matter

Authors and Affiliations

Delft University of Technology, The Netherlands

Universität Oldenburg, Germany

Oticon A/S, Denmark Aalborg University, Denmark

About the authors

Bibliographic Information

Publish with us

Navigation

DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement

Overview

Access this book

Other ways to access

About this book

Similar content being viewed by others

Table of contents (10 chapters)

Front Matter

Back Matter

Authors and Affiliations

Delft University of Technology, The Netherlands

Universität Oldenburg, Germany

Oticon A/S, Denmark Aalborg University, Denmark

About the authors

Bibliographic Information

Publish with us

Search

Navigation