SpringerBriefs in Electrical and Computer Engineering

Speech Enhancement in the STFT Domain

Authors: Benesty, Jacob, Chen, Jingdong, Habets, Emanuël A.P.

  • Addresses this problem in the the short-time Fourier transform.
  • Written for experts in this field.
Show all benefits

Buy this book

eBook $34.99
price for USA (gross)
  • ISBN 978-3-642-23250-3
  • Digitally watermarked, DRM-free
  • Included format: PDF, EPUB
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $49.95
price for USA
  • ISBN 978-3-642-23249-7
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Rent the ebook  
  • Rental duration: 1 or 6 month
  • low-cost access
  • online reader with highlighting and note-making option
  • can be used across all devices
About this book

This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-channel problem where STFT coefficients at different frames and frequency bands are assumed to be independent. In this case, the noise reduction filter in each frequency band is basically a real gain. Since a gain does not improve the signal-to-noise ratio (SNR) for any given subband and frame, the noise reduction is basically achieved by liftering the subbands and frames that are less noisy while weighing down on those that are more noisy. The second category also concerns the single-channel problem. The difference is that now the interframe correlation is taken into account and a filter is applied in each subband instead of just a gain.
The advantage of using the interframe correlation is that we can improve not only the long-time fullband SNR, but the frame-wise subband SNR as well. The third and fourth classes discuss the problem of multichannel noise reduction in the STFT domain with and without interframe correlation, respectively. In the last category, we consider the interband correlation in the design of the noise reduction filters. We illustrate the basic principle for the single-channel case as an example, while this concept can be generalized to other scenarios. In all categories, we propose different optimization cost functions from which we derive the optimal filters and we also define the performance measures that help analyzing them.

Reviews

From the reviews:

“This work addresses the problem in the short-time Fourier transform (STFT) domain. The general problem is divided into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. … This book is mainly a research book for people doing research in electrical and computer engineering.” (Yuehua Wu, Zentralblatt MATH, Vol. 1242, 2012)


Table of contents (4 chapters)

  • Introduction

    Jacob Benesty, Jingdong Chen, Emanuël A. P. Habets

    Pages 1-13

  • Single-Channel Speech Enhancement with a Gain

    Jacob Benesty, Jingdong Chen, Emanuël A. P. Habets

    Pages 15-28

  • Single-Channel Speech Enhancement with a Filter

    Jacob Benesty, Jingdong Chen, Emanuël A. P. Habets

    Pages 29-49

  • The Bifrequency Spectrum in Speech Enhancement

    Jacob Benesty, Jingdong Chen, Emanuël A. P. Habets

    Pages 93-101

Buy this book

eBook $34.99
price for USA (gross)
  • ISBN 978-3-642-23250-3
  • Digitally watermarked, DRM-free
  • Included format: PDF, EPUB
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $49.95
price for USA
  • ISBN 978-3-642-23249-7
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Rent the ebook  
  • Rental duration: 1 or 6 month
  • low-cost access
  • online reader with highlighting and note-making option
  • can be used across all devices
Loading...

Recommended for you

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Speech Enhancement in the STFT Domain
Authors
Series Title
SpringerBriefs in Electrical and Computer Engineering
Copyright
2012
Publisher
Springer-Verlag Berlin Heidelberg
Copyright Holder
The Author(s)
eBook ISBN
978-3-642-23250-3
DOI
10.1007/978-3-642-23250-3
Softcover ISBN
978-3-642-23249-7
Series ISSN
2191-8112
Edition Number
1
Number of Pages
VII, 109
Number of Illustrations and Tables
5 b/w illustrations
Topics