Skip to main content
  • Book
  • © 2018

Audio Source Separation

Editors:

  • Offers the first comprehensive treatment of audio source separation based on non-negative matrix factorization, deep neural network, and sparse component analysis
  • Describes fundamentals and application of state-of-the-art audio source separation techniques
  • Presents a comprehensive, authoritative, and accessible treatment to the subject matter

Part of the book series: Signals and Communication Technology (SCT)

Buy it now

Buying options

eBook USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (14 chapters)

  1. Front Matter

    Pages i-viii
  2. Single-Channel Audio Source Separation with NMF: Divergences, Constraints and Algorithms

    • Cédric Févotte, Emmanuel Vincent, Alexey Ozerov
    Pages 1-24
  3. Dynamic Non-negative Models for Audio Source Separation

    • Paris Smaragdis, Gautham Mysore, Nasser Mohammadiha
    Pages 49-71
  4. An Introduction to Multichannel NMF for Audio Source Separation

    • Alexey Ozerov, Cédric Févotte, Emmanuel Vincent
    Pages 73-94
  5. General Formulation of Multichannel Extensions of NMF Variants

    • Hirokazu Kameoka, Hiroshi Sawada, Takuya Higuchi
    Pages 95-124
  6. Determined Blind Source Separation with Independent Low-Rank Matrix Analysis

    • Daichi Kitamura, Nobutaka Ono, Hiroshi Sawada, Hirokazu Kameoka, Hiroshi Saruwatari
    Pages 125-155
  7. Deep Neural Network Based Multichannel Audio Source Separation

    • Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent
    Pages 157-185
  8. Efficient Source Separation Using Bitwise Neural Networks

    • Minje Kim, Paris Smaragdis
    Pages 187-206
  9. DNN Based Mask Estimation for Supervised Speech Separation

    • Jitong Chen, DeLiang Wang
    Pages 207-235
  10. Informed Spatial Filtering Based on Constrained Independent Component Analysis

    • Hendrik Barfuss, Klaus Reindl, Walter Kellermann
    Pages 237-278
  11. Recent Advances in Multichannel Source Separation and Denoising Based on Source Sparseness

    • Nobutaka Ito, Shoko Araki, Tomohiro Nakatani
    Pages 279-300
  12. Multimicrophone MMSE-Based Speech Source Separation

    • Shmulik Markovich-Golan, Israel Cohen, Sharon Gannot
    Pages 301-331
  13. Audio-Visual Source Separation with Alternating Diffusion Maps

    • David Dov, Ronen Talmon, Israel Cohen
    Pages 365-382
  14. Back Matter

    Pages 383-385

About this book

This book provides the first comprehensive overview of the fascinating topic of audio source separation based on non-negative matrix factorization, deep neural networks, and sparse component analysis.

The first section of the book covers single channel source separation based on non-negative matrix factorization (NMF). After an introduction to the technique, two further chapters describe separation of known sources using non-negative spectrogram factorization, and temporal NMF models. In section two, NMF methods are extended to multi-channel source separation. Section three introduces deep neural network (DNN) techniques, with chapters on multichannel and single channel separation, and a further chapter on DNN based mask estimation for monaural speech separation. In section four, sparse component analysis (SCA) is discussed, with chapters on source separation using audio directional statistics modelling, multi-microphone MMSE-based techniques and diffusion map methods.

The book brings together leading researchers to provide tutorial-like and in-depth treatments on major audio source separation topics, with the objective of becoming the definitive source for a comprehensive, authoritative, and accessible treatment. This book is written for graduate students and researchers who are interested in audio source separation techniques based on NMF, DNN and SCA.

Editors and Affiliations

  • University of Tsukuba, Ibaraki, Japan

    Shoji Makino

About the editor

SHOJI MAKINO (F) received the B. E., M. E., and Ph.D. degrees from Tohoku University, Japan, in 1979, 1981, and 1993, respectively. He joined NTT in 1981. He is now a Professor at University of Tsukuba. His research interests include adaptive filtering technologies, realization of acoustic echo cancellation, blind source separation of convolutive mixtures of speech, and acoustic signal processing for speech and audio applications.

Dr. Makino received the IEEE SPS Best Paper Award in 2014, the IEEE MLSP Competition Award in 2007, the ICA Unsupervised Learning Pioneer Award in 2006, the Commendation for Science and Technology of Japanese Government in 2015, the TELECOM System Technology Award in 2015 and 2004, the Achievement Award of the Institute of Electronics, Information, and Communication Engineers (IEICE) in 1997, and the Outstanding Technological Development Award of the Acoustical Society of Japan (ASJ) in 1995, the Paper Award of the IEICE in 2005 and 2002, the Paper Award of the ASJ in 2005 and 2002. He is the author or co-author of more than 200 articles in journals and conference proceedings and is responsible for more than 150 patents. He was a Keynote Speaker at ICA2007 and a Tutorial speaker at EMBC 2013, Interspeech 2011 and ICASSP 2007.

Dr. Makino IEEE activities include: Member, SPS Technical Directions Board (2013-14), SPS Awards Board (2006-08), SPS Conference Board (2002-04), IEEE Jack S. Kilby Signal Processing Medal Committee (2015-), IEEE James L. Flanagan Speech & Audio Processing Award Committee (2008-11) and  Member and Chair, SPS Audio and Electroacoustics Technical Committee (1993-09 and 2013-14, respectively); SPS Distinguished Lecturer (2009-10); Chair, Circuits and Systems Society Blind Signal Processing Technical Committee (2009-2010); Associate Editor, IEEE Transactions on Speech and Audio Processing (2002-05) and EURASIP Journal on Advances in Signal Processing (2005-2012). He was the Vice President, Engineering Sciences Society of the IEICE (2007-08) and Chair, Engineering Acoustics Technical Committee of the IEICE (2006-08). He is a Member, International IWAENC Standing committee and International ICA Steering Committee; General Chair, WASPAA2007 and IWAENC2003; Organizing Chair, ICA2003; and Plenary Chair, ICASSP2012.

Dr. Makino is an IEEE Fellow, an IEICE Fellow, a Board member of the ASJ, and a member of EURASIP and ISCA.

Bibliographic Information

  • Book Title: Audio Source Separation

  • Editors: Shoji Makino

  • Series Title: Signals and Communication Technology

  • DOI: https://doi.org/10.1007/978-3-319-73031-8

  • Publisher: Springer Cham

  • eBook Packages: Engineering, Engineering (R0)

  • Copyright Information: Springer International Publishing AG 2018

  • Hardcover ISBN: 978-3-319-73030-1Published: 12 March 2018

  • Softcover ISBN: 978-3-030-10303-3Published: 25 January 2019

  • eBook ISBN: 978-3-319-73031-8Published: 01 March 2018

  • Series ISSN: 1860-4862

  • Series E-ISSN: 1860-4870

  • Edition Number: 1

  • Number of Pages: VIII, 385

  • Number of Illustrations: 67 b/w illustrations, 74 illustrations in colour

  • Topics: Signal, Image and Speech Processing, Acoustics, Engineering Acoustics

Buy it now

Buying options

eBook USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access