Name: Speech Coding
ISBN: 978-3-319-50204-5

Overview

Authors:

Tom Bäckström ⁰

Tom Bäckström
1. International Audio Laboratories Erlangen (AudioLabs), Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany
View author publications

You can also search for this author in PubMed Google Scholar

Provides a unified theoretical framework for analysis, treatment and development of speech coding methods.
Covers state-of-the-art speech coding methods, including those used in the 3GPP Enhanced Voice Services codec.
Includes pedagogical and thorough expositions of the theoretical foundations of speech doing methods
Includes supplementary material: sn.pub/extras

Part of the book series: Signals and Communication Technology (SCT)

16k Accesses
21 Citations
3 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 79.99

Price excludes VAT (USA)

Softcover Book USD 129.99

Price excludes VAT (USA)

Hardcover Book USD 139.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (15 chapters)

Front Matter

Pages i-xxi

Download chapter PDF
Introduction
- Tom Bäckström
Pages 1-8
Basic Properties of Speech Signals
1. Front Matter
  
  Pages 9-9
  
  Download chapter PDF
2. Speech Production and Modelling
  
  Tom Bäckström
  
  Pages 11-30
3. Principles of Entropy Coding with Perceptual Quality Evaluation
  
  Tom Bäckström
  
  Pages 31-44
Core Tools
1. Front Matter
  
  Pages 45-45
  
  Download chapter PDF
2. Spectral Envelope and Perceptual Masking Models
  
  Tom Bäckström
  
  Pages 47-76
3. Windowing and the Zero Input Response
  
  Tom Bäckström
  
  Pages 77-90
4. Fundamental Frequency
  
  Tom Bäckström
  
  Pages 91-96
5. Residual Coding
  
  Tom Bäckström
  
  Pages 97-116
6. Signal Gain and Harmonics to Noise Ratio
  
  Tom Bäckström
  
  Pages 117-120
Advanced Tools and Extensions
1. Front Matter
  
  Pages 121-121
  
  Download chapter PDF
2. Pre- and Postfiltering
  
  Tom Bäckström
  
  Pages 123-130
3. Frequency Domain Coding
  
  Tom Bäckström
  
  Pages 131-150
4. Bandwidth Extension
  
  Sascha Disch, Tom Bäckström
  
  Pages 151-160
5. Packet Loss and Concealment
  
  Jérémie Lecomte, Tom Bäckström
  
  Pages 161-184
6. Voice Activity Detection
  
  Christian Uhle, Tom Bäckström
  
  Pages 185-203
7. Relaxed Code-Excited Linear Prediction (RCELP)
  
  Guillaume Fuchs, Tom Bäckström
  
  Pages 205-215
Standards and Specifications
1. Front Matter
  
  Pages 217-217
  
  Download chapter PDF
2. Quality Evaluation
  
  Tom Bäckström
  
  Pages 219-235

Keywords

About this book

This book provides scientific understanding of the most central techniques used in speech coding both for advanced students as well as professionals with a background in speech audio and or digital signal processing. It provides a clear connection between the Why’s?, How’s?, and What’s, such that the necessity, purpose and solutions provided by tools should be always within sight, as well as their strengths and weaknesses in each respect. Equivalently, this book sheds light on the following perspectives for each technology presented:

Objective: What do we want to achieve and especially why is this goal important?

Resource / Information: What information is available and how can it be useful?

Resource / Platform: What kind of platforms are we working with and what are the capabilities/restrictions of those platforms? This includes properties such as computational, memory, acoustic and transmission capacity of devices used.

Solutions: Which solutions have been proposed and how can they be used to reach the stated goals?

Strengths and weaknesses: In which ways do the solutions fulfill the objectives and where are they insufficient? Are resources used efficiently?

This book concentrates solely on code excited linear prediction and its derivatives since mainstream speech codecs are based on linear prediction It also concentrates exclusively on time domain techniques because frequency domain tools are to a large extent common with audio codecs.

Authors and Affiliations

International Audio Laboratories Erlangen (AudioLabs), Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany

Tom Bäckström

About the author

Tom Bäckström is Professor for Speech Coding at University of Erlangen-Nuremberg and Member of the International Audio Labs Erlangen, funded by Fraunhofer IIS. He is active as a researcher in mathematical methods in the modeling of the voice and audio. His interests are in developing the mathematical side even more in the intersection of digital signal processing, matrix and polynomial algebra and functional analysis.

Bibliographic Information

Book Title: Speech Coding
Book Subtitle: with Code-Excited Linear Prediction
Authors: Tom Bäckström
Series Title: Signals and Communication Technology
DOI: https://doi.org/10.1007/978-3-319-50204-5
Publisher: Springer Cham
eBook Packages: Engineering, Engineering (R0)
Copyright Information: Springer International Publishing AG 2017
Hardcover ISBN: 978-3-319-50202-1Published: 07 April 2017
Softcover ISBN: 978-3-319-84344-5Published: 20 July 2018
eBook ISBN: 978-3-319-50204-5Published: 29 March 2017
Series ISSN: 1860-4862
Series E-ISSN: 1860-4870
Edition Number: 1
Number of Pages: XXI, 240
Number of Illustrations: 76 b/w illustrations
Topics: Signal, Image and Speech Processing, Information Systems and Communication Service, Communications Engineering, Networks

Publish with us

Policies and ethics

Speech Coding

Overview

Access this book

Other ways to access

Table of contents (15 chapters)

Front Matter

Basic Properties of Speech Signals

Front Matter

Core Tools

Front Matter

Advanced Tools and Extensions

Front Matter

Standards and Specifications

Front Matter

Keywords

About this book

Authors and Affiliations

International Audio Laboratories Erlangen (AudioLabs), Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany

About the author

Bibliographic Information

Publish with us

Search

Navigation