Overview
- Provides a unified theoretical framework for analysis, treatment and development of speech coding methods.
- Covers state-of-the-art speech coding methods, including those used in the 3GPP Enhanced Voice Services codec.
- Includes pedagogical and thorough expositions of the theoretical foundations of speech doing methods
- Includes supplementary material: sn.pub/extras
Part of the book series: Signals and Communication Technology (SCT)
Access this book
Tax calculation will be finalised at checkout
Other ways to access
Table of contents (15 chapters)
-
Basic Properties of Speech Signals
-
Advanced Tools and Extensions
-
Standards and Specifications
Keywords
About this book
This book provides scientific understanding of the most central techniques used in speech coding both for advanced students as well as professionals with a background in speech audio and or digital signal processing. It provides a clear connection between the Why’s?, How’s?, and What’s, such that the necessity, purpose and solutions provided by tools should be always within sight, as well as their strengths and weaknesses in each respect. Equivalently, this book sheds light on the following perspectives for each technology presented:
Objective: What do we want to achieve and especially why is this goal important?
Resource / Information: What information is available and how can it be useful?
Resource / Platform: What kind of platforms are we working with and what are the capabilities/restrictions of those platforms? This includes properties such as computational, memory, acoustic and transmission capacity of devices used.
Solutions: Which solutions have been proposed and how can they be used to reach the stated goals?
Strengths and weaknesses: In which ways do the solutions fulfill the objectives and where are they insufficient? Are resources used efficiently?
This book concentrates solely on code excited linear prediction and its derivatives since mainstream speech codecs are based on linear prediction It also concentrates exclusively on time domain techniques because frequency domain tools are to a large extent common with audio codecs.
Authors and Affiliations
About the author
Tom Bäckström is Professor for Speech Coding at University of Erlangen-Nuremberg and Member of the International Audio Labs Erlangen, funded by Fraunhofer IIS. He is active as a researcher in mathematical methods in the modeling of the voice and audio. His interests are in developing the mathematical side even more in the intersection of digital signal processing, matrix and polynomial algebra and functional analysis.
Bibliographic Information
Book Title: Speech Coding
Book Subtitle: with Code-Excited Linear Prediction
Authors: Tom Bäckström
Series Title: Signals and Communication Technology
DOI: https://doi.org/10.1007/978-3-319-50204-5
Publisher: Springer Cham
eBook Packages: Engineering, Engineering (R0)
Copyright Information: Springer International Publishing AG 2017
Hardcover ISBN: 978-3-319-50202-1Published: 07 April 2017
Softcover ISBN: 978-3-319-84344-5Published: 20 July 2018
eBook ISBN: 978-3-319-50204-5Published: 29 March 2017
Series ISSN: 1860-4862
Series E-ISSN: 1860-4870
Edition Number: 1
Number of Pages: XXI, 240
Number of Illustrations: 76 b/w illustrations
Topics: Signal, Image and Speech Processing, Information Systems and Communication Service, Communications Engineering, Networks