Name: Advances in Non-Linear Modeling for Speech Processing
ISBN: 978-1-4614-1505-3

Authors:

Raghunath S. Holambe ⁰,
Mangesh S. Deshpande ¹

Raghunath S. Holambe
1. , Department of Instrumentation, SGGS Institute of Engineering & Technolo, Vishnupuri, Nanded, India
View author publications

You can also search for this author in PubMed Google Scholar
Mangesh S. Deshpande
1. , Department of E&TC Engineering, SRES College of Engineering, Kopargaon, India
View author publications

You can also search for this author in PubMed Google Scholar

Nonlinear aspects of speech signals are covered in depth
Covers nonlinear modeling techniques from the context of speaker identification
New insight is explored to combine the speech production and speech perception systems
Includes supplementary material: sn.pub/extras

Part of the book series: SpringerBriefs in Speech Technology (BRIEFSSPEECHTECH)

4047 Accesses
13 Citations

Buy it now

eBook USD 39.99

Price excludes VAT (USA)

Softcover Book USD 54.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Learn about institutional subscriptions

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (6 chapters)

Front Matter

Pages i-xiii

PDF
Introduction
- Raghunath S. Holambe, Mangesh S. Deshpande
Pages 1-9
Nonlinearity Framework in Speech Processing
- Raghunath S. Holambe, Mangesh S. Deshpande
Pages 11-25
Linear and Dynamic System Model
- Raghunath S. Holambe, Mangesh S. Deshpande
Pages 27-44
Nonlinear Measurement and Modeling Using Teager Energy Operator
- Raghunath S. Holambe, Mangesh S. Deshpande
Pages 45-59
AM-FM: Modulation and Demodulation Techniques
- Raghunath S. Holambe, Mangesh S. Deshpande
Pages 61-75
Application to Speaker Recognition
- Raghunath S. Holambe, Mangesh S. Deshpande
Pages 77-99
Back Matter

Pages 101-102

PDF

About this book

Advances in Non-Linear Modeling for Speech Processing includes advanced topics in non-linear estimation and modeling techniques along with their applications to speaker recognition.

Non-linear aeroacoustic modeling approach is used to estimate the important fine-structure speech events, which are not revealed by the short time Fourier transform (STFT). This aeroacostic modeling approach provides the impetus for the high resolution Teager energy operator (TEO). This operator is characterized by a time resolution that can track rapid signal energy changes within a glottal cycle.

The cepstral features like linear prediction cepstral coefficients (LPCC) and mel frequency cepstral coefficients (MFCC) are computed from the magnitude spectrum of the speech frame and the phase spectra is neglected. To overcome the problem of neglecting the phase spectra, the speech production system can be represented as an amplitude modulation-frequency modulation (AM-FM) model. To demodulate the speech signal, to estimation the amplitude envelope and instantaneous frequency components, the energy separation algorithm (ESA) and the Hilbert transform demodulation (HTD) algorithm are discussed.

Different features derived using above non-linear modeling techniques are used to develop a speaker identification system. Finally, it is shown that, the fusion of speech production and speech perception mechanisms can lead to a robust feature set.

Keywords

Authors and Affiliations

, Department of Instrumentation, SGGS Institute of Engineering & Technolo, Vishnupuri, Nanded, India

Raghunath S. Holambe
, Department of E&TC Engineering, SRES College of Engineering, Kopargaon, India

Mangesh S. Deshpande

Bibliographic Information

Book Title: Advances in Non-Linear Modeling for Speech Processing
Authors: Raghunath S. Holambe, Mangesh S. Deshpande
Series Title: SpringerBriefs in Speech Technology
DOI: https://doi.org/10.1007/978-1-4614-1505-3
Publisher: Springer New York, NY
eBook Packages: Engineering, Engineering (R0)
Softcover ISBN: 978-1-4614-1504-6Published: 21 February 2012
eBook ISBN: 978-1-4614-1505-3Published: 21 February 2012
Series ISSN: 2191-737X
Series E-ISSN: 2191-7388
Edition Number: 1
Number of Pages: XIII, 102
Number of Illustrations: 32 b/w illustrations
Topics: Signal, Image and Speech Processing, Natural Language Processing (NLP), Artificial Intelligence

Publish with us

Policies and ethics

Authors:

Sections

Buy it now

Buying options

Other ways to access

Table of contents (6 chapters)

Front Matter

Introduction

Nonlinearity Framework in Speech Processing

Linear and Dynamic System Model

Nonlinear Measurement and Modeling Using Teager Energy Operator

AM-FM: Modulation and Demodulation Techniques

Application to Speaker Recognition

Back Matter

About this book

Keywords

Authors and Affiliations

, Department of Instrumentation, SGGS Institute of Engineering & Technolo, Vishnupuri, Nanded, India

, Department of E&TC Engineering, SRES College of Engineering, Kopargaon, India

Bibliographic Information

Publish with us

Buy it now

Buying options

Other ways to access

Search

Navigation