Logo - springer
Slogan - springer

Engineering - Signals & Communication | Acoustical and Environmental Robustness in Automatic Speech Recognition

Acoustical and Environmental Robustness in Automatic Speech Recognition

Acero, Alex

1993, XXI, 186 p.

Available Formats:
eBook
Information

Springer eBooks may be purchased by end-customers only and are sold without copy protection (DRM free). Instead, all eBooks include personalized watermarks. This means you can read the Springer eBooks across numerous devices such as Laptops, eReaders, and tablets.

You can pay for Springer eBooks with Visa, Mastercard, American Express or Paypal.

After the purchase you can directly download the eBook file or read it online in our Springer eBook Reader. Furthermore your eBook will be stored in your MySpringer account. So you can always re-download your eBooks.

 
$159.00

(net) price for USA

ISBN 978-1-4615-3122-7

digitally watermarked, no DRM

Included Format: PDF

download immediately after purchase


learn more about Springer eBooks

add to marked items

Hardcover
Information

Hardcover version

You can pay for Springer Books with Visa, Mastercard, American Express or Paypal.

Standard shipping is free of charge for individual customers.

 
$209.00

(net) price for USA

ISBN 978-0-7923-9284-2

free shipping for individuals worldwide

usually dispatched within 3 to 5 business days


add to marked items

Softcover
Information

Softcover (also known as softback) version.

You can pay for Springer Books with Visa, Mastercard, American Express or Paypal.

Standard shipping is free of charge for individual customers.

 
$209.00

(net) price for USA

ISBN 978-1-4613-6366-8

free shipping for individuals worldwide

usually dispatched within 3 to 5 business days


add to marked items

  • About this book

The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received only a small fraction of the attention devoted to speaker independence, even speech recognition systems that are designed to be speaker independent frequently perform very poorly when they are tested using a different type of microphone or acoustical environment from the one with which they were trained. The use of microphones other than a "close­ talking" headset also tends to severely degrade speech recognition -performance. Even in relatively quiet office environments, speech is degraded by additive noise from fans, slamming doors, and other conversations, as well as by the effects of unknown linear filtering arising reverberation from surface reflections in a room, or spectral shaping by microphones or the vocal tracts of individual speakers. Speech-recognition systems designed for long-distance telephone lines, or applications deployed in more adverse acoustical environments such as motor vehicles, factory floors, oroutdoors demand far greaterdegrees ofenvironmental robustness. There are several different ways of building acoustical robustness into speech recognition systems. Arrays of microphones can be used to develop a directionally-sensitive system that resists intelference from competing talkers and other noise sources that are spatially separated from the source of the desired speech signal.

Content Level » Research

Keywords » Signal - development - digital signal processor - filtering - filters - signal processing - signal processor - speech recognition - tables

Related subjects » Electronics & Electrical Engineering - Signals & Communication

Table of contents 

List of Figures. List of Tables. Foreword. 1. Introduction. 2. Experimental Procedure. 3. Frequency Domain Processing. 4. The SDCN Algorithm. 5. The CDCN Algorithm. 6. Other Algorithms. 7. Frequency Normalization. 8. Summary of Results. 9. Conclusions. Appendix I: Glossary. Appendix II: Signal Processing in Sphinx. Appendix III: The Bilinear Transform. Appendix IV: Spectral Estimation Issues. Appendix V: MMSE Estimation in the CDCN Algorithm. Appendix VI: Maximum Likelihood via the EM Algorithm. Appendix VII: Estimation of Noise and Spectral Tilt. Appendix VIII: Vocabulary and Pronunciation Dictionary. References. Index.

Popular Content within this publication 

 

Articles

Read this Book on Springerlink

Services for this book

New Book Alert

Get alerted on new Springer publications in the subject area of Signal, Image and Speech Processing.