Overview

Authors:

Kai-Fu Lee ⁰

Kai-Fu Lee
1. Carnegie Mellon University, Pittsburgh, USA
View author publications

You can also search for this author in PubMed Google Scholar

Part of the book series: The Springer International Series in Engineering and Computer Science (SECS, volume 62)

2004 Accesses
237 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 129.00

Price excludes VAT (USA)

Softcover Book USD 169.99

Price excludes VAT (USA)

Hardcover Book USD 169.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (9 chapters)

Front Matter

Pages i-xv

Download chapter PDF
Introduction
- Kai-Fu Lee
Pages 1-16
Hidden Markov Modeling of Speech
- Kai-Fu Lee
Pages 17-43
Task and Databases
- Kai-Fu Lee
Pages 45-50
The Baseline SPHINX System
- Kai-Fu Lee
Pages 51-62
Adding Knowledge
- Kai-Fu Lee
Pages 63-89
Finding a Good Unit of Speech
- Kai-Fu Lee
Pages 91-114
Learning and Adaptation
- Kai-Fu Lee
Pages 115-127
Summary of Results
- Kai-Fu Lee
Pages 129-136
Conclusion
- Kai-Fu Lee
Pages 137-144
Back Matter

Pages 145-207

Download chapter PDF

Keywords

About this book

Speech Recognition has a long history of being one of the difficult problems in Artificial Intelligence and Computer Science. As one goes from problem solving tasks such as puzzles and chess to perceptual tasks such as speech and vision, the problem characteristics change dramatically: knowledge poor to knowledge rich; low data rates to high data rates; slow response time (minutes to hours) to instantaneous response time. These characteristics taken together increase the computational complexity of the problem by several orders of magnitude. Further, speech provides a challenging task domain which embodies many of the requirements of intelligent behavior: operate in real time; exploit vast amounts of knowledge, tolerate errorful, unexpected unknown input; use symbols and abstractions; communicate in natural language and learn from the environment. Voice input to computers offers a number of advantages. It provides a natural, fast, hands free, eyes free, location free input medium. However, there are many as yet unsolved problems that prevent routine use of speech as an input device by non-experts. These include cost, real time response, speaker independence, robustness to variations such as noise, microphone, speech rate and loudness, and the ability to handle non-grammatical speech. Satisfactory solutions to each of these problems can be expected within the next decade. Recognition of unrestricted spontaneous continuous speech appears unsolvable at present. However, by the addition of simple constraints, such as clarification dialog to resolve ambiguity, we believe it will be possible to develop systems capable of accepting very large vocabulary continuous speechdictation.

Authors and Affiliations

Carnegie Mellon University, Pittsburgh, USA

Kai-Fu Lee

Bibliographic Information

Book Title: Automatic Speech Recognition
Book Subtitle: The Development of the SPHINX System
Authors: Kai-Fu Lee
Series Title: The Springer International Series in Engineering and Computer Science
DOI: https://doi.org/10.1007/978-1-4615-3650-5
Publisher: Springer New York, NY
eBook Packages: Springer Book Archive
Copyright Information: Springer Science+Business Media New York 1989
Hardcover ISBN: 978-0-89838-296-9Published: 31 October 1988
Softcover ISBN: 978-1-4613-6624-9Published: 03 March 2013
eBook ISBN: 978-1-4615-3650-5Published: 06 December 2012
Series ISSN: 0893-3405
Edition Number: 1
Number of Pages: XV, 207
Topics: Signal, Image and Speech Processing, Artificial Intelligence

Publish with us

Policies and ethics

Automatic Speech Recognition

Overview

Access this book

Other ways to access

Table of contents (9 chapters)

Front Matter

Back Matter

Keywords

About this book

Authors and Affiliations

Carnegie Mellon University, Pittsburgh, USA

Bibliographic Information

Publish with us

Search

Navigation