Name: Latent Semantic Mapping
ISBN: 978-3-031-02556-3

Overview

Authors:

Jerome R. Bellegarda ⁰

Jerome R. Bellegarda
1. Apple Inc., USA
View author publications

You can also search for this author in PubMed Google Scholar

Part of the book series: Synthesis Lectures on Speech and Audio Processing (SLSAP)

412 Accesses
2 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 29.99

Price excludes VAT (USA)

Softcover Book USD 16.99 ~~USD 37.99~~

Discount applied Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

Latent semantic mapping (LSM) is a generalization of latent semantic analysis (LSA), a paradigm originally developed to capture hidden word patterns in a text document corpus. In information retrieval, LSA enables retrieval on the basis of conceptual content, instead of merely matching words between queries and documents. It operates under the assumption that there is some latent semantic structure in the data, which is partially obscured by the randomness of word choice with respect to retrieval. Algebraic and/or statistical techniques are brought to bear to estimate this structure and get rid of the obscuring ""noise."" This results in a parsimonious continuous parameter description of words and documents, which then replaces the original parameterization in indexing and retrieval. This approach exhibits three main characteristics: -Discrete entities (words and documents) are mapped onto a continuous vector space; -This mapping is determined by global correlation patterns; and -Dimensionality reduction is an integral part of the process. Such fairly generic properties are advantageous in a variety of different contexts, which motivates a broader interpretation of the underlying paradigm. The outcome (LSM) is a data-driven framework for modeling meaningful global relationships implicit in large volumes of (not necessarily textual) data. This monograph gives a general overview of the framework, and underscores the multifaceted benefits it can bring to a number of problems in natural language understanding and spoken language processing. It concludes with a discussion of the inherent tradeoffs associated with the approach, and some perspectives on its general applicability to data-driven information extraction. Contents: I. Principles / Introduction / Latent Semantic Mapping / LSM Feature Space / Computational Effort / Probabilistic Extensions / II. Applications/ Junk E-mail Filtering / Semantic Classification / Language Modeling / Pronunciation Modeling / Speaker Verification / TTS Unit Selection / III. Perspectives / Discussion / Conclusion / Bibliography

Table of contents (13 chapters)

Front Matter

Pages i-x

Download chapter PDF
Principles
1. Front Matter
  
  Pages 1-1
  
  Download chapter PDF
2. Introduction
  
  Jerome R. Bellegarda
  
  Pages 3-8
3. Latent Semantic Mapping
  
  Jerome R. Bellegarda
  
  Pages 9-13
4. LSM Feature Space
  
  Jerome R. Bellegarda
  
  Pages 15-19
5. Computational Effort
  
  Jerome R. Bellegarda
  
  Pages 21-24
6. Probabilistic Extensions
  
  Jerome R. Bellegarda
  
  Pages 25-29
Applications
1. Front Matter
  
  Pages 31-31
  
  Download chapter PDF
2. Junk E-Mail Filtering
  
  Jerome R. Bellegarda
  
  Pages 33-39
3. Semantic Classification
  
  Jerome R. Bellegarda
  
  Pages 41-47
4. Language Modeling
  
  Jerome R. Bellegarda
  
  Pages 49-54
5. Pronunciation Modeling
  
  Jerome R. Bellegarda
  
  Pages 55-61
6. Speaker Verification
  
  Jerome R. Bellegarda
  
  Pages 63-69
7. TTS Unit Selection
  
  Jerome R. Bellegarda
  
  Pages 71-76
Perspectives
1. Front Matter
  
  Pages 77-77
  
  Download chapter PDF
2. Discussion
  
  Jerome R. Bellegarda
  
  Pages 79-83
3. Conclusion
  
  Jerome R. Bellegarda
  
  Pages 85-87
Back Matter

Pages 89-101

Download chapter PDF

Authors and Affiliations

Apple Inc., USA

Jerome R. Bellegarda

About the author

Jerome R. Bellegarda received the Diplome dIngenieur degree (summa cum laude) from the Ecole Nationale Superieure dElectricite et de Mecanique, Nancy, France, in 1984, and the M.S. and Ph.D. degrees in Electrical Engineering from the University of Rochester, Rochester, NY, in 1984 and 1987, respectively. From 1988 to 1994, he was a Research Staff Member at the IBM T.J. Watson Research Center, Yorktown Heights, NY, working on speech and handwriting recognition, particularly acoustic and chirographic modeling. In 1994, he joined Apple Inc., Cupertino, CA, where he is currently Apple Distinguished Scientist in Speech & Language Technologies. At Apple he has worked on many facets of human language processing, including speech recognition, speech synthesis, statistical language modeling, voice authentication, speaker adaptation, dialog interaction, metadata extraction, and semantic classification. In these areas he has written close to 150 journal and conference papers, and holds over 30 patents. He has also contributed chapters to several edited books, most recently Pattern Recognition in Speech and Language Processing (New York, NY: CRC Press, 2003), and Mathematical Foundations of Speech and Language Processing (New York, NY: Springer-Verlag, 2004). His research interests include statistical modeling algorithms, voice-driven man-machine communications, multiple input/output modalities, and multimedia knowledge management. Dr. Bellegarda has served on many international scientific committees, review panels, and editorial boards.

Bibliographic Information

Book Title: Latent Semantic Mapping
Book Subtitle: Principles and Applications
Authors: Jerome R. Bellegarda
Series Title: Synthesis Lectures on Speech and Audio Processing
DOI: https://doi.org/10.1007/978-3-031-02556-3
Publisher: Springer Cham
eBook Packages: Synthesis Collection of Technology (R0), eBColl Synthesis Collection 1
Copyright Information: Springer Nature Switzerland AG 2007
Softcover ISBN: 978-3-031-01428-4Published: 31 December 2007
eBook ISBN: 978-3-031-02556-3Published: 31 May 2022
Series ISSN: 1932-121X
Series E-ISSN: 1932-1678
Edition Number: 1
Number of Pages: X, 101
Topics: Electrical Engineering, Signal, Image and Speech Processing, Engineering Acoustics

Publish with us

Policies and ethics

Latent Semantic Mapping

Overview

Access this book

Other ways to access

About this book

Table of contents (13 chapters)

Front Matter

Principles

Front Matter

Applications

Front Matter

Perspectives

Front Matter

Back Matter

Authors and Affiliations

Apple Inc., USA

About the author

Bibliographic Information

Publish with us

Search

Navigation