Skip to main content

Multilingual Phone Recognition in Indian Languages

  • Book
  • © 2022

Overview

  • Discusses the use of a multilingual Phone Recognition System (multi-PRS) used for decoding the phonetic units present in speech signals
  • Includes the design, development, and applications of multilingual phone recognition within several Indian languages
  • Presents applications in machine-translation, speech-to-speech systems, language adaptation, language recognition and code-switching

Part of the book series: SpringerBriefs in Speech Technology (BRIEFSSPEECHTECH)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (7 chapters)

Keywords

About this book

The book presents current research and developments in multilingual speech recognition. The author presents a Multilingual Phone Recognition System (Multi-PRS), developed using a common multilingual phone-set derived from the International Phonetic Alphabets (IPA) based transcription of six Indian languages - Kannada, Telugu, Bengali, Odia, Urdu, and Assamese. The author shows how the performance of Multi-PRS can be improved using tandem features. The book compares Monolingual Phone Recognition Systems (Mono-PRS) versus Multi-PRS and baseline versus tandem system. Methods are proposed to predict Articulatory Features (AFs) from spectral features using Deep Neural Networks (DNN). Multitask learning is explored to improve the prediction accuracy of AFs. Then, the AFs are explored to improve the performance of Multi-PRS using lattice rescoring method of combination and tandem method of combination. The author goes on to develop and evaluate the Language Identification followed by Monolingual phone recognition (LID-Mono) and common multilingual phone-set based multilingual phone recognition systems.

Authors and Affiliations

  • U R Rao Satellite Centre, Indian Space Research Organisation, Bengaluru, India

    K.E Manjunath

About the author

Dr. Manjunath K E received his PhD in multilingual speech recognition from International Institute of Information Technology, Bangalore, India, and his MS in automatic speech recognition from Indian Institute of Technology, Kharagpur, India. Currently, he works as Scientist at U R Rao Satellite Centre, Indian Space Research Organisation (ISRO). He has published in several international conferences and journals. He has co-authored the book “Speech recognition using Articulatory and Excitation Source Features” (Springer 2017).

Bibliographic Information

Publish with us