Overview

Authors:

Mohamed Elmahdy ⁰,
Rainer Gruhn ¹,
Wolfgang Minker ²

Mohamed Elmahdy
1. Qatar University, Doha, Qatar
View author publications

You can also search for this author in PubMed Google Scholar
Rainer Gruhn
1. SVOX Deutschland GmbH, Ulm, Germany
View author publications

You can also search for this author in PubMed Google Scholar
Wolfgang Minker
1. , Institute of Information Technology, University of Ulm, Ulm, Germany
View author publications

You can also search for this author in PubMed Google Scholar

Presents novel approaches that overcome the major problems in dialectal Arabic speech recognition
Investigates how to benefit from existing standard Arabic speech resources to improve speech recognition accuracy for dialectal Arabic
Explains in detail how the proposed approaches have been evaluated against conventional speech recognition techniques
Includes supplementary material: sn.pub/extras

5572 Accesses
18 Citations
1 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 84.99

Price excludes VAT (USA)

Softcover Book USD 109.99

Price excludes VAT (USA)

Hardcover Book USD 109.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (7 chapters)

Front Matter

Pages I-XXI

Download chapter PDF
Introduction
- Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
Pages 1-5
Fundamentals
- Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
Pages 7-23
Speech Corpora
- Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
Pages 25-32
Phonemic Acoustic Modeling
- Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
Pages 33-51
Graphemic Acoustic Modeling
- Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
Pages 53-69
Phonetic Transcription Using the Arabic Chat Alphabet
- Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
Pages 71-80
Conclusions and Future Directions
- Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
Pages 81-85
Back Matter

Pages 87-110

Download chapter PDF

Keywords

About this book

Novel Techniques for Dialectal Arabic Speech describes approaches to improve automatic speech recognition for dialectal Arabic. Since speech resources for dialectal Arabic speech recognition are very sparse, the authors describe how existing Modern Standard Arabic (MSA) speech data can be applied to dialectal Arabic speech recognition, while assuming that MSA is always a second language for all Arabic speakers.

In this book, Egyptian Colloquial Arabic (ECA) has been chosen as a typical Arabic dialect. ECA is the first ranked Arabic dialect in terms of number of speakers, and a high quality ECA speech corpus with accurate phonetic transcription has been collected. MSA acoustic models were trained using news broadcast speech. In order to cross-lingually use MSA in dialectal Arabic speech recognition, the authors have normalized the phoneme sets for MSA and ECA. After this normalization, they have applied state-of-the-art acoustic model adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A-Posteriori (MAP) to adapt existing phonemic MSA acoustic models with a small amount of dialectal ECA speech data. Speech recognition results indicate a significant increase in recognition accuracy compared to a baseline model trained with only ECA data.

Authors and Affiliations

Qatar University, Doha, Qatar

Mohamed Elmahdy
SVOX Deutschland GmbH, Ulm, Germany

Rainer Gruhn
, Institute of Information Technology, University of Ulm, Ulm, Germany

Wolfgang Minker

Bibliographic Information

Book Title: Novel Techniques for Dialectal Arabic Speech Recognition
Authors: Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker
DOI: https://doi.org/10.1007/978-1-4614-1906-8
Publisher: Springer New York, NY
eBook Packages: Engineering, Engineering (R0)
Copyright Information: Springer Science+Business Media New York 2012
Hardcover ISBN: 978-1-4614-1905-1
Softcover ISBN: 978-1-4899-9945-0
eBook ISBN: 978-1-4614-1906-8
Edition Number: 1
Number of Pages: XXII, 110
Topics: Signal, Image and Speech Processing, Natural Language Processing (NLP), Communications Engineering, Networks, Arabic, Computational Linguistics

Publish with us

Policies and ethics

Novel Techniques for Dialectal Arabic Speech Recognition

Overview

Access this book

Other ways to access

Table of contents (7 chapters)

Front Matter

Introduction

Fundamentals

Speech Corpora

Phonemic Acoustic Modeling

Graphemic Acoustic Modeling

Phonetic Transcription Using the Arabic Chat Alphabet

Conclusions and Future Directions

Back Matter

Keywords

About this book

Authors and Affiliations

Qatar University, Doha, Qatar

SVOX Deutschland GmbH, Ulm, Germany

, Institute of Information Technology, University of Ulm, Ulm, Germany

Bibliographic Information

Publish with us

Search

Navigation