Name: Audio Processing and Speech Recognition
ISBN: 978-981-13-6098-5

Overview

Authors:

Soumya Sen ⁰,
Anjan Dutta ¹,
Nilanjan Dey ²

Soumya Sen
1. A.K.Choudhury School of Information Technology, University of Calcutta, Kolkata, India
View author publications

You can also search for this author in PubMed Google Scholar
Anjan Dutta
1. Department of Information Technology, Techno India College of Technology, Kolkata, India
View author publications

You can also search for this author in PubMed Google Scholar
Nilanjan Dey
1. Department of Information Technology, Techno India College of Technology, Kolkata, India
View author publications

You can also search for this author in PubMed Google Scholar

Provides background on concepts and models of the audio processing and speech recognition systems
Offers in-depth overview of the classical audio indexing and speech recognition systems
Reports the challenges regarding an ASR system and provides a discussion on relevant research scopes

Part of the book series: SpringerBriefs in Applied Sciences and Technology (BRIEFSAPPLSCIENCES)

Part of the book sub series: SpringerBriefs in Computational Intelligence (BRIEFSINTELL)

5186 Accesses
32 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99

Price excludes VAT (USA)

Softcover Book USD 54.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (5 chapters)

Front Matter

Pages i-xiv

Download chapter PDF
Audio Indexing
- Soumya Sen, Anjan Dutta, Nilanjan Dey
Pages 1-11
Speech Processing and Recognition System
- Soumya Sen, Anjan Dutta, Nilanjan Dey
Pages 13-43
Feature Extraction
- Soumya Sen, Anjan Dutta, Nilanjan Dey
Pages 45-66
Audio Classification
- Soumya Sen, Anjan Dutta, Nilanjan Dey
Pages 67-93
Conclusion
- Soumya Sen, Anjan Dutta, Nilanjan Dey
Pages 95-96

Keywords

About this book

This book offers an overview of audio processing, including the latest advances in the methodologies used in audio processing and speech recognition. First, it discusses the importance of audio indexing and classical information retrieval problem and presents two major indexing techniques, namely Large Vocabulary Continuous Speech Recognition (LVCSR) and Phonetic Search. It then offers brief insights into the human speech production system and its modeling, which are required to produce artificial speech. It also discusses various components of an automatic speech recognition (ASR) system.

Describing the chronological developments in ASR systems, and briefly examining the statistical models used in ASR as well as the related mathematical deductions, the book summarizes a number of state-of-the-art classification techniques and their application in audio/speech classification.

By providing insights into various aspects of audio/speech processing and speech recognition, this book appeals a wide audience, from researchers and postgraduate students to those new to the field.

Authors and Affiliations

A.K.Choudhury School of Information Technology, University of Calcutta, Kolkata, India

Soumya Sen
Department of Information Technology, Techno India College of Technology, Kolkata, India

Anjan Dutta, Nilanjan Dey

About the authors

Soumya Sen is an Assistant Professor at A. K. Choudhury School of Information Technology, University of Calcutta. He received his Ph.D. (Tech) degree from the Department of Computer Science and Engineering, at the same university, in 2016. Before joining A. K. Choudhury School of Information Technology, he worked at IBM India Pvt. Ltd and RS Software. His industrial expertise includes ERP and data warehousing. Currently his research interests are data warehousing and OLAP tools, data mining, big data, service engineering, distributed databases, and machine learning. He has published 1 book, 70 research papers in peer-reviewed journals and international conferences and registered 3 patents in USA, Japan and South Korea. Dr. Sen is a PC member and reviewer for numerous International conferences.

Anjan Dutta was born in Kolkata, India, in 1986. He received his B.Tech degree in Information Technology from West Bengal University of Technology in 2008 and M.Tech in Information Technology in 2011 from Calcutta University.
He served in IXIA Technologies LTD and TATA Consultancy Services Ltd. (TCSL) over 6 years of period. Initially he worked as a protocol developer in IXIA Technologies LTD and worked on 3gpp wireless protocols. Thereafter he worked as an IT Analyst in TATA Consultancy Services Ltd.(TCSL) Form July, 2011 to July, 2017. He is now employed as an Assistant Professor in Department of Information Technology, Techno India College of Technology, India. He is an active researcher in the field of Big Data, Data Mining, Audio processing and Audio classification etc.

Nilanjan Dey was born in Kolkata, India, in 1984. He received his B.Tech. degree in
Information Technology from West Bengal University of Technology in 2005, M.Tech.in Information Technology in 2011 from the same University and Ph.D. in digital image processing in 2015 from Jadavpur University, India.
In 2011, he was appointed as an Assistant Professor in the Department of Information Technology at JIS College of Engineering, Kalyani, India followed by Bengal College of Engineering College, Durgapur, India in 2014. He is now employed as an Assistant Professor in Department of Information Technology, Techno India College of Technology, India. He is a visiting fellow of the University of Reading, UK. His research topic is signal processing, machine learning and information security.
Dr. Dey is an Associate Editor of IEEE ACCESS and is currently the Editor in-Chief of the International Journal of Ambient Computing and Intelligence. Series Co-editor of Advances in Ubiquitous Sensing Applications for Healthcare (AUSAH), Elsevier and Springer Tracts in Nature-Inspired Computing (STNIC).

Bibliographic Information

Book Title: Audio Processing and Speech Recognition
Book Subtitle: Concepts, Techniques and Research Overviews
Authors: Soumya Sen, Anjan Dutta, Nilanjan Dey
Series Title: SpringerBriefs in Applied Sciences and Technology
DOI: https://doi.org/10.1007/978-981-13-6098-5
Publisher: Springer Singapore
eBook Packages: Engineering, Engineering (R0)
Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2019
Softcover ISBN: 978-981-13-6097-8Published: 20 February 2019
eBook ISBN: 978-981-13-6098-5Published: 30 January 2019
Series ISSN: 2191-530X
Series E-ISSN: 2191-5318
Edition Number: 1
Number of Pages: XIV, 96
Number of Illustrations: 38 b/w illustrations, 3 illustrations in colour
Topics: Signal, Image and Speech Processing, User Interfaces and Human Computer Interaction, Input/Output and Data Communications

Publish with us

Policies and ethics

Audio Processing and Speech Recognition

Overview

Access this book

Other ways to access

Table of contents (5 chapters)

Front Matter

Audio Indexing

Speech Processing and Recognition System

Feature Extraction

Audio Classification

Conclusion

Keywords

About this book

Authors and Affiliations

A.K.Choudhury School of Information Technology, University of Calcutta, Kolkata, India

Department of Information Technology, Techno India College of Technology, Kolkata, India

About the authors

Bibliographic Information

Publish with us

Search

Navigation