Skip to main content
  • Book
  • © 2015

Cognitively Inspired Audiovisual Speech Filtering

Towards an Intelligent, Fuzzy Based, Multimodal, Two-Stage Speech Enhancement System

  • State-of-the-art summary of multimodal speech filtering literature
  • Novel interdisciplinary cognitive inspiration in audio and visual aspects of speech
  • A novel approach to combining audio and visual speech filtering for real-world applications

Part of the book series: SpringerBriefs in Cognitive Computation (BRIEFSCC, volume 5)

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (8 chapters)

  1. Front Matter

    Pages i-xviii
  2. Introduction

    • Andrew Abel, Amir Hussain
    Pages 1-4
  3. Audio and Visual Speech Relationship

    • Andrew Abel, Amir Hussain
    Pages 5-12
  4. The Research Context

    • Andrew Abel, Amir Hussain
    Pages 13-34
  5. A Two Stage Multimodal Speech Enhancement System

    • Andrew Abel, Amir Hussain
    Pages 35-51
  6. Experiments, Results, and Analysis

    • Andrew Abel, Amir Hussain
    Pages 53-73
  7. Towards Fuzzy Logic Based Multimodal Speech Filtering

    • Andrew Abel, Amir Hussain
    Pages 75-90
  8. Evaluation of Fuzzy Logic Proof of Concept

    • Andrew Abel, Amir Hussain
    Pages 91-110
  9. Potential Future Research Directions

    • Andrew Abel, Amir Hussain
    Pages 111-114
  10. Back Matter

    Pages 115-121

About this book

This book presents a summary of the cognitively inspired basis behind multimodal speech enhancement, covering the relationship between audio and visual modalities in speech, as well as recent research into audiovisual speech correlation. A number of audiovisual speech filtering approaches that make use of this relationship are also discussed. A novel multimodal speech enhancement system, making use of both visual and audio information to filter speech, is presented, and this book explores the extension of this system with the use of fuzzy logic to demonstrate an initial implementation of an autonomous, adaptive, and context aware multimodal system. This work also discusses the challenges presented with regard to testing such a system, the limitations with many current audiovisual speech corpora, and discusses a suitable approach towards development of a corpus designed to test this novel, cognitively inspired, speech filtering system.                                                                                

Reviews

                                                                                                                                                                                                                                                                                        

Authors and Affiliations

  • Computing Science and Mathematics, University of Stirling, Stirling, EU

    Andrew Abel, Amir Hussain

About the authors

                                                                                                                         

Bibliographic Information

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access