Skip to main content
  • Book
  • © 2016

Real-time Speech and Music Classification by Large Audio Feature Space Extraction

Authors:

  • Nominated as an outstanding thesis
  • by Technische Universität München, Germany
  • Describes the details and
  • architecture of openSMILE - the number 1 open-source toolkit in speech emotion
  • analytics and computational paralinguistics
  • Reports on extensive automatic classification results for over ten public speech and music databases
  • Includes supplementary material: sn.pub/extras

Part of the book series: Springer Theses (Springer Theses)

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (7 chapters)

  1. Front Matter

    Pages i-xxxviii
  2. Introduction

    • Florian Eyben
    Pages 1-7
  3. Acoustic Features and Modelling

    • Florian Eyben
    Pages 9-122
  4. Standard Baseline Feature Sets

    • Florian Eyben
    Pages 123-137
  5. Real-time Incremental Processing

    • Florian Eyben
    Pages 139-161
  6. Real-Life Robustness

    • Florian Eyben
    Pages 163-183
  7. Evaluation

    • Florian Eyben
    Pages 185-236
  8. Discussion and Outlook

    • Florian Eyben
    Pages 237-245
  9. Back Matter

    Pages 247-298

About this book

This book reports on an outstanding thesis that has significantly advanced the state-of-the-art in the automated analysis and classification of speech and music.  It defines several standard acoustic parameter sets and describes their implementation in a novel, open-source, audio analysis framework called openSMILE, which has been accepted and intensively used worldwide. The book offers extensive descriptions of key methods for the automatic classification of speech and music signals in real-life conditions and reports on the evaluation of the framework developed and the acoustic parameter sets that were selected. It is not only intended as a manual for openSMILE users, but also and primarily as a guide and source of inspiration for students and scientists involved in the design of speech and music analysis methods that can robustly handle real-life conditions.

Authors and Affiliations

  • Institute for Human-Machine Communication (MMK), Technische Universität München, Munich, Germany

    Florian Eyben

Bibliographic Information

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access