Springer Theses

Real-time Speech and Music Classification by Large Audio Feature Space Extraction

Authors: Eyben, Florian

  • Nominated as an outstanding thesis by Technische Universität München, Germany
  • Describes the details and architecture of openSMILE - the number 1 open-source toolkit in speech emotion analytics and computational paralinguistics 
  • Reports on extensive automatic classification results for over ten public speech and music databases
see more benefits

Buy this book

eBook 118,99 €
price for Spain (gross)
  • ISBN 978-3-319-27299-3
  • Digitally watermarked, DRM-free
  • Included format: PDF, EPUB
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover 145,59 €
price for Spain (gross)
  • ISBN 978-3-319-27298-6
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
  • The final prices may differ from the prices shown due to specifics of VAT rules
About this book

This book reports on an outstanding thesis that has significantly advanced the state-of-the-art in the automated analysis and classification of speech and music.  It defines several standard acoustic parameter sets and describes their implementation in a novel, open-source, audio analysis framework called openSMILE, which has been accepted and intensively used worldwide. The book offers extensive descriptions of key methods for the automatic classification of speech and music signals in real-life conditions and reports on the evaluation of the framework developed and the acoustic parameter sets that were selected. It is not only intended as a manual for openSMILE users, but also and primarily as a guide and source of inspiration for students and scientists involved in the design of speech and music analysis methods that can robustly handle real-life conditions.

Table of contents (7 chapters)

Buy this book

eBook 118,99 €
price for Spain (gross)
  • ISBN 978-3-319-27299-3
  • Digitally watermarked, DRM-free
  • Included format: PDF, EPUB
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover 145,59 €
price for Spain (gross)
  • ISBN 978-3-319-27298-6
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
  • The final prices may differ from the prices shown due to specifics of VAT rules
Loading...

Recommended for you

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Real-time Speech and Music Classification by Large Audio Feature Space Extraction
Authors
Series Title
Springer Theses
Copyright
2016
Publisher
Springer International Publishing
Copyright Holder
Springer International Publishing Switzerland
eBook ISBN
978-3-319-27299-3
DOI
10.1007/978-3-319-27299-3
Hardcover ISBN
978-3-319-27298-6
Series ISSN
2190-5053
Edition Number
1
Number of Pages
XXXVIII, 298
Number of Illustrations and Tables
2 b/w illustrations, 39 illustrations in colour
Topics