Skip to main content
Book cover

Privacy-Preserving Machine Learning for Speech Processing

  • Book
  • © 2013

Overview

  • Nominated as outstanding PhD thesis from Carnegie Mellon University
  • Develops an efficient computational framework, making it possible to create speech processing applications such as voice biometrics, mining and speech recognition that are privacy-preserving
  • Presents a technology solution, which would allow a user to utilize an IVR system without fear that the system could learn undesired information, such as gender or nationality, or be able to record and edit voice
  • Includes supplementary material: sn.pub/extras

Part of the book series: Springer Theses (Springer Theses)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (13 chapters)

  1. Introduction

  2. Privacy-Preserving Speaker Verification

  3. Privacy-Preserving Speaker Identification

  4. Privacy-Preserving Speech Recognition

  5. Conclusion

Keywords

About this book

This thesis discusses the privacy issues in speech-based applications such as biometric authentication, surveillance, and external speech processing services. Author Manas A. Pathak presents solutions for privacy-preserving speech processing applications such as speaker verification, speaker identification and speech recognition. The author also introduces some of the tools from cryptography and machine learning and current techniques for improving the efficiency and scalability of the presented solutions. Experiments with prototype implementations of the solutions for execution time and accuracy on standardized speech datasets are also included in the text. Using the framework proposed  may now make it possible for a surveillance agency to listen for a known terrorist without being able to hear conversation from non-targeted, innocent civilians.

Authors and Affiliations

  • Carnegie Mellon University, Pittsburgh, USA

    Manas A. Pathak

About the author

Dr. Manas A. Pathak received the BTech degree in computer science from Visvesvaraya National Institute of Technology, Nagpur, India, in 2006, and the MS and PhD degrees from the Language Technologies Institute at Carnegie Mellon University (CMU) in 2009 and 2012 respectively. He is currently working as a research scientist at Adchemy, Inc. His research interests include intersection of data privacy, machine learning, speech processing.

Bibliographic Information

Publish with us