Computer Communications and Networks

Guide to High Performance Distributed Computing

Case Studies with Hadoop, Scalding and Spark

Authors: Srinivasa, K.G., Muppalla, Anil Kumar

  • Provides a guide to the distributed computing technologies of Hadoop and Spark, from the perspective of industry practitioners
  • Supports the theory with case studies taken from a range of disciplines, including data mining, machine learning, graph processing and image processing
  • Supplies working source code to aid understanding through step-by-step implementation
see more benefits

Buy this book

eBook 51,16 €
price for India (gross)
  • ISBN 978-3-319-13497-0
  • Digitally watermarked, DRM-free
  • Included format: PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover 59,99 €
price for India (gross)
  • ISBN 978-3-319-13496-3
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Softcover 59,99 €
price for India (gross)
  • ISBN 978-3-319-38347-7
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
About this Textbook

This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies. Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks. Features: describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing; presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution; Reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding; Provides detailed case studies on approaches to clustering, data classification and regression analysis; Explains the process of creating a working recommender system using Scalding and Spark.

Table of contents (8 chapters)

  • Introduction

    Srinivasa, K. G. (et al.)

    Pages 3-31

    Preview Buy Chapter 24,95 €
  • Getting Started with Hadoop

    Srinivasa, K. G. (et al.)

    Pages 33-72

    Preview Buy Chapter 24,95 €
  • Getting Started with Spark

    Srinivasa, K. G. (et al.)

    Pages 73-99

    Preview Buy Chapter 24,95 €
  • Programming Internals of Scalding and Spark

    Srinivasa, K.G. (et al.)

    Pages 101-154

    Preview Buy Chapter 24,95 €
  • Case Study I: Data Clustering using Scalding and Spark

    Srinivasa, K G (et al.)

    Pages 157-183

    Preview Buy Chapter 24,95 €

Buy this book

eBook 51,16 €
price for India (gross)
  • ISBN 978-3-319-13497-0
  • Digitally watermarked, DRM-free
  • Included format: PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover 59,99 €
price for India (gross)
  • ISBN 978-3-319-13496-3
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Softcover 59,99 €
price for India (gross)
  • ISBN 978-3-319-38347-7
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Loading...

Recommended for you

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Guide to High Performance Distributed Computing
Book Subtitle
Case Studies with Hadoop, Scalding and Spark
Authors
Series Title
Computer Communications and Networks
Copyright
2015
Publisher
Springer International Publishing
Copyright Holder
Springer International Publishing Switzerland
eBook ISBN
978-3-319-13497-0
DOI
10.1007/978-3-319-13497-0
Hardcover ISBN
978-3-319-13496-3
Softcover ISBN
978-3-319-38347-7
Series ISSN
1617-7975
Edition Number
1
Number of Pages
XVII, 304
Number of Illustrations and Tables
43 b/w illustrations
Topics