Logo - springer
Slogan - springer

Computer Science | Pro Hadoop

Pro Hadoop

Venner, Jason

2009, 440 p.

A product of Apress
Available Formats:

Springer eBooks may be purchased by end-customers only and are sold without copy protection (DRM free). Instead, all eBooks include personalized watermarks. This means you can read the Springer eBooks across numerous devices such as Laptops, eReaders, and tablets.

You can pay for Springer eBooks with Visa, Mastercard, American Express or Paypal.

After the purchase you can directly download the eBook file or read it online in our Springer eBook Reader. Furthermore your eBook will be stored in your MySpringer account. So you can always re-download your eBooks.


(net) price for USA

ISBN 978-1-4302-1943-9

digitally watermarked, no DRM

Included Format: PDF

download immediately after purchase

learn more about Springer eBooks

add to marked items


Softcover (also known as softback) version.

You can pay for Springer Books with Visa, Mastercard, American Express or Paypal.

Standard shipping is free of charge for individual customers.


(net) price for USA

ISBN 978-1-4302-1942-2

free shipping for individuals worldwide

usually dispatched within 3 to 5 business days

add to marked items

  • The first full book to market of any type on Hadoop.
  • Cloud computing is a very hot new area, Hadoop is almost certain to be a part of its rise, and for any hip cloud computing programmer, learning Pro Hadoop is the best bet at getting in on it.
  • Merrill Lynch’s analysts predicted in August 2008 that the annual global market for cloud computing will be $95 billion by 2013.
  • Major cloud campaigns from Google, Microsoft, Yahoo, Amazon will soon become very evident, stimulating huge interest.
  • Author, Jason Venner, is uniquely qualified to write a professional-level Hadoop book. As principal engineer of a startup specializing in Hadoop since Hadoop’s first public release, Jason Venner’s had opportunities to commit, and fix, the most common beginner mistakes and to learn the most important early lessons in using Hadoop

You've heard the hype about Hadoop: it runs petabyte–scale data mining tasks insanely fast, it runs gigantic tasks on clouds for absurdly cheap, it's been heavily committed to by tech giants like IBM, Yahoo!, and the Apache Project, and it's completely open-source (thus free). But what exactly is it, and more importantly, how do you even get a Hadoop cluster up and running?

From Apress, the name you've come to trust for hands–on technical knowledge, Pro Hadoop brings you up to speed on Hadoop. You learn the ins and outs of MapReduce; how to structure a cluster, design, and implement the Hadoop file system; and how to build your first cloud–computing tasks using Hadoop. Learn how to let Hadoop take care of distributing and parallelizing your software—you just focus on the code, Hadoop takes care of the rest.

Best of all, you'll learn from a tech professional who's been in the Hadoop scene since day one. Written from the perspective of a principal engineer with down–in–the–trenches knowledge of what to do wrong with Hadoop, you learn how to avoid the common, expensive first errors that everyone makes with creating their own Hadoop system or inheriting someone else's.

Skip the novice stage and the expensive, hard–to–fix mistakes...go straight to seasoned pro on the hottest cloud–computing framework with Pro Hadoop. Your productivity will blow your managers away.

Content Level » Popular/general

Keywords » Debugging - Open Source - design - productivity - software

Related subjects » Computer Science

Table of contents 

  1. Getting Started with Hadoop Core
  2. The Basics of a MapReduce Job
  3. The Basics of Multimachine Clusters
  4. HDFS Details for Multimachine Clusters
  5. MapReduce Details for Multimachine Clusters
  6. Tuning Your MapReduce Jobs
  7. Unit Testing and Debugging
  8. Advanced and Alternate MapReduce Techniques
  9. Solving Problems with Hadoop
  10. Projects Based On Hadoop and Future Directions

Popular Content within this publication 



Read this Book on Springerlink

Services for this book

New Book Alert

Get alerted on new Springer publications in the subject area of Computer Science (general).