Skip to main content
  • Book
  • © 2018

Computing with Data

An Introduction to the Data Industry

  • Links to an online, user-friendly website to edit and run the book’s plethora of code examples
  • Presents principles that can be employed in applications ranging from software simulations to real-world web applications that serve millions of users
  • Explains prevalent programming languages and data processing systems that are commonly used to address engineering challenges
  • Introduces programmers to data science concepts and practices through new apparatuses such as R programming and data processing techniques
  • Explores new tools and libraries to use in big data projects
  • Helps readers to hone their skills and introduces new skills necessary to be a more productive data scientist, analyst, and engineer

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (15 chapters)

  1. Front Matter

    Pages i-xvii
  2. Introduction: How to Use This Book?

    • Guy Lebanon, Mohamed El-Geish
    Pages 1-5
  3. Essential Knowledge: Hardware

    • Guy Lebanon, Mohamed El-Geish
    Pages 7-36
  4. Essential Knowledge: Operating Systems

    • Guy Lebanon, Mohamed El-Geish
    Pages 37-98
  5. Learning C++

    • Guy Lebanon, Mohamed El-Geish
    Pages 99-168
  6. Learning Java

    • Guy Lebanon, Mohamed El-Geish
    Pages 169-190
  7. Learning Python and a Few More Things

    • Guy Lebanon, Mohamed El-Geish
    Pages 191-253
  8. Learning R

    • Guy Lebanon, Mohamed El-Geish
    Pages 255-276
  9. Visualizing Data in R and Python

    • Guy Lebanon, Mohamed El-Geish
    Pages 277-324
  10. Processing Data in R and Python

    • Guy Lebanon, Mohamed El-Geish
    Pages 325-361
  11. Essential Knowledge: Parallel Programming

    • Guy Lebanon, Mohamed El-Geish
    Pages 363-413
  12. Essential Knowledge: Testing

    • Guy Lebanon, Mohamed El-Geish
    Pages 415-439
  13. A Few More Things About Programming

    • Guy Lebanon, Mohamed El-Geish
    Pages 441-470
  14. Essential Knowledge: Data Stores

    • Guy Lebanon, Mohamed El-Geish
    Pages 471-493
  15. Thoughts on System Design for Big Data

    • Guy Lebanon, Mohamed El-Geish
    Pages 495-541
  16. Thoughts on Software Craftsmanship

    • Guy Lebanon, Mohamed El-Geish
    Pages 543-576

About this book

This book introduces basic computing skills designed for industry professionals without a strong computer science background. Written in an easily accessible manner, and accompanied by a user-friendly website, it serves as a self-study guide to survey data science and data engineering for those who aspire to start a computing career, or expand on their current roles, in areas such as applied statistics, big data, machine learning, data mining, and informatics.

The authors draw from their combined experience working at software and social network companies, on big data products at several major online retailers, as well as their experience building big data systems for an AI startup. Spanning from the basic inner workings of a computer to advanced data manipulation techniques, this book opens doors for readers to quickly explore and enhance their computing knowledge.

Computing with Data comprises a wide range of computational topics essential for data scientists, analysts, and engineers, providing them with the necessary tools to be successful in any role that involves computing with data. The introduction is self-contained, and chapters progress from basic hardware concepts to operating systems, programming languages, graphing and processing data, testing and programming tools, big data frameworks, and cloud computing.

The book is fashioned with several audiences in mind. Readers without a strong educational background in CS--or those who need a refresher--will find the chapters on hardware, operating systems, and programming languages particularly useful. Readers with a strong educational background in CS, but without significant industry background, will find the following chapters especially beneficial: learning R, testing, programming, visualizing and processing data in Python and R, system design for big data, data stores, and software craftsmanship.

Reviews

“The intended audience is too broad. ... readers looking for direction in the data industry will find the process useful.” (Serge Berger, Computing Reviews, October 11, 2019)

Authors and Affiliations

  • Amazon, Menlo Park, USA

    Guy Lebanon

  • Voicera, Santa Clara, USA

    Mohamed El-Geish

Bibliographic Information

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access