Skip to main content
Book cover

Statistics for Data Scientists

An Introduction to Probability, Statistics, and Data Analysis

  • Textbook
  • © 2022

Overview

  • Provides an accessible introduction to applied statistics by combining hands-on exercises with mathematical theory
  • Introduces statistical inference in a natural way, using finite samples and real data
  • Contains modern statistical methods including Bayesian decision theory, equivalence testing and statistical modelling

Part of the book series: Undergraduate Topics in Computer Science (UTICS)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (9 chapters)

Keywords

About this book

This book provides an undergraduate introduction to analysing data for data science, computer science, and quantitative social science students. It uniquely combines a hands-on approach to data analysis – supported by numerous real data examples and reusable [R] code – with a rigorous treatment of probability and statistical principles. 

Where contemporary undergraduate textbooks in probability theory or statistics often miss applications and an introductory treatment of modern methods (bootstrapping, Bayes, etc.), and where applied data analysis books often miss a rigorous theoretical treatment, this book provides an accessible but thorough introduction into data analysis, using statistical methods combining the two viewpoints. The book further focuses on methods for dealing with large data-sets and streaming-data and hence provides a single-course introduction of statistical methods for data science.

Reviews

“Having taught data analytics at the introductory graduate level, I welcome the authors’ textbook as an essential resource for training well-grounded entry-level data scientists. … A data scientist shall provide competent data science professional services to a client. … Training in both the theory and practice of data analytics is a requirement for such competence. The authors’ textbook definitely provides a valuable resource for such training.” (Harry J. Foxwell, Computing Reviews, July 7, 2022)

Authors and Affiliations

  • Tilburg University, Tilburg, The Netherlands

    Maurits Kaptein

  • Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands

    Edwin van den Heuvel

About the authors

Prof. Dr. Maurits Kaptein works on statistical methods for sequential experimentation. He has extensive experience in research and education in the fields of statistics, machine learning, and research methodology. Maurits works for the Jheronimus Academy of Data Science and for the University of Tilburg. His work has been published in influential journals such as Bayesian Analysis and the Journal of Interactive Marketing.



Prof. Dr. Edwin van den Heuvel works on statistical methods for analyzing cross-sectional and longitudinal data from experimental and observational studies in the domain of health and life sciences. He has been teaching many different topics on statistics to (PhD, master, and bachelor) students from different backgrounds (medicine, engineering, mathematics, etc.) He is full-time professor in statistics at Eindhoven University of Technology and has affiliations at other universities. He publishes mostly in peer-reviewed influential statistical, epidemiological, and medical journals. 

Bibliographic Information

Publish with us