Skip to main content
  • Book
  • © 2006

Graphics of Large Datasets

Visualizing a Million

  • Brings together a number of views on statistical graphics for large data sets
  • Provides a useful account of work in the area by experts

Part of the book series: Statistics and Computing (SCO)

Buy it now

Buying options

eBook USD 59.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (11 chapters)

  1. Front Matter

    Pages i-xiii
  2. Introduction

    1. Introduction

      • Antony Unwin
      Pages 1-27
  3. Basics

    1. Front Matter

      Pages 29-29
    2. Statistical Graphics

      • Martin Theus
      Pages 31-54
    3. Scaling Up Graphics

      • Martin Theus
      Pages 55-72
    4. Interacting with Graphics

      • Antony Unwin
      Pages 73-101
  4. Applications

    1. Front Matter

      Pages 103-103
    2. Rotating Plots

      • Dianne Cook, Leslie Miller
      Pages 125-141
    3. Multivariate Continuous Data — Parallel Coordinates

      • Rida Moustafa, Ed Wegman
      Pages 143-155
    4. Networks

      • Graham Wills
      Pages 157-175
    5. Trees

      • Simon Urbanek
      Pages 177-202
    6. Transactions

      • Bárbara González-Arévalo, Félix Hernández-Campos, Steve Marron, Cheolwoo Park
      Pages 203-226
    7. Graphics of a Large Dataset

      • Antony Unwin, Martin Theus
      Pages 227-249
  5. Back Matter

    Pages 251-271

About this book

Graphics are great for exploring data, but how can they be used for looking at the large datasets that are commonplace to-day? This book shows how to look at ways of visualizing large datasets, whether large in numbers of cases or large in numbers of variables or large in both. Data visualization is useful for data cleaning, exploring data, identifying trends and clusters, spotting local patterns, evaluating modeling output, and presenting results. It is essential for exploratory data analysis and data mining. Data analysts, statisticians, computer scientists-indeed anyone who has to explore a large dataset of their own-should benefit from reading this book.

New approaches to graphics are needed to visualize the information in large datasets and most of the innovations described in this book are developments of standard graphics. There are considerable advantages in extending displays which are well-known and well-tried, both in understanding how best to make use of them in your work and in presenting results to others. It should also make the book readily accessible for readers who already have a little experience of drawing statistical graphics. All ideas are illustrated with displays from analyses of real datasets and the authors emphasize the importance of interpreting displays effectively. Graphics should be drawn to convey information and the book includes many insightful examples.

From the reviews:

"Anyone interested in modern techniques for visualizing data will be well rewarded by reading this book. There is a wealth of important plotting types and techniques." Paul Murrell for the Journal of Statistical Software, December 2006

"This fascinating book looks at the question of visualizing large datasets from many different perspectives. Different authors are responsible for different chapters and this approach works well in giving the reader alternative viewpoints of the same problem. Interestingly the authors havecleverly chosen a definition of 'large dataset'. Essentially they focus on datasets with the order of a million cases. As the authors point out there are now many examples of much larger datasets but by limiting to ones that can be loaded in their entirety in standard statistical software they end up with a book that has great utility to the practitioner rather than just the theorist. Another very attractive feature of the book is the many colour plates, showing clearly what can now routinely be seen on the computer screen. The interactive nature of data analysis with large datasets is hard to reproduce in a book but the authors make an excellent attempt to do just this." P. Marriott for the Short Book Reviews of the ISI

Reviews

From the reviews:

"Anyone interested in modern techniques for visualizing data will be well rewarded by reading this book. There is a wealth of important plotting types and techniques." Paul Murrell for the Journal of Statistical Software, December 2006

"This fascinating book looks at the question of visualizing large datasets from many different perspectives. Different authors are responsible for different chapters and this approach works well in giving the reader alternative viewpoints of the same problem. Interestingly the authors have cleverly chosen a definition of 'large dataset'. Essentially they focus on datasets with the order of a million cases. As the authors point out there are now many examples of much larger datasets but by limiting to ones that can be loaded in their entirety in standard statistical software they end up with a book that has great utility to the practitioner rather than just the theorist. Another very attractive feature of the book is the many colour plates, showing clearly what can now routinely be seen on the computer screen. The interactive nature of data analysis with large datasets is hard to reproduce in a book but the authors make an excellent attempt to do just this." P. Marriott for the Short Book Reviews of the ISI

"The intended readership of this book is anyone who is willing to explore a large dataset of their own as well as statisticians, computer scientists, and data analysts. … This is a valuable book for all the researchers who need practical guidance to explore their large datasets by the help of a variety of visualization methods. … We recommend this book for everyone interested in discovering structures hidden in the large datasets by a variety of state-of-the-art visualization techniques." (Tulay Koru-Sengul, Technometrics, Vol. 49 (3), August, 2007)

"The chief attractions of the book are its focus and clarity while dealing with a diverse range of topics andits simplicity of presentation....Thus, we can safely say that we recommend the book to anyone interested in the field of datat visualization. " (Rajesh Natarajan, Lev: Book Reviews, Interfaces 37(5), 2007)

"Data visualization is useful for data cleaning, exploring data, identifying trends and clusters, spotting local patterns, evaluating modelling output, and presenting results. It is also essential for exploratory data analysis and data mining. Given its subject matter, the book is well addressed to data analysts, statisticians, computer scientists, and anyone who has to explore a large dataset." (Christina Diakaki, Zentralblatt MATH, Vol. 1118 (20), 2007)

"I found the book informative, easy to read, and on target addressing an important and evolving need - the visualization of large datasets. The book contains many important points supported by examples, datasets, softare, and websites, which are accessible both intellectually and electronically." (Thomas Bradstreet, The American Statistician, Vol. 62 (2), 2008)

"The book has a Web site with files and code-settings for figures, links to software, and the most important datasets used. The style is very clear and makes pleasant reading. This book is full of inspiring ideas for all practicing statisticians, even for those who do not usually deal with large datasets." (Ricardo Maronna, Statistical Papers, Vol. 50, 2009)

Authors and Affiliations

  • Department of Computer Oriented Statistics and Data Analysis, University of Augsburg, Augsburg, Germany

    Antony Unwin, Martin Theus

  • Department of Statistics, Iowa State University, Ames

    Heike Hofmann

Bibliographic Information

Buy it now

Buying options

eBook USD 59.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access