Authors:

Boris Mirkin ⁰

Boris Mirkin
1. , Department of Computer Science, University of London, London, United Kingdom
View author publications

You can also search for this author in PubMed Google Scholar

Provides an in-depth understanding of a few basic techniques in data analysis rather than covering the broad spectrum of approaches developed to date.
Explores methodical innovations of summarization and correlation techniques in a cognitive way.
Includes worked examples, case studies, projects and questions, ideal for class and self-study.

Part of the book series: Undergraduate Topics in Computer Science (UTICS)

31k Accesses
32 Citations
1 Altmetric

Buy it now

eBook USD 29.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Learn about institutional subscriptions

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (8 chapters)

Front Matter

Pages i-xx

PDF
Introduction: What Is Core
- Boris Mirkin
Pages 1-30
1D Analysis: Summarization and Visualization of a Single Feature
- Boris Mirkin
Pages 31-65
2D Analysis: Correlation and Visualization of Two Features
- Boris Mirkin
Pages 67-112
Learning Multivariate Correlations in Data
- Boris Mirkin
Pages 113-172
Principal Component Analysis and SVD
- Boris Mirkin
Pages 173-219
K-Means and Related Clustering Methods
- Boris Mirkin
Pages 221-281
Hierarchical Clustering
- Boris Mirkin
Pages 283-313
Approximate and Spectral Clustering for Network and Affinity Data
- Boris Mirkin
Pages 315-356
Back Matter

Pages 357-390

PDF

About this book

Core Concepts in Data Analysis: Summarization, Correlation and Visualization provides in-depth descriptions of those data analysis approaches that either summarize data (principal component analysis and clustering, including hierarchical and network clustering) or correlate different aspects of data (decision trees, linear rules, neuron networks, and Bayes rule).

Boris Mirkin takes an unconventional approach and introduces the concept of multivariate data summarization as a counterpart to conventional machine learning prediction schemes, utilizing techniques from statistics, data analysis, data mining, machine learning, computational intelligence, and information retrieval.

Innovations following from his in-depth analysis of the models underlying summarization techniques are introduced, and applied to challenging issues such as the number of clusters, mixed scale data standardization, interpretation of the solutions, as well as relations between seemingly unrelated concepts: goodness-of-fit functions for classification trees and data standardization, spectral clustering and additive clustering, correlation and visualization of contingency data.

The mathematical detail is encapsulated in the so-called “formulation” parts, whereas most material is delivered through “presentation” parts that explain the methods by applying them to small real-world data sets; concise “computation” parts inform of the algorithmic and coding issues.

Four layers of active learning and self-study exercises are provided: worked examples, case studies, projects and questions.

Keywords

Reviews

From the reviews:

“Oriented toward undergraduate students in the computer science field, this work offers a unique approach to data analysis by focusing primarily on summarization, correlation, and visualization techniques instead of more broad-based approaches. Summarization is the more prevalent topic in this book, with detailed coverage of clustering and principal component analysis--two important areas of summarization often treated as heuristics. … Summing Up: Highly recommended. Upper-division undergraduates and faculty.” (D. J. Gougeon, Choice, Vol. 49 (2), October, 2011)

“This textbook follows an unconventional way to present the main aspects regarding data analysis. … the reader is led in a friendly way through different data analysis areas … . this book represents an exciting text, covering the main topics of the data analysis area. It can be successfully used as a textbook for BS and MS students in computer science, on the one hand, and for researchers in datamining and related fields, on the other hand.” (Florin Gorunescu, Zentralblatt MATH, Vol. 1219, 2011)

“Core concepts in data analysis is clean and devoid of any fuzziness. The author presents his theses with a refreshing clarity seldom seen in a text of this sophistication. The entire text is rich in solved examples, case studies, projects, and introspective questions. … To single out just one of the text’s many successes: I doubt readers will ever encounter again such a detailed and excellent treatment of correlation concepts. … statisticians will also find it refreshing and engaging.” (James Van Speybroeck, ACM Computing Reviews, June, 2011)

Authors and Affiliations

, Department of Computer Science, University of London, London, United Kingdom

Boris Mirkin

Bibliographic Information

Book Title: Core Concepts in Data Analysis: Summarization, Correlation and Visualization
Authors: Boris Mirkin
Series Title: Undergraduate Topics in Computer Science
DOI: https://doi.org/10.1007/978-0-85729-287-2
Publisher: Springer London
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: Springer-Verlag London Ltd., part of Springer Nature 2011
eBook ISBN: 978-0-85729-287-2Published: 05 April 2011
Series ISSN: 1863-7310
Series E-ISSN: 2197-1781
Edition Number: 1
Number of Pages: XX, 390
Number of Illustrations: 129 b/w illustrations
Topics: Discrete Mathematics in Computer Science, Probability and Statistics in Computer Science, Math Applications in Computer Science, Artificial Intelligence, Pattern Recognition

Publish with us

Policies and ethics

Authors:

Sections

Buy it now

Buying options

Other ways to access

Table of contents (8 chapters)

Front Matter

Back Matter

About this book

Keywords

Reviews

Authors and Affiliations

, Department of Computer Science, University of London, London, United Kingdom

Bibliographic Information

Publish with us

Buy it now

Buying options

Other ways to access

Search

Navigation