Get 40% off select Statistics books or choose from thousands of Archive eBooks at 9.99 each!

Undergraduate Topics in Computer Science

Core Data Analysis: Summarization, Correlation, and Visualization

Authors: Mirkin, Boris

Free Preview
  • Focuses on the encoder-decoder interpretation of summarization methods, such as Principal Component Analysis and K-means clustering
  • Supplies an in-depth description of K-means partitioning including a data-driven mathematical theory
  • Covers novel topics such as Google PageRank ranking and Consensus clustering as interlaced within the general framework
  • Includes a multitude of worked examples, case studies and questions (with answers) 
see more benefits

Buy this book

eBook $59.99
price for USA in USD (gross)
  • ISBN 978-3-030-00271-8
  • Digitally watermarked, DRM-free
  • Included format: PDF, EPUB
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $74.99
price for USA in USD
  • ISBN 978-3-030-00270-1
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
About this Textbook

This text examines the goals of data analysis with respect to enhancing knowledge, and identifies data summarization and correlation analysis as the core issues. Data summarization, both quantitative and categorical, is treated within the encoder-decoder paradigm bringing forward a number of mathematically supported insights into the methods and relations between them. Two Chapters describe methods for categorical summarization: partitioning, divisive clustering and separate cluster finding and another explain the methods for quantitative summarization, Principal Component Analysis and PageRank.

Features:

·        An in-depth presentation of K-means partitioning including a corresponding Pythagorean decomposition of the data scatter.

·        Advice regarding such issues as clustering of categorical and mixed scale data, similarity and network data, interpretation aids, anomalous clusters, the number of clusters, etc.

·        Thorough attention to data-driven modelling including a number of mathematically stated relations between statistical and geometrical concepts including those between goodness-of-fit criteria for decision trees and data standardization, similarity and consensus clustering, modularity clustering and uniform partitioning.

New edition highlights:

·        Inclusion of ranking issues such as Google PageRank, linear stratification and tied rankings median, consensus clustering, semi-average clustering, one-cluster clustering

·        Restructured to make the logics more straightforward and sections self-contained

Core Data Analysis: Summarization, Correlation and Visualization is aimed at those who are eager to participate in developing the field as well as appealing to novices and practitioners. 

About the authors

Boris Mirkin holds a PhD in Computer Science (Mathematics) and DSc in Systems Analysis (Technology) degrees from Russian Universities. Between 1991-2010, he had long-term visiting appointments in France, Germany, USA, and a teaching appointment as a Professor of Computer Science at Birkbeck University of London, UK (2000-2010).

He develops methods for clustering and interpretation of complex data within the “data recovery” perspective.  Currently these approaches are being extended to automation of text analysis problems including the development and use of hierarchical ontologies. He has published a hundred  refereed papers and a dozen books, of which the latest are:  "Clustering: A Data Recovery Approach" (Chapman and Hall/CRC Press, 2012) and a textbook "Introductory Data Analysis" (In Russian, URAIT Publishers, Moscow, 2016). 

Table of contents (5 chapters)

Table of contents (5 chapters)

Buy this book

eBook $59.99
price for USA in USD (gross)
  • ISBN 978-3-030-00271-8
  • Digitally watermarked, DRM-free
  • Included format: PDF, EPUB
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $74.99
price for USA in USD
  • ISBN 978-3-030-00270-1
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Loading...

Recommended for you

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Core Data Analysis: Summarization, Correlation, and Visualization
Authors
Series Title
Undergraduate Topics in Computer Science
Copyright
2019
Publisher
Springer International Publishing
Copyright Holder
Springer Nature Switzerland AG
eBook ISBN
978-3-030-00271-8
DOI
10.1007/978-3-030-00271-8
Softcover ISBN
978-3-030-00270-1
Series ISSN
1863-7310
Edition Number
2
Number of Pages
XV, 524
Number of Illustrations
107 b/w illustrations, 80 illustrations in colour
Topics