Skip to main content
  • Book
  • © 2013

Data Mining in Large Sets of Complex Data

  • Contains a survey on clustering algorithms for moderate-to-high dimensionality data
  • Includes examples of applications in breast cancer diagnosis, region detection in satellite images, assistance to climate change forecast, recommender systems for the Web, and social networks
  • Includes supplementary material: sn.pub/extras

Part of the book series: SpringerBriefs in Computer Science (BRIEFSCOMPUTER)

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (7 chapters)

  1. Front Matter

    Pages i-xi
  2. Introduction

    • Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
    Pages 1-6
  3. Related Work and Concepts

    • Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
    Pages 7-20
  4. Clustering Methods for Moderate-to-High Dimensionality Data

    • Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
    Pages 21-32
  5. Halite

    • Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
    Pages 33-67
  6. BoW

    • Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
    Pages 69-92
  7. QMAS

    • Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
    Pages 93-109
  8. Conclusion

    • Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior
    Pages 111-116

About this book

The amount and the complexity of the data gathered by current enterprises are increasing at an exponential rate. Consequently, the analysis of Big Data is nowadays a central challenge in Computer Science, especially for complex data. For example, given a satellite image database containing tens of Terabytes, how can we find regions aiming at identifying native rainforests, deforestation or reforestation? Can it be made automatically? Based on the work discussed in this book, the answers to both questions are a sound “yes”, and the results can be obtained in just minutes. In fact, results that used to require days or weeks of hard work from human specialists can now be obtained in minutes with high precision. Data Mining in Large Sets of Complex Data discusses new algorithms that take steps forward from traditional data mining (especially for clustering) by considering large, complex datasets. Usually, other works focus in one aspect, either data size or complexity. This work considers both: it enables mining complex data from high impact applications, such as breast cancer diagnosis, region classification in satellite images, assistance to climate change forecast, recommendation systems for the Web and social networks; the data are large in the Terabyte-scale, not in Giga as usual; and very accurate results are found in just minutes. Thus, it provides a crucial and well timed contribution for allowing the creation of real time applications that deal with Big Data of high complexity in which mining on the fly can make an immeasurable difference, such as supporting cancer diagnosis or detecting deforestation.

Reviews

From the reviews:

“This book is a must-read for all data mining professionals, as it explains new and superior techniques for clustering large datasets of high-dimensional data. It would also be interesting for professionals who work with large volumes of complex data and want real-time information for better decision making.” (Alexis Leon, Computing Reviews, July, 2013)

Authors and Affiliations

  • Computer Science Department - ICMC, University of São Paulo, São Carlos, Brazil

    Robson L. F. Cordeiro, Caetano Traina Júnior

  • School of Computer Science, Carnegie Mellon University, Pittsburgh, USA

    Christos Faloutsos

Bibliographic Information

  • Book Title: Data Mining in Large Sets of Complex Data

  • Authors: Robson L. F. Cordeiro, Christos Faloutsos, Caetano Traina Júnior

  • Series Title: SpringerBriefs in Computer Science

  • DOI: https://doi.org/10.1007/978-1-4471-4890-6

  • Publisher: Springer London

  • eBook Packages: Computer Science, Computer Science (R0)

  • Copyright Information: The Author(s) 2013

  • Softcover ISBN: 978-1-4471-4889-0Published: 11 January 2013

  • eBook ISBN: 978-1-4471-4890-6Published: 11 January 2013

  • Series ISSN: 2191-5768

  • Series E-ISSN: 2191-5776

  • Edition Number: 1

  • Number of Pages: XI, 116

  • Number of Illustrations: 12 b/w illustrations, 25 illustrations in colour

  • Topics: Data Mining and Knowledge Discovery, Database Management

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access