Logo - springer
Slogan - springer

Engineering - Computational Intelligence and Complexity | Astronomy and Big Data - A Data Clustering Approach to Identifying Uncertain Galaxy Morphology

Astronomy and Big Data

A Data Clustering Approach to Identifying Uncertain Galaxy Morphology

Series: Studies in Big Data, Vol. 6

Edwards, Kieran, Gaber, Mohamed Medhat

2014, XII, 105 p. 54 illus., 24 illus. in color.

Available Formats:

Springer eBooks may be purchased by end-customers only and are sold without copy protection (DRM free). Instead, all eBooks include personalized watermarks. This means you can read the Springer eBooks across numerous devices such as Laptops, eReaders, and tablets.

You can pay for Springer eBooks with Visa, Mastercard, American Express or Paypal.

After the purchase you can directly download the eBook file or read it online in our Springer eBook Reader. Furthermore your eBook will be stored in your MySpringer account. So you can always re-download your eBooks.


(net) price for USA

ISBN 978-3-319-06599-1

digitally watermarked, no DRM

Included Format: PDF

download immediately after purchase

learn more about Springer eBooks

add to marked items


Hardcover version

You can pay for Springer Books with Visa, Mastercard, American Express or Paypal.

Standard shipping is free of charge for individual customers.


(net) price for USA

ISBN 978-3-319-06598-4

free shipping for individuals worldwide

usually dispatched within 3 to 5 business days

add to marked items

  • Presents recent applications of Big Data research to Astronomy
  • Demonstrates the application of Big data to the Galaxy Zoo project, where a large collection of galaxy images are annotated by citizen scientists
  • Presents a Data Clustering Approach to Identifying Uncertain Galaxy Morphology

With the onset of massive cosmological data collection through media such as the Sloan Digital Sky Survey (SDSS), galaxy classification has been accomplished for the most part with the help of citizen science communities like Galaxy Zoo. Seeking the wisdom of the crowd for such Big Data processing has proved extremely beneficial. However, an analysis of one of the Galaxy Zoo morphological classification data sets has shown that a significant majority of all classified galaxies are labelled as “Uncertain”.

This book reports on how to use data mining, more specifically clustering, to identify galaxies that the public has shown some degree of uncertainty for as to whether they belong to one morphology type or another. The book shows the importance of transitions between different data mining techniques in an insightful workflow. It demonstrates that Clustering enables to identify discriminating features in the analysed data sets, adopting a novel feature selection algorithms called Incremental Feature Selection (IFS). The book shows the use of state-of-the-art classification techniques, Random Forests and Support Vector Machines to validate the acquired results. It is concluded that a vast majority of these galaxies are, in fact, of spiral morphology with a small subset potentially consisting of stars, elliptical galaxies or galaxies of other morphological variants.

Content Level » Research

Keywords » Astronomy - Big Data - Citizen Science - Data Clustering - Galaxy Morphology - Galaxy Zoo Project

Related subjects » Artificial Intelligence - Astronomy, Observations and Techniques - Computational Intelligence and Complexity - Database Management & Information Retrieval

Table of contents 

Introduction.- Astronomy, Galaxies and Stars: An Overview.- Astronomical Data Mining.- Adopted Data Mining Methods.- Research Methodology.- Development of Data Mining Models.- Experimentation Results.- Conclusion and Future Work.

Popular Content within this publication 



Read this Book on Springerlink

Services for this book

New Book Alert

Get alerted on new Springer publications in the subject area of Computational Intelligence.