Skip to main content
  • Book
  • © 2020

Big Data 2.0 Processing Systems

A Systems Overview

Authors:

  • Provides readers the “big picture” and a comprehensive survey of the domain of big data processing systems and discusses various aspects of research and development
  • Describes an entire range of engines that transcend the Hadoop framework and are dedicated to specific verticals (e.g. structured data, graph data, streaming data)
  • A valuable reference guide for students, researchers and professionals in the domain of big data processing systems

Buy it now

Buying options

eBook USD 59.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 79.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (7 chapters)

  1. Front Matter

    Pages i-xvi
  2. Introduction

    • Sherif Sakr
    Pages 1-16
  3. Large-Scale Graph Processing Systems

    • Sherif Sakr
    Pages 59-93
  4. Large-Scale Stream Processing Systems

    • Sherif Sakr
    Pages 95-115
  5. Conclusions and Outlook

    • Sherif Sakr
    Pages 127-133
  6. Back Matter

    Pages 135-145

About this book

This book provides readers the “big picture” and a comprehensive survey of the domain of big data processing systems. For the past decade, the Hadoop framework has dominated the world of big data processing, yet recently academia and industry have started to recognize its limitations in several application domains and thus, it is now gradually being replaced by a collection of engines that are dedicated to specific verticals (e.g. structured data, graph data, and streaming data). The book explores this new wave of systems, which it refers to as Big Data 2.0 processing systems.

After Chapter 1 presents the general background of the big data phenomena, Chapter 2 provides an overview of various general-purpose big data processing systems that allow their users to develop various big data processing jobs for different application domains. In turn, Chapter 3 examines various systems that have been introduced to support the SQL flavor on top of the Hadoop infrastructure and provide competing and scalable performance in the processing of large-scale structured data. Chapter 4 discusses several systems that have been designed to tackle the problem of large-scale graph processing, while the main focus of Chapter 5 is on several systems that have been designed to provide scalable solutions for processing big data streams, and on other sets of systems that have been introduced to support the development of data pipelines between various types of big data processing jobs and systems. Next, Chapter 6 focuses on covering the emerging frameworks and systems in the domain of scalable machine learning and deep learning processing. Lastly, Chapter 7 shares conclusions and an outlook on future research challenges. This new and considerably enlarged second edition not only contains the completely new chapter 6, but also offers a refreshed content for the state-of-the-art in all domains of big data processing over the last years.

Overall, the book offers a valuable reference guide for professional, students, and researchers in the domain of big data processing systems. Further, its comprehensive content will hopefully encourage readers to pursue further research on the subject.


Reviews

“This short book is well written and informative. … As a survey book, the author succeeds in raising awareness for the topic and reinforcing the view of its depth. As a research tool, the book works as a stepping stone for the curious manager or researcher wanting a short introduction to a wide range of big data areas. An easy read on the topic … . Its many references provide a solid foundation for further study.” (Jean-Pierre Kuilboer, Computing Reviews, August 12, 2022)

“The book "Big Data 2.0 Processing Systems" is a valuable and up-to-date guide through this field and provides the reader with a comprehensible and concise overview of the main developments beyond the initial Map Reduce-focused version of Hadoop.” (Prof. Dr. Erhard Rahm, Universität Leipzig, Germany)

Authors and Affiliations

  • Institute of Computer Science, University of Tartu, Tartu, Estonia

    Sherif Sakr

About the author

Sherif Sakr is the Head of Data Systems Group at the Institute of Computer Science, University of Tartu, Estonia. His research interest is data and information management in general, particularly in big data processing systems, big data analytics, data science and big data management in cloud computing platforms. He has published more than 150 refereed research publications in international journals and conferences. Sherif is an ACM Senior Member and an IEEE Senior Member, and in 2017, he has been appointed to serve as an ACM Distinguished Speaker and as an IEEE Distinguished Speaker. In addition, he is serving as the Editor-in-Chief of the Springer Encyclopedia of Big Data Technologies, and is also serving as a Co-Chair for the European Big Data Value Association (BDVA) TF6-Data Technology Architectures Group. In 2019, he received the best Arab scholar award from the Abdul Hammed Shoman Foundation.

Bibliographic Information

  • Book Title: Big Data 2.0 Processing Systems

  • Book Subtitle: A Systems Overview

  • Authors: Sherif Sakr

  • DOI: https://doi.org/10.1007/978-3-030-44187-6

  • Publisher: Springer Cham

  • eBook Packages: Computer Science, Computer Science (R0)

  • Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2020

  • Hardcover ISBN: 978-3-030-44186-9Published: 10 July 2020

  • Softcover ISBN: 978-3-030-44189-0Published: 10 July 2021

  • eBook ISBN: 978-3-030-44187-6Published: 09 July 2020

  • Edition Number: 2

  • Number of Pages: XVI, 145

  • Number of Illustrations: 51 b/w illustrations, 19 illustrations in colour

  • Additional Information: The first edition of this work was published in the series: SpringerBriefs in Computer Science, with the

  • Topics: Information Storage and Retrieval, IT in Business, Machine Learning, Database Management

Buy it now

Buying options

eBook USD 59.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 79.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access