Skip to main content
  • Book
  • © 2020

Principles of Data Science

  • Introduces various techniques, methods, and algorithms adopted by Data Science experts
  • Provides a detailed explanation of data science perceptions, reinforced by practical examples
  • Presents a road map of future trends suitable for innovative data science research and practice

Buy it now

Buying options

eBook USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (13 chapters)

  1. Front Matter

    Pages i-xiv
  2. Simulation-Based Data Acquisition

    • Fabian Lorig, Ingo J. Timm
    Pages 1-15
  3. Coding of Bits for Entities by Means of Discrete Events (CBEDE): A Method of Compression and Transmission of Data

    • Reinaldo Padilha França, Yuzo Iano, Ana Carolina Borges Monteiro, Rangel Arthur
    Pages 17-30
  4. Big Biomedical Data Engineering

    • Ripon Patgiri, Sabuzima Nayak
    Pages 31-48
  5. Big Data Preprocessing: An Application on Online Social Networks

    • Androniki Sapountzi, Kostas E. Psannis
    Pages 49-78
  6. Feature Engineering

    • Sorin Soviany, Cristina Soviany
    Pages 79-103
  7. Data Summarization Using Sampling Algorithms: Data Stream Case Study

    • Rayane El Sibai, Jacques Bou Abdo, Yousra Chabchoub, Jacques Demerjian, Raja Chiky, Kablan Barbar
    Pages 105-124
  8. Fast Imputation: An Algorithmic Formalism

    • Devisha Arunadevi Tiwari
    Pages 125-153
  9. A Scientific Perspective on Big Data in Earth Observation

    • Corina Vaduva, Michele Iapaolo, Mihai Datcu
    Pages 155-188
  10. Visualizing High-Dimensional Data Using t-Distributed Stochastic Neighbor Embedding Algorithm

    • Jayesh Soni, Nagarajan Prabakar, Himanshu Upadhyay
    Pages 189-206
  11. Active and Machine Learning for Earth Observation Image Analysis with Traditional and Innovative Approaches

    • Corneliu Octavian Dumitru, Gottfried Schwarz, Gabriel Dax, Vlad Andrei, Dongyang Ao, Mihai Datcu
    Pages 207-231
  12. Applications in Financial Industry: Use-Case for Fraud Management

    • Sorin Soviany, Cristina Soviany
    Pages 233-248
  13. Stochastic Analysis for Short- and Long-Term Forecasting of Latin American Country Risk Indexes

    • JuliĂ¡n Pucheta, Gustavo Alasino, Carlos Salas, MartĂ­n Herrera, Cristian Rodriguez Rivero
    Pages 249-272
  14. Correction to: Principles of Data Science

    • Hamid R. Arabnia, Kevin Daimi, Robert Stahlbock, Cristina Soviany, Leonard Heilig, Kai BrĂ¼ssau
    Pages C1-C1
  15. Back Matter

    Pages 273-278

About this book

This book provides readers with a thorough understanding of various research areas within the field of data science. The book introduces readers to various techniques for data acquisition, extraction, and cleaning, data summarizing and modeling, data analysis and communication techniques, data science tools, deep learning, and various data science applications. Researchers can extract and conclude various future ideas and topics that could result in potential publications or thesis. Furthermore, this book contributes to Data Scientists’ preparation and to enhancing their knowledge of the field. The book provides a rich collection of manuscripts in highly regarded data science topics, edited by professors with long experience in the field of data science.
  • Introduces various techniques, methods, and algorithms adopted by Data Science experts
  • Provides a detailed explanation of data science perceptions, reinforced by practical examples
  • Presents a road map of future trends suitable for innovative data science research and practice




Editors and Affiliations

  • University of Georgia, Athens, USA

    Hamid R. Arabnia

  • University of Detroit Mercy, Detroit, USA

    Kevin Daimi

  • University of Hamburg, Hamburg, Germany

    Robert Stahlbock, Leonard Heilig, Kai BrĂ¼ssau

  • Features Analytics, Nivelles, Belgium

    Cristina Soviany

About the editors

Hamid R. Arabnia received a Ph.D. degree in Computer Science from the University of Kent (England) in 1987. He is currently a Professor (Emeritus) of Computer Science at University of Georgia (Georgia, USA), where he has been since October 1987. His research interests include parallel and distributed processing techniques and algorithms, supercomputing, Data Science (in the context of scalable HPC), imaging science, and other compute intensive problems. His most recent activities include: Studying ways to promote legislation that would prevent cyber-stalking, cyber-harassment, and cyber-bullying. As a victim of cyber-harassment and cyber-bullying, in 2017 and 2018 he won a lawsuit with damages awarded for a total of $3 Million (includes $650K awarded for attorney’s costs). Since this court case was one of the few cases of its kind in the United States, this ruling is considered to be important. Prof. Arabnia is Editor-in-Chief of The Journal of Supercomputing (Springer). He is the book series editor-in-chief of "Transactions of Computational Science and Computational Intelligence" (Springer). He is the editor of Computational Science and Computational Intelligence (IEEE CPS). He is a Senior Adviser to a number of corporations and is a Fellow and Adviser of Center of Excellence in Terrorism, Resilience, Intelligence & Organized Crime Research (CENTRIC).

Dr. Kevin Daimi received his Ph.D. from the University of Cranfield, England. He has a long mixture of academia and industry experience. He has worked as Senior Programmer/Systems Analyst, Computer Specialist, and Computer Consultant. He is currently Professor of Computer Science and Software Engineering Programs at the University of Detroit Mercy. His research interests include Data Science, Computer and Network Security with emphasis on vehicle network security, Software Engineering, and Computer Science and Software Engineering Education. Two of his publications received the BestPaper Award from two international conferences. He has been a member of the International Conference on Data Mining (DMIN) since 2004, and a member of the Program Committee for the 2018 International Conference on Data Science (ICDATA’18). He participated in a number of Data Science workshops. Kevin is a Senior Member of the Association for Computing Machinery (ACM), a Senior Member of the Institute of Electrical and Electronic Engineers (IEEE), and a Fellow of the British Computer Society (BCS). He served as a Program Committee member for many international conferences and chaired some of them. In 2103, he received the Faculty Excellence Award from the University of Detroit Mercy.

Robert Stahlbock is a lecturer and researcher at the Institute of Information Systems, University of Hamburg. He is also lecturer at the FOM University of Applied Sciences since 2003. He holds a diploma in Business Administration and a PhD from the UHH. His research interests are focused on managerial decision support and issues related to Maritime Logistics and other industries as well as Operations Research, Information Systems, Business Intelligence and Data Science. He is author of research studies published in international prestigious journals, conference proceedings and book chapters. He serves as guest editor of data science related books, as reviewer for international leading journals as well as a member of conference program committees. He is General Chair of the annual International Conference on Data Science since 2006. He also consults companies in various sectors and projects.

Kai BrĂ¼ssau is a lecturer and researcher at the Institute of Information Systems, University of Hamburg. He holds a diploma in Business Mathematics and a PhD from the UHH. In his research as well as in his courses he cooperates with Bachelor and Master students in several projects belonging to the fields of Operations Research, Data Science, and Business Analytics. Therefore, the application of optimization and Data Mining methods for solving practical problems is his main interest. In many industry projects he works together with several companies, e.g. a telecommunication provider, port logistics enterprises, and manufacturers. He also focuses on developing new approaches and implementing them in different application systems.

Leonard Heilig is a lecturer and researcher at the Institute of Information Systems, University of Hamburg. He holds a M.Sc. in Information Systems and a PhD from the UHH. His current research interest is centered around Cloud Computing, Operations Research, and Data Science with applications in Logistics and Telecommunications. He spent some time at the University of St Andrews (Scotland, UK) and at the Cloud Computing and Distributed Systems (CLOUDS) Lab at the University of Melbourne, Australia. He served as guest editor for several international journals and consults companies in various sectors and projects.

Dr. Cristina Soviany holds a MSc degree in Computer Science from Polytechnics University of Bucharest, Romania, and a PhD in Applied Sciences from Delft University of Technology, the Netherlands. She is a technologist with strong academic, R&D and more than 14 years of entrepreneurial experience. She has published in many scientific magazines and presented in several international conferences like Money 2020, MRC, MPE, RegTech Summit NY, B-Hive conf., Vendorcom events. Dr. Soviany is currently the co-founder and CEO of Features Analytics, a young AI technology company based in Belgium. She has been awarded the prize for leading the most innovative technology company in Europe in December 2011 and benefits from continuous financial support of Belgian Ministry of Economy and Scientific Research. Prior to starting Features Analytics, Cristina has worked as a senior scientist for Philips Applied Technologies, Netherlands. She then joined the Advanced Medical Diagnostics(AMD), a start-up company based in Belgium, for six years. At AMD, Cristina was in charge of leading the development of an innovative technology for cancer tissue characterization in 3D ultrasound data.

Bibliographic Information

Buy it now

Buying options

eBook USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access