Skip to main content

Integrity Constraints on Rich Data Types

  • Book
  • © 2023

Overview

  • Explores application of data dependencies and their extensions to big data analysis
  • Guides readers on selection of appropriate data dependencies based on expressive power and discovery cost
  • Identifies typical data dependency categories, their relationships, and various specific application scenarios

Part of the book series: Synthesis Lectures on Data Management (SLDM)

  • 322 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 49.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (7 chapters)

Keywords

About this book

This book examines the recent trend of extending data dependencies to adapt to rich data types in order to address variety and veracity issues in big data. Readers will be guided through the full range of rich data types where data dependencies have been successfully applied, including categorical data with equality relationships, heterogeneous data with similarity relationships, numerical data with order relationships, sequential data with timestamps, and graph data with complicated structures. The text will also discuss interesting constraints on ordering or similarity relationships contained in novel classes of data dependencies in addition to those in equality relationships, e.g., considered in functional dependencies (FDs). In addition to exploring the concepts of these data dependency notations, the book investigates the extension relationships between data dependencies, such as conditional functional dependencies (CFDs) that extend conventional functional dependencies (FDs). This forms in the book a family tree of extensions, mostly rooted in FDs, that help illuminate the expressive power of various data dependencies. Moreover, the book points to work on the discovery of dependencies from data, since data dependencies are often unlikely to be manually specified in a traditional way, given the huge volume and high variety in big data. It further outlines the applications of the extended data dependencies, in particular in data quality practice. Altogether, this book provides a comprehensive guide for readers to select proper data dependencies for their applications that have sufficient expressive power and reasonable discovery cost. Finally, the book concludes with several directions of future studies on emerging data.

Authors and Affiliations

  • School of Software, Tsinghua University, Beijing, China

    Shaoxu Song

  • Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, Hong Kong

    Lei Chen

About the authors

Shaoxu Song is an Associate Professor in the School of Software at Tsinghua University in Beijing, China. His research interests include data quality and data integration. He has published more than 50 papers in top conferences and journals such as SIGMOD, VLDB, ICDE, ACM TODS, VLDBJ, IEEE TKDE, etc. He served as a Vice Program Chair for the 2022 IEEE International Conference on Big Data (IEEE BigData 2022) and received the Distinguished Reviewer award from VLDB 2019 and an Outstanding Reviewer award from CIKM 2017.

Lei Chen is a Chaired Professor in the Department of Computer Science and Engineering at the Hong Kong University of Science and Technology and the Director of the HKUST Big Data Institute. He received the SIGMOD Test-of-Time Award in 2015 and served as the Program Committee Co-Chair of VLDB 2019 and ICDE 2023. He is currently the Editor-in-Chief of the VLDB Journal, and the Editor-in-Chief of IEEE Transactions on Knowledge and Data Engineering (TKDE). He is an IEEE Fellow and ACM Distinguished Scientist.

Bibliographic Information

Publish with us