Skip to main content
  • Book
  • © 2013

Handbook of Data Quality

Research and Practice

Editors:

  • Presents a comprehensive framework based on organizational, architectural, and computational techniques and solutions
  • Includes a separate section devoted to successful data quality approaches in industry
  • Combines contributions from leading researchers, recognized industry thought leaders, and experienced practitioners
  • Includes supplementary material: sn.pub/extras

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (17 chapters)

  1. Front Matter

    Pages i-xii
  2. Organizational Aspects of Data Quality

    1. Front Matter

      Pages 13-14
    2. Data Quality Projects and Programs

      • Danette McGilvray
      Pages 41-73
    3. Cost and Value Management for Data Quality

      • Mouzhi Ge, Markus Helfert
      Pages 75-92
  3. Architectural Aspects of Data Quality

    1. Front Matter

      Pages 119-119
    2. Data Warehouse Quality: Summary and Outlook

      • Lukasz Golab
      Pages 121-140
    3. Using Semantic Web Technologies for Data Quality Management

      • Christian Fürber, Martin Hepp
      Pages 141-161
    4. Data Glitches: Monsters in Your Data

      • Tamraparni Dasu
      Pages 163-178
  4. Computational Aspects of Data Quality

    1. Front Matter

      Pages 179-180
    2. Generic and Declarative Approaches to Data Quality Management

      • Leopoldo Bertossi, Loreto Bravo
      Pages 181-211
    3. Linking Records in Complex Context

      • Pei Li, Andrea Maurino
      Pages 213-233
    4. A Practical Guide to Entity Resolution with OYSTER

      • John R. Talburt, Yinle Zhou
      Pages 235-270
    5. Managing Quality of Probabilistic Databases

      • Reynold Cheng
      Pages 271-291
    6. Data Fusion: Resolving Conflicts from Multiple Sources

      • Xin Luna Dong, Laure Berti-Equille, Divesh Srivastava
      Pages 293-318
  5. Data Quality in Action

    1. Front Matter

      Pages 319-319
    2. Ensuring the Quality of Health Information: The Canadian Experience

      • Heather Richards, Nancy White
      Pages 321-346
    3. Shell’s Global Data Quality Journey

      • Ken Self
      Pages 347-368

About this book

The issue of data quality is as old as data itself. However, the proliferation of diverse, large-scale and often publically available data on the Web has increased the risk of poor data quality and misleading data interpretations. On the other hand, data is now exposed at a much more strategic level e.g. through business intelligence systems, increasing manifold the stakes involved for individuals, corporations as well as government agencies. There, the lack of knowledge about data accuracy, currency or completeness can have erroneous and even catastrophic results.

With these changes, traditional approaches to data management in general, and data quality control specifically, are challenged. There is an evident need to incorporate data quality considerations into the whole data cycle, encompassing managerial/governance as well as technical aspects.

Data quality experts from research and industry agree that a unified framework for data quality management should bring together organizational, architectural and computational approaches. Accordingly, Sadiq structured this handbook in four parts: Part I is on organizational solutions, i.e. the development of data quality objectives for the organization, and the development of strategies to establish roles, processes, policies, and standards required to manage and ensure data quality. Part II, on architectural solutions, covers the technology landscape required to deploy developed data quality management processes, standards and policies. Part III, on computational solutions, presents effective and efficient tools and techniques related to record linkage, lineage and provenance, data uncertainty, and advanced integrity constraints. Finally, Part IV is devoted to case studies of successful data quality initiatives that highlight the various aspects of data quality in action. The individual chapters present both an overview of the respective topic in terms of historical research and/or practice and state of the art, as well as specific techniques, methodologies and frameworks developed by the individual contributors.

Researchers and students of computer science, information systems, or business management as well as data professionals and practitioners will benefit most from this handbook by not only focusing on the various sections relevant to their research area or particular practical work, but by also studying chapters that they may initially consider not to be directly relevant to them, as there they will learn about new perspectives and approaches.

Reviews

From the reviews:

“The book is suitable for academics and students in computer science, information systems, and management. Practitioners should find matters related to their practice areas in one division or another. … This book goes a long way toward providing an integrative view of data quality, and is a useful addition to the body of literature in this area.” (R. M. Malyankar, Computing Reviews, May, 2014)

Editors and Affiliations

  • University of Queensland, Brisbane, Australia

    Shazia Sadiq

About the editor

Shazia Sadiq is professor of computer science at the University of Queensland where she teaches and conducts research on information systems with a particular focus on business processes management, governance, risk and compliance, and data quality. Shazia is a keen advocate of cross-disciplinary and industry relevant research, and she has published her results in more than 100 scientific papers so far.

Bibliographic Information

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access