Data-Centric Systems and Applications

Data and Information Quality

Dimensions, Principles and Techniques

Authors: Batini, Carlo, Scannapieco, Monica

  • Presents an extensive description of the techniques that constitute the core of data and information quality research
  • Combines concrete practical solutions, such as methodologies, benchmarks, and case studies with sound theoretical formalisms
  • Includes also necessary foundations from probability theory, statistical data analysis, and machine learning
see more benefits

Buy this book

eBook $59.99
price for USA (gross)
  • ISBN 978-3-319-24106-7
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover $79.99
price for USA
  • ISBN 978-3-319-24104-3
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
About this book

This book provides a systematic and comparative description of the vast number of research issues related to the quality of data and information. It does so by delivering a sound, integrated and comprehensive  overview of the state of the art and future development of data and information quality in databases and information systems.

To this end, it presents an extensive description of the techniques that constitute the core of data and information quality research, including record linkage (also called object identification), data integration, error localization and correction, and examines the related techniques in a comprehensive and original methodological framework. Quality dimension definitions and adopted models are also analyzed in detail, and differences between the proposed solutions are highlighted and discussed. Furthermore, while systematically describing data and information quality as an autonomous research area, paradigms and influences deriving from other areas, such as probability theory, statistical data analysis, data mining, knowledge representation, and machine learning are also included. Last not least, the book also highlights very practical solutions, such as methodologies, benchmarks for the most effective techniques, case studies, and examples.

The book has been written primarily for researchers in the fields of databases and information management or in  natural sciences who are interested in investigating properties of data and information that have an impact on the quality of experiments, processes and on real life. The material presented is also sufficiently self-contained for masters or PhD-level courses, and it covers all the fundamentals and topics without the need for other textbooks. Data and information system administrators and practitioners, who deal with systems exposed to data-quality issues and as a result need a systematization of the field and practical methods in the area, will also benefit from the combination of concrete practical approaches with sound theoretical formalisms.

About the authors

Carlo Batini is full professor of Computer Engineering since 1986, initially at Sapienza – Università di Roma, then since 2002 at University of Milano Bicocca. His research interests include eGoverment, information systems and data base modeling and design, data and information quality, and service science. From 1995 to 2003 he was a member of the board of directors of the Authority for Information Technology in Public Administration, where he headed several large scale projects for the modernization of public administration.

Monica Scannapieco is a researcher at Istat, the Italian National Institute of Statistics since 2006. She earned a University Degree in Computer Engineering with honors and a Ph.D. in Computer Engineering at Sapienza - Università di Roma. She is the author of more than 100 papers mainly on data quality, privacy preservation and data integration, published in leading conferences and journals in databases and information systems. She has been involved in several European research projects on data quality and data integration.

Reviews

“This book addresses the dimensions, principles, and techniques to ensure that data and information conform to the necessary quality requirements. … Information and communication technology (ICT) professionals who touch in any way upon data and information quality … should find this book mandatory reading. … its serious depth and breadth would seem to merit building an advanced course on data and information quality around it, so computer science students would be yet another audience.” (David G. Hill, Computing Reviews, computingreviews.com, October, 2016)


Table of contents (15 chapters)

  • Introduction to Information Quality

    Batini, Carlo (et al.)

    Pages 1-19

  • Data Quality Dimensions

    Batini, Carlo (et al.)

    Pages 21-51

  • Information Quality Dimensions for Maps and Texts

    Batini, Carlo (et al.)

    Pages 53-86

  • Data Quality Issues in Linked Open Data

    Rula, Anisa (et al.)

    Pages 87-112

  • Quality of Images

    Ciocca, Gianluigi (et al.)

    Pages 113-135

Buy this book

eBook $59.99
price for USA (gross)
  • ISBN 978-3-319-24106-7
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover $79.99
price for USA
  • ISBN 978-3-319-24104-3
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Loading...

Recommended for you

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Data and Information Quality
Book Subtitle
Dimensions, Principles and Techniques
Authors
Series Title
Data-Centric Systems and Applications
Copyright
2016
Publisher
Springer International Publishing
Copyright Holder
Springer International Publishing Switzerland
eBook ISBN
978-3-319-24106-7
DOI
10.1007/978-3-319-24106-7
Hardcover ISBN
978-3-319-24104-3
Series ISSN
2197-9723
Edition Number
1
Number of Pages
XXVIII, 500
Number of Illustrations and Tables
207 b/w illustrations, 53 illustrations in colour
Topics