Synonyms
Data reconciliation; Data standardization; Minimal-change integrity maintenance
Definition
Given a set Σ of integrity constraints and a database instance D of a schema R, the problem of constraint-driven database repair is to find an instance D′ of the same schema R such that (i) D′ is consistent, i.e., D′ satisfies Σ, and moreover, (ii) D′minimally differs from the original database D, i.e., it takes a minimal number of repair operations or incurs minimal cost to obtain D′ by updating D.
Historical Background
Real-life data is often dirty, i.e., inconsistent, inaccurate, stale, or deliberately falsified. While the prevalent use of the Web has made it possible, on an unprecedented scale, to extract and integrate data from diverse sources, it has also increased the risks of creating and propagating dirty data. Dirty data routinely leads to misleading or biased analytical results and decisions and incurs loss of revenue, credibility, and customers. With this comes the need for...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Arenas M, Bertossi LE, Chomicki J. Consistent query answers in inconsistent databases. In: Proceedings of the 18th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems; 1999. p. 68–79.
Bohannon P, Fan W, Flaster M, Rastogi R. A cost-based model and effective heuristic for repairing constraints by value modification. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2005. p. 143–54.
Bravo L, Fan W, Ma S. Extending dependencies with conditions. In: Proceedings of the 33rd International Conference on Very Large Data Bases; 2007. p. 243–54.
Calì A, Lembo D, Rosati R. On the decidability and complexity of query answering over inconsistent and incomplete databases. In: Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems; 2003. p. 260–71.
Cao Y, Fan W, Yu W. Determining the relative accuracy of attributes. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2013. p. 565–76.
Chomicki J. Consistent query answering: five easy pieces. In: Proceedings of the 11th International Conference on Database Theory; 2007. p. 1–17.
Chomicki J, Marcinkowski J. Minimal-change integrity maintenance using tuple deletions. Inf Comput. 2005;197(1–2):90–121.
Chomicki J, Marcinkowski J. On the computational complexity of minimal-change integrity maintenance in relational databases. Inconsistency Tolerance. 2005. Lecture Notes in Computer Science 3300:119–150.
Cong G, Fan W, Geerts F, Jia X, Ma S. Improving data quality: consistency and accuracy. In: Proceedings of the 33rd International Conference on Very Large Data Bases; 2007. p. 315–26.
Fan W, Geerts F. Foundations of data quality management. Synthesis lectures on data management. Morgan & Claypool Publishers; 2012.
Fan W, Geerts F, Jia X, Kementsietsidis A. Conditional functional dependencies for capturing data inconsistencies. ACM Trans Database Syst. NY, USA: 2008;33(2):1–48.
Fan W, Li J, Ma S, Tang N, Yu W. Interaction between record matching and data repairing. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2011. p. 469–80.
Fan W, Li J, Ma S, Tang N, Yu W. Towards certain fixes with editing rules and master data. VLDB J. 2012;21(2):213–38.
Fellegi I, Holt D. A systematic approach to automatic edit and imputation. J Am Stat Assoc. 1976;71(353):17–35.
Lopatenko A, Bertossi LE. Complexity of consistent query answering in databases under cardinality-based and incremental repair semantics. In: Proceedings of the 11th International Conference on Database Theory; 2007. p. 179–93.
Wijsen J. Database repairing using updates. ACM Trans Database Syst. 2005;30(3):722–68.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Fan, W. (2018). Constraint-Driven Database Repair. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_599
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_599
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering