Abstract
From statistics, sampling technics were proposed and some of them were proved to be very useful in many database applications. Rather surprisingly, it seems these works never consider the preservation of data semantics. Since functional dependencies (FDs) are known to convey most of data semantics, an interesting issue would be to construct samples preserving FDs satisfied in existing relations.
To cope with this issue, we propose in this paper to define Informative Armstrong Relations (IARs); a relation s is an IAR for a relation r if s is a subset of r and if FDs satisfied in s are exactly the same as FDs satisfied in r. Such a relation always exists since r is obviously an IAR for itself; moreover we shall point out that small IARs with interesting bounded sizes exist. Experiments on relations available in the KDD archive were conducted and highlight the interest of IARs to sample existing relations.
This work is partially supported by the AS CNRS-STIC “Data Mining”
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Reference
William Ward Armstrong and Claude Delobel. Decomposition and functional dependencies in relations. ACM Transactions on Database Systems, 5(4):404–430, 1980.
W. W. Armstrong. Dependency structures of database relationships. In Jack L. Rosenfeld, editor, International Federation for Information Processing, Amsterdam, pages 580–583, 1974.
S. D. Bay. The UCI KDD Archive [http://kdd.ics.uci.edu]. Technical report, Irvine, CA: University of California, Department of Information and Computer Science, 1999.
C. Beeri, M. Dowd, R. Fagin, and R. Statman. On the structure of Armstrong relations for functional dependencies. Journal of the ACM, 31(1):30–56, January 1984.
F. De Marchi, S. Lopes, and J-M. Petit. Informative armstrong relations: Application to database analysis. In Bases de Données Avancées, Agadir, Maroc, October 2001.
F. De Marchi, S. Lopes, and J.-M. Petit. Efficient algorithms for mining inclusion dependenciess. In International Conference on Extending Database Technology, Prague, Czech Republic. To appear, 2002.
J. Demetrovics and V.D. Thi. Some remarks on generating armstrong and inferring functional dependencies relation. Acta Cybernetica, 12(2):167–180, 1995.
R. Fagin. Armstrong databases. In IBM Symposium on Mathematical Foundations of Computer Science, Kanagawa, Japan, 1982.
R. Fagin. Horn clauses and database dependencies. Journal of the ACM, 99(4):952–985, 1982.
M. Levene and G. Loizou. A Guided Tour of Relational Databases and Beyond. Springer, 1999.
S. Lopes, J.-M. Petit, and L. Lakhal. Efficient discovery of functional dependencies and armstrong relations. In Carlo Zaniolo, Peter C. Lockemann, Marc H. Scholl, and Torsten Grust, editors, International Conference on Extending Database Technology, Konstanz, Germany, volume 1777 of Lecture Notes in Computer Science, pages 350–364. Springer, 2000.
S. Lopes, J-M. Petit, and L. Lakhal. A framework for understanding existing databases. In Michel E. Adiba, Christine Collet, and Bipin C. Desai, editors, International Database Engineering and Applications Symposium, Grenoble, France, pages 330–338, July 2001.
S. Lopes, J-M. Petit, and F. Toumani. Discovering interesting inclusion dependencies: Application to logical database tuning. Information System, 17(1):1–19, 2002.
H. Mannila and K.-J. Räihä. Design by example: An application of arm-strong relations. Journal of Computer and System Sciences, 63(2):126–141, October 1986.
H. Mannila and K. J. Räihä. Algorithms for inferring functional-dependencies from relations. Data and Knowledge Engineering, 12(1):83–99, 1994.
A. M. Silva and M. A. Melkanoff. A method for helping discover the dependencies of a relation. In Hervé Gallaire, Jean-Marie Nicolas, and Jack Minker, editors, Advances in Data Base Theory, pages 115–133, Toulouse, France, 1979.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
De Marchi, F., Lopes, S., Petit, JM. (2002). Samples for Understanding Data-Semantics in Relations. In: Hacid, MS., Raś, Z.W., Zighed, D.A., Kodratoff, Y. (eds) Foundations of Intelligent Systems. ISMIS 2002. Lecture Notes in Computer Science(), vol 2366. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48050-1_60
Download citation
DOI: https://doi.org/10.1007/3-540-48050-1_60
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43785-7
Online ISBN: 978-3-540-48050-1
eBook Packages: Springer Book Archive