Skip to main content

Samples for Understanding Data-Semantics in Relations

  • Conference paper
  • First Online:
Foundations of Intelligent Systems (ISMIS 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2366))

Included in the following conference series:

Abstract

From statistics, sampling technics were proposed and some of them were proved to be very useful in many database applications. Rather surprisingly, it seems these works never consider the preservation of data semantics. Since functional dependencies (FDs) are known to convey most of data semantics, an interesting issue would be to construct samples preserving FDs satisfied in existing relations.

To cope with this issue, we propose in this paper to define Informative Armstrong Relations (IARs); a relation s is an IAR for a relation r if s is a subset of r and if FDs satisfied in s are exactly the same as FDs satisfied in r. Such a relation always exists since r is obviously an IAR for itself; moreover we shall point out that small IARs with interesting bounded sizes exist. Experiments on relations available in the KDD archive were conducted and highlight the interest of IARs to sample existing relations.

This work is partially supported by the AS CNRS-STIC “Data Mining”

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Reference

  1. William Ward Armstrong and Claude Delobel. Decomposition and functional dependencies in relations. ACM Transactions on Database Systems, 5(4):404–430, 1980.

    Article  MATH  Google Scholar 

  2. W. W. Armstrong. Dependency structures of database relationships. In Jack L. Rosenfeld, editor, International Federation for Information Processing, Amsterdam, pages 580–583, 1974.

    Google Scholar 

  3. S. D. Bay. The UCI KDD Archive [http://kdd.ics.uci.edu]. Technical report, Irvine, CA: University of California, Department of Information and Computer Science, 1999.

    Google Scholar 

  4. C. Beeri, M. Dowd, R. Fagin, and R. Statman. On the structure of Armstrong relations for functional dependencies. Journal of the ACM, 31(1):30–56, January 1984.

    Google Scholar 

  5. F. De Marchi, S. Lopes, and J-M. Petit. Informative armstrong relations: Application to database analysis. In Bases de Données Avancées, Agadir, Maroc, October 2001.

    Google Scholar 

  6. F. De Marchi, S. Lopes, and J.-M. Petit. Efficient algorithms for mining inclusion dependenciess. In International Conference on Extending Database Technology, Prague, Czech Republic. To appear, 2002.

    Google Scholar 

  7. J. Demetrovics and V.D. Thi. Some remarks on generating armstrong and inferring functional dependencies relation. Acta Cybernetica, 12(2):167–180, 1995.

    MATH  MathSciNet  Google Scholar 

  8. R. Fagin. Armstrong databases. In IBM Symposium on Mathematical Foundations of Computer Science, Kanagawa, Japan, 1982.

    Google Scholar 

  9. R. Fagin. Horn clauses and database dependencies. Journal of the ACM, 99(4):952–985, 1982.

    Article  MathSciNet  Google Scholar 

  10. M. Levene and G. Loizou. A Guided Tour of Relational Databases and Beyond. Springer, 1999.

    Google Scholar 

  11. S. Lopes, J.-M. Petit, and L. Lakhal. Efficient discovery of functional dependencies and armstrong relations. In Carlo Zaniolo, Peter C. Lockemann, Marc H. Scholl, and Torsten Grust, editors, International Conference on Extending Database Technology, Konstanz, Germany, volume 1777 of Lecture Notes in Computer Science, pages 350–364. Springer, 2000.

    Google Scholar 

  12. S. Lopes, J-M. Petit, and L. Lakhal. A framework for understanding existing databases. In Michel E. Adiba, Christine Collet, and Bipin C. Desai, editors, International Database Engineering and Applications Symposium, Grenoble, France, pages 330–338, July 2001.

    Google Scholar 

  13. S. Lopes, J-M. Petit, and F. Toumani. Discovering interesting inclusion dependencies: Application to logical database tuning. Information System, 17(1):1–19, 2002.

    Article  Google Scholar 

  14. H. Mannila and K.-J. Räihä. Design by example: An application of arm-strong relations. Journal of Computer and System Sciences, 63(2):126–141, October 1986.

    Google Scholar 

  15. H. Mannila and K. J. Räihä. Algorithms for inferring functional-dependencies from relations. Data and Knowledge Engineering, 12(1):83–99, 1994.

    Article  MATH  Google Scholar 

  16. A. M. Silva and M. A. Melkanoff. A method for helping discover the dependencies of a relation. In Hervé Gallaire, Jean-Marie Nicolas, and Jack Minker, editors, Advances in Data Base Theory, pages 115–133, Toulouse, France, 1979.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

De Marchi, F., Lopes, S., Petit, JM. (2002). Samples for Understanding Data-Semantics in Relations. In: Hacid, MS., Raś, Z.W., Zighed, D.A., Kodratoff, Y. (eds) Foundations of Intelligent Systems. ISMIS 2002. Lecture Notes in Computer Science(), vol 2366. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48050-1_60

Download citation

  • DOI: https://doi.org/10.1007/3-540-48050-1_60

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43785-7

  • Online ISBN: 978-3-540-48050-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics