Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2682))

Abstract

Researchers convincingly argue that the ability to declaratively mine and analyze relational databases using SQL for decision support is a critical requirement for the success of the acclaimed data mining technology. Although there have been several encouraging attempts at developing methods for data mining using SQL, simplicity and efficiency still remain significant impediments for further development. In this article, we propose a significantly new approach and show that any object relational database can be mined for association rules without any restructuring or preprocessing using only basic SQL3 constructs and functions, and hence no additional machineries are necessary. In particular, we show that the cost of computing association rules for a given database does not depend on support and confidence thresholds. More precisely, the set of large items can be computed using one simple join query and an aggregation once the set of all possible meets (least fixpoint) of item set patterns in the input table is known. We believe that this is an encouraging discovery especially compared to the well known SQL based methods in the literature. Finally, we capture the functionality of our proposed mining method in a mine by SQL3 operator for general use in any relational database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: VLDB, pp. 487–499 (1994)

    Google Scholar 

  2. Beeri, C., Ramakrishnan, R.: On the power of magic. In: ACM PODS, pp. 269–283 (1987)

    Google Scholar 

  3. Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: Generalizing association rules to correlations. In: Proc. ACM SIGMOD, pp. 265–276 (1997)

    Google Scholar 

  4. Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proc. ACM SIGMOD, pp. 1–12 (2000)

    Google Scholar 

  5. Jamil, H.M.: Mining first-order knowledge bases for association rules. In: Proceedings of the 13th IEEE International Conference on Tools with Artificial Intelligence (ICTAI), Dallas, Texas, pp. 218–227. IEEE Press, Los Alamitos (2001)

    Google Scholar 

  6. Jamil, H.M.: A new indexing scheme for set-valued keys. Technical report, Department of Computer Science, MSU, USA (June 2001)

    Google Scholar 

  7. Jamil, H.M.: Ad hoc association rule mining as SQL3 queries. In: Proceedings of the IEEE International Conference on Data Mining, San Jose, California, pp. 609–612. IEEE Press, Los Alamitos (2001)

    Google Scholar 

  8. Jamil, H.M.: On the equivalence of top-down and bottom-up data mining in relational databases. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2001. LNCS, vol. 2114, pp. 41–50. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  9. Jamil, H.M.: Bottom-up association rule mining in relational databases. Journal of Intelligent Information Systems 19(2), 191–206 (2002)

    Article  Google Scholar 

  10. Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H., Verkamo, I.: Finding interesting rules from large sets of discovered association rules. In: CIKM, pp. 401–407 (1994)

    Google Scholar 

  11. Korn, F., Labrinidis, A., Kotidis, Y., Faloutsos, C.: Ratio rules: A new paradigm for fast, quantifiable data mining. In: Proc. of 24th VLDB, pp. 582–593 (1998)

    Google Scholar 

  12. Lent, B., Swami, A.N., Widom, J.: Clustering association rules. In: Proc. of the 3th ICDE, pp. 220–231 (1997)

    Google Scholar 

  13. Meo, R., Psaila, G., Ceri, S.: A new SQL-like operator for mining association rules. In: Proc. of 22nd VLDB, pp. 122–133 (1996)

    Google Scholar 

  14. Meo, R., Psaila, G., Ceri, S.: An extension to SQL for mining association rules. DMKD 2(2), 195–224 (1998)

    Google Scholar 

  15. Mumick, I.S., Pirahesh, H.: Implementation of magic-sets in a relational database system. In: ACM SIGMOD, pp. 103–114 (1994)

    Google Scholar 

  16. Netz, A., Chaudhuri, S., Bernhardt, J., Fayyad, U.M.: Integration of data mining with database technology. In: Proceedings of 26th VLDB, pp. 719–722 (2000)

    Google Scholar 

  17. Netz, A., Chaudhuri, S., Fayyad, U.M., Bernhardt, J.: Integrating data mining with SQL databases. In: IEEE ICDE (2001)

    Google Scholar 

  18. Ng, R.T., Lakshmanan, L.V.S., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained association rules. In: Proc. ACM SIGMOD, pp. 13–24 (1998)

    Google Scholar 

  19. Park, J.S., Chen, M.-S., Yu, P.S.: An effective hash based algorithm for mining association rules. In: Proc. ACM SIGMOD, pp. 175–186 (1995)

    Google Scholar 

  20. Rajamani, K., Cox, A., Iyer, B., Chadha, A.: Efficient mining for association rules with relational database systems. In: IDEAS, pp. 148–155 (1999)

    Google Scholar 

  21. Sarawagi, S., Thomas, S., Agrawal, R.: Integrating mining with relational database systems: Alternatives and implications. In: Proc. ACM SIGMOD, pp. 343–354 (1998)

    Google Scholar 

  22. Savasere, A., Omiecinski, E., Navathe, S.B.: An efficient algorithm for mining association rules in large databases. In: Proc of 21th VLDB, pp. 432–444 (1995)

    Google Scholar 

  23. Shenoy, P., Haritsa, J.R., Sudarshan, S., Bhalotia, G., Bawa, M., Shah, D.: Turbo-charging vertical mining of large databases. In: ACM SIGMOD, pp. 22–33 (2000)

    Google Scholar 

  24. Silberschatz, A., Korth, H.F., Sudarshan, S.: Database System Concepts, 3rd edn. McGraw-Hill, New York (1996)

    MATH  Google Scholar 

  25. Thomas, S., Sarawagi, S.: Mining generalized association rules and sequential patterns using SQL queries. In: KDD, pp. 344–348 (1998)

    Google Scholar 

  26. Ullman, J.D.: Principles of Database and Knowledge-base Systems, Part I & II. Computer Science Press (1988)

    Google Scholar 

  27. Zaki, M.J.: Generating non-redundant association rules. In: Proc. of the 6th ACM SIGKDD Intl. Conf., Boston, MA (August 2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Jamil, H.M. (2004). Declarative Data Mining Using SQL3. In: Meo, R., Lanzi, P.L., Klemettinen, M. (eds) Database Support for Data Mining Applications. Lecture Notes in Computer Science(), vol 2682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44497-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-44497-8_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22479-2

  • Online ISBN: 978-3-540-44497-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics