Skip to main content

Compression-Based Measures for Mining Interesting Rules

  • Conference paper
Next-Generation Applied Intelligence (IEA/AIE 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5579))

Abstract

An interestingness measure estimates the degree of interestingness of a discovered pattern and has been actively studied in the past two decades. Several pitfalls should be avoided in the study such as a use of many parameters and a lack of systematic evaluation in the presence of noise. Compression-based measures have advantages in this respect as they are typically parameter-free and robust to noise. In this paper, we present J-measure and a measure based on an extension of the Minimum Description Length Principle (MDLP) as compression-based measures for mining interesting rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I., et al.: Fast discovery of association rules. In: Fayyad, U.M., et al. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 307–328. AAAI/MIT Press, Menlo Park (1996)

    Google Scholar 

  2. Barthélemy, J. P., Legrain, A., Lenca, P., Vaillant, B.: Aggregation of Valued Relations Applied to Association Rule Interestingness Measures. In: Torra, V., Narukawa, Y., Valls, A., Domingo-Ferrer, J. (eds.) MDAI 2006. LNCS, vol. 3885, pp. 203–214. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  3. Bayardo, R.J., Agrawal, R.: Mining the Most Interesting Rules. In: Proc. Fifth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining, pp. 145–154 (1999)

    Google Scholar 

  4. Brin, S., Motwani, R., Silverstein, C.: Beyond Market Baskets: Generalizing Association Rules to Correlations. In: SIGMOD 1997, Proc. ACM SIGMOD Int’l Conf. on Management of Data, pp. 265–276 (1997)

    Google Scholar 

  5. Carvalho, D.R., Freitas, A.A., Ebecken, N.F.F.: Evaluating the Correlation Between Objective Rule Interestingness Measures and Real Human Interest. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS, vol. 3721, pp. 453–461. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  6. Gras, R.: L’ Implication Statistique. La Pensée Sauvage (1996) (in French)

    Google Scholar 

  7. Jaroszewicz, S., Simovici, D.A.: Interestingness of Frequent Itemsets Using Bayesian Networks as Background Knowledge. In: Proc. Tenth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining, pp. 178–186 (2004)

    Google Scholar 

  8. Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: On Selecting Interestingness Measures for Association Rules: User Oriented Description and Multiple Criteria Decision Aid. European Journal of Operational Research 184(2), 610–626 (2008)

    Article  MATH  Google Scholar 

  9. Liu, B., Hsu, W., Chen, S.: Using General Impressions to Analyze Discovered Classification Rules. In: Proc. Third Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 31–36 (1997)

    Google Scholar 

  10. Liu, B., Hsu, W., Mun, L.-F., Lee, H.-Y.: Finding Interesting Patterns Using User Expectations. IEEE Trans. Knowledge and Data Eng. 11(6), 817–832 (1999)

    Article  Google Scholar 

  11. Padmanabhan, B., Tuzhilin, A.: A Belief-Driven Method for Discovering Unexpected Patterns. In: Proc. Fourth Int’l Conf. Knowledge Discovery and Data Mining (KDD), pp. 94–100 (1998)

    Google Scholar 

  12. Piatetsky-Shapiro, G.: Discovery, Analysis, and Presentation of Strong Rules. In: Knowledge Discovery in Databases, pp. 229–248. AAAI/MIT Press, Menlo Park (1991)

    Google Scholar 

  13. Silberschatz, A., Tuzhilin, A.: On Subjective Measures of Interestingness in Knowledge Discovery. In: Proc. First Int’l Conf. Knowledge Discovery and Data Mining (KDD), pp. 275–281 (1995)

    Google Scholar 

  14. Silberschatz, A., Tuzhilin, A.: What Makes Patterns Interesting in Knowledge Discovery Systems. IEEE Trans. Knowledge and Data Eng. 8(6), 970–974 (1996)

    Article  Google Scholar 

  15. Smyth, P., Goodman, R.M.: An Information Theoretic Approach to Rule Induction from Databases. IEEE Trans. Knowledge and Data Engineering 4(4), 301–316 (1992)

    Article  Google Scholar 

  16. Suzuki, E.: Autonomous Discovery of Reliable Exception Rules. In: Proc. Third Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 259–262 (1997)

    Google Scholar 

  17. Suzuki, E.: Undirected Discovery of Interesting Exception Rules. Int’l Journal of Pattern Recognition and Artificial Intelligence 16(8), 1065–1086 (2002)

    Article  Google Scholar 

  18. Suzuki, E.: Pitfalls for Categorizations of Objective Interestingness Measures for Rule Discovery. In: Gras, R., Suzuki, E., Guillet, F., Spagnolo, F. (eds.) Statistical Implicative Analysis: Theory and Applications, pp. 383–395. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  19. Suzuki, E.: Negative Encoding Length as a Subjective Interestingness Measure for Groups of Rules. In: Proc. Thirteenth Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD (2009) (accepted for publication)

    Google Scholar 

  20. Suzuki, E., Shimura, M.: Exceptional Knowledge Discovery in Databases Based on Information Theory. In: Proc. Second Int’l Conf. Knowledge Discovery and Data Mining (KDD), pp. 275–278 (1996)

    Google Scholar 

  21. Tan, P.N., Kumar, V., Srivastava, J.: Selecting the Right Interestingness Measure for Association Patterns. In: Proc. Eighth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 32–41 (2002)

    Google Scholar 

  22. Vaillant, B., Lallich, S., Lenca, P.: On the Behavior of the Generalizations of the Intensity of Implication: A Data-driven Comparative Study. In: Gras, R., Suzuki, E., Guillet, F., Spagnolo, F. (eds.) Statistical Implicative Analysis: Theory and Applications, pp. 421–447. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  23. Suzuki, E.: Interestingness Measures - Limits, Desiderata, and Recent Results. In: Proc. Quality Issues, Measures of Interestingness and Evaluation of Data Mining Models, QIMIE (2009) (keynote talk, accepted for publication)

    Google Scholar 

  24. Suzuki, E.: Evaluation Scheme for Exception Rule/Group Discovery. In: Intelligent Technologies for Information Analysis, pp. 89–108. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  25. Keogh, E.J., Lonardi, S., Ratanamahatana, C.A.: Towards Parameter-free Data Mining. In: Proc. Tenth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 206–215 (2004)

    Google Scholar 

  26. Feldman, R., Dagan, I.: Knowledge Discovery in Textual Databases (KDT). In: Proc. First International Conference on Knowledge Discovery and Data Mining (KDD), pp. 112–117 (1995)

    Google Scholar 

  27. Keogh, E.J., Pazzani, M.J.: Scaling up Dynamic Time Warping to Massive Dataset. In: Żytkow, J.M., Rauch, J. (eds.) PKDD 1999. LNCS (LNAI), vol. 1704, pp. 1–11. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  28. Blachman, N.M.: The Amount of Information That y Gives About X. IEEE Transactions on Information Theory IT-14(1), 27–31 (1968)

    Article  MathSciNet  MATH  Google Scholar 

  29. Hájek, P., Havel, C.M.: The GUHA Method of Automatic Hypotheses Determination. Computing 1, 293–308 (1966)

    Article  MATH  Google Scholar 

  30. Quinlan, J.R., Rivest, R.L.: Inferring Decision Trees Using the Minimum Description Length Principle. Information and Computation 80(3), 227–248 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  31. Wallace, C.S., Patrick, J.D.: Coding Decision Trees. Machine Learning 11(1), 7–22 (1993)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Suzuki, E. (2009). Compression-Based Measures for Mining Interesting Rules. In: Chien, BC., Hong, TP., Chen, SM., Ali, M. (eds) Next-Generation Applied Intelligence. IEA/AIE 2009. Lecture Notes in Computer Science(), vol 5579. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02568-6_75

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02568-6_75

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02567-9

  • Online ISBN: 978-3-642-02568-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics