Abstract
We present ELEM2, a new method for inducing classification rules from a set of examples. The method employs several new strategies in the induction and classification processes to improve the predictive performance of induced rules. In particular, a new heuristic function for evaluating attribute-value pairs is proposed. The function is defined to reflect the degree of relevance of an attribute-value pair to a target concept and leads to selection of the most relevant pairs for formulating rules. Another feature of ELEM2 is that it handles inconsistent training data by defining an unlearnable region of a concept based on the probability distribution of that concept in the training data. To further deal with imperfect data, ELEM2 makes use of the post-pruning technique to remove unreliable portions of a generated rule. A new rule quality measure is proposed for the purpose of post-pruning. The measure is defined according to the relative distribution of a rule with respect to positive and negative examples. To show whether ELEM2 achieves its objective, we report experimental results which compare ELEM2 with C4.5 and CN2 on a number of datasets.
Preview
Unable to display preview. Download preview PDF.
References
An, A. 1997. Analysis Methodologies for Integrated and Enhanced Problem Solving. Ph.D. Thesis, Dept. of Computer Science, University of Regina, Regina, Canada.
Brodley, C.E. 1993. “Addressing the Selective Superiority Problem: Automatic Algorithm/model Class Selection.” Proceedings of the 10th Machine Learning Conference. pp.17–24.
Cendrowska, J. 1988. “PRISM: An Algorithm for Inducing Modular Rules”. In Gaines, B. and Boose, J. (eds.): Knowledge Acquisition for Knowledge-Based Systems. Academic Press.
Clark, P. and Niblett, T. 1989. “The CN2 Induction Algorithm”. Machine Learning, 3, pp.261–283.
Cooper, W.S. 1973. “On Selecting a Measure of Retrieval Effectiveness.” Journal of the American Society for Information Science. Vo1.24.
Creecy, R.H., Masand, B.M., Smith, S.J. and Waltz, D.L. 1992. “Trading MIPS and Memory for Knowledge Engineering”. Communications of the ACM, 35, pp.48–64.
Domingos, P. 1995. “Rule Induction and Instance-Based Learning: A Unified Approach.” IJCAI-95. Montreal, Canada. pp.1226–1232.
Grzymala-Busse, J.W. 1992. “LERS-A System for Learning From Examples Based on Rough Sets”, in Slowinski, R.(ed.): Intelligent Decision Support: Handbook of Applications and Advances of Rough Sets Theory, Kluwer Academic Publishers, pp.318.
Hamilton, H.J., Shan, N. and Cercone, N. 1996. “RIAC: A Rule Induction Algorithm Based on Approximate Classification”. Technical Report CS-96-06, University of Regina.
Holte, R., Acker, L. and Porter, B. 1989. “Concept Learning and the Problem of Small Disjuncts”. Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, Detroit, Michigan.
Kerber, R. 1992. “ChiMerge: Discretization of Numeric Attributes”, Proceedings of the 10th National Conference on Artificial Intelligence AAAI-92, San Jose, CA.
Michalski, R.S., Mozetic, I., Hong, J. and Lavrac, N. 1986. “The Multi-Purpose Incremental Learning System AQl5 and Its Testing Application to Three Medical Domains”. Proceedings of AAAI 1986. pp.1041–1045.
Murphy, P.M. and Aha, D.W. 1994. UCI Repository of Machine Learning Databases. URL: http://www.ics.uci.edu/ mlearn/MLRepository.html. For information contact ml-repository@ics.uci.edu.
Quinlan, J.R. 1983. “Learning efficient classification procedures and their application to chess end games”. In Michalski, R.S., Carbonell, J.G. and Mitchell, T.M. (eds.): Machine Learning: An Artificial Intelligence Approach. Vol.1.
Quinlan, J.R. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers. San Mateo, CA.
Wnek, J., Sarma, J., Wahab, A. and Michalski, R. 1990. “Comparison Learning Paradigms via Diagrammatic Visualization: A Case Study in Single Concept Learning Using Symbolic, Neural Net and Genetic Algorithm Methods”. Technical Report, Computer Science Department, George Mason University.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
An, A., Cercone, N. (1998). ELEM2: A learning system for more accurate classifications. In: Mercer, R.E., Neufeld, E. (eds) Advances in Artificial Intelligence. Canadian AI 1998. Lecture Notes in Computer Science, vol 1418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64575-6_68
Download citation
DOI: https://doi.org/10.1007/3-540-64575-6_68
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64575-7
Online ISBN: 978-3-540-69349-9
eBook Packages: Springer Book Archive