ELEM2: A learning system for more accurate classifications

An, Aijun; Cercone, Nick

doi:10.1007/3-540-64575-6_68

Aijun An¹ &
Nick Cercone¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1418))

Included in the following conference series:

Conference of the Canadian Society for Computational Studies of Intelligence

203 Accesses
21 Citations

Abstract

We present ELEM2, a new method for inducing classification rules from a set of examples. The method employs several new strategies in the induction and classification processes to improve the predictive performance of induced rules. In particular, a new heuristic function for evaluating attribute-value pairs is proposed. The function is defined to reflect the degree of relevance of an attribute-value pair to a target concept and leads to selection of the most relevant pairs for formulating rules. Another feature of ELEM2 is that it handles inconsistent training data by defining an unlearnable region of a concept based on the probability distribution of that concept in the training data. To further deal with imperfect data, ELEM2 makes use of the post-pruning technique to remove unreliable portions of a generated rule. A new rule quality measure is proposed for the purpose of post-pruning. The measure is defined according to the relative distribution of a rule with respect to positive and negative examples. To show whether ELEM2 achieves its objective, we report experimental results which compare ELEM2 with C4.5 and CN2 on a number of datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

An, A. 1997. Analysis Methodologies for Integrated and Enhanced Problem Solving. Ph.D. Thesis, Dept. of Computer Science, University of Regina, Regina, Canada.
Google Scholar
Brodley, C.E. 1993. “Addressing the Selective Superiority Problem: Automatic Algorithm/model Class Selection.” Proceedings of the 10th Machine Learning Conference. pp.17–24.
Google Scholar
Cendrowska, J. 1988. “PRISM: An Algorithm for Inducing Modular Rules”. In Gaines, B. and Boose, J. (eds.): Knowledge Acquisition for Knowledge-Based Systems. Academic Press.
Google Scholar
Clark, P. and Niblett, T. 1989. “The CN2 Induction Algorithm”. Machine Learning, 3, pp.261–283.
Google Scholar
Cooper, W.S. 1973. “On Selecting a Measure of Retrieval Effectiveness.” Journal of the American Society for Information Science. Vo1.24.
Google Scholar
Creecy, R.H., Masand, B.M., Smith, S.J. and Waltz, D.L. 1992. “Trading MIPS and Memory for Knowledge Engineering”. Communications of the ACM, 35, pp.48–64.
Article Google Scholar
Domingos, P. 1995. “Rule Induction and Instance-Based Learning: A Unified Approach.” IJCAI-95. Montreal, Canada. pp.1226–1232.
Google Scholar
Grzymala-Busse, J.W. 1992. “LERS-A System for Learning From Examples Based on Rough Sets”, in Slowinski, R.(ed.): Intelligent Decision Support: Handbook of Applications and Advances of Rough Sets Theory, Kluwer Academic Publishers, pp.318.
Google Scholar
Hamilton, H.J., Shan, N. and Cercone, N. 1996. “RIAC: A Rule Induction Algorithm Based on Approximate Classification”. Technical Report CS-96-06, University of Regina.
Google Scholar
Holte, R., Acker, L. and Porter, B. 1989. “Concept Learning and the Problem of Small Disjuncts”. Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, Detroit, Michigan.
Google Scholar
Kerber, R. 1992. “ChiMerge: Discretization of Numeric Attributes”, Proceedings of the 10th National Conference on Artificial Intelligence AAAI-92, San Jose, CA.
Google Scholar
Michalski, R.S., Mozetic, I., Hong, J. and Lavrac, N. 1986. “The Multi-Purpose Incremental Learning System AQl5 and Its Testing Application to Three Medical Domains”. Proceedings of AAAI 1986. pp.1041–1045.
Google Scholar
Murphy, P.M. and Aha, D.W. 1994. UCI Repository of Machine Learning Databases. URL: http://www.ics.uci.edu/ mlearn/MLRepository.html. For information contact ml-repository@ics.uci.edu.
Google Scholar
Quinlan, J.R. 1983. “Learning efficient classification procedures and their application to chess end games”. In Michalski, R.S., Carbonell, J.G. and Mitchell, T.M. (eds.): Machine Learning: An Artificial Intelligence Approach. Vol.1.
Google Scholar
Quinlan, J.R. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers. San Mateo, CA.
Google Scholar
Wnek, J., Sarma, J., Wahab, A. and Michalski, R. 1990. “Comparison Learning Paradigms via Diagrammatic Visualization: A Case Study in Single Concept Learning Using Symbolic, Neural Net and Genetic Algorithm Methods”. Technical Report, Computer Science Department, George Mason University.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Waterloo, N2L 3G1, Waterloo, Ontario, Canada
Aijun An & Nick Cercone

Authors

Aijun An
View author publications
You can also search for this author in PubMed Google Scholar
Nick Cercone
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Robert E. Mercer Eric Neufeld

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

An, A., Cercone, N. (1998). ELEM2: A learning system for more accurate classifications. In: Mercer, R.E., Neufeld, E. (eds) Advances in Artificial Intelligence. Canadian AI 1998. Lecture Notes in Computer Science, vol 1418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64575-6_68

Download citation

DOI: https://doi.org/10.1007/3-540-64575-6_68
Published: 29 July 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64575-7
Online ISBN: 978-3-540-69349-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics