Abstract
We describe EDRL-MD, an evolutionary algorithm-based system, for learning decision rules from databases. The main novelty of our approach lies in dealing with continuous – valued attributes. Most of decision rule learners use univariate discretization methods, which search for threshold values for one attribute at the same time. In contrast to them, EDRL-MD simultaneously searches for threshold values for all continuous-valued attributes, when inducing decision rules. We call this approach multivariate discretization. Since multivariate discretization is able to capture interdependencies between attributes it may improve the accuracy of obtained rules. The evolutionary algorithm uses problem specific operators and variable-length chromosomes, which allows it to search for complete rulesets rather than single rules. The preliminary results of the experiments on some real-life datasets are presented.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Blake, C., Keogh, E., Merz, C.J.: UCI repository of machine learning databases, available on-line (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Bobrowski, L.: Piecewise-linear classifiers, formal neurons and separability of the learning sets. In: Proc. of 13th Int. Conf. on Pattern Recognition ICPR 1996, pp. 224–228. IEEE Computer Society Press, Los Alamitos (1996)
De Jong, K.A., Spears, W.M., Gordon, D.F.: Using genetic algorithm for concept learning. Machine Learning 13, 168–182 (1993)
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and unsupervised discretization of continuous features. In: Machine Learning: Proc of 12th Int. Conference, pp. 194–202. Morgan Kaufmann, San Francisco (1995)
Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Proc. of IJCAI 1993, pp. 1022–1027. Morgan Kaufmann, San Francisco (1993)
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Advances in Knowledge Discovery and Data Mining. AAAI Press, Menlo Park (1996)
Janikow, C.Z.: A knowledge intensive genetic algorithm for supervised learning. Machine Learning 13, 192–228 (1993)
Kwedlo, W., Krętowski, M.: Discovery of decision rules from databases: an evolutionary approach. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS, vol. 1510, pp. 370–378. Springer, Heidelberg (1998)
Michalewicz, Z.: Genetic Algorithms + Data Structures = Evolution Programs, 3rd edn. Springer, Heidelberg (1996)
Quinlan, J.R.: Improved use of continuous attributes in C4.5. Journal of Artificial Intelligence Research 4, 77–90 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kwedlo, W., Krętowski, M. (1999). An Evolutionary Algorithm Using Multivariate Discretization for Decision Rule Induction. In: Żytkow, J.M., Rauch, J. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 1999. Lecture Notes in Computer Science(), vol 1704. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-48247-5_48
Download citation
DOI: https://doi.org/10.1007/978-3-540-48247-5_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66490-1
Online ISBN: 978-3-540-48247-5
eBook Packages: Springer Book Archive