When does overfitting decrease prediction accuracy in induced decision trees and rule sets?

Schaffer, Cullen

doi:10.1007/BFb0017014

Cullen Schaffer¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 482))

Included in the following conference series:

European Working Session on Learning

318 Accesses
5 Citations

Abstract

Researchers studying classification techniques based on induced decision trees and rule sets have found that the model which best fits training data is unlikely to yield optimal performance on fresh data. Such a model is typically overfitted, in the sense that it captures not only true regularities reflected in the training data, but also chance patterns which have no significance for classification and, in fact, reduce the model's predictive accuracy. Various simplification methods have been shown to help avoid overfitting in practice. Here, through detailed analysis of a paradigmatic example, I attempt to uncover the conditions under which these techniques work as expected. One auxilliary result of importance is identification of conditions under which overfitting does not decrease predictive accuracy and hence in which it would be a mistake to apply simplification techniques, if predictive accuracy is the key goal.

The research reported here was supported by a grant from the Robert Wood Johnson Pharmaceutical Research Institute.

Download to read the full chapter text

Chapter PDF

References

Breiman, L., Friedman, J., Olshen, R., Stone, C. Classification and Regression Trees. Wadsworth and Brooks, 1984.
Google Scholar
Niblett, T. Constructing decision trees in noisy domains. In Proceedings of the Second European Working Session on Learning, pages 67–78. Sigma Press, Bled., Yugoslavia, 1987.
Google Scholar
Spangler, S., Fayyad, U., Uthurusamy, R. Induction of decision trees from inconclusive data. In Proceedings of the Fifth International Workshop on Machine Learning, pages 146–150, 1988.
Google Scholar
Quinlan, J. The effect of noise on concept learning. In Michalski, R., Carbonell, J., Mitchell, T. Machine Learning: An Artificial Intelligence Approach, volume 2, chapter 6. Morgan Kaufmann, 1986.
Google Scholar
Quinlan, J. Simplifying decision trees. International Journal of Man-Machine Studies, 27:221–234, 1987.
Google Scholar
Weiss, S., Galen, R., Tadepalli, P. Optimizing the predictive value of diagnostic decision rules. In Proceedings of the Sixth National Conference on Artificial Intelligence, pages 521–526, 1987.
Google Scholar

Download references

Author information

Authors and Affiliations

Departments of Computer Science and Statistics, Rutgers University, 08903, New Brunswick, NJ
Cullen Schaffer

Authors

Cullen Schaffer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Yves Kodratoff

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schaffer, C. (1991). When does overfitting decrease prediction accuracy in induced decision trees and rule sets?. In: Kodratoff, Y. (eds) Machine Learning — EWSL-91. EWSL 1991. Lecture Notes in Computer Science, vol 482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0017014

Download citation

DOI: https://doi.org/10.1007/BFb0017014
Published: 09 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-53816-5
Online ISBN: 978-3-540-46308-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics