Abstract
Outlier detection is an important task in data mining because outliers can be either useful knowledge or noise. Many statistical methods have been applied to detect outliers, but they usually assume a given distribution of data and it is difficult to deal with high dimensional data. The Statistical Learning Theory (SLT) established by Vapnik et al. provides a new way to overcome these drawbacks. According to SLT Schölkopf et al. proposed a ν-Support Vector Machine (ν-SVM) andapplied it to detect outliers. However, it is still difficult for data mining users to decide onekey parameter in ν-SVM. This paper proposes a new SVM method to detect outliers, SVM-OD, which can avoid this parameter. We provide the theoretical analysis based on SLT as well as experiments to verify the effectiveness of our method. Moreover, an experiment on synthetic data shows that SVM-OD can detect some local outliers near the cluster with some distribution while ν-SVM cannot dothat.
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Rights and permissions
About this chapter
Cite this chapter
Wang, J., Zhang, C., Wu, X., Qi, H., Wang, J. SVM-OD: SVM Method to Detect Outliers1. In: Young Lin, T., Ohsuga, S., Liau, CJ., Hu, X. (eds) Foundations and Novel Approaches in Data Mining. Studies in Computational Intelligence, vol 9. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539827_7
Download citation
DOI: https://doi.org/10.1007/11539827_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28315-7
Online ISBN: 978-3-540-31229-1
eBook Packages: EngineeringEngineering (R0)