Avoiding the Cluster Hypothesis in SV Classification of Partially Labeled Data

Malchiodi, Dario; Legnani, Tommaso

doi:10.1007/978-3-319-04129-2_4

Dario Malchiodi⁶ &
Tommaso Legnani⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 26))

2108 Accesses

Abstract

We propose a Support Vector-based methodology for learning classifiers from partially labeled data. Its novelty stands in a formulation not based on the cluster hypothesis, stating that learning algorithms should search among classifiers whose decision surface is far from the unlabeled points. On the contrary, we assume such points as specimens of uncertain labels which should lay in a region containing the decision surface. The proposed approach is tested against synthetic data sets and subsequently applied to well-known benchmarks, attaining better or at least comparable performance w.r.t. methods described in the literature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chapelle, O., Schölkopf, B., Zien, A.: Semi-supervised learning. MIT press (2006)
Google Scholar
Castillo, C., Donato, D., Becchetti, L., Boldi, P., Leonardi, S., Santini, M., Vigna, S.: A reference collection for web spam. In: ACM Sigir Forum, vol. 40, pp. 11–24. ACM (2006)
Google Scholar
Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 14, 199–222 (2004)
Article MathSciNet Google Scholar
Pedrycz, W.: Shadowed sets: representing and processing fuzzy sets. IEEE Trans. on Systems, Man, and Cybernetics, Part B: Cybernetics 28(1), 103–109 (1998)
Article Google Scholar
Frank, A., Asuncion, A.: UC irvine machine learning repository (2010), http://archive.ics.uci.edu/ml
Bennett, K., Demiriz, A.: Semi-supervised support vector machines. In: Advances in Neural Information Processing Systems, vol. 11, pp. 368–374 (1998)
Google Scholar
Abbass, H.: An evolutionary artificial neural networks approach for breast cancer diagnosis. Artificial Intelligence in Medicine 25(3), 265–281 (2002)
Article Google Scholar
Street, N., Wolberg, W., Mangasarian, O.: Nuclear feature extraction for breast tumor diagnosis. In: IS&T/SPIE 1993 International Symposium on Electronic Imaging: Science and Technology, vol. 1905, pp. 861–870 (1993)
Google Scholar
Kononenko, I., Šimec, E., Robnik-Šikonjam, M.: Overcoming the myopia of inductive learning algorithms with relieff. Applied Intelligence 7(1), 39–55 (1997)
Article Google Scholar
Quinlan, R.: Combining instance-based and model-based learning. In: Proc. of the 10th Int. Conference on Machine Learning, pp. 236–243. Morgan Kaufmann (1993)
Google Scholar
Fung, G., Dundar, M., Bi, J., Rao, B.: A fast iterative algorithm for fisher discriminant using heterogeneous kernels. In: Proceedings of the 21st International Conference on Machine Learning, pp. 40–47. ACM Press (2004)
Google Scholar
Dietterich, T., Lathrop, R., Lozano-Pérez, T.: Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence 89(1), 31–71 (1997)
Article MATH Google Scholar
Gorman, P., Sejnowski, T.: Analysis of hidden units in a layered network trained to classify sonar targets. Neural Networks 1(1), 75–89 (1988)
Article Google Scholar
Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisciplinary Reviews: Computational Statistics 2, 433–459 (2010)
Article Google Scholar
Bezdek, J.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Google Scholar
Chen, Y., Wang, J.: Support vector learning for fuzzy rule-based classification systems. IEEE Trans. on Fuzzy Systems 11(6), 716–728 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Università degli Studi di Milano, Milano, Italy
Dario Malchiodi
Dipartimento di Matematica “F. Enriques”, Università degli Studi di Milano, Milano, Italy
Tommaso Legnani

Authors

Dario Malchiodi
View author publications
You can also search for this author in PubMed Google Scholar
Tommaso Legnani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dario Malchiodi .

Editor information

Editors and Affiliations

Department of Computer Science, University of Milano, Milano, Italy
Simone Bassis
Department of Psychology, Second University of Naples, Institute for Advanced Scientific Studies (IIASS), Caserta, Italy
Anna Esposito
Department of Civil, Energy, Environmental, and Materials Engineering, DICEAM, University Mediterranea of Reggio Calabria, Reggio Calabria, Italy
Francesco Carlo Morabito

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Malchiodi, D., Legnani, T. (2014). Avoiding the Cluster Hypothesis in SV Classification of Partially Labeled Data. In: Bassis, S., Esposito, A., Morabito, F. (eds) Recent Advances of Neural Network Models and Applications. Smart Innovation, Systems and Technologies, vol 26. Springer, Cham. https://doi.org/10.1007/978-3-319-04129-2_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-04129-2_4
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04128-5
Online ISBN: 978-3-319-04129-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics