Skip to main content

Avoiding the Cluster Hypothesis in SV Classification of Partially Labeled Data

  • Conference paper
Recent Advances of Neural Network Models and Applications

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 26))

  • 2108 Accesses

Abstract

We propose a Support Vector-based methodology for learning classifiers from partially labeled data. Its novelty stands in a formulation not based on the cluster hypothesis, stating that learning algorithms should search among classifiers whose decision surface is far from the unlabeled points. On the contrary, we assume such points as specimens of uncertain labels which should lay in a region containing the decision surface. The proposed approach is tested against synthetic data sets and subsequently applied to well-known benchmarks, attaining better or at least comparable performance w.r.t. methods described in the literature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chapelle, O., Schölkopf, B., Zien, A.: Semi-supervised learning. MIT press (2006)

    Google Scholar 

  2. Castillo, C., Donato, D., Becchetti, L., Boldi, P., Leonardi, S., Santini, M., Vigna, S.: A reference collection for web spam. In: ACM Sigir Forum, vol. 40, pp. 11–24. ACM (2006)

    Google Scholar 

  3. Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 14, 199–222 (2004)

    Article  MathSciNet  Google Scholar 

  4. Pedrycz, W.: Shadowed sets: representing and processing fuzzy sets. IEEE Trans. on Systems, Man, and Cybernetics, Part B: Cybernetics 28(1), 103–109 (1998)

    Article  Google Scholar 

  5. Frank, A., Asuncion, A.: UC irvine machine learning repository (2010), http://archive.ics.uci.edu/ml

  6. Bennett, K., Demiriz, A.: Semi-supervised support vector machines. In: Advances in Neural Information Processing Systems, vol. 11, pp. 368–374 (1998)

    Google Scholar 

  7. Abbass, H.: An evolutionary artificial neural networks approach for breast cancer diagnosis. Artificial Intelligence in Medicine 25(3), 265–281 (2002)

    Article  Google Scholar 

  8. Street, N., Wolberg, W., Mangasarian, O.: Nuclear feature extraction for breast tumor diagnosis. In: IS&T/SPIE 1993 International Symposium on Electronic Imaging: Science and Technology, vol. 1905, pp. 861–870 (1993)

    Google Scholar 

  9. Kononenko, I., Šimec, E., Robnik-Šikonjam, M.: Overcoming the myopia of inductive learning algorithms with relieff. Applied Intelligence 7(1), 39–55 (1997)

    Article  Google Scholar 

  10. Quinlan, R.: Combining instance-based and model-based learning. In: Proc. of the 10th Int. Conference on Machine Learning, pp. 236–243. Morgan Kaufmann (1993)

    Google Scholar 

  11. Fung, G., Dundar, M., Bi, J., Rao, B.: A fast iterative algorithm for fisher discriminant using heterogeneous kernels. In: Proceedings of the 21st International Conference on Machine Learning, pp. 40–47. ACM Press (2004)

    Google Scholar 

  12. Dietterich, T., Lathrop, R., Lozano-Pérez, T.: Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence 89(1), 31–71 (1997)

    Article  MATH  Google Scholar 

  13. Gorman, P., Sejnowski, T.: Analysis of hidden units in a layered network trained to classify sonar targets. Neural Networks 1(1), 75–89 (1988)

    Article  Google Scholar 

  14. Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisciplinary Reviews: Computational Statistics 2, 433–459 (2010)

    Article  Google Scholar 

  15. Bezdek, J.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)

    Google Scholar 

  16. Chen, Y., Wang, J.: Support vector learning for fuzzy rule-based classification systems. IEEE Trans. on Fuzzy Systems 11(6), 716–728 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dario Malchiodi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Malchiodi, D., Legnani, T. (2014). Avoiding the Cluster Hypothesis in SV Classification of Partially Labeled Data. In: Bassis, S., Esposito, A., Morabito, F. (eds) Recent Advances of Neural Network Models and Applications. Smart Innovation, Systems and Technologies, vol 26. Springer, Cham. https://doi.org/10.1007/978-3-319-04129-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-04129-2_4

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-04128-5

  • Online ISBN: 978-3-319-04129-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics