Skip to main content

Fuzzy Granular Classifier Approach for Spam Detection

  • Conference paper
  • First Online:
Computational Collective Intelligence

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9330))

Abstract

Spam email problem is a major shortcoming of email technology for computer security. In this research, a granular classifier model is proposed to discover hyperboxes in the geometry of information granules for spam detection in three steps. In the first step, the k-means clustering algorithm is applied to find the seed_points to build the granular structure of the spam and non-spam patterns. Moreover, applying the interval analysis through the high homogeneity of the patterns captures the key part of the spam and non-spam classifiers’ structure. In the second step, PSO algorithm is hybridized with the k-means to optimize the formalized information granules’ performance. The proposed model is evaluated based on the accuracy, misclassification and coverage criteria. Experimental results reveal that the performance of our proposed model is increased through applying Particle Swarm Optimization and fuzzy set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Idris, I., Selamat, A.: Improved email spam detection model with negative selection algorithm and particle swarm optimization. Applied Soft Computing 22, 11–27 (2014)

    Article  Google Scholar 

  2. Salcedo-Campos, F., Díaz-Verdejo, J., García-Teodoro, P.: Segmental parameterisation and statistical modelling of e-mail headers for spam detection. Information Sciences 195, 45–61 (2012)

    Article  Google Scholar 

  3. Méndez, J.R., Reboiro-Jato, M., Díaz, F., Díaz, E., Fdez-Riverola, F.: Grindstone4Spam: An optimization toolkit for boosting e-mail classification. Journal of Systems and Software 85, 2909–2920 (2012)

    Article  Google Scholar 

  4. Bouguila, N., Amayri, O.: A discrete mixture-based kernel for SVMs: application to spam and image categorization. Information Processing & Management 45, 631–642 (2009)

    Article  Google Scholar 

  5. Salehi, S., Selamat, A., Bostanian, M.: Enhanced genetic algorithm for spam detection in email. In: 2011 IEEE 2nd International Conference on Software Engineering and Service Science (ICSESS), pp. 594–597. IEEE (2011)

    Google Scholar 

  6. Guzella, T.S., Mota-Santos, T.A., Uchôa, J.Q., Caminhas, W.M.: Identification of SPAM messages using an approach inspired on the immune system. Biosystems 92, 215–225 (2008)

    Article  Google Scholar 

  7. Salehi, S., Selamat, A.: Hybrid simple artificial immune system (SAIS) and particle swarm optimization (PSO) for spam detection. In: 2011 5th Malaysian Conference in Software Engineering (MySEC), pp. 124–129. IEEE (2011)

    Google Scholar 

  8. Fdez-Riverola, F., Iglesias, E.L., Díaz, F., Méndez, J.R., Corchado, J.M.: SpamHunting: An instance-based reasoning system for spam labelling and filtering. Decision Support Systems 43, 722–736 (2007)

    Article  Google Scholar 

  9. Özgür, L., Güngör, T., Gürgen, F.: Adaptive anti-spam filtering for agglutinative languages: a special case for Turkish. Pattern Recognition Letters 25, 1819–1831 (2004)

    Article  Google Scholar 

  10. Velmurugan, T.: Performance based analysis between k-Means and Fuzzy C-Means clustering algorithms for connection oriented telecommunication data. Applied Soft Computing 19, 134–146 (2014)

    Article  Google Scholar 

  11. Salehi, S., Selamat, A., Fujita, H.: Systematic mapping study on Granular computing. Knowledge-Based Systems (2015)

    Google Scholar 

  12. Pedrycz, W., Park, B.-J., Oh, S.-K.: The design of granular classifiers: A study in the synergy of interval calculus and fuzzy sets in pattern recognition. Pattern Recognition 41, 3720–3735 (2008)

    Article  MATH  Google Scholar 

  13. Forrest, S., Perelson, A.S., Allen, L., Cherukuri, R.: Self-nonself discrimination in a computer. In: 2012 IEEE Symposium on Security and Privacy, pp. 202–202. IEEE Computer Society (1994)

    Google Scholar 

  14. Oda, T., White, T.: Developing an immunity to spam. In: CantĂş-Paz, E., et al. (eds.) GECCO 2003. LNCS, vol. 2723. Springer, Heidelberg (2003)

    Google Scholar 

  15. Santos, I., Laorden, C., Sanz, B., Bringas, P.G.: Enhanced topic-based vector space model for semantics-aware spam filtering. Expert Systems with applications 39, 437–444 (2012)

    Article  Google Scholar 

  16. Carnap, R.: Meaning and synonymy in natural languages. Philosophical Studies 6, 33–47 (1955)

    Article  Google Scholar 

  17. Polyvyanyy, A.: Evaluation of a novel information retrieval model: eTVSM. Master’s thesis, Hasso Plattner Institut (2007)

    Google Scholar 

  18. Salton, G., McGill, M.J.: Introduction to modern information retrieval (1983)

    Google Scholar 

  19. Laorden, C., Ugarte-Pedrero, X., Santos, I., Sanz, B., Nieves, J., Bringas, P.G.: Study on the effectiveness of anomaly detection for spam filtering. Information Sciences 277, 421–444 (2014)

    Article  Google Scholar 

  20. Zhang, Y., Wang, S., Phillips, P., Ji, G.: Binary PSO with mutation operator for feature selection using decision tree applied to spam detection. Knowledge-Based Systems 64, 22–31 (2014)

    Article  Google Scholar 

  21. DeBarr, D., Wechsler, H.: Spam detection using random boost. Pattern Recognition Letters 33, 1237–1244 (2012)

    Article  Google Scholar 

  22. Asuncion, A., Newman, D.: UCI machine learning repository (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ali Selamat .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Salehi, S., Selamat, A., Krejcar, O., Kuca, K. (2015). Fuzzy Granular Classifier Approach for Spam Detection. In: Núñez, M., Nguyen, N., Camacho, D., Trawiński, B. (eds) Computational Collective Intelligence. Lecture Notes in Computer Science(), vol 9330. Springer, Cham. https://doi.org/10.1007/978-3-319-24306-1_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24306-1_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24305-4

  • Online ISBN: 978-3-319-24306-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics