Skip to main content

Data Pre-Processing Using SMOTE Technique for Gender Classification with Imbalance Hu’s Moments Features

  • Conference paper
  • First Online:
Proceedings of the Second International Conference on the Future of ASEAN (ICoFA) 2017 – Volume 2

Abstract

Imbalance data is common in real-world applications like text categorization, face recognition for gender classification, medical diagnosis, fraud detection, oil-spills detection of satellite images. Most of the algorithms in machine learning are focusing on classification of majority class while ignoring or misclassifying minority sample. The minority samples are those that rarely occur but very important. It is commonly agreed that standard classifiers such as neural networks, support vector machines, and C4.5 are heavily biased in recognizing mostly the majority class since they are built to achieve overall accuracy to which the minority class contributes very little. In this study, we demonstrate how the synthetic minority over-sampling technique (SMOTE) can significantly improve the imbalance problem in gender classification from the data-level perspective. Hu’s moment of the face images was generated as the numerical descriptors with different imbalance ratio and classified using a supervised decision tree (J48) algorithm. The results show that prior to preprocessing the data with SMOTE, the minority group was severely misclassified as the majority group. Our claims are confirmed through the application of SMOTE in reducing the imbalance effects before inducing the decision tree.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 309.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 399.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 399.00
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Baumann, F., Ehlers, A., Vogt, K., & Rosenhahn, B. (2013). Cascaded random forest for fast object detection. In Image Analysis, (pp. 131–142). Berlin Heidelberg: Springer.

    Chapter  Google Scholar 

  • Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 321–357.

    Google Scholar 

  • Hu, M. K. (1962). Visual pattern recognition by moment invariants. Information Theory, IRE Transactions, 8(2), 179–187.

    Google Scholar 

  • Jia, H., & Martinez, A. M. (2009, June). Support vector machines in face recognition with occlusions. In IEEE conference on computer vision and pattern recognition, 2009. CVPR 2009. (pp. 136–141).

    Google Scholar 

  • Khan, M. N. A., Qureshi, S. A., & Riaz, N. (2013). Gender classification with decision trees. International Journal of Signal Processing, Image Processing and Pattern Recognition, 6, 165–176.

    Article  Google Scholar 

  • Maturana, D., Mery, D., & Soto, A. (2010). Face recognition with decision tree-based local binary patterns. In Computer Vision–ACCV 2010, (pp. 618–629). Berlin Heidelberg: Springer.

    Chapter  Google Scholar 

  • Moallem, P., & Mousavi, B. S. (2013). Gender classification by fuzzy inference system. International Journal of Advanced Robotic Systems, 10.

    Google Scholar 

  • Morgan, R. E., & Mason, B. J. (2014). Crimes against the elderly, 2003–2013. Special Report (NCJ 248339). Washington, DC: United States Department of Justice, Office of Justice Programs, Bureau of Justice Statistics.

    Google Scholar 

  • Palaniappan, R., Raveendran, P., &Omatu, S. (2000). Improved moment invariants for invariant image representation. In Invariants for pattern recognition and classification’ (pp. 167–187). Singapore: World Scientific Publishing Co.

    Chapter  Google Scholar 

  • Phillips, P. J. (1998). Support vector machines applied to face recognition(Vol. 285). US Department of Commerce, Technology Administration, National Institute of Standards and Technology.

    Google Scholar 

  • Riaz, Z., Mayer, C., Wimmer, M., & Radig, B. (2008). Model based face recognition across facial expressions. Journal of Information and Communication Technology, 2.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ahmad Haadzal Kamarulzalis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kamarulzalis, A.H., Mohd Razali, M.H., Moktar, B. (2018). Data Pre-Processing Using SMOTE Technique for Gender Classification with Imbalance Hu’s Moments Features. In: Saian, R., Abbas, M. (eds) Proceedings of the Second International Conference on the Future of ASEAN (ICoFA) 2017 – Volume 2. Springer, Singapore. https://doi.org/10.1007/978-981-10-8471-3_37

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-8471-3_37

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-8470-6

  • Online ISBN: 978-981-10-8471-3

  • eBook Packages: Social SciencesSocial Sciences (R0)

Publish with us

Policies and ethics