Abstract
Imbalance data is common in real-world applications like text categorization, face recognition for gender classification, medical diagnosis, fraud detection, oil-spills detection of satellite images. Most of the algorithms in machine learning are focusing on classification of majority class while ignoring or misclassifying minority sample. The minority samples are those that rarely occur but very important. It is commonly agreed that standard classifiers such as neural networks, support vector machines, and C4.5 are heavily biased in recognizing mostly the majority class since they are built to achieve overall accuracy to which the minority class contributes very little. In this study, we demonstrate how the synthetic minority over-sampling technique (SMOTE) can significantly improve the imbalance problem in gender classification from the data-level perspective. Hu’s moment of the face images was generated as the numerical descriptors with different imbalance ratio and classified using a supervised decision tree (J48) algorithm. The results show that prior to preprocessing the data with SMOTE, the minority group was severely misclassified as the majority group. Our claims are confirmed through the application of SMOTE in reducing the imbalance effects before inducing the decision tree.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baumann, F., Ehlers, A., Vogt, K., & Rosenhahn, B. (2013). Cascaded random forest for fast object detection. In Image Analysis, (pp. 131–142). Berlin Heidelberg: Springer.
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 321–357.
Hu, M. K. (1962). Visual pattern recognition by moment invariants. Information Theory, IRE Transactions, 8(2), 179–187.
Jia, H., & Martinez, A. M. (2009, June). Support vector machines in face recognition with occlusions. In IEEE conference on computer vision and pattern recognition, 2009. CVPR 2009. (pp. 136–141).
Khan, M. N. A., Qureshi, S. A., & Riaz, N. (2013). Gender classification with decision trees. International Journal of Signal Processing, Image Processing and Pattern Recognition, 6, 165–176.
Maturana, D., Mery, D., & Soto, A. (2010). Face recognition with decision tree-based local binary patterns. In Computer Vision–ACCV 2010, (pp. 618–629). Berlin Heidelberg: Springer.
Moallem, P., & Mousavi, B. S. (2013). Gender classification by fuzzy inference system. International Journal of Advanced Robotic Systems, 10.
Morgan, R. E., & Mason, B. J. (2014). Crimes against the elderly, 2003–2013. Special Report (NCJ 248339). Washington, DC: United States Department of Justice, Office of Justice Programs, Bureau of Justice Statistics.
Palaniappan, R., Raveendran, P., &Omatu, S. (2000). Improved moment invariants for invariant image representation. In Invariants for pattern recognition and classification’ (pp. 167–187). Singapore: World Scientific Publishing Co.
Phillips, P. J. (1998). Support vector machines applied to face recognition(Vol. 285). US Department of Commerce, Technology Administration, National Institute of Standards and Technology.
Riaz, Z., Mayer, C., Wimmer, M., & Radig, B. (2008). Model based face recognition across facial expressions. Journal of Information and Communication Technology, 2.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kamarulzalis, A.H., Mohd Razali, M.H., Moktar, B. (2018). Data Pre-Processing Using SMOTE Technique for Gender Classification with Imbalance Hu’s Moments Features. In: Saian, R., Abbas, M. (eds) Proceedings of the Second International Conference on the Future of ASEAN (ICoFA) 2017 – Volume 2. Springer, Singapore. https://doi.org/10.1007/978-981-10-8471-3_37
Download citation
DOI: https://doi.org/10.1007/978-981-10-8471-3_37
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8470-6
Online ISBN: 978-981-10-8471-3
eBook Packages: Social SciencesSocial Sciences (R0)