Abstract
We introduce a complete pipeline for recognizing and classifying people’s clothing in natural scenes. This has several interesting applications, including e-commerce, event and activity recognition, online advertising, etc. The stages of the pipeline combine a number of state-of-the-art building blocks such as upper body detectors, various feature channels and visual attributes. The core of our method consists of a multi-class learner based on a Random Forest that uses strong discriminative learners as decision nodes. To make the pipeline as automatic as possible we also integrate automatically crawled training data from the web in the learning process. Typically, multi-class learning benefits from more labeled data. Because the crawled data may be noisy and contain images unrelated to our task, we extend Random Forests to be capable of transfer learning from different domains. For evaluation, we define 15 clothing classes and introduce a benchmark data set for the clothing classification task consisting of over 80,000 images, which we make publicly available. We report experimental results, where our classifier outperforms an SVM baseline with 41.38 % vs 35.07 % average accuracy on challenging benchmark data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded Up Robust Features. In: ICCV (2006)
Breiman, L.: Random forests. Machine Learning, 5–32 (2001)
Caruana, R., Karampatziakis, N., Yessenalina, A.: An empirical evaluation of supervised learning methods in high dimensions. In: ICML (2008)
Chen, H., Xu, Z.J., Liu, Z.Q., Zhu, S.C.: Composite Templates for Cloth Modeling and Sketching. In: CVPR (2006)
Criminisi, A., Shotton, J., Konukoglu, E.: Decision forests for classification, regression, density estimation, manifold learning and semi-supervised learning. Technical Report MSR-TR-2011-114, Microsoft Research (2011)
Dalal, N., Triggs, B.: Histograms of Oriented Gradients for Human Detection. In: CVPR (2005)
Daumé, H.: Frustratingly easy domain adaptation. Annual Meeting-Association for Computational Linguistics 45, 256 (2007)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. In: CVPR (2009)
Eichner, M., Ferrari, V.: CALVIN Upper-body detector for detection in still images
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: A library for large linear classification. JMLR 9 (2008)
Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their attributes. In: CVPR (2009)
Ferrari, V., Zisserman, A.: Learning visual attributes. In: NIPS (2008)
Gallagher, A.C.: Clothing cosegmentation for recognizing people. In: CVPR (2008)
Hu, Z., Yan, H., Lin, X.: Clothing segmentation using foreground and background estimation based on the constrained Delaunay triangulation. Pattern Recognition 41 (2008)
Joachims, T.: Transductive inference for text classification using support vector machines. In: ICML (1999)
Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K.: Attribute and simile classifiers for face verification. In: ICCV (2009)
Lampert, C., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between-class attribute transfer. In: CVPR (2009)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
Leistner, C., Saffari, A., Santner, J., Bischof, H.: Semi-Supervised Random Forests. In: ICCV (2009)
Liu, S., Song, Z., Liu, G., Xu, C., Lu, H., Yan, S.: Street-to-Shop: Cross-Scenario Clothing Retrieval via Parts Alignment and Auxiliary Set. In: CVPR (2012)
Ojala, T., Pietikainen, M., Harwood, D.: Performance evaluation of texture measures with classification based on Kullback discrimination of distributions. In: ICOR (1994)
Pan, S.J., Yang, Q.: A survey on transfer learning. TKDE (2010)
Shechtman, E., Irani, M.: Matching Local Self-Similarities across Images and Videos. In: CVPR (2007)
Song, Z., Wang, M., Hua, X.s., Yan, S.: Predicting occupation via human clothing and contexts. In: ICCV (2011)
Sorokin, A., Forsyth, D.: Utility data annotation with amazon mechanical turk. In: Workshop on Internet Vision (2008)
Stark, M., Goesele, M., Schiele, B.: A shape-based object class model for knowledge transfer. In: ICCV (2009)
Wang, N., Ai, H.: Who Blocks Who: Simultaneous clothing segmentation for grouping images. In: ICCV (2011)
Wang, X., Zhang, T.: Clothes search in consumer photos via color matching and attribute learning. In: MM. ACM Press (2011)
Yamaguchi, K., Kiapour, H., Ortiz, L., Berg, T.L.: Parsing Clothing in Fashion Photographs. In: CVPR (2012)
Yao, B., Khosla, A., Fei-Fei, L.: Combining randomization and discrimination for fine-grained image categorization. In: CVPR (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bossard, L., Dantone, M., Leistner, C., Wengert, C., Quack, T., Van Gool, L. (2013). Apparel Classification with Style. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds) Computer Vision – ACCV 2012. ACCV 2012. Lecture Notes in Computer Science, vol 7727. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37447-0_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-37447-0_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37446-3
Online ISBN: 978-3-642-37447-0
eBook Packages: Computer ScienceComputer Science (R0)