Abstract
In this paper we propose an original image retrieval model inspired from the vector space information retrieval model. We build for different features and different scales a visual concept dictionary composed by visual words intended to represent a semantic concept, and then we represent an image by the frequency of the visual words within the image. Then the image similarity is computed as in the textual domain where a textual document is represented by a vector in which each component is the frequency of occurrence of a specific textual word in that document. We then adapt the common text-based paradigm by using the TF-IDF weighting scheme to construct a WF-IIF weighting scheme in our Multi-Scale Visual Dictionary (MSVD) vector space model.
The experiments are conducted on the 2009 Visual Concept Detection ImageCLEF Campaign. We compare WF-IIF to usual direct Support-Vector Machine (SVM) algorithm. We demonstrate that SVM and WF-IIF are in average over all the concept giving the same Area Under the Curve (AUC). We then discuss the fusion process that should enhance the whole system, and of some particular properties of MSVD, that shall be less dependant of the training set size of each concept than the SVM.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Picard, R.W.: Toward a visual thesaurus. In: Springer Werlag Workshops in Computing, MIRO (1995)
Picard, R.W.: A society of models for video and image libraries (1996)
Zhang, R., Zhang, Z.M.: Hidden semantic concept discovery in region based image retrieval. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 996–1001 (2004)
Lim, J.H.: Categorizing visual contents by matching visual “keywords”. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 367–374. Springer, Heidelberg (1999)
Fauqueur, J., Boujemaa, N.: Mental image search by boolean composition of region categories. In: Multimedia Tools and Applications, pp. 95–117 (2004)
Souvannavong, F., Hohl, L., MĂ©rialdo, B., Huet, B.: Enhancing latent semantic analysis video object retrieval with structural information. In: IEEE International Conference on Image Processing, ICIP 2004, Singapore, October 24-27 (2004)
Salton, G., Mcgill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, Inc., New York (1986)
Mitchell, T.: Machine Learning (October 1997)
Seymore, K., Chen, S., Rosenfeld, R., Chen, S., Rosenfeld, R.: Nonlinear interpolation of topic models for language model adaptation. In: Proceedings of ICSLP-1998, vol. 6, pp. 2503–2506 (1998)
Jensen, R., Shen, Q.: Fuzzy-rough data reduction with ant colony optimization. Fuzzy Sets and Systems (March 2004)
Nowak, S., Dunker, P.: Overview of the CLEF 2009 large-scale visual concept detection and annotation task. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 94–109. Springer, Heidelberg (2010)
Smach, F., Lemaître, C., Gauthier, J.P., Miteran, J., Atri, M.: Generalized fourier descriptors with applications to objects recognition in svm context. J. Math. Imaging Vis. 30(1), 43–71 (2008)
Glotin, H., Zhao, Z., Ayache, S.: Efficient image concept indexing by harmonic and arithmetic profiles entropy. In: Proceedings of 2009 IEEE International Conference on Image Processing (ICIP 2009), Cairo, Egypt, November 7-11 (2009)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
van de Sande, K., Gevers, T., Smeulders, A.: The university of Amsterdam’s concept detection system at imageCLEF 2009. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 261–268. Springer, Heidelberg (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dumont, E., Glotin, H., Paris, S., Zhao, ZQ. (2010). A Fast Visual Word Frequency - Inverse Image Frequency for Detector of Rare Concepts. In: Peters, C., et al. Multilingual Information Access Evaluation II. Multimedia Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6242. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15751-6_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-15751-6_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15750-9
Online ISBN: 978-3-642-15751-6
eBook Packages: Computer ScienceComputer Science (R0)