Symbol Spotting for Document Categorization

Rusiñol, Marçal; Lladós, Josep

doi:10.1007/978-1-84996-208-7_3

Marçal Rusiñol³ &
Josep Lladós³

320 Accesses

Abstract

In this chapter, we present a method for spotting symbols in document images by using a photometric description of symbols. As a running example we present an application of logo spotting. The presented method uses a bag-of-words model in order to perform a categorization of document images such as invoices or receipts. The hypotheses validation is done in terms of spatial coherence by the use of a Hough-like voting scheme. Experiments which demonstrate the effectiveness of this system on a large set of real data are presented at the end of the chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bagdanov, A., Ballan, L., Bertini, M., Bimbo, A.D.: Trademark matching and retrieval in sports video databases. In: Proceedings of the International Workshop on Multimedia Information Retrieval, pp. 79–86. ACM, New York (2007)
Chapter Google Scholar
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(4), 509–522 (2002)
Article Google Scholar
Califano, A., Mohan, R.: Multidimensional indexing for recognizing visual shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence 16(4), 373–392 (1994)
Article Google Scholar
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings of the Alvey Vision Conference, pp. 147–151 (1988)
Google Scholar
Ke, Y., Sukthankar, R.: PCA-SIFT: A more distinctive representation for local image descriptor. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 506–513. IEEE Computer Society, Los Alamitos (2004)
Google Scholar
Klein, B., Agne, S., Dengel, A.: Results of a study on invoice-reading systems in Germany. In: Document Analysis Systems VI, Lecture Notes on Computer Science, vol. 3163, pp. 451–462. Springer, Berlin (2004)
Google Scholar
Lowe, D.: Object recognition from local scale-invariant features. In: Proceedings of the Seventh IEEE International Conference on Computer Vision, pp. 1150–1157. IEEE Computer Society, Los Alamitos (1999)
Chapter Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: Proceedings of the Eighth IEEE International Conference on Computer Vision, pp. 525–531. IEEE Computer Society, Los Alamitos (2001)
Chapter Google Scholar
Mikolajczyk, K., Schmid, C.: Scale & affine invariant interest point detectors. International Journal of Computer Vision 60(1), 63–86 (2004)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(10), 1615–1630 (2005)
Article Google Scholar
Nakai, T., Kise, K., Iwamura, M.: Camera-based document image retrieval as voting for partial signatures of projective invariants. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 379–383. IEEE Computer Society, Los Alamitos (2005)
Chapter Google Scholar
Nakai, T., Kise, K., Iwamura, M.: Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval. In: Document Analysis Systems VII, Lecture Notes on Computer Science, vol. 3872, pp. 541–552. Springer, Berlin (2006)
Chapter Google Scholar
Sivic, J., Russell, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their localization in images. In: Proceedings of the Tenth IEEE International Conference on Computer Vision, pp. 370–377. IEEE Computer Society, Los Alamitos (2005)
Chapter Google Scholar
Valveny, E., Dosch, P., Fornés, A., Escalera, S.: Report on the third contest on symbol recognition. In: Graphics Recognition. Recent Advances and New Opportunities, Lecture Notes on Computer Science, vol. 5046, pp. 321–328. Springer, Berlin (2008)
Chapter Google Scholar
Viola, P., Rinker, J., Law, M.: Automatic fax routing. In: Document Analysis Systems VI, Lecture Notes on Computer Science, vol. 3163, pp. 484–495. Springer, Berlin (2004)
Google Scholar
Wei, C., Li, Y., Chau, W., Li, C.: Trademark image retrieval using synthetic features for describing global shape and interior structure. Pattern Recognition 42(3), 386–394 (2009)
Article MATH Google Scholar
Zhu, G., Doerman, D.: Automatic document logo detection. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition, pp. 864–868. IEEE Computer Society, Los Alamitos (2007)
Chapter Google Scholar
Zhu, G., Doerman, D.: Logo matching for document image retrieval. In: Proceedings of the Tenth International Conference on Document Analysis and Recognition, pp. 606–610. IEEE Computer Society, Los Alamitos (2009)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Departament de Ciències de la Computació, Centre de Visió per Computador, Universitat Autònoma de Barcelona, Edifici O, Campus UAB, 08193, Bellaterra, Spain
Marçal Rusiñol & Josep Lladós

Authors

Marçal Rusiñol
View author publications
You can also search for this author in PubMed Google Scholar
Josep Lladós
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marçal Rusiñol .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rusiñol, M., Lladós, J. (2010). Symbol Spotting for Document Categorization. In: Symbol Spotting in Digital Libraries. Springer, London. https://doi.org/10.1007/978-1-84996-208-7_3

Download citation

DOI: https://doi.org/10.1007/978-1-84996-208-7_3
Published: 21 May 2010
Publisher Name: Springer, London
Print ISBN: 978-1-84996-207-0
Online ISBN: 978-1-84996-208-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics