Skip to main content

Symbol Spotting for Document Categorization

  • Chapter
  • First Online:
Symbol Spotting in Digital Libraries

Abstract

In this chapter, we present a method for spotting symbols in document images by using a photometric description of symbols. As a running example we present an application of logo spotting. The presented method uses a bag-of-words model in order to perform a categorization of document images such as invoices or receipts. The hypotheses validation is done in terms of spatial coherence by the use of a Hough-like voting scheme. Experiments which demonstrate the effectiveness of this system on a large set of real data are presented at the end of the chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bagdanov, A., Ballan, L., Bertini, M., Bimbo, A.D.: Trademark matching and retrieval in sports video databases. In: Proceedings of the International Workshop on Multimedia Information Retrieval, pp. 79–86. ACM, New York (2007)

    Chapter  Google Scholar 

  2. Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(4), 509–522 (2002)

    Article  Google Scholar 

  3. Califano, A., Mohan, R.: Multidimensional indexing for recognizing visual shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence 16(4), 373–392 (1994)

    Article  Google Scholar 

  4. Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings of the Alvey Vision Conference, pp. 147–151 (1988)

    Google Scholar 

  5. Ke, Y., Sukthankar, R.: PCA-SIFT: A more distinctive representation for local image descriptor. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 506–513. IEEE Computer Society, Los Alamitos (2004)

    Google Scholar 

  6. Klein, B., Agne, S., Dengel, A.: Results of a study on invoice-reading systems in Germany. In: Document Analysis Systems VI, Lecture Notes on Computer Science, vol. 3163, pp. 451–462. Springer, Berlin (2004)

    Google Scholar 

  7. Lowe, D.: Object recognition from local scale-invariant features. In: Proceedings of the Seventh IEEE International Conference on Computer Vision, pp. 1150–1157. IEEE Computer Society, Los Alamitos (1999)

    Chapter  Google Scholar 

  8. Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  9. Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: Proceedings of the Eighth IEEE International Conference on Computer Vision, pp. 525–531. IEEE Computer Society, Los Alamitos (2001)

    Chapter  Google Scholar 

  10. Mikolajczyk, K., Schmid, C.: Scale & affine invariant interest point detectors. International Journal of Computer Vision 60(1), 63–86 (2004)

    Article  Google Scholar 

  11. Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(10), 1615–1630 (2005)

    Article  Google Scholar 

  12. Nakai, T., Kise, K., Iwamura, M.: Camera-based document image retrieval as voting for partial signatures of projective invariants. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 379–383. IEEE Computer Society, Los Alamitos (2005)

    Chapter  Google Scholar 

  13. Nakai, T., Kise, K., Iwamura, M.: Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval. In: Document Analysis Systems VII, Lecture Notes on Computer Science, vol. 3872, pp. 541–552. Springer, Berlin (2006)

    Chapter  Google Scholar 

  14. Sivic, J., Russell, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their localization in images. In: Proceedings of the Tenth IEEE International Conference on Computer Vision, pp. 370–377. IEEE Computer Society, Los Alamitos (2005)

    Chapter  Google Scholar 

  15. Valveny, E., Dosch, P., Fornés, A., Escalera, S.: Report on the third contest on symbol recognition. In: Graphics Recognition. Recent Advances and New Opportunities, Lecture Notes on Computer Science, vol. 5046, pp. 321–328. Springer, Berlin (2008)

    Chapter  Google Scholar 

  16. Viola, P., Rinker, J., Law, M.: Automatic fax routing. In: Document Analysis Systems VI, Lecture Notes on Computer Science, vol. 3163, pp. 484–495. Springer, Berlin (2004)

    Google Scholar 

  17. Wei, C., Li, Y., Chau, W., Li, C.: Trademark image retrieval using synthetic features for describing global shape and interior structure. Pattern Recognition 42(3), 386–394 (2009)

    Article  MATH  Google Scholar 

  18. Zhu, G., Doerman, D.: Automatic document logo detection. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition, pp. 864–868. IEEE Computer Society, Los Alamitos (2007)

    Chapter  Google Scholar 

  19. Zhu, G., Doerman, D.: Logo matching for document image retrieval. In: Proceedings of the Tenth International Conference on Document Analysis and Recognition, pp. 606–610. IEEE Computer Society, Los Alamitos (2009)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marçal Rusiñol .

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag London Limited

About this chapter

Cite this chapter

Rusiñol, M., Lladós, J. (2010). Symbol Spotting for Document Categorization. In: Symbol Spotting in Digital Libraries. Springer, London. https://doi.org/10.1007/978-1-84996-208-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-1-84996-208-7_3

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-84996-207-0

  • Online ISBN: 978-1-84996-208-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics