Modality Classification for Medical Images Using Sparse Coded Affine-Invariant Descriptors

Gál, Viktor; Solt, Illés; Kerre, Etienne; Nachtegael, Mike

doi:10.1007/978-3-642-36778-6_1

Viktor Gál²¹,
Illés Solt²²,
Etienne Kerre²¹ &
…
Mike Nachtegael²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7769))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1054 Accesses
1 Citations

Abstract

Modality is a key facet in medical image retrieval, as a user is likely interested in only one of e.g. radiology images, flowcharts, and pathology photos. While assessing image modality is trivial for humans, reliable automatic methods are required to deal with large un-annotated image bases, such as figures taken from the millions of scientific publications. We present a multi-disciplinary approach to tackle the classification problem by combining image features, meta-data, textual and referential information. We test our system’s accuracy on the Image- CLEF 2011 medical modality classification data set. We show that using a fully affine-invariant feature descriptor and sparse coding on these descriptors in the Bag-of-Words image representation significantly increases the classification accuracy. Our best method achieves 87.89 and outperforms the state of the art.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hersh, W.R., Müller, H., Jensen, J.R., Yang, J., Gorman, P.N., Ruch, P.: Advancing Biomedical Image Retrieval: Development and Analysis of a Test Collection. Journal of the American Medical Informatics Association 13(5), 488–496 (2006)
Article Google Scholar
Pentland, A., Picard, R.W., Sclaroff, S.: Photobook: Content-based manipulation of image databases. International Journal of Computer Vision 18(3), 233–254 (1996)
Article Google Scholar
Lakdashti, A., Moin, M.S.: A New Content-Based Image Retrieval Approach Based on Pattern Orientation Histogram. In: Gagalowicz, A., Philips, W. (eds.) MIRAGE 2007. LNCS, vol. 4418, pp. 587–595. Springer, Heidelberg (2007)
Chapter Google Scholar
Jain, A.: Image retrieval using color and shape. Pattern Recognition 29(8), 1233–1244 (1996)
Article Google Scholar
Morel, J.-M., Yu, G.: ASIFT: A New Framework for Fully Affine Invariant Image Comparison. SIAM Journal on Imaging Sciences 2(2) (April 2009)
Google Scholar
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, vol. 1, p. 22 (2004)
Google Scholar
Jarrett, K., Kavukcuoglu, K., Ranzato, M.A., LeCun, Y.: What is the best multi-stage architecture for object recognition? In: 2009 IEEE 12th International Conference on Computer Vision, pp. 2146–2153 (2009)
Google Scholar
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1794–1801 (2009)
Google Scholar
Efron, B., Hastie, T., Johnstone, I., Tibshirani, R.: Least Angle Regression. ArXiv Mathematics e-prints (June 2004)
Google Scholar
Kalpathy-Cramer, J., Müller, H., Bedrick, S., Eggel, I., de Herrera, A.G.S., Tsikrika, T.: The CLEF 2011 medical image retrieval and classification tasks. In: CLEF 2011 Working Notes, Amsterdam, The Netherlands (2011)
Google Scholar
Veltkamp, R.C.: A survey of content-based image retrieval systems. Content-based Image and Video Retrieval (2002)
Google Scholar
Duda, R.O.: Use of the Hough transformation to detect lines and curves in pictures. Communications of the ACM (1972)
Google Scholar
Chai, D., Ngan, K.N.: Face segmentation using skin-color map in videophone applications. IEEE Transactions on Circuits and Systems for Video Technology 9(4), 551–564 (1999)
Article Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the International Conference on Computer Vision, ICCV 1999, pp. 1150–1157. IEEE Computer Society, Washington, DC (1999)
Google Scholar
Nister, D., Stewenius, H.: Scalable Recognition with a Vocabulary Tree. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2161–2168 (2006)
Google Scholar
Jegou, H., Harzallah, H., Schmid, C.: A contextual dissimilarity measure for accurate and efficient image search. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2007), pp. 1–8 (2007)
Google Scholar
Chum, O., Philbin, J., Sivic, J., Isard, M.: Total Recall: Automatic query expansion with a generative feature model for object retrieval. In: 2007 IEEE 11th International Conference on Computer Vision, pp. 1–8. IEEE (October 2007)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: Improving particular object retrieval in large scale image databases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Gal, V., Solt, I., Gedeon, T., Nachtegael, M., Kerre, E.: Multi-disciplinary modality classication for medical images. In: CLEF 2011 Working Notes, Amsterdam, The Netherlands (2011)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: A Text Retrieval Approach to Object Matching in Videos. In: 9th IEEE International Conference on Computer Vision (ICCV 2003), pp. 1470–1477. IEEE Computer Society (2003)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: Improving particular object retrieval in large scale image databases. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8 (2008)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explorations 11(1), 10–18 (2009)
Article Google Scholar
Csurka, G., Clinchant, S., Jacquet, G.: XRCE’s Participation at Medical Image Modality Classification and Ad-hoc Retrieval Tasks of Image CLEF 2011. In: CLEF 2011 Working Notes, Amsterdam, The Netherlands (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Applied Mathematics and Computer Science, Ghent University, Belgium
Viktor Gál, Etienne Kerre & Mike Nachtegael
Department of Telecommunications and Media Informatics, Budapest University of Technology and Economics, Hungary
Illés Solt

Authors

Viktor Gál
View author publications
You can also search for this author in PubMed Google Scholar
Illés Solt
View author publications
You can also search for this author in PubMed Google Scholar
Etienne Kerre
View author publications
You can also search for this author in PubMed Google Scholar
Mike Nachtegael
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ISIR, Osaka University, 8-1, Mihogaoka, Ibaraki, Osaka, Japan
Takashi Washio
Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, 1068 Xueyuan Boulevard, 518055, Shenzhen, Guangdong, China
Jun Luo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gál, V., Solt, I., Kerre, E., Nachtegael, M. (2013). Modality Classification for Medical Images Using Sparse Coded Affine-Invariant Descriptors. In: Washio, T., Luo, J. (eds) Emerging Trends in Knowledge Discovery and Data Mining. PAKDD 2012. Lecture Notes in Computer Science(), vol 7769. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36778-6_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-36778-6_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36777-9
Online ISBN: 978-3-642-36778-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics