Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics

Browne, Matthew; Ghidary, Saeed Shiry; Mayer, Norbert Michael

doi:10.1007/978-3-540-75398-8_15

Matthew Browne⁴,
Saeed Shiry Ghidary⁵ &
Norbert Michael Mayer⁶

Part of the book series: Studies in Computational Intelligence ((SCI,volume 83))

2236 Accesses
7 Citations

Convolutional neural networks (CNNs) represent an interesting method for adaptive image processing, and form a link between general feed-forward neural networks and adaptive filters. Two-dimensional CNNs are formed by one or more layers of two-dimensional filters, with possible non-linear activation functions and/or down-sampling. Convolutional neural networks (CNNs) impose constraints on the weights and connectivity of the network, providing a framework well suited to the processing of spatially or temporally distributed data. CNNs possess key properties of translation invariance and spatially local connections (receptive fields). The socalled “weight-sharing” property of CNNs limits the number of free parameters. Although CNNs have been applied to face and character recognition, it is fair to say that the full potential of CNNs has not yet been realised. This chapter presents a description of the convolutional neural network architecture, and reports some of our work applying CNNs to theoretical and real-world image processing problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fukushima K, Miyake S, Ito T (1983) “Neocognitron: a neural model for a mechanism of visual pattern recognition,” IEEE Transactions on Systems, Man, and Cybernetics, 13:826-834.
Google Scholar
Fukushima K (1988) “Neocognitron: A hierachical neural network capable of visual pattern recognition,” Neural Networks, 1(2):119-130.
Article Google Scholar
Fukushima K (1979) “Neural-network model for a mechanism of pattern recog-nition unaffected by shift in position,” Trans. IECE Japan, 62-A(10):658-665.
Google Scholar
Lovell DR, Simon D, Tsoi AC (1993) “Improving the performance of the neocognitron,” In Leong P, Jabri M (Eds.) Proceedings of the Fourth Australian Conference on Neural Networks, pp. 202-205.
Google Scholar
Lovell DR, Downs T, Tsoi AC (1997) “An evaluation of the neocognitron,” IEEE Trans. on Neural Networks, 8(5):1090-1105
Article Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) “Learning internal represen-tation by error propagation,” In Rumelhart DE, McClelland JL (Eds.) Par-allel Distributed Processing: Explorations in the Microstructure of Cognition, 1:318-362. MIT, Cambridge, MA.
Google Scholar
Le Cun YB, Boser JS, Denker D, Henderson RE, Howard W, Hubbard W, Jackel LD (1988) “Backpropagation applied to handwritten zip code recognition,” Neural Computation, 4(1):541-551.
Google Scholar
Lang KJ, Hinton GE (1990) “Dimensionality reduction and prior knowledge in e-set recognition,” In Touretzky, DS (Ed.) Advances in Neural Information Processing Systems, 178-185. Morgan Kauffman, San Marteo, CA.
Google Scholar
Le Cun Y, Bengio Y (1995) “Convolutional networks for images, speech, and time series,” In Arbib, MA (Ed.) The Handbook of Brain Theory and Neural Networks, 255-258. MIT, Cambridge, MA.
Google Scholar
Lawrence S, Giles CL, Tsoi AC, Back AD (1997) “Face recognition: A convolutional neural network approach,” IEEE Transactions on Neural Networks 8(1):98-113.
Article Google Scholar
Fasel B (2002) “Robust face analysis using convolutional neural networks,” In Proceedings of the International Conference on Pattern Recognition (ICPR 2002), Quebec, Canada.
Google Scholar
Sackinger E, Boser B, Bromley J, LeCun Y (1992) “Application of the ANNA neural network chip to high-speed character recognition,” IEEE Transactions on Neural Networks, 3:498-505.
Article Google Scholar
Le Cun Y (1989) “Generalization and network design strategies,” Tech. Rep. CRG-TR-89-4, Department of Computer Science, University of Toronto.
Google Scholar
Bengio Y, Le Cun Y, Henderson D (1994) “Globally trained handwritten word recognizer using spatial representation, convolutional neural networks, and Hidden Markov Models,” In Cowan JD, Tesauro G, Alspector J (Eds.) Advances in Neural Information Processing Systems, 6:937-944. Morgan Kaufmann, San Marteo, CA.
Google Scholar
Fasel B (2002) “Facial expression analysis using shape and motion information extracted by convolutional neural networks,” In Proc. of the International IEEE Workshop on Neural Networks for Signal Processing (NNSP 2002), Martigny, Switzerland.
Google Scholar
Kirchner F and Hertzberg J (1997) “A prototype study of an autonomous robot platform for sewerage system maintenance,” Autonomous Robots, 4(4):319-331.
Article Google Scholar
Browne M, Shiry S, Dorn M, Ouellette R (2002) “Visual feature extraction via PCA-based parameterization of wavelet density functions,” In International Symposium on Robots and Automation, pp. 398-402, Toluca, Mexico.
Google Scholar
Browne M, Shiry Ghidary S (2003) “Convolutional neural networks for image processing: an application in robot vision,” Lecture Notes in Computer Science, Springer, Berlin Heidelberg New York 2903:641-652.
Google Scholar
Shiry Ghidary S, Browne M (2003) “Convolutional neural networks for robot vision: numerical studies and implementation on a sewer robot,” In Proceedings of the 8th Australian and New Zealand Intelligent Information Systems Conference, 653-665, Sydney, Australia.
Google Scholar
Cooper TPD, Taylor N (1998) “Towards the recovery of extrinsic camera param-eters from video records of sewer surveys,” Machine Vision and Applications, 11:53-63.
Article Google Scholar
del Solar JR, K-Pen, R (1996) “Sewer pipe image segmentation using a neural based architecture,” Pattern Recognition Letters 17:363-368.
Article Google Scholar
Hertzberg J, Kirchner F (1996) “Landmark-based autonomous navigation in sewerage pipes,” In Proceedings of First Euromicro Workshop on Advanced Mobile Robots (EUROBOT ‘96):68-73. Kaiserslautern, IEEE Press.
Google Scholar
Paletta ERL, Pinz A (1999) “Visual object detection for autonomous sewer robots,” In Proceedings of 1999 IEEE/RSJ International Conference on Intel-ligent Robots and Systems (IROS ’99), 2:1087-1093. Piscataway NJ, IEEE Press.
Google Scholar
Simard PY, Steinkraus D, Platt JC (2003) “Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis,” In Proceedigs of the Seventh International Conference on Document Analysis and Recognition (ICDAR ’03):958-963. Washington, DC, USA.
Google Scholar
Hubel DH, Wiesel TN (1959) “Receptive fields of single neurons in the cat’s striate cortex,” Journal of Physiology, 148:574-591.
Google Scholar

Download references

Author information

Authors and Affiliations

CSIRO Mathematical and Information Sciences, Cleveland, Australia
Matthew Browne
Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran
Saeed Shiry Ghidary
Department of Adaptive Machine Systems, Osaka University, Osaka, Japan
Norbert Michael Mayer

Authors

Matthew Browne
View author publications
You can also search for this author in PubMed Google Scholar
Saeed Shiry Ghidary
View author publications
You can also search for this author in PubMed Google Scholar
Norbert Michael Mayer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer and Information Sciences, Florida A&M University, Tallahassee, FL 32307, USA
Bhanu Prasad
Department of Electronics and Communication Engineering, Indian Institute of Technology Guwahati, Guwahati, India
S. R. Mahadeva Prasanna

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Browne, M., Ghidary, S.S., Mayer, N.M. (2008). Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics. In: Prasad, B., Prasanna, S.R.M. (eds) Speech, Audio, Image and Biomedical Signal Processing using Neural Networks. Studies in Computational Intelligence, vol 83. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75398-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-540-75398-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75397-1
Online ISBN: 978-3-540-75398-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics