Fast Gradient Based Off-Line Training of Multilayer Perceptrons

McLoone, Seán; Irwin, George

doi:10.1007/978-1-4471-3066-6_9

Seán McLoone⁵ &
George Irwin⁵

Part of the book series: Advances in Industrial Control ((AIC))

261 Accesses
6 Citations

Abstract

Fast off-line training of Multilayer Perceptrons (MLPs) using gradient based algorithms is discussed. Simple Back Propagation and Batch Back Propagation, follow by viewing training as an unconstrained optimization problem. The inefficiencies of these methods are demonstrated with the aid of a number of test problems and used to justify the investigation of more powerful, second-order optimization techniques such as Conjugate Gradient (CG), Full Memory BFGS (FM) and Limited Memory BFGS (LM). Training is then at least an order of magnitude faster than with standard BBP, with the FM algorithm proving to be vastly superior to the others giving speed-ups of between 100 and 1000, depending on the size of the problem and the convergence criterion used.

Possibilities of parallelisation are investigated for both FM and LM based training. Parallel versions of these routines are proposed and shown to give significant speed-ups over the sequential versions for large problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

G. Lightbody, “Identification and Control Using Neural Networks”, PhD thesis, Queen’s University of Belfast, Control Engineering Research Group, May 1993.
Google Scholar
J.D. Morningred et al., “An Adaptive Nonlinear Predictive Controller”, Proc. ACC 90, Vol.2, pp. 1614–1619, May 1990.
Google Scholar
D.E. Rumelhart, G. Hinton and R. Williams, “Learning internal representations by error propagation”, in D.E Rumelhart, J.L. McClelland, (editors), Parallel Distributed Processing, Vol.1 pp 318–364 MIT Press, 1986
Google Scholar
J.J McKeown, D. Meegan and D. Sprevak, “An Introduction to Unconstrained Optimization”, Adam Hilger, Bristol, 1990.
MATH Google Scholar
P.E. Gill, W. Murray and M.H. Wrights, “Practical Optimization”, Academic Press, London.
Google Scholar
R. Fletcher, “Practical Methods of Optimization”, Vol. 1, Wiley & Sons, pp.51.
Google Scholar
S. McLoone, G.W. Irwin, “Insights into multilayer perceptrons and their training”, Proc. Irish DSP and Control Colloquium, 1994, pp.61–68.
Google Scholar
G. Lightbody, G.W. Irwin, “A parallel Algorithm for Training Neural Network Based Nonlinear Models”, Proc. 2nd IFAC Workshop on Algorithms and Architectures for Realtime Control, 1992, pp. 99–104.
Google Scholar
A. Beguelin, J.J. Dongarra, G.A. Geist, W. Jiang, R. Manchek, K. Moore and V.S. Sunderam, “PVM 3 User’s Guide and Reference Manual”, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, 1993.
Google Scholar
G. Lightbody, G.W. Irwin, A. Taylor, K. Kelly and J. McCormick, “Neural Network Modelling of a Polymerisation Reactor”, Proc. IEE Int. Conf., Control ‘94, Vol.1, pp. 237–242.
Google Scholar
M.D. Brown, G.W. Irwin, B.W. Hogg and E. Swidenbank, “Modelling and Control of Generating Units using Neural Network Techniques”, 3rd IEEE Control Applications Conference, Glasgow, August 1994, Vol.1, pp. 735–740.
Google Scholar

Download references

Author information

Authors and Affiliations

Control Engineering Research Group Department of Electrical and Electronic Engineering, The Queen’s University of Belfast, Belfast, BT9 5AH, UK
Seán McLoone & George Irwin

Authors

Seán McLoone
View author publications
You can also search for this author in PubMed Google Scholar
George Irwin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Daimler-Benz AG, Alt Moabit 96 a, D-10559, Berlin, Germany
Kenneth J. Hunt
Department of Electrical and Electronic Engineering, Queen’s University of Belfast, Belfast, BT9 5AH, UK
George R. Irwin
Department of Cybernetics, School of Engineering and Information Sciences, University of Reading, P.O. Box 225, Whiteknights, Reading, RG6 2AY, UK
Kevin Warwick

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

McLoone, S., Irwin, G. (1995). Fast Gradient Based Off-Line Training of Multilayer Perceptrons. In: Hunt, K.J., Irwin, G.R., Warwick, K. (eds) Neural Network Engineering in Dynamic Control Systems. Advances in Industrial Control. Springer, London. https://doi.org/10.1007/978-1-4471-3066-6_9

Download citation

DOI: https://doi.org/10.1007/978-1-4471-3066-6_9
Publisher Name: Springer, London
Print ISBN: 978-1-4471-3068-0
Online ISBN: 978-1-4471-3066-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics