Skip to main content

Fast Gradient Based Off-Line Training of Multilayer Perceptrons

  • Chapter
Neural Network Engineering in Dynamic Control Systems

Part of the book series: Advances in Industrial Control ((AIC))

Abstract

Fast off-line training of Multilayer Perceptrons (MLPs) using gradient based algorithms is discussed. Simple Back Propagation and Batch Back Propagation, follow by viewing training as an unconstrained optimization problem. The inefficiencies of these methods are demonstrated with the aid of a number of test problems and used to justify the investigation of more powerful, second-order optimization techniques such as Conjugate Gradient (CG), Full Memory BFGS (FM) and Limited Memory BFGS (LM). Training is then at least an order of magnitude faster than with standard BBP, with the FM algorithm proving to be vastly superior to the others giving speed-ups of between 100 and 1000, depending on the size of the problem and the convergence criterion used.

Possibilities of parallelisation are investigated for both FM and LM based training. Parallel versions of these routines are proposed and shown to give significant speed-ups over the sequential versions for large problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. G. Lightbody, “Identification and Control Using Neural Networks”, PhD thesis, Queen’s University of Belfast, Control Engineering Research Group, May 1993.

    Google Scholar 

  2. J.D. Morningred et al., “An Adaptive Nonlinear Predictive Controller”, Proc. ACC 90, Vol.2, pp. 1614–1619, May 1990.

    Google Scholar 

  3. D.E. Rumelhart, G. Hinton and R. Williams, “Learning internal representations by error propagation”, in D.E Rumelhart, J.L. McClelland, (editors), Parallel Distributed Processing, Vol.1 pp 318–364 MIT Press, 1986

    Google Scholar 

  4. J.J McKeown, D. Meegan and D. Sprevak, “An Introduction to Unconstrained Optimization”, Adam Hilger, Bristol, 1990.

    MATH  Google Scholar 

  5. P.E. Gill, W. Murray and M.H. Wrights, “Practical Optimization”, Academic Press, London.

    Google Scholar 

  6. R. Fletcher, “Practical Methods of Optimization”, Vol. 1, Wiley & Sons, pp.51.

    Google Scholar 

  7. S. McLoone, G.W. Irwin, “Insights into multilayer perceptrons and their training”, Proc. Irish DSP and Control Colloquium, 1994, pp.61–68.

    Google Scholar 

  8. G. Lightbody, G.W. Irwin, “A parallel Algorithm for Training Neural Network Based Nonlinear Models”, Proc. 2nd IFAC Workshop on Algorithms and Architectures for Realtime Control, 1992, pp. 99–104.

    Google Scholar 

  9. A. Beguelin, J.J. Dongarra, G.A. Geist, W. Jiang, R. Manchek, K. Moore and V.S. Sunderam, “PVM 3 User’s Guide and Reference Manual”, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, 1993.

    Google Scholar 

  10. G. Lightbody, G.W. Irwin, A. Taylor, K. Kelly and J. McCormick, “Neural Network Modelling of a Polymerisation Reactor”, Proc. IEE Int. Conf., Control ‘94, Vol.1, pp. 237–242.

    Google Scholar 

  11. M.D. Brown, G.W. Irwin, B.W. Hogg and E. Swidenbank, “Modelling and Control of Generating Units using Neural Network Techniques”, 3rd IEEE Control Applications Conference, Glasgow, August 1994, Vol.1, pp. 735–740.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag London Limited

About this chapter

Cite this chapter

McLoone, S., Irwin, G. (1995). Fast Gradient Based Off-Line Training of Multilayer Perceptrons. In: Hunt, K.J., Irwin, G.R., Warwick, K. (eds) Neural Network Engineering in Dynamic Control Systems. Advances in Industrial Control. Springer, London. https://doi.org/10.1007/978-1-4471-3066-6_9

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-3066-6_9

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-3068-0

  • Online ISBN: 978-1-4471-3066-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics