Skip to main content

Managing Large Computer Systems with Change Objects

  • Conference paper
Sustained Simulation Performance 2015
  • 413 Accesses

Abstract

The management of large, complex computer systems is unimaginable without automation tools which help setting up and keeping the system in the desired state. We discuss the design and implementation of a distributed automated management system able to manage the complex state of large Linux installations. In the context of autonomic computing our approach belongs into the class of self-configuring systems. Popular self-configuration systems like CFEngine, Puppet, Chef, etc. focus on describing the configuration of each of the servers in a large computer system by a set of high-level policies called promises, manifests or recipes, and implement them in some mostly well-defined order. We go beyond the server focused approach of these tools and implement mechanisms for handling complex dependencies between managed components that cross server boundaries. The targeted system configuration and system state is described by a global, distributed graph of change objects (COBs) and their dependencies. Autonomous cobd agents running on each server are cooperating and driving the system towards the desired distributed state. The system can deal with reverting changes and correctly implementing a new generation of COBs where managed components are added or removed from the system, without needing to reinstall the servers. We apply the automated system to the configuration and management of a cluster of highly available file and storage servers delivering the Lustre parallel file system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kephart, J.O., Chess, D.M.: The vision of autonomic computing. Computer 36(1), 41–50 (2003)

    Article  MathSciNet  Google Scholar 

  2. CFEngine configuration management and automation framework. http://cfengine.com/product/ Cited 1 July 2015

  3. Burgess, M.: An approach to understanding policy based on autonomy and voluntary cooperation. In: Schönwälder, J., Serrat, J. (eds.) Ambient Networks. Lecture Notes in Computer Science, vol. 3775, pp. 97–108. Springer, Berlin/Heidelberg (2005)

    Google Scholar 

  4. Wikipedia: Promise theory. http://en.wikipedia.org/wiki/Promise_theory Cited 1 July 2015

  5. OSCAR: Open Source Cluster Application Resources. http://oscar.openclustergroup.org Cited 1 July 2015

  6. The Warewulf Project. http://warewulf.lbl.gov/trac Cited 1 July 2015

  7. SystemImager. http://www.systemimager.org Cited 1 July 2015

  8. The Puppet Configuration Management System. https://puppetlabs.com/ Cited 1 July 2015

  9. Ansible open-source software platform for configuring and managing computers. http://www.ansible.com/home Cited 1 July 2015

  10. Chef: Automation for Web-Scale IT. http://www.getchef.com/ Cited 1 July 2015

  11. SaltStack open source configuration management and remote execution application. http://saltstack.com/ Cited 1 July 2015

  12. Pacemaker: A scalable High Availability cluster resource manager. http://clusterlabs.org/ Cited 1 July 2015

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Erich Focht .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Focht, E., Mangold, O. (2015). Managing Large Computer Systems with Change Objects. In: Resch, M., Bez, W., Focht, E., Kobayashi, H., Qi, J., Roller, S. (eds) Sustained Simulation Performance 2015. Springer, Cham. https://doi.org/10.1007/978-3-319-20340-9_2

Download citation

Publish with us

Policies and ethics