Skip to main content

A Theory of Fault Recovery for Component-Based Models

  • Conference paper
Stabilization, Safety, and Security of Distributed Systems (SSS 2012)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7596))

Included in the following conference series:

Abstract

This paper introduces a theory of fault recovery for component-based models. We specify a model in terms of a set of atomic components incrementally composed and synchronized by a set of glue operators. We define what it means for such models to provide a recovery mechanism, so that the model converges to its normal behavior in the presence of faults (e.g., in self-stabilizing systems). We identify corrector components whose presence in a model is essential to guarantee recovery after the occurrence of faults. We also formalize component-based models that effectively separate recovery from functional concerns. We also show that any model that provides fault recovery can be transformed into an equivalent model, where functional and recovery tasks are modularized in different components.

This work is partially sponsored by Canada NSERC Discovery Grant 418396-2012.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abdellatif, T., Combaz, J., Sifakis, J.: Model-based implementation of real-time applications. In: ACM International Conference on Embedded Software (EMSOFT), pp. 229–238 (2010)

    Google Scholar 

  2. Arora, A., Gouda, M.G.: Closure and convergence: A foundation of fault-tolerant computing. IEEE Transactions on Software Engineering 19(11), 1015–1027 (1993)

    Article  Google Scholar 

  3. Arora, A., Kulkarni, S.S.: Detectors and correctors: A theory of fault-tolerance components. In: International Conference on Distributed Computing Systems (ICDCS), pp. 436–443 (1998)

    Google Scholar 

  4. Basu, A., Bonakdarpour, B., Bozga, M., Sifakis, J.: Systematic Correct Construction of Self-stabilizing Systems: A Case Study. In: Dolev, S., Cobb, J., Fischer, M., Yung, M. (eds.) SSS 2010. LNCS, vol. 6366, pp. 4–18. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  5. Bonakdarpour, B., Bozga, M., Jaber, M., Quilbeuf, J., Sifakis, J.: A framework for automated distributed implementation of component-based models. Springer Journal on Distributed Computing, DC (to appear, 2012)

    Google Scholar 

  6. Bonakdarpour, B., Kulkarni, S.S.: Compositional verification of real-time fault-tolerant programs. In: ACM International Conference on Embedded Software (EMSOFT), pp. 29–38 (2009)

    Google Scholar 

  7. Bonakdarpour, B., Kulkarni, S.S., Arora, A.: Disassembling real-time fault-tolerant programs. In: ACM International Conference on Embedded Software (EMSOFT), pp. 169–178 (2008)

    Google Scholar 

  8. Bozga, M., Sfyrla, V., Sifakis, J.: Modeling synchronous systems in BIP. In: ACM International Conference on Embedded Software (EMSOFT), pp. 77–86 (2009)

    Google Scholar 

  9. Brukman, O., Dolev, S.: Recovery oriented programming: runtime monitoring of safety and liveness. Springer Journal on Software Tools for Technology Transfer (STTT) 13(4), 377–395 (2011)

    Google Scholar 

  10. Elmqvist, J., Nadjm-Tehrani, S., Minea, M.: Safety Interfaces for Component-Based Systems. In: Winther, R., Gran, B.A., Dahll, G. (eds.) SAFECOMP 2005. LNCS, vol. 3688, pp. 246–260. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  11. Gärtner, F.C.: Transformational approaches to the specification and verification of fault-tolerant systems: Formal background and classification. Journal of Universal Computer Science 5(10), 668–692 (1999)

    MATH  Google Scholar 

  12. Gössler, G., Sifakis, J.: Composition for component-based modeling. Sci. Comput. Program. 55(1-3), 161–183 (2005)

    Article  MATH  Google Scholar 

  13. Lamport, L.: The temporal logic of actions. ACM Transactions on Programming Languages and Systems (TOPLAS) 16, 872–923 (1994)

    Article  Google Scholar 

  14. Leal, W., Arora, A.: Scalable self-stabilization via composition. In: Distributed Computing Systems (ICDCS), pp. 12–21 (2004)

    Google Scholar 

  15. Liu, Z., Joseph, M.: Transformation of programs for fault-tolerance. Formal Aspects of Computing 4(5), 442–469 (1992)

    Article  MATH  Google Scholar 

  16. Liu, Z., Joseph, M.: Specification and verification of fault-tolerance, timing, and scheduling. ACM Transactions on Programming Languages and Systems (TOPLAS) 21(1), 46–89 (1999)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bonakdarpour, B., Bozga, M., Gössler, G. (2012). A Theory of Fault Recovery for Component-Based Models. In: Richa, A.W., Scheideler, C. (eds) Stabilization, Safety, and Security of Distributed Systems. SSS 2012. Lecture Notes in Computer Science, vol 7596. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33536-5_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33536-5_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33535-8

  • Online ISBN: 978-3-642-33536-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics