Skip to main content

Consensus in Asynchronous Distributed Systems: A Concise Guided Tour

  • Chapter
  • First Online:
Advances in Distributed Systems

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1752))

Abstract

It is now recognized that the Consensus problem is a fundamental problem when one has to design and implement reliable asynchronous distributed systems. This chapter is on the Consensus problem. It studies Consensus in two failure models, namely, the Crash/no Recovery model and the Crash/Recovery model. The assumptions related to the detection of failures that are required to solve Consensus in a given model are particularly emphasized.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aguilera M.K., Chen W. and Toueg S., Failure Detection and Consensus in the Crash-Recovery Model. In Proc. 11th Int. Symposium on Distributed Computing (DISC’98, formerly WDAG), Springer-Verlag, LNCS 1499, pp. 231–245, Andros, Greece, September 1998.

    Google Scholar 

  2. Bollo R., Le Narzul J.-P., Raynal M. and Tronel F., Probabilistic Analysis of a Group Failure Detection Protocol. Proc. 4th Workshop on Object-oriented Real-time Distributed Systems (WORDS’99), Santa-Barbara, January 1999.

    Google Scholar 

  3. Chandra T. and Toueg S., Unreliable Failure Detectors for Reliable Distributed Systems. Journal of the ACM, 43(1):225–267, March 1996 (A preliminary version appeared in Proc. of the 10th ACM Symposium on Principles of Distributed Computing, pp. 325–340, 1991).

    Article  MATH  MathSciNet  Google Scholar 

  4. Chandra T., Hadzilacos V. and Toueg S., The Weakest Failure Detector for Solving Consensus. Journal of the ACM, 43(4):685–722, July 1996 (A preliminary version appeared in Proc. of the 11th ACM Symposium on Principles of Distributed Computing, pp. 147–158, 1992).

    Article  MATH  MathSciNet  Google Scholar 

  5. Défago X, Schiper A., Sergent N., Semi-Passive Replication. Proc. 17th IEEE Symp. on Reliable Distributed Systems, West Lafayette, Indiana, USA, October 1997, pp. 43–50.

    Google Scholar 

  6. Dolev D., Dwork C. and Stockmeyer L., On the Minimal Synchronism Needed for Distributed Consensus. Journal of the ACM, 34(1):77–97, January 1987.

    Article  MATH  MathSciNet  Google Scholar 

  7. Dolev D., Friedman R., Keidar I. and Malkhi D., Failure Detectors in Omission Failure Environments. Technical Report 96-1608, Department of Computer Science, Cornell University, Ithaca, NY, September 1996.

    Google Scholar 

  8. Dwork C., Lynch N. and Stockmeyer L., Consensus in the Presence of Partial Synchrony. Journal of the ACM, 35(2):288–323, April 1988.

    Article  MathSciNet  Google Scholar 

  9. Fischer M.J., Lynch N. and Paterson M.S., Impossibility of Distributed Consensus with One Faulty Process. Journal of the ACM, 32(2):374–382, April 1985.

    Article  MATH  MathSciNet  Google Scholar 

  10. Fritzke U., Ingels Ph., Mostefaoui A. and Raynal M., Fault-Tolerant Total Order Multicast to Asynchronous Groups. Proc. 17th IEEE Symposium on Reliable Distributed Systems, Purdue University (IN), pp.228–234, October 1998.

    Google Scholar 

  11. Fromentin E., Raynal M. and Tronel F., On Classes of Problems in Asynchronous Distributed Systems with Process Crashes. Proc. 19th IEEE Int. Conf. on Distibuted Computing Systems (ICDCS-19), Austin, TX, pp. 470–477, June 1999.

    Google Scholar 

  12. Gray J.N. and Reuter A., Transaction Processing: Concepts and Techniques, Morgan Kaufmann, 1070 pages, 1993.

    Google Scholar 

  13. Guerraoui R., Revisiting the Relationship between Non-Blocking Atomic Commitment and Consensus. Proc. 9th Int. Workshop on Distributed Algorithms (WDAG95), Springer-Verlag LNCS 972 (J.M. Hélary and M. Raynal Eds), Sept. 1995, pp. 87–100.

    Google Scholar 

  14. Guerraoui R., Larrea M. and Schiper A., Reducing the Cost for Non-Blocking in Atomic Commitment. Proc. IEEE 16th Intl. Conf. Distributed Computing Systems, Hong-Kong, May 1996, pp. 692–697.

    Google Scholar 

  15. Guerraoui R., Oliveira R. and Schiper A., Stubborn Communication Channels. Research Report, Département d’informatique, EPFL, Lausanne, Switzerland, July 1997.

    Google Scholar 

  16. Guerraoui R., Raynal M. and Schiper A., Atomic Commit And Consensus: a Unifield View. (In French) Technique et Science Informatiques, 17(3):279–298, 1998.

    Google Scholar 

  17. Guerraoui R. and Schiper A., The Decentralized Non-Blocking Atomic Commitment Protocol. Proc. of the 7th IEEE Symposium on Parallel and Distributed Systems, San Antonio, TX, 1995, pp. 2–9.

    Google Scholar 

  18. Guerraoui R. and Schiper A., Total Order Multicast to Multiple Groups. Proc. 17th IEEE Int. Conf. on Distributed Computing Systems (ICDCS-17), Baltimore, MD, 1997, pp. 578–585.

    Google Scholar 

  19. Guerraoui R. and Schiper A., Consensus: the Big Misunderstanding. Proc of the Sixth IEEE Workshop on Future Trends of Distributed Computing Systems, Tunis, 1997, pp. 183–186.

    Google Scholar 

  20. Guerraoui R. and Schiper A., The Generic Consensus Service. Research Report 98-282, EPFL, Lausanne, Suisse, 1998. A previous version appeared in Proc. IEEE 26th Int Symp on Fault-Tolerant Computing (FTCS-26), June 1996, pp. 168–177.

    Google Scholar 

  21. Hadzilacos V. and Toueg S., Reliable Broadcast and Related Problems. In Distributed Systems (Second Edition), acm Press (S. Mullender Ed.), New-York, 1993, pp. 97–145.

    Google Scholar 

  22. Hurfin M. and Raynal M., A Simple and Fast Asynchronous Consensus Protocol Based on a Weak Failure Detector. Distributed Computing, 12(4):209–223, 1999.

    Article  Google Scholar 

  23. Hurfin M., Mostefaoui A. and Raynal M., Consensus in Asynchronous Systems Where Processes Can Crash and Recover. Proc. 17th IEEE Symposium on Reliable Distributed Systems, Purdue University (IN), pp. 280–286, October 1998.

    Google Scholar 

  24. Hurfin M., Macedo R., Raynal M. and Tronel F., A General Framework to Solve Agreement Problems. Proc. 18th IEEE Symposium on Reliable Distributed Systems, Lausanne, October 1999.

    Google Scholar 

  25. Mostefaoui A., Raynal M. and Takizawa M., Consistent Lamport’s Clocks for Asynchronous Groups with Process Crashes. Proc. 5th Int. Conference on Parallel Computing Technologies (PACT’99), St-Petersburg, Springer Verlag LNCS 1662, pp. 98–107, 1999.

    Google Scholar 

  26. Mostefaoui A. and Raynal M., Solving Consensus Using Chandra-Toueg’s Unreliable Failure Detectors: a General Quorum-Based Approach. Proc. 13th Int. Symposium on Distributed Computing (DICS’99, formerly WDAG), Springer-Verlag LNCS 1693, pp. 49–63, Bratislava (Slovakia), September 1999.

    Google Scholar 

  27. Oliveira R., Guerraoui R. and Schiper A., Consensus in the Crash/Recovery Model. Research Report 97–239, EPFL, Lausanne, Suisse, 1997.

    Google Scholar 

  28. Oliveira R., Solving Asynchronous Consensus with the Crash and Recovery of Processes. PhD Thesis, EPFL Département d’Informatique, 1999 (to appear).

    Google Scholar 

  29. Pedone F. and Schiper A., Generic Broadcast. Proc. 13th Int. Symposium on Distributed Computing (DICS’99, formerly WDAG), Springer-Verlag LNCS 1693, pp. 94–108, Bratislava (Slovakia), September 1999.

    Google Scholar 

  30. Raynal M., Consensus-Based Management of Distributed and Replicated Data. IEEE Bulletin of the TC on Data Engineering, 21(4):31–37, December 1998.

    Google Scholar 

  31. Raynal M., Non-Blocking Atomic Commitment in Distributed Systems: A Tutorial Based on a Generic Protocol. Journal of Computer Systems Science and Engineering, Vol.14, 1999.

    Google Scholar 

  32. Schiper A., Early Consensus in an Asynchronous System with a Weak Failure Detector. Distributed Computing, 10:149–157, 1997.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Guerraoui, R., Hurfinn, M., Mostefaoui, A., Oliveira, R., Raynal, M., Schiper, A. (2000). Consensus in Asynchronous Distributed Systems: A Concise Guided Tour. In: Krakowiak, S., Shrivastava, S. (eds) Advances in Distributed Systems. Lecture Notes in Computer Science, vol 1752. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46475-1_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-46475-1_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67196-1

  • Online ISBN: 978-3-540-46475-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics