A Data-Centric Approach for Scalable State Machine Replication

Chockler, Gregory; Malkhi, Dahlia; Dolev, Danny

doi:10.1007/3-540-37795-6_29

Gregory Chockler^4,5,
Dahlia Malkhi⁴ &
Danny Dolev⁴

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2584))

568 Accesses
1 Citations

Abstract

Data replication is a key design principle for achieving reliability, high-availability, survivability and load balancing in distributed computing systems. The common denominator of all existing replication systems is the need to keep replicas consistent. The main paradigm for supporting replicated data is active replication, in which replicas execute the same sequence of methods on the object in order to remain consistent. This paradigm led to the definition of State Machine Replication (SMR) [29.8], [29.13]. The necessary building block of SMR is an engine that delivers operations at each site in the same total order without gaps, thus keeping the replica states consistent.

This work was supported in part by the Israeli Ministry of Science grant #1230-3-01.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Boichat, P. Dutta, S. Frolund and R. Guerraoui. Deconstructing Paxos. Technical Report DSC ID:200106, Communication Systems Department (DSC), École Polytechnic Fédérale de Lausanne (EPFL), January 2001. Available at http://dscwww.epfl.ch/EN/publications/documents/tr01006.pdf.
T. D. Chandra and S. Toueg. Unreliable failure detectors for reliable distributed systems. Journal of the ACM 43(2):225–267, March 1996.
Article MathSciNet MATH Google Scholar
G. Chockler and D. Malkhi. Active disk Paxos with infinitely many processes. In Proceedings of the 21st ACM Symposium on Principles of Distributed Computing (PODC’02 ), July 2002. To appear.
Google Scholar
G. Chockler, D. Malkhi and M. K. Reiter. Backoff protocols for distributed mutual exclusion and ordering. In Proceedings of the 21st International Conference on Distributed Computing Systems, pages 11–20, April 2001.
Google Scholar
M. J. Fischer, N. A. Lynch, and M. S. Paterson. Impossibility of distributed consensus with one faulty process. Journal of the ACM 32(2):374–382, April 1985.
Article MathSciNet MATH Google Scholar
E. Gafni and L. Lamport. Disk Paxos. In Proceedings of 14th International Symposium on Distributed Computing (DISC’2000), pages 330–344, October 2000.
Google Scholar
P. Jayanti, T. Chandra, and S. Toueg. Fault-tolerant wait-free shared objects. Journal of the ACM 45(3):451–500, May 1998.
Article MathSciNet MATH Google Scholar
L. Lamport. Time, clocks, and the ordering of events in distributed systems. Communications of the ACM 21(7):558–565, July 1978.
Article MATH Google Scholar
L. Lamport. The Part-time parliament. ACMTransactions on Computer Systems 16(2):133–169, May 1998.
Article Google Scholar
W. K. Lo and V. Hadzilacos. Using failure detectors to solve consensus in asynchronous shared-memory systems. In Proceedings of the 8th InternationalWorkshop on Distributed Algorithms (WDAG), Springer-Verlag LNCS 857:280–295, Berlin, 1994.
Google Scholar
D. Malkhi and M. K. Reiter. An architecture for survivable coordination in largescale systems. IEEE Transactions on Knowledge and Data Engineering 12(2):187–202, March/April 2000.
Article Google Scholar
J. P. Martin, L. Alvisi and M. Dahlin. Minimal Byzantine Storage. In Proceedings of the 16th International Conference on DIStribued Computing (DISC’02), pages 311–325, October 2002
Google Scholar
F. B. Schneider. Implementing fault-tolerant services using the state machine approach: A tutorial. ACM Computing Surveys 22(4):299–319, December 1990.
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, The Hebrew University of Jerusalem, 91904, Jerusalem, Israel
Gregory Chockler, Dahlia Malkhi & Danny Dolev
IBM Haifa Research Labs (Tel-Aviv Annex), Haifa
Gregory Chockler

Authors

Gregory Chockler
View author publications
You can also search for this author in PubMed Google Scholar
Dahlia Malkhi
View author publications
You can also search for this author in PubMed Google Scholar
Danny Dolev
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculté Informatique et Communication, École Polytechnique Fédérale de Lausanne, IN-Ecublens, 1015, Lausanne, Switzerland
André Schiper
Computer Science and Engineering, University of Connecticut, Unit 3155, 06269, Storrs, CT, USA
Alex A. Shvartsman
Computer Science Division, University of California at Berkeley, 447/443 Soda Hall, 94704-1776, Berkeley, CA, USA
Hakim Weatherspoon & Ben Y. Zhao &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Chockler, G., Malkhi, D., Dolev, D. (2003). A Data-Centric Approach for Scalable State Machine Replication. In: Schiper, A., Shvartsman, A.A., Weatherspoon, H., Zhao, B.Y. (eds) Future Directions in Distributed Computing. Lecture Notes in Computer Science, vol 2584. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-37795-6_29

Download citation

DOI: https://doi.org/10.1007/3-540-37795-6_29
Published: 14 March 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00912-2
Online ISBN: 978-3-540-37795-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics