Skip to main content

Experiences Gained from Building a Services-Based Distributed Operating System

  • Conference paper
Algorithms and Architectures for Parallel Processing (ICA3PP 2010)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6082))

  • 699 Accesses

Abstract

The goal of this paper is to present the experiences gained over 15 years of research into the design and development of a services-based distributed operating system. The lessons learnt over this period, we hope, will be of value to researchers involved in the design and development of operating systems that wish to harness the collective resources of ever-expanding distributed systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cheriton, D.R.: The V Kernel: A Software Base for Distributed Systems. IEEE Software 1, 19–42 (1984)

    Article  Google Scholar 

  2. Tanenbaum, A.S., Van Renesse, R.: Distributed operating systems. ACM Computer Surveys 17, 419–470 (1985)

    Article  Google Scholar 

  3. Goscinski, A.: Distributed operating systems, the logical design. Addison-Wesley, Reading (1991)

    Google Scholar 

  4. De Paoli, D., Goscinski, A., Hobbs, M., Wickham, G.: The RHODOS Microkernel, Kernel Servers and Their Cooperation. In: IEEE 1st Intl. Conf. on Algorithms and Architectures for Parallel Processing, vol. 1, pp. 345–354. IEEE, Brisbane (1995)

    Chapter  Google Scholar 

  5. Tanenbaum, A.S.: LINUX is obsolete. comp.os.minix, Google Groups (1992), http://groups.google.com/group/comp.os.minix/browse_thread/thread/c25870d7a41696d2/ (last accessed 20/1/2010)

  6. Ritchie, D.M., Thompson, K.: The UNIX time-sharing system. Communications of the ACM 17, 10 (1974)

    Article  Google Scholar 

  7. Torvalds, L.B.: What would you like to see most in minix? comp.os.minix, Google Groups (1991), http://groups.google.com/group/comp.os.minix/msg/b813d52cbc5a044b (last accessed 20/1/2010)

  8. Golub, D.B., Julin, D.P., Rashid, R.F., Draves, R.P., Dean, R.W., Forin, A., Barrera, J., Tokuda, H., Malan, G., Bohman, D.: Microkernel operating system architecture and mach. In: Proc. USENIX Workshop on Micro-Kernels and Other Kernel Architectures, pp. 11–30 (1992)

    Google Scholar 

  9. Hildebrand, D.: An Architectural Overview of QNX. In: Proceedings of the Workshop on Micro-kernels and Other Kernel Architectures, pp. 113–126. USENIX Assoc., CA (1992)

    Google Scholar 

  10. Liedtke, J.: On μ-Kernel Construction. In: Proc. 15th ACM symposium on Operating Systems Principles (SOSP), pp. 237–250 (1995)

    Google Scholar 

  11. Engler, D.R., Kaashoek, M.F., O’Toole, J.: Exokernel: an operating system architecture for application-level resource management. In: Fifteenth ACM Symposium on Operating Systems Principles, pp. 251–266. ACM, New York (1995)

    Chapter  Google Scholar 

  12. Ousterhout, J.K., Cherenson, A.R., Douglis, F., Nelson, M.N., Welch, B.B.: The Sprite Network Operating System. Computer 21, 23–36 (1988)

    Article  Google Scholar 

  13. Presotto, D., Pike, R., Thompson, K., Trickey, H.: Plan 9, A Distributed System. In: Proceedings of the Spring 1991 EurOpen Conference (1991)

    Google Scholar 

  14. Mullender, S.J., Rossum, G.v., Tanenbaum, A.S., Renesse, R.v., Staveren, H.v.: Amoeba: A distributed operating system for the 1990s. IEEE Computer 23, 44–53 (1990)

    Google Scholar 

  15. Kon, F., Campbell, R., Mickunas, M.D., Nahrstedt, K., Ballesteros, F.J.: 2K: A Distributed Operating System for Dynamic Heterogeneous Environments. In: 9th IEEE Intl. Symposium on High Performance Distributed Computing, pp. 201–209. IEEE, Pittsburgh (2000)

    Google Scholar 

  16. Bershad, B.N., Savage, S., Pardyak, P., Sirer, E.G., Fiuczynski, M.E., Becker, D., Chambers, C., Eggers, S.: Extensibility safety and performance in the SPIN operating system. In: Proc. of the 15th ACM symposium on Operating systems principles, pp. 267–283. ACM, Copper Mountain (1995)

    Google Scholar 

  17. Barak, A., La’adan, O.: The MOSIX Multicomputer Operating System for High Performance Cluster Computing. J. of Future Generation Comp. Systems 13, 361–372 (1998)

    Article  Google Scholar 

  18. Morin, C., Lottiaux, R., Valle, G., Gallard, P., Margery, D., Berthou, J.-Y., Scherson, I.D.: Kerrighed and data parallelism: cluster computing on single system image operating systems. In: Sixth IEEE Intl. Conf. on Cluster Computing. IEEE, San Diego (2004)

    Google Scholar 

  19. Goscinski, A.: A single system image operating system for next generation application software. In: Glowacz, P.Z. (ed.) International Conference on Modern Directions in Electrotechnics, Automatics, Computer Sciece, Electronics and Telecommunication, pp. 147–152. University of Mining and Metallurgy, Crakov (2002)

    Google Scholar 

  20. Horn, P.: Autonomic computing: Ibm’s prospective on the state of information technology. IBM Corp. (2001), http://www.research.ibm.com/autonomic/ (last accessed 20/1/2010)

  21. Goscinski, A., Silcock, J., Hobbs, M.: Building Autonomic Clusters: A Response to IBM’s Autonomic Computing Challenge. In: Wyrzykowski, R., Dongarra, J., Paprzycki, M., Waśniewski, J. (eds.) PPAM 2004. LNCS, vol. 3019, pp. 27–35. Springer, Heidelberg (2004)

    Google Scholar 

  22. Goscinski, A., Hobbs, M., Silcock, J.: The Genesis Cluster Operating System Supporting Parallel Processing. In: High Performance Computing Systems and Applications, pp. 301–313. Kluwer Academic Publishers, The Netherlands (2002)

    Google Scholar 

  23. Goscinski, A., Jeffers, P., Silcock, J.: Data Collection for Global Scheduling in the GENESIS System. In: International Symposium on Parallel Architectures, Algorithms and Networks I-SPAN’02, pp. 193–198. IEEE Computer Society, Makati City (2002)

    Chapter  Google Scholar 

  24. Dines, E., Goscinski, A.: Toward self discovery for an autonomic cluster. In: Hobbs, M., Goscinski, A.M., Zhou, W. (eds.) ICA3PP 2005. LNCS, vol. 3719, pp. 125–131. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  25. De Paoli, D., Goscinski, A.: The Rhodos Migration Facility. Journal of Systems and Software 40, 51–65 (1998)

    Article  Google Scholar 

  26. Rough, J., Goscinski, A.: Exploiting Operating System Services to Efficiently Checkpoint Parallel Applications in GENESIS. In: Wanlei Zhou, X.-b.C., Goscinski, A., Li, G.-j. (eds.) The Fifth International Conference on Algorithms and Architectures for Parallel Processing, pp. 261–268. The IEEE Computer Society, Beijing (2002)

    Chapter  Google Scholar 

  27. Maloney, A., Goscinski, A.: The Cost of Storing Checkpoints to Multiple Volatile Storage Locations Using at-least-k Semantics. In: Michael Hobbs, Y.H., Kuo, S.-Y., Zhou, W. (eds.) 13th IEEE International Symposium on Pacific Rim Dependable Computing (PRDC 2007), pp. 330–333. IEEE Computer Society, Melbourne (2007)

    Chapter  Google Scholar 

  28. Hobbs, M., Goscinski, A.: The GENESIS parallelism management system employing concurrent process-creation services. Microprocessors and Microsystems 24, 415–427 (2000)

    Article  Google Scholar 

  29. Rough, J., Goscinski, A., De Paoli, D.: PVM on the RHODOS Distributed Operating System, pp. 208–215. Springer, Heidelberg (1997)

    Google Scholar 

  30. Maloney, A., Goscinski, A., Hobbs, M.: An MPI Implementation Supported by Process Migration and Load Balancing. In: Dongarra, J., Laforenza, D., Orlando, S. (eds.) EuroPVM/MPI 2003. LNCS, vol. 2840, pp. 414–423. Springer, Heidelberg (2003)

    Google Scholar 

  31. Silcock, J., Goscinski, A.: A Comprehensive Distributed Shared Memory System that is Easy to Use and Program. Distributed Systems Engineering 6, 121–128 (1999)

    Article  Google Scholar 

  32. Goscinski, A., Hobbs, M., Silcock, J.: GENESIS: an efficient, transparent and easy to use cluster operating system. Parallel Computing 28, 557–606 (2002)

    Article  MATH  Google Scholar 

  33. Hobbs, M., Wickham, G., Paoli, D.D., Goscinski, A.: Generic Memory Object for Supporting Distributed Systems. In: International Conference on Automation, pp. 363–366. Allied Publishers, Indore (1995)

    Google Scholar 

  34. Joyce, P., De Paoli, D., Goscinski, A., Hobbs, M.: Implementation and Performance of the Interprocess Communications Facility in Rhodos. In: Intl. Conference on Networks/Intl. Conference on Information Engineering, pp. 571–575. IEEE, Singapore (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Goscinski, A., Hobbs, M. (2010). Experiences Gained from Building a Services-Based Distributed Operating System. In: Hsu, CH., Yang, L.T., Park, J.H., Yeo, SS. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2010. Lecture Notes in Computer Science, vol 6082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13136-3_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13136-3_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13135-6

  • Online ISBN: 978-3-642-13136-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics