Skip to main content

Using the SMiLE Monitoring Infrastructure to Detect and Lower the Inefficiency of Parallel Applications

  • Conference paper
  • First Online:
High Performance Computing and Networking (HPCN-Europe 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1823))

Included in the following conference series:

  • 368 Accesses

Abstract

High computational demands are one of the main reasons for the use of parallel architectures like clusters of PCs. Many parallel programs, however, suffer from severe inefficiencies when executed on such a loosely coupled architecture for a variety of reasons. One of the most important is the frequent access to remote memories. In this article, we present a hybrid event-driven monitoring system which uses a hardware monitor to observe all of the underlying transactions on the network and to deliver information about the run-time behavior of parallel programs to tools for performance analysis and debugging. This monitoring system is targeted towards cluster architectures with NUMA characteristics.

Jie Tao is a staff member of Jilin University of China and is currently pursuing her Ph.D at Technische Universität München of Germany

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. IEEE Standard for the Scalable Coherent Interface(SCI). IEEE Std 1596–1992, 1993, IEEE 345 East 47th Street, New York, NY 10017-2394, USA

    Google Scholar 

  2. G. Acher, W. Karl, and M. Leberecht. PCI-SCI protocol translations: Applying microprogramming concepts to FPGAs. In Proceedings of the 8th International Workshop, FPL’98, volume 1482 of Lecture Notes in Computer Science, pages 238–247, Tallinn, Estonia, September 1998. Springer Verlag, Heidelberg.

    Google Scholar 

  3. D. Badouel, T. Priol, and L. Renambot. A performance tuning tool for DSM-based parallel computers. In Proceedings of Europar’96-Parallel Processing, volume 1123 of Lecture Notes in Computer Science, pages 98–105, Lyon, France, August 1996. Springer Verlag.

    Google Scholar 

  4. M. Bubak, W. Funika, R. Gembarowski, and R. Wismüller. OMIS-compliant monitoring system for MPI applications. In Proc. 3rd International Conference on Parallel Processing and Applied Mathematics-PPAM’99, pages 378–386, Kazimierz Dolny, Poland, September 1999.

    Google Scholar 

  5. Hermann Hellwagner and Alexander Reinefeld, editors. SCI: Scalable Coherent Interface: Architecture and Software for High-Performance Computer Clusters, volume 1734 of Lecture Notes in Computer Science. Springer-Verlag, 1999.

    Google Scholar 

  6. R. Hockauf, W. Karl, M. Leberecht, M. Oberhuber, and M. Wagner. Exploiting spatial and temporal locality of accesses: A new hardware-based monitoring approach for DSM systems. In Proceedings of Euro-Par’98 Parallel Processing / 4th International Euro-Par Conference Southampton, volume 1470 of Lecture Notes in Computer Science, pages 206–215, UK, September 1998. Springer Verlag.

    Google Scholar 

  7. Ayal Itzkovitz, Assaf Schuster, and Lea Shalev. Thread migration and its applications in distributed shared memory systems. The Journal of Systems and Software, 42(1):71–87, July 1998.

    Article  Google Scholar 

  8. Wolfgang Karl, Markus Leberecht, and Martin Schulz. Optimizing data locality for SCI-based PC-clusters with the SMiLE monitoring approach. In Proceedings of International Conference on Parallel Architectures and Compilation Techniques (PACT’ 99), pages 169–176, Newport Beach, CA, October 1999. IEEE Computer Society.

    Google Scholar 

  9. Wolfgang Karl, Markus Leberecht, and Martin Schulz. Supporting shared memory and message passing on clusters of PCs with a SMiLE. In Proceedings of the third International Workshop, CANPC’99, volume 1602 of Lecture Notes in Computer Science, Orlando, Florida, USA (together with HPCA-5), January 1999. Springer Verlag, Heidelberg.

    Google Scholar 

  10. D. Magdic. Limes: An execution-driven multiprocessor simulation tool for the i486+-based PCs. School of Electrical Engineering, Department of Computer Engineering, University of Belgrade, POB 816 11000 Belgrade, Serbia, Yugoslavia, 1997.

    Google Scholar 

  11. Michael Manzke and Brian Coghlan. Non-intrusive deep tracing of SCI interconnect traffic. In Conference Proceedings of SCI Europe’99, pages 53–58, Toulouse, France, September 1999.

    Google Scholar 

  12. M. Martonosi, D. W. Clark, and M. Mesarina. The SHRIMP performance monitor: Design and applications. In Proc. SIGMETRICS Symposium on Parallel and Distributed Tools, pages 61–69, Philadelphia, May 1996.

    Google Scholar 

  13. M. Schulz. SCI-VM: A flexible base for transparent shared memory programming models on clusters of PCs. In Proceedings of HIPS’99, volume 1586 of Lecture Notes in Computer Science, pages 19–33, Berlin, April 1999. Springer Verlag.

    Google Scholar 

  14. Martin Schulz and Hermann Hellwagner. Global virtual memory based on SCI-DSM. In Proceedings of SCI-Europe’ 98, pages 59–67, Bordeaux, France, September 1998. Cheshire Henbury.

    Google Scholar 

  15. R. Wismüller. Interoperability support in the distributed monitoring system OCM. In Proc. 3rd International Conference on Parallel Processing and Applied Mathematics-PPAM’99, pages 77–91, Kazimierz Dolny, Poland, September 1999.

    Google Scholar 

  16. Steven Cameron Woo, Moriyoshi Ohara, Evan Torrie, Jaswinder Pal Singh, and Anoop Gupta. The SPLASH-2 programs: Characterization and methodological considerations. In Proceedings of the 22nd Annual International Symposium on Computer Architecture, pages 24–36, June 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tao, J., Karl, W., Schulz, M. (2000). Using the SMiLE Monitoring Infrastructure to Detect and Lower the Inefficiency of Parallel Applications. In: Bubak, M., Afsarmanesh, H., Hertzberger, B., Williams, R. (eds) High Performance Computing and Networking. HPCN-Europe 2000. Lecture Notes in Computer Science, vol 1823. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45492-6_27

Download citation

  • DOI: https://doi.org/10.1007/3-540-45492-6_27

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67553-2

  • Online ISBN: 978-3-540-45492-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics