Abstract
The clusters of SMP using fast networks, such as the Myricom’s Myrinet, have emerged as important platforms for high performance computing. Although their peak advertised performance is very high, their real performance may be much lower than the peak advertised performance for many applications. To achieve high performance, we need to take advantages of both SMP and cluster architectures. Based on the HPM model for parallel computing, the performance of clusters of SMP systems is analyzed, and principles to optimize parallel algorithms (both from the parallelism and locality point of view) are proposed. The influence of memory hierarchies on the performance is highly emphasized. Some practical examples on commercial clusters of SMPs systems Dawning D2000-2 and D3000 are also given.
Supported by the National Natural Science Foundation of China (Grand 69933020)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Boden N. J., Cohen D., Felderman R. E., Kulawik A. E., Seitz C. L., Seizovic J. N., Su W.: Myrinet-A Gigabit-per-Second Local-Area Network. IEEE Micro. Vol. 15, February (1995) 29–38
Bell G., Gray J.: High Performance Computing: Crays, Clusters, and Centers, What Next? August (2001) MSR-TR-2001-76
Qiao X. Z.: HPM-A Hierarchical Model for Parallel Computations. Internal Report, NCIC (2002)
Culler D.E., Singh J. P., Gupta A.: Parallel Architecture: A Hardware/Software Approach. Morgan Kaufman, San Francisco (1999)
Leopold C.: Parallel and Distributed Computing. John Wiley & Sons, New York (2001)
Luo X. G., Jiang W., Qiao X. Z.: The Performance of the Library ESSL on the Dawning Systems. NCIC Report (1999)
NCIC: Cluster Dawn2000. NCIC, Beijing (2000)
NCIC: Cluster Dawn3000. NCIC, Beijing (2001)
Xavier C., Iyengar S. S.: Introduction to Parallel Algorithms. John Wiley & Sons, New York (1998)
Kluzek E. B.: User’s Guide to NCAR CCM3.6. NCAR Technical Report (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Qiao, X. (2002). Optimization of Parallel Algorithms on Cluster of SMP’s. In: Fagerholm, J., Haataja, J., Järvinen, J., Lyly, M., Råback, P., Savolainen, V. (eds) Applied Parallel Computing. PARA 2002. Lecture Notes in Computer Science, vol 2367. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48051-X_49
Download citation
DOI: https://doi.org/10.1007/3-540-48051-X_49
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43786-4
Online ISBN: 978-3-540-48051-8
eBook Packages: Springer Book Archive