Utilization of cache area in on-chip multiprocessor

Oi, Hitoshi; Ranganathan, N.

doi:10.1007/BFb0094939

Hitoshi Oi¹ &
N. Ranganathan²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1615))

Included in the following conference series:

International Symposium on High Performance Computing

114 Accesses
5 Citations

Abstract

On-chip multiprocessor can be an alternative to wide-issue superscalar processor to exploit increasing number of transistors on a silicon chip. Utilization of cache has more performance impact due to its higher penalty for remote (off-chip) communication than board-level implementation. We examine two options for better utilizing cache resource: (1) private data is only cached at L1 and L2 is used only for shared data, (2) dividing cache area into L2 and remote victim cache or just a large L2 cache. Results of execution-driven simulations showed that the first option improved the performance up to 10%. For the second option, four out of six benchmark programs showed that large L2 is more effective than the combination of L2 and remote victim cache.

Supported in part by a National Science Foundation Grant No. MIPS 9522265

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

K. Olukotun et al., “The Case for a Single-Chip Multiprocessor” in Proceedings of 7th International Conference on Architectural Support for Programming Languages and Operating Systems, ACM Press, New York, 2–11, October 1996.
Chapter Google Scholar
Y. Nunomura, T. Shimizu and O. Tomisawa, “M32R/D-Integrating DRAM and Microprocessor”, IEEE Micro, Vol. 17, No. 6, 40–48, November/December 1997.
Article Google Scholar
A-T. Nguyen, M. Michael, A. Sharma and J. Torrellas, “The Augmint Multiprocessor Simulation Toolkit for Intel x86 Architectures”, in Proceedings of 1996 International Conference on Computer Design, 486–490, October 1996.
Google Scholar
S. C. Woo et.al., “The SPLASH-2 Programs: Characterization and Methodological Considerations”, in Proceedings of the 22nd International Symposium on Computer Architecture, 24–36, June 1995.
Google Scholar
Z. Zhang and J. Torrellas, “Reducing Remote Conflict Misses: NUMA with Remote Cache COMA”, in Proceedings of International Symposium on High Performance Computer Architecture, 272–281, February 1997.
Google Scholar
A. Moga and M. Dubois, “The Effectiveness of SRAM Network Caches in Clustered DSMs”, in Proceedings of The Fourth International Symposium on High Performance Computer Architecture, 103–112, February 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science and Engineering, University of South Florida, 33620, Tampa, FL
Hitoshi Oi
Dept. of Electrical and Computer Engineering, The University of Texas at El Paso, 79968, El Paso, Texas
N. Ranganathan

Authors

Hitoshi Oi
View author publications
You can also search for this author in PubMed Google Scholar
N. Ranganathan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Constantine Polychronopoulos Kazuki Joe Akira Fukuda Shinji Tomita

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Oi, H., Ranganathan, N. (1999). Utilization of cache area in on-chip multiprocessor. In: Polychronopoulos, C., Fukuda, K.J.A., Tomita, S. (eds) High Performance Computing. ISHPC 1999. Lecture Notes in Computer Science, vol 1615. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0094939

Download citation

DOI: https://doi.org/10.1007/BFb0094939
Published: 19 October 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65969-3
Online ISBN: 978-3-540-48821-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics