Leveraging On-Chip Networks for Data Cache Migration in Chip Multiprocessors

被引:12
作者
Eisley, Noel [1 ]
Peh, Li-Shiuan [1 ]
Shang, Li
机构
[1] Princeton Univ, Dept EE, Princeton, NJ 08544 USA
来源
PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES | 2008年
关键词
Chip-multiprocessor; CMP; Interconnection network; NoC; Migration; Network-driven computing;
D O I
10.1145/1454115.1454144
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, chip multiprocessors (CMPs) have arisen as the de facto design for modern high-performance processors, with increasing core counts. An important property of CMPs is that remote, but on-chip, L2 cache accesses are less costly than off-chip accesses; this is in contrast to earlier chip-to-chip or board-to-board multi-processors, where an access to a remote node is just as costly if no more so than a main memory access. This motivates on-chip cache migration as a means to retain more data on-chip. However, previously proposed techniques do not scale to high core counts: they do not leverage the on-chip caches of all cores nor have a scalable migration mechanism. In this paper we propose ascalable in-ne work migration technique which uses hints embedded within the router microarchitecture to steer L2 cache evictions towards free/invalid cache slots in any on-chip core cache, rather than evicting it off-chip. We show that our technique can provide an average of a 19% reduction in the number of off-chip memory accesses over the state-of-the-art, beating the performance of a pseudo-optimal migration technique. This can be done with negligible area overhead and a manageable traffic overhead of 13.4%.
引用
收藏
页码:197 / 207
页数:11
相关论文
共 21 条
[1]  
[Anonymous], 2005, INT C SUPERCOMPUTING
[2]  
Beckmann BM, 2006, INT SYMP MICROARCH, P443
[3]   Memory Bandwidth Limitations of Future Microprocessors [J].
Burger, D. ;
Goodman, J. R. ;
Kaegi, A. .
Computer Architecture News, 1996, 24 (02)
[4]  
CAIN H, 2006, P 5 WORKSH COMP ARCH, P13
[5]  
Chang JC, 2006, CONF PROC INT SYMP C, P264, DOI 10.1145/1150019.1136509
[6]  
CHEN J, 2005, DASCMP
[7]   Optimizing replication, communication, and capacity allocation in CMPs [J].
Chishti, Z ;
Powell, MD ;
Vijaykumar, TN .
32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2005, :357-368
[8]  
Eisley N, 2006, INT SYMP MICROARCH, P321
[9]  
GOODMAN JR, 1988, P 15 ANN INT S COMP, P422
[10]   The Stanford Hydra CMP [J].
Hammond, L ;
Hubbert, BA ;
Siu, M ;
Prabhu, MK ;
Chen, M ;
Olukotun, K .
IEEE MICRO, 2000, 20 (02) :71-84