Leveraging On-Chip Networks for Data Cache Migration in Chip Multiprocessors

被引：12

作者：

Eisley, Noel ^{[1
]}

Peh, Li-Shiuan ^{[1
]}

Shang, Li

机构：

[1] Princeton Univ, Dept EE, Princeton, NJ 08544 USA

来源：

PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES | 2008年

关键词：

Chip-multiprocessor; CMP; Interconnection network; NoC; Migration; Network-driven computing;

D O I：

10.1145/1454115.1454144

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, chip multiprocessors (CMPs) have arisen as the de facto design for modern high-performance processors, with increasing core counts. An important property of CMPs is that remote, but on-chip, L2 cache accesses are less costly than off-chip accesses; this is in contrast to earlier chip-to-chip or board-to-board multi-processors, where an access to a remote node is just as costly if no more so than a main memory access. This motivates on-chip cache migration as a means to retain more data on-chip. However, previously proposed techniques do not scale to high core counts: they do not leverage the on-chip caches of all cores nor have a scalable migration mechanism. In this paper we propose ascalable in-ne work migration technique which uses hints embedded within the router microarchitecture to steer L2 cache evictions towards free/invalid cache slots in any on-chip core cache, rather than evicting it off-chip. We show that our technique can provide an average of a 19% reduction in the number of off-chip memory accesses over the state-of-the-art, beating the performance of a pseudo-optimal migration technique. This can be done with negligible area overhead and a manageable traffic overhead of 13.4%.

引用

页码：197 / 207

页数：11

共 21 条

[1]

[Anonymous], 2005, INT C SUPERCOMPUTING

[2]

Beckmann BM, 2006, INT SYMP MICROARCH, P443

[3] Memory Bandwidth Limitations of Future Microprocessors [J].

Burger, D. ;

Goodman, J. R. ;

Kaegi, A. .

Computer Architecture News, 1996, 24 (02)

[4]

CAIN H, 2006, P 5 WORKSH COMP ARCH, P13

[5]

Chang JC, 2006, CONF PROC INT SYMP C, P264, DOI 10.1145/1150019.1136509

[6]

CHEN J, 2005, DASCMP

[7] Optimizing replication, communication, and capacity allocation in CMPs [J].

Chishti, Z ;

Powell, MD ;

Vijaykumar, TN .

32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2005, :357-368

[8]

Eisley N, 2006, INT SYMP MICROARCH, P321

[9]

GOODMAN JR, 1988, P 15 ANN INT S COMP, P422

[10] The Stanford Hydra CMP [J].

Hammond, L ;

Hubbert, BA ;

Siu, M ;

Prabhu, MK ;

Chen, M ;

Olukotun, K .

IEEE MICRO, 2000, 20 (02) :71-84

← 1 2 3 →