The Nexus approach to integrating multithreading and communication

被引:106
作者
Foster, I [1 ]
Kesselman, C [1 ]
Tuecke, S [1 ]
机构
[1] CALTECH, BECKMAN INST, PASADENA, CA 91125 USA
基金
美国国家科学基金会;
关键词
D O I
10.1006/jpdc.1996.0108
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Lightweight threads have an important role to play in parallel systems: they can be used to exploit shared-memory parallelism, to mask communication and I/O latencies, to implement remote memory access, and to support task-parallel and irregular applications. In this paper, we address the question of how to integrate threads and communication in high-performance distributed-memory systems. We propose an approach based on global pointer and remote service request mechanisms, and explain how these mechanisms support dynamic communication structures, asynchronous messaging, dynamic thread creation and destruction, and a global memory model via interprocessor references. We also explain how these mechanisms can be implemented in various environments. Our global pointer and remote service request mechanisms have been incorporated in a runtime system called Nexus that is used as a compiler target for parallel languages and as a substrate for higher-level communication libraries. We report the results of performance studies conducted using a Nexus implementation; these results indicate that Nexus mechanisms can be implemented efficiently on commodity hardware and software systems. (C) 1996 Academic Press, Inc.
引用
收藏
页码:70 / 82
页数:13
相关论文
共 47 条
[1]   THE PERFORMANCE IMPLICATIONS OF THREAD MANAGEMENT ALTERNATIVES FOR SHARED-MEMORY MULTIPROCESSORS [J].
ANDERSON, TE ;
LAZOWSKA, ED ;
LEVY, HM .
IEEE TRANSACTIONS ON COMPUTERS, 1989, 38 (12) :1631-1644
[2]  
[Anonymous], OPERATING SYSTEMS CO
[3]   USER-LEVEL INTERPROCESS COMMUNICATION FOR SHARED MEMORY MULTIPROCESSORS [J].
BERSHAD, BN ;
ANDERSON, TE ;
LAZOWSKA, ED ;
LEVY, HM .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1991, 9 (02) :175-198
[4]  
BHOEDJANG R, 1993, S EXPERIENCES DISTRI, V4, P213
[5]   THE PROCESS GROUP-APPROACH TO RELIABLE DISTRIBUTED COMPUTING [J].
BIRMAN, KP .
COMMUNICATIONS OF THE ACM, 1993, 36 (12) :37-&
[6]   IMPLEMENTING REMOTE PROCEDURE CALLS [J].
BIRRELL, AD ;
NELSON, BJ .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1984, 2 (01) :39-59
[7]  
Blumofe R. D., 1995, SIGPLAN Notices, V30, P207, DOI 10.1145/209937.209958
[8]  
BOOTHE R, 1992, ACM SIGARCH COMPUTER, V20
[9]  
BUHR P, 1990, SOFTWARE PRACTIC SEP, P929
[10]   MONITORS, MESSAGES, AND CLUSTERS - THE P4 PARALLEL PROGRAMMING SYSTEM [J].
BUTLER, RM ;
LUSK, EL .
PARALLEL COMPUTING, 1994, 20 (04) :547-564