Prefix computations on symmetric multiprocessors

被引:20
作者
Helman, DR [1 ]
JáJá, J
机构
[1] Univ Maryland, Inst Adv Comp Studies, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
parallel algorithms; list ranking; prefix computations; symmetric multiprocessors; parallel performance;
D O I
10.1006/jpdc.2000.1678
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We introduce a new prefix computation algorithm on linked lists which builds upon the sparse ruling set approach of Reid-Miller and Blelloch. Besides being somewhat simpler and requiring nearly half the number of memory accesses, we can bound our complexity with high probability instead of merely on average. Moreover, whereas Reid-Miller and Blelloch targeted their algorithm for implementation on a vector multiprocessor architecture, we develop our algorithm for implementation on the symmetric multiprocessor architecture (SMP). These symmetric multiprocessors dominate the high-end server market and are currently the primary candidate for constructing large scale multiprocessor systems. Our prefix computation algorithm was implemented in C using POSIX threads and run on four symmetric multiprocessors: the HP-Convex Exemplar (S-Class), the IBM SP-2 (High Node), the SGI Power Challenge, and the DEC AlphaServer. We ran our code using a variety of benchmarks which we identified to examine the dependence of our algorithm on memory access patterns. For some problems, our algorithm actually matched or exceeded the performance of the standard sequential solution using only a single thread. Moreover, in spite of the fact that the processors must compete for access to main memory, our algorithm still achieved scalable performance with up to 16 processors, which was the largest platform available to us, (C) 2001 Academic Press.
引用
收藏
页码:265 / 278
页数:14
相关论文
共 13 条
  • [1] AGGARWAL A, 1987, P 28 ANN IEEE S FDN, P204, DOI DOI 10.1109/SFCS.1987.31
  • [2] THE UNIFORM MEMORY-HIERARCHY MODEL OF COMPUTATION
    ALPERN, B
    CARTER, L
    FEIG, E
    SELKER, T
    [J]. ALGORITHMICA, 1994, 12 (2-3) : 72 - 109
  • [3] ANDERSON R, 1988, P 3 AEG WORKSH COMP, P81
  • [4] [Anonymous], THESIS CORNELL U ITH
  • [5] Accounting for memory bank contention and delay in high-bandwidth multiprocessors
    Blelloch, GE
    Gibbons, PB
    Matias, Y
    Zagha, M
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (09) : 943 - 958
  • [6] GIBBONS PB, IN PRESS SIAM J COMP
  • [7] A GUIDED TOUR OF CHERNOFF BOUNDS
    HAGERUP, T
    RUB, C
    [J]. INFORMATION PROCESSING LETTERS, 1990, 33 (06) : 305 - 308
  • [8] HELMAN DR, 1998, CSTR3915 UMIACS U MA
  • [9] HELMAN DR, 1998, UMIACSTR9838 U MAR E
  • [10] ON THE DISTRIBUTION OF THE NUMBER OF SUCCESSES IN INDEPENDENT TRIALS
    HOEFFDING, W
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1956, 27 (03): : 713 - 721