Parallel global optimization with the particle swarm algorithm

被引:250
作者
Schutte, JF
Reinbolt, JA
Fregly, BJ
Haftka, RT
George, AD
机构
[1] Univ Florida, Dept Mech & Aerosp Engn, Gainesville, FL 32611 USA
[2] Univ Florida, Dept Biomed Engn, Gainesville, FL 32611 USA
[3] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
关键词
particle swarm; parallel global optimization; cluster computing;
D O I
10.1002/nme.1149
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Present day engineering optimization problems often impose large computational demands, resulting in long solution times even on a modern high-end processor. To obtain enhanced computational throughput and global search capability, we detail the coarse-grained parallelization of an increasingly popular global search method, the particle swarm optimization (PSO) algorithm. Parallel PSO performance was evaluated using two categories of optimization problems possessing multiple local minima-large-scale analytical test problems with computationally cheap function evaluations and medium-scale biomechanical system identification problems with computationally expensive function evaluations. For load-balanced analytical test problems formulated using 128 design variables, speedup was close to ideal and parallel efficiency above 95% for up to 32 nodes on a Beowulf cluster. In contrast, for load-imbalanced biomechanical system identification problems with 12 design variables, speedup plateaued and parallel efficiency decreased almost linearly with increasing number of nodes. The primary factor affecting parallel performance was the synchronization requirement of the parallel algorithm, which dictated that each iteration must wait for completion of the slowest fitness evaluation. When the analytical problems were solved using a fixed number of swarm iterations, a single population of 128 particles produced a better convergence rate than did multiple independent runs performed using sub-populations (8 runs with 16 particles, 4 runs with 32 particles, or 2 runs with 64 particles). These results suggest that (1) parallel PSO exhibits excellent parallel performance under load-balanced conditions, (2) an asynchronous implementation would be valuable for real-life problems subject to load imbalance, and (3) larger population sizes should be considered when multiple processors are available. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:2296 / 2315
页数:20
相关论文
共 28 条
[1]  
ANDERSON FC, 1995, J BIOCMECHANICAL ENG, V117, P300
[2]  
Carlisle A, 2000, IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, P429
[3]  
Carlisle A., 2001, Proceedings of the Workshop on Particle Swarm Optimization, P1
[4]   A SOLIDIFICATION PROCEDURE TO FACILITATE KINEMATIC ANALYSES BASED ON VIDEO SYSTEM DATA [J].
CHEZE, L ;
FREGLY, BJ ;
DIMNET, J .
JOURNAL OF BIOMECHANICS, 1995, 28 (07) :879-884
[5]  
Clerc M., 2002, Proceedings of the 1999 Congress on Evolutionary Computation, DOI DOI 10.1109/CEC.1999.785513
[6]   MINIMIZING MULTIMODAL FUNCTIONS OF CONTINUOUS-VARIABLES WITH THE SIMULATED ANNEALING ALGORITHM [J].
CORANA, A ;
MARCHESI, M ;
MARTINI, C ;
RIDELLA, S .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1987, 13 (03) :262-280
[7]  
Eberhart RC, 2001, IEEE C EVOL COMPUTAT, P81, DOI 10.1109/CEC.2001.934374
[8]  
Eberhart RC, 2000, IEEE C EVOL COMPUTAT, P84, DOI 10.1109/CEC.2000.870279
[9]  
EBERHART RC, 2001, P IEEE C EVOL COMP
[10]  
FOURIE PC, 2001, P 4 WORLD C STRUCT M, P52