DIALIGN P: Fast pair-wise and multiple sequence alignment using parallel processors

被引:26
作者
Schmollinger, M
Nieselt, K
Kaufmann, M
Morgenstern, B
机构
[1] Univ Gottingen, Inst Microbiol & Genet, D-37077 Gottingen, Germany
[2] Wilhelm Schickard Inst Informat, D-72076 Tubingen, Germany
[3] Ctr Bioinformat Tubingen, D-72076 Tubingen, Germany
关键词
D O I
10.1186/1471-2105-5-128
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Parallel computing is frequently used to speed up computationally expensive tasks in Bioinformatics. Results: Herein, a parallel version of the multi-alignment program DIALIGN is introduced. We propose two ways of dividing the program into independent sub-routines that can be run on different processors: (a) pair-wise sequence alignments that are used as a first step to multiple alignment account for most of the CPU time in DIALIGN. Since alignments of different sequence pairs are completely independent of each other, they can be distributed to multiple processors without any effect on the resulting output alignments. (b) For alignments of large genomic sequences, we use a heuristics by splitting up sequences into sub-sequences based on a previously introduced anchored alignment procedure. For our test sequences, this combined approach reduces the program running time of DIALIGN by up to 97%. Conclusions: By distributing sub-routines to multiple processors, the running time of DIALIGN can be crucially improved. With these improvements, it is possible to apply the program in large-scale genomics and proteomics projects that were previously beyond its scope.
引用
收藏
页数:6
相关论文
共 30 条
[1]  
ABDEDDAIM S, 2001, LECT NOTES COMPUTER, V2066, P1
[2]  
Amdahl G., 1967, AFIPS C P, V30, P483, DOI DOI 10.1145/1465482.1465560
[3]  
[Anonymous], 1997, MPI 2 EXT MESS PASS
[4]   Fast and sensitive multiple alignment of large genomic sequences -: art. no. 66 [J].
Brudno, M ;
Chapman, M ;
Göttgens, B ;
Batzoglou, S ;
Morgenstern, B .
BMC BIOINFORMATICS, 2003, 4 (1)
[5]  
Chain Patrick, 2003, Briefings in Bioinformatics, V4, P105, DOI 10.1093/bib/4.2.105
[6]   Comparative and functional analyses of LYL1 loci establish marsupial sequences as a model for phylogenetic footprinting [J].
Chapman, MA ;
Charchar, FJ ;
Kinston, S ;
Bird, CP ;
Grafham, D ;
Rogers, J ;
Grützner, F ;
Graves, JAM ;
Green, AR ;
Göttgens, B .
GENOMICS, 2003, 81 (03) :249-259
[7]   Rapid development of nucleic acid diagnostics [J].
Fitch, JP ;
Gardner, SN ;
Kuczmarski, TA ;
Kurtz, S ;
Myers, R ;
Ott, LL ;
Slezak, TR ;
Vitalis, EA ;
Zemla, AT ;
McCready, PM .
PROCEEDINGS OF THE IEEE, 2002, 90 (11) :1708-1721
[8]   Independent Hox-cluster duplications in lampreys [J].
Fried, C ;
Prohaska, SJ ;
Stadler, PF .
JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION, 2003, 299B (01) :18-25
[9]   Transcriptional regulation of the stem cell leukemia gene (SCL) -: Comparative analysis of five vertebrate SCL loci [J].
Göttgens, B ;
Barton, LM ;
Chapman, MA ;
Sinclair, AM ;
Knudsen, B ;
Grafham, D ;
Gilbert, JGR ;
Rogers, J ;
Bentley, DR ;
Green, AR .
GENOME RESEARCH, 2002, 12 (05) :749-759
[10]   Analysis of vertebrate SCL loci identifies conserved enhancers [J].
Göttgens, B ;
Barton, LM ;
Gilbert, JGR ;
Bench, AJ ;
Sanchez, MJ ;
Bahn, S ;
Mistry, S ;
Grafham, D ;
McMurray, A ;
Vaudin, M ;
Amaya, E ;
Bentley, DR ;
Green, AR .
NATURE BIOTECHNOLOGY, 2000, 18 (02) :181-186