BLAZE (TM) - AN IMPLEMENTATION OF THE SMITH-WATERMAN SEQUENCE COMPARISON ALGORITHM ON A MASSIVELY-PARALLEL COMPUTER

被引:24
作者
BRUTLAG, DL
DAUTRICOURT, JP
DIAZ, R
FIER, J
MOXON, B
STAMM, R
机构
[1] INTELLIGENET INC, MT VIEW, CA 94040 USA
[2] MASPAR INC 749, SUNNYVALE, CA 94086 USA
来源
COMPUTERS & CHEMISTRY | 1993年 / 17卷 / 02期
关键词
D O I
10.1016/0097-8485(93)85011-Z
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We have implemented the Smith and Waterman dynamic programming algorithm on the massively parallel MP1104 computer from MasPar and compared its ability to detect remote protein sequence homologies with that of other commonly used database search algorithms. Dynamic programming algorithms are normally too computer intensive to permit full databases search, however on the MP1104 a search of the Swiss-Prot database takes about 15 s. This nearly interactive speed of database searching permits one to optimize the parameters for each query. Most of the common database search methods (FASTA, FASTDB and BLAST) gain their speed by using approximations such as word matching or eliminating gaps from the alignments which prevents them from detecting remote homologies. By using queries from protein super families containing a large number of family members of diverse similarities, we have measured the ability of each of these algorithms to detect the remotest members of each super family. Using these super families, we have found that the algorithms, in order of decreasing sensitivity are BLAZE, FASTDB, FASTA and BLAST. Hence the massively parallel computers allow one to have maximal sensitivity and search speed simultaneously.
引用
收藏
页码:203 / 207
页数:5
相关论文
共 17 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] BARSALOU T, 1991, M D COMPUT, V8, P144
  • [3] BRUTLAG DL, 1990, COMPUT APPL BIOSCI, V6, P237
  • [4] APPLICATIONS OF PARALLEL PROCESSING ALGORITHMS FOR DNA-SEQUENCE ANALYSIS
    COLLINS, JF
    COULSON, AFW
    [J]. NUCLEIC ACIDS RESEARCH, 1984, 12 (01) : 181 - 192
  • [5] DESHPANDE AS, 1991, COMPUT APPL BIOSCI, V7, P237
  • [6] HOW BIG IS THE UNIVERSE OF EXONS
    DORIT, RL
    SCHOENBACH, L
    GILBERT, W
    [J]. SCIENCE, 1990, 250 (4986) : 1377 - 1382
  • [7] GALPER AR, 1990, KSL9074 STANF U REP
  • [8] AN IMPROVED ALGORITHM FOR MATCHING BIOLOGICAL SEQUENCES
    GOTOH, O
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (03) : 705 - 708
  • [9] A LARGE FAMILY OF BACTERIAL ACTIVATOR PROTEINS
    HENIKOFF, S
    HAUGHN, GW
    CALVO, JM
    WALLACE, JC
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (18) : 6602 - 6606
  • [10] HUNKAPILLER T, 1990, HUMAN GENOME 1989 90, P101