Local weighting schemes for protein multiple sequence alignment

被引:32
作者
Heringa, J [1 ]
机构
[1] Natl Inst Med Res, MRC, Div Math Biol, London NW7 1AA, England
来源
COMPUTERS & CHEMISTRY | 2002年 / 26卷 / 05期
关键词
weighting schemes; multiple sequence alignment; profile;
D O I
10.1016/S0097-8485(02)00008-6
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper describes three weighting schemes for improving the accuracy of progressive multiple sequence alignment methods: (1) global profile pre-processing, to capture for each sequence information about other sequences in a profile before the actual multiple alignment takes place; (2) local pre-processing; which incorporates a new protocol to only use non-overlapping local sequence regions to construct the pre-processed profiles; and (3) local-global alignment, a weighting scheme based on the double dynamic programming (DDP) technique to softly bias global alignment to local sequence motifs. The first two schemes allow the compilation of residue-specific multiple alignment reliability indices, which can be used in an iterative fashion. The schemes have been implemented with associated iterative modes in the PRALINE multiple sequence alignment method, and have been evaluated using the BAliBASE benchmark alignment database. These tests indicate that PRALINE is a toolbox able to build alignments with very high quality. We found that local profile pre-processing raises the alignment quality by 5.5% compared to PRALINE alignments generated under default conditions. Iteration enhances the quality by a further percentage point. The implications of multiple alignment scoring functions and iteration in relation to alignment quality and benchmarking are discussed. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:459 / 477
页数:19
相关论文
共 53 条
  • [1] Do aligned sequences share the same fold?
    Abagyan, RA
    Batalov, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (01) : 355 - 368
  • [2] WEIGHTS FOR DATA RELATED BY A TREE
    ALTSCHUL, SF
    CARROLL, RJ
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1989, 207 (04) : 647 - 653
  • [3] A SENSITIVE PROCEDURE TO COMPARE AMINO-ACID-SEQUENCES
    ARGOS, P
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (02) : 385 - 396
  • [4] BENNER SA, 1992, SCIENCE, V257, P609
  • [5] A flexible motif search technique based on generalized profiles
    Bucher, P
    Karplus, K
    Moeri, N
    Hofmann, K
    [J]. COMPUTERS & CHEMISTRY, 1996, 20 (01): : 3 - 23
  • [6] BUCHER P, 1996, P 4 INT C INT SYST M, P44
  • [7] CARILLO H, 1988, SIAM J APPL MATH, V48, P1073
  • [8] DAYHOFF MO, 1978, ATLAS PROTEIN STR S3, V4
  • [9] Profile hidden Markov models
    Eddy, SR
    [J]. BIOINFORMATICS, 1998, 14 (09) : 755 - 763
  • [10] EHOGEWEG P, 1984, J MOL EVOL, V20, P175