APDB: a novel measure for benchmarking sequence alignment methods without reference alignments

被引:19
作者
O'Sullivan, Orla [2 ]
Zehnder, Mark [3 ]
Higgins, Des [2 ]
Bucher, Philipp [3 ]
Grosdidier, Aurelien [3 ]
Notredame, Cedric [1 ,3 ]
机构
[1] CNRS, UMR 1889, F-13402 Marseille, France
[2] Natl Univ Ireland Univ Coll Cork, Dept Biochem, Cork, Ireland
[3] Swiss Inst Bioinformat, CH-1066 Epalinges, Switzerland
关键词
D O I
10.1093/bioinformatics/btg1029
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: We describe APDB, a novel measure for evaluating the quality of a protein sequence alignment, given two or more PDB structures. This evaluation does not require a reference alignment or a structure superposition. APDB is designed to efficiently and objectively benchmark multiple sequence alignment methods. Results: Using existing collections of reference multiple sequence alignments and existing alignment methods, we show that APDB gives results that are consistent with those obtained using conventional evaluations. We also show that APDB is suitable for evaluating sequence alignments that are structurally equivalent. We conclude that APDB provides an alternative to more conventional methods used for benchmarking sequence alignment packages.
引用
收藏
页码:i215 / i221
页数:7
相关论文
共 13 条
  • [1] A STRATEGY FOR THE RAPID MULTIPLE ALIGNMENT OF PROTEIN SEQUENCES - CONFIDENCE LEVELS FROM TERTIARY STRUCTURE COMPARISONS
    BARTON, GJ
    STERNBERG, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (02) : 327 - 337
  • [2] A study of quality measures for protein threading models
    Cristobal, Susana
    Zemla, Adam
    Fischer, Daniel
    Rychlewski, Leszek
    Elofsson, Arne
    [J]. BMC BIOINFORMATICS, 2001, 2 (1)
  • [3] DURET L, 2000, BIOINFORMATICS SEQUE
  • [4] Evaluation of protein multiple alignments by SAM-T99 using the BAliBASE multiple alignment test set
    Karplus, K
    Hu, BR
    [J]. BIOINFORMATICS, 2001, 17 (08) : 713 - 720
  • [5] ProSup: a refined tool for protein structure alignment
    Lackner, P
    Koppensteiner, WA
    Sippl, MJ
    Domingues, FS
    [J]. PROTEIN ENGINEERING, 2000, 13 (11): : 745 - 752
  • [6] HOMSTRAD: A database of protein structure alignments for homologous families
    Mizuguchi, K
    Deane, CM
    Blundell, TL
    Overington, JP
    [J]. PROTEIN SCIENCE, 1998, 7 (11) : 2469 - 2471
  • [7] SAGA: Sequence alignment by genetic algorithm
    Notredame, C
    Higgins, DG
    [J]. NUCLEIC ACIDS RESEARCH, 1996, 24 (08) : 1515 - 1524
  • [8] Recent progress in multiple sequence alignment: a survey
    Notredame, C
    [J]. PHARMACOGENOMICS, 2002, 3 (01) : 131 - 144
  • [9] T-Coffee: A novel method for fast and accurate multiple sequence alignment
    Notredame, C
    Higgins, DG
    Heringa, J
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 302 (01) : 205 - 217
  • [10] A DATA-BANK MERGING RELATED PROTEIN STRUCTURES AND SEQUENCES
    PASCARELLA, S
    ARGOS, P
    [J]. PROTEIN ENGINEERING, 1992, 5 (02): : 121 - 137