Conservative, Non-Conservative and Average Pairwise Statistical Significance of Local Sequence Alignment

被引:2
作者
Agrawal, Ankit [1 ]
Huang, Xiaoqiu [1 ]
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
来源
2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS | 2008年
关键词
D O I
10.1109/BIBM.2008.19
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Estimation of statistical significance of a pairwise alignment is tin important problem in sequence comparison. Recently, it was shown that pairwise statistical significance does better in practice than database statistical significance in terms of retrieval accuracy of homologs. In this paper we introduce the concept of conservative, non-conservative, and average pairwise statistical significance which (,an be easily derived from original pairwise statistical significance estimates and use more information specific to the sequence pair under consideration using multiple shuffle spaces. Experimental results for homology detection reveal that the proposed measures give at least comparable or significantly better retrieval accuracy than original pairwise statistical significance and database statistical significance reported by BLAST PSI-BIAST and SSEARCH. The use of the proposed measures is further shown to be extremely useful when using sequence-specific substitution matrices.
引用
收藏
页码:433 / 436
页数:4
相关论文
共 8 条
[1]  
AGRAWAL A, 2008, USING SEQUENCE UNPUB
[2]  
Agrawal A, 2008, LECT N BIOINFORMAT, V4983, P50, DOI 10.1007/978-3-540-79450-9_6
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]  
Mott R., 2005, ENCY LIFE SCI
[5]   CATH - a hierarchic classification of protein domain structures [J].
Orengo, CA ;
Michie, AD ;
Jones, S ;
Jones, DT ;
Swindells, MB ;
Thornton, JM .
STRUCTURE, 1997, 5 (08) :1093-1108
[6]  
Pearson W R, 2000, Methods Mol Biol, V132, P185
[7]   Sensitivity and selectivity in protein structure comparison [J].
Sierk, ML ;
Pearson, WR .
PROTEIN SCIENCE, 2004, 13 (03) :773-785
[8]   IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES [J].
SMITH, TF ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) :195-197