The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions

被引:74
作者
Yu, YK [1 ]
Altschul, SF [1 ]
机构
[1] NIH, Natl Biotechnol Ctr, Natl Lib Med, Bethesda, MD 20894 USA
关键词
D O I
10.1093/bioinformatics/bti070
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Amino acid substitution matrices play a central role in protein alignment methods. Standard log-odds matrices, such as those of the PAM and BLOSUM series, are constructed from large sets of protein alignments having implicit background amino acid frequencies. However, these matrices frequently are used to compare proteins with markedly different amino acid compositions, such as transmembrane proteins or proteins from organisms with strongly biased nucleotide compositions. It has been argued elsewhere that standard matrices are not ideal for such comparisons and, furthermore, a rationale has been presented for transforming a standard matrix for use in a non-standard compositional context. Results: This paper presents the mathematical details underlying the compositional adjustment of amino acid or DNA substitution matrices.
引用
收藏
页码:902 / 911
页数:10
相关论文
共 23 条
[1]   AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE [J].
ALTSCHUL, SF .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 219 (03) :555-565
[2]   The estimation of statistical parameters for local alignment score distributions [J].
Altschul, SF ;
Bundschuh, R ;
Olsen, R ;
Hwa, T .
NUCLEIC ACIDS RESEARCH, 2001, 29 (02) :351-361
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   A PROTEIN ALIGNMENT SCORING SYSTEM SENSITIVE AT ALL EVOLUTIONARY DISTANCES [J].
ALTSCHUL, SF .
JOURNAL OF MOLECULAR EVOLUTION, 1993, 36 (03) :290-300
[5]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[6]  
[Anonymous], 1994, Ann. Prob
[7]  
Dayhoff M., 1978, ATLAS PROTEIN SEQ ST, V5, P353
[8]  
Dayhoff M.O., 1978, ATLAS PROTEIN SEQ ST, V5
[9]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[10]   Genome sequence and analysis of the oral bacterium Fusobacterium nucleatum strain ATCC 25586 [J].
Kapatral, V ;
Anderson, I ;
Ivanova, N ;
Reznik, G ;
Los, T ;
Lykidis, A ;
Bhattacharyya, A ;
Bartman, A ;
Gardner, W ;
Grechkin, G ;
Zhu, LH ;
Vasieva, O ;
Chu, L ;
Kogan, Y ;
Chaga, O ;
Goltsman, E ;
Bernal, A ;
Larsen, N ;
D'Souza, M ;
Walunas, T ;
Pusch, G ;
Haselkorn, R ;
Fonstein, M ;
Kyrpides, N ;
Overbeek, R .
JOURNAL OF BACTERIOLOGY, 2002, 184 (07) :2005-2018