PHAT: a transmembrane-specific substitution matrix

被引:106
作者
Ng, PC
Henikoff, JG
Henikoff, S
机构
[1] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA
[2] Univ Washington, Ctr Bioengn, Seattle, WA 98195 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/16.9.760
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Database searching algorithms for proteins use scoring matrices based on average protein properties, and thus are dominated by globular proteins. However since transmembrane regions of a protein are in a distinctly different environment than globular proteins, one would expect generalized substitution matrices to be inappropriate for transmembrane regions. Results: We present the PHAT (predicted hydrophobic and transmembrane) matrix, which significantly outperforms generalized matrices and a previously published transmembrane matrix in searches with transmembrane queries. We conclude that a better matrix can be constructed by using background frequencies characteristic of the twilight zone, where low-scoring true positives have scores indistinguishable from high-scoring false positives, rather than the amino acid frequencies of the database. The PHAT matrix may help improve the accuracy of sequence alignments and evolutionary trees of membrane proteins.
引用
收藏
页码:760 / 766
页数:7
相关论文
共 16 条
[1]   AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE [J].
ALTSCHUL, SF .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 219 (03) :555-565
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]   The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :49-54
[4]  
Dayhoff M.O., 1978, ATLAS PROTEIN SEQ ST, V5
[5]  
Henikoff S, 1997, PROTEIN SCI, V6, P698
[6]   PERFORMANCE EVALUATION OF AMINO-ACID SUBSTITUTION MATRICES [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1993, 17 (01) :49-61
[7]   Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations [J].
Henikoff, S ;
Henikoff, JG ;
Pietrokovski, S .
BIOINFORMATICS, 1999, 15 (06) :471-479
[8]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[9]   AUTOMATED ASSEMBLY OF PROTEIN BLOCKS FOR DATABASE SEARCHING [J].
HENIKOFF, S ;
HENIKOFF, JG .
NUCLEIC ACIDS RESEARCH, 1991, 19 (23) :6565-6572
[10]   The PROSITE database, its status in 1999 [J].
Hofmann, K ;
Bucher, P ;
Falquet, L ;
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :215-219