Using an alignment of fragment strings for comparing protein structures

被引:30
作者
Friedberg, Iddo [1 ]
Harder, Tim
Kolodny, Rachel
Sitbon, Einat
Li, Zhanwen
Godzik, Adam
机构
[1] Burnham Inst Med Res, Program Bioinformat & Syst Biol, La Jolla, CA USA
[2] Columbia Univ, Dept Biochem & Mol Biophys, New York, NY USA
[3] Weizmann Inst Sci, Dept Mol Genet, IL-76100 Rehovot, Israel
关键词
D O I
10.1093/bioinformatics/btl310
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Most methods that are used to compare protein structures use three-dimensional (3D) structural information. At the same time, it has been shown that a 1D string representation of local protein structure retains a degree of structural information. This type of representation can be a powerful tool for protein structure comparison and classification, given the arsenal of sequence comparison tools developed by computational biology. However, in order to do so, there is a need to first understand how much information is contained in various possible 1D representations of protein structure. Results: Here we describe the use of a particular structure fragment library, denoted here as KL-strings, for the 1D representation of protein structure. Using KL-strings, we develop an infrastructure for comparing protein structures with a 1D representation. This study focuses on the added value gained from such a description. We show the new local structure language adds resolution to the traditional three-state (helix, strand and coil) secondary structure description, and provides a high degree of accuracy in recognizing structural similarities when used with a pairwise alignment benchmark. The results of this study have immediate applications towards fast structure recognition, and for fold prediction and classification.
引用
收藏
页码:E219 / E224
页数:6
相关论文
共 29 条
[1]   The ASTRAL compendium for protein structure and sequence analysis [J].
Brenner, SE ;
Koehl, P ;
Levitt, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :254-256
[2]   Connecting the protein structure universe by using sparse recurring fragments [J].
Friedberg, I ;
Godzik, A .
STRUCTURE, 2005, 13 (08) :1213-1224
[3]   The interplay of fold recognition and experimental structure determination in structural genomics [J].
Friedberg, I ;
Jaroszewski, L ;
Ye, Y ;
Godzik, A .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2004, 14 (03) :307-312
[4]   RECURRING LOCAL SEQUENCE MOTIFS IN PROTEINS [J].
HAN, KF ;
BAKER, D .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 251 (01) :176-187
[5]   Quantifying the similarities within fold space [J].
Harrison, A ;
Pearl, F ;
Mott, R ;
Thornton, J ;
Orengo, C .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 323 (05) :909-926
[6]   Reducing the computational complexity of protein folding via fragment folding and assembly [J].
Haspel, N ;
Tsai, CJ ;
Wolfson, H ;
Nussinov, R .
PROTEIN SCIENCE, 2003, 12 (06) :1177-1187
[7]  
HASPEL N, 2002, PROTEIN STRUCTURE PR
[8]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[9]  
HENIKOFF S, 1995, GENE, V163, P17
[10]   Touring protein fold space with Dali/FSSP [J].
Holm, L ;
Sander, C .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :316-319