Long membrane helices and short loops predicted less accurately

被引:19
作者
Chen, CP
Rost, B [1 ]
机构
[1] Columbia Univ, CUBIC, Dept Biochem & Mol Biophys, New York, NY 10032 USA
[2] Columbia Univ, Ctr Computat Biol & Bioinformat C2B2, New York, NY 10032 USA
[3] Columbia Univ, NE Struct Genomics Consortium, Dept Biochem & Mol Biophys, New York, NY 10032 USA
关键词
membrane proteins; protein structure prediction; predicting transmembrane helices; bioinformatics;
D O I
10.1110/ps.0214602
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Low-resolution experiments suggest that most membrane helices span over 17-25 residues and that most loops between two helices are longer than 15 residues. Both constraints have been used explicitly in the development of prediction methods. Here, we compared the largest possible sequence-unique data sets from high- and low-resolution experiments. For the high-resolution data, we found that only half of the helices fall into the expected length interval and that half of the loops were shorter than 10 residues. We compared the accuracy of detecting short loops and long helices for 28 advanced and simple prediction methods: All methods predicted short loops less accurately than longer ones. In particular, loops shorter than 7 residues appeared to be very difficult to detect by current methods. Similarly, all methods tended to be more accurate for longer than for shorter helices. However, helices with more than 32 residues were predicted less accurately than all other helices. Our findings may suggest particular strategies for improving predictions of membrane helices.
引用
收藏
页码:2766 / 2773
页数:8
相关论文
共 64 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Arkin IT, 1997, PROTEINS, V28, P465, DOI 10.1002/(SICI)1097-0134(199708)28:4<465::AID-PROT1>3.0.CO
[3]  
2-9
[4]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[5]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[6]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[7]   Helix packing in membrane proteins [J].
Bowie, JU .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 272 (05) :780-789
[8]   SURFACE-TENSION OF AMINO-ACID SOLUTIONS - HYDROPHOBICITY SCALE OF AMINO-ACID RESIDUES [J].
BULL, HB ;
BREESE, K .
ARCHIVES OF BIOCHEMISTRY AND BIOPHYSICS, 1974, 161 (02) :665-670
[9]  
Casadio R, 1996, EUR BIOPHYS J BIOPHY, V24, P165, DOI 10.1007/BF00180274
[10]  
CHEN CP, 2002, PROTEIN SCI