SLiM on Diet: finding short linear motifs on domain interaction interfaces in Protein Data Bank

被引:10
作者
Hugo, Willy [1 ,2 ]
Song, Fushan [1 ]
Aung, Zeyar [2 ]
Ng, See-Kiong [2 ]
Sung, Wing-Kin [1 ,3 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117548, Singapore
[2] Inst Infocomm Res, Data Min Dept, Singapore, Singapore
[3] Genome Inst Singapore, Dept Informat & Math Sci, Singapore, Singapore
关键词
GLYCERALDEHYDE-3-PHOSPHATE DEHYDROGENASE; SITES; CLASSIFICATION; LIBRARY; DISEASE; COMPLEX; SYSTEMS;
D O I
10.1093/bioinformatics/btq065
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: An important class of protein interactions involves the binding of a protein's domain to a short linear motif (SLiM) on its interacting partner. Extracting such motifs, either experimentally or computationally, is challenging because of their weak binding and high degree of degeneracy. Recent rapid increase of available protein structures provides an excellent opportunity to study SLiMs directly from their 3D structures. Results: Using domain interface extraction (Diet), we characterized 452 distinct SLiMs from the Protein Data Bank (PDB), of which 155 are validated in varying degrees - 40 have literature validation, 54 are supported by at least one domain - peptide structural instance, and another 61 have overrepresentation in high- throughput PPI data. We further observed that the lacklustre coverage of existing computational SLiM detection methods could be due to the common assumption that most SLiMs occur outside globular domain regions. 198 of 452 SLiM that we reported are actually found on domain domain interface; some of them are implicated in autoimmune and neurodegenerative diseases. We suggest that these SLiMs would be useful for designing inhibitors against the pathogenic protein complexes underlying these diseases. Our. ndings show that 3D structure- based SLiM detection algorithms can provide a more complete coverage of SLiM- mediated protein interactions than current sequence- based approaches.
引用
收藏
页码:1036 / 1042
页数:7
相关论文
共 42 条
[1]  
Alexandrov NN, 1996, PROTEINS, V25, P354, DOI 10.1002/(SICI)1097-0134(199607)25:3<354::AID-PROT7>3.3.CO
[2]  
2-W
[3]   Structural systems biology: modelling protein interactions [J].
Aloy, P ;
Russell, RB .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2006, 7 (03) :188-197
[4]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[5]  
Aung Zeyar, 2006, Journal of Bioinformatics and Computational Biology, V4, P1197, DOI 10.1142/S0219720006002417
[6]   Minimotif Miner: a tool for investigating protein function [J].
Balla, S ;
Thapar, V ;
Verma, S ;
Luong, T ;
Faghri, T ;
Huang, CH ;
Rajasekaran, S ;
del Campo, JJ ;
Shinn, JH ;
Mohler, WA ;
Maciejewski, MW ;
Gryk, MR ;
Piccirillo, B ;
Schiller, SR ;
Schiller, MR .
NATURE METHODS, 2006, 3 (03) :175-177
[7]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[8]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[9]  
Berry MD, 2000, J NEUROSCI RES, V60, P150, DOI 10.1002/(SICI)1097-4547(20000415)60:2<150::AID-JNR3>3.0.CO
[10]  
2-4