LigASite -: a database of biologically relevant binding sites in proteins with known apo-structures

被引:72
作者
Dessailly, Benoit H. [1 ,2 ]
Lensink, Marc F. [1 ]
Orengo, Christine A. [2 ]
Wodak, Shoshana J. [1 ,3 ]
机构
[1] Univ Libre Bruxelles, Ctr Struct Biol & Bioinformat, B-1050 Brussels, Belgium
[2] UCL, Biomol Struct & Modelling Unit, London WC1E 6BT, England
[3] Hosp Sick Children, Struct Biol & Biochem Program, Toronto, ON M5G 6BT, Canada
关键词
D O I
10.1093/nar/gkm839
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Better characterization of binding sites in proteins and the ability to accurately predict their location and energetic properties are major challenges which, if addressed, would have many valuable practical applications. Unfortunately, reliable benchmark datasets of binding sites in proteins are still sorely lacking. Here, we present LigASite ('LIGand Attachment SITE'), a gold-standard dataset of binding sites in 550 proteins of known structures. LigASite consists exclusively of biologically relevant binding sites in proteins for which at least one apo- and one holo-structure are available. In defining the binding sites for each protein, information from all holo-structures is combined, considering in each case the quaternary structure defined by the PQS server. LigASite is built using simple criteria and is automatically updated as new structures become available in the PDB, thereby guaranteeing optimal data coverage over time. Both a redundant and a culled non-redundant version of the dataset is available at http://www.scmbb.ulb.ac.be/Users/benoit/LigASite. The website interface allows users to search the dataset by PDB identifiers, ligand identifiers, protein names or sequence, and to look for structural matches as defined by the CATH homologous superfamilies. The datasets can be downloaded from the website as Schema-validated XML files or comma-separated flat files.
引用
收藏
页码:D667 / D673
页数:7
相关论文
共 29 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   Relating destabilizing regions to known functional sites in proteins [J].
Dessailly, Benoit H. ;
Lensink, Marc F. ;
Wodak, Shoshana J. .
BMC BIOINFORMATICS, 2007, 8 (1)
[4]   A method for localizing ligand binding pockets in protein structures [J].
Glaser, F ;
Morris, RJ ;
Najmanovich, RJ ;
Laskowski, RA ;
Thornton, JM .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (02) :479-488
[5]   SitesBase: a database for structure-based protein-ligand binding site comparisons [J].
Gold, Nicola D. ;
Jackson, Richard M. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D231-D234
[6]   The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution [J].
Greene, Lesley H. ;
Lewis, Tony E. ;
Addou, Sarah ;
Cuff, Alison ;
Dallman, Tim ;
Dibley, Mark ;
Redfern, Oliver ;
Pearl, Frances ;
Nambudiry, Rekha ;
Reid, Adam ;
Sillitoe, Ian ;
Yeats, Corin ;
Thornton, Janet M. ;
Orengo, Christine A. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D291-D297
[7]   PQS: a protein quaternary structure file server [J].
Henrick, K ;
Thornton, JM .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) :358-361
[8]   PDBSite: a database of the 3D structure of protein functional sites [J].
Ivanisenko, VA ;
Pintus, SS ;
Grigorovich, DA ;
Kolchanov, NA .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D183-D187
[9]   Searching for functional sites in protein structures [J].
Jones, S ;
Thornton, JM .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2004, 8 (01) :3-7
[10]   sc-PDB: an annotated database of druggable binding sites from the protein data bank [J].
Kellenberger, E ;
Muller, P ;
Schalon, C ;
Bret, G ;
Foata, N ;
Rognan, D .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) :717-727