Improving the Representation of Peptide-Like Inhibitor and Antibiotic Molecules in the Protein Data Bank

被引:25
作者
Dutta, Shuchismita [1 ]
Dimitropoulos, Dimitris [2 ,3 ]
Feng, Zukang [1 ]
Persikova, Irina [1 ]
Sen, Sanchayita [4 ]
Shao, Chenghua [1 ]
Westbrook, John [1 ]
Young, Jasmine [1 ]
Zhuravleva, Marina A. [1 ]
Kleywegt, Gerard J. [4 ]
Berman, Helen M. [1 ]
机构
[1] Rutgers State Univ, RCSB Prot Data Bank, Dept Chem & Chem Biol, Piscataway, NJ 08854 USA
[2] Univ Calif San Diego, RCSB Prot Data Bank, San Diego Supercomp Ctr, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA 92093 USA
[4] European Bioinformat Inst, European Mol Biol Lab, Prot Data Bank Europe PDBe, Cambridge CB10 1SD, England
基金
英国惠康基金; 英国生物技术与生命科学研究理事会; 美国国家科学基金会;
关键词
peptide-like inhibitor; peptide-like antibiotic; Protein Data Bank; CRYSTAL-STRUCTURE; RECOGNITION; COMPLEX;
D O I
10.1002/bip.22434
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With the accumulation of a large number and variety of molecules in the Protein Data Bank (PDB) comes the need on occasion to review and improve their representation. The Worldwide PDB (wwPDB) partners have periodically updated various aspects of structural data representation to improve the integrity and consistency of the archive. The remediation effort described here was focused on improving the representation of peptide-like inhibitor and antibiotic molecules so that they can be easily identified and analyzed. Peptide-like inhibitors or antibiotics were identified in over 1000 PDB entries, systematically reviewed and represented either as peptides with polymer sequence or as single components. For the majority of the single-component molecules, their peptide-like composition was captured in a new representation, called the subcomponent sequence. A novel concept called group was developed for representing complex peptide-like antibiotics and inhibitors that are composed of multiple polymer and nonpolymer components. In addition, a reference dictionary was developed with detailed information about these peptide-like molecules to aid in their annotation, identification and analysis. Based on the experience gained in this remediation, guidelines, procedures, and tools were developed to annotate new depositions containing peptide-like inhibitors and antibiotics accurately and consistently. (c) 2013 Wiley Periodicals, Inc. Biopolymers 101: 659-668, 2014.
引用
收藏
页码:659 / 668
页数:10
相关论文
共 23 条
[1]   The Cambridge Structural Database: a quarter of a million crystal structures and rising [J].
Allen, FH .
ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE, 2002, 58 (3 PART 1) :380-388
[2]  
Arndt HD, 2002, ANGEW CHEM INT EDIT, V41, P4062, DOI 10.1002/1521-3773(20021104)41:21<4062::AID-ANIE4062>3.0.CO
[3]  
2-U
[4]   The Universal Protein Resource (UniProt) [J].
Bairoch, Amos ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Puy, Ghislaine Argoud ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
Saux, Virginie Bulliard-Le ;
decastro, Edouard ;
Ciampina, Luciane ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
David, Fabrice ;
Delbard, Gwennaelle ;
Dornevil, Dolnide ;
Duek-Roggli, Paula ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Feuermann, Marc ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gehant, Sebastian ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
Innocenti, Alessandro ;
James, Janet ;
Jain, Eric ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D190-D195
[5]   Announcing the worldwide Protein Data Bank [J].
Berman, H ;
Henrick, K ;
Nakamura, H .
NATURE STRUCTURAL BIOLOGY, 2003, 10 (12) :980-980
[6]   Structure of the macrocycle thiostrepton solved using the anomalous dispersion contribution of sulfur [J].
Bond, CS ;
Shaw, MP ;
Alphey, MS ;
Hunter, WN .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2001, 57 :755-758
[7]   Macromolecular crystallographic information file [J].
Bourne, PE ;
Berman, HM ;
McMahon, B ;
Watenpaugh, KD ;
Westbrook, JD ;
Fitzgerald, PMD .
MACROMOLECULAR CRYSTALLOGRAPHY, PT B, 1997, 277 :571-590
[8]   Retrieval of crystallographically-derived molecular geometry information [J].
Bruno, IJ ;
Cole, JC ;
Kessler, M ;
Luo, J ;
Motherwell, WDS ;
Purkis, LH ;
Smith, BR ;
Taylor, R ;
Cooper, RI ;
Harris, SE ;
Orpen, AG .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06) :2133-2144
[9]   NORINE:: a database of nonribosomal peptides [J].
Caboche, Segolene ;
Pupin, Maude ;
Leclere, Valerie ;
Fontaine, Arnaud ;
Jacques, Philippe ;
Kucherov, Gregory .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D326-D331
[10]  
Callaway J., 1996, Protein Data Bank contents guide: Atomic coordinate entry format description