MODBASE, a database of annotated comparative protein structure models and associated resources

被引:127
作者
Pieper, Ursula [1 ,2 ]
Eswar, Narayanan [1 ,2 ]
Webb, Ben M. [1 ,2 ]
Eramian, David [1 ,2 ,3 ]
Kelly, Libusha [1 ,2 ,4 ]
Barkan, David T. [1 ,2 ,4 ]
Carter, Hannah [5 ]
Mankoo, Parminder [5 ]
Karchin, Rachel [5 ]
Marti-Renom, Marc A. [6 ]
Davis, Fred P. [7 ]
Sali, Andrej [1 ,2 ]
机构
[1] Univ Calif San Francisco, Dept Pharmaceut Chem, Dept Bioengn & Therapeut Sci, San Francisco, CA 94158 USA
[2] Univ Calif San Francisco, Calif Inst Quantitat Biosci, San Francisco, CA 94158 USA
[3] Univ Calif San Francisco, Grad Grp Biophys, San Francisco, CA 94158 USA
[4] Univ Calif San Francisco, Grad Grp Bioinformat, San Francisco, CA 94158 USA
[5] Johns Hopkins Univ, Inst Computat Med, Dept Biomed Engn, Baltimore, MD 21218 USA
[6] CIPF, Bioinformat & Genom Dept, Struct Genom Unit, Valencia 46012, Spain
[7] Howard Hughes Med Inst, Ashburn, VA 20147 USA
基金
美国国家卫生研究院;
关键词
PREDICTION; SITES; RECOGNITION; SEQUENCES; SYSTEM; GENOME; ENZYME; GENES;
D O I
10.1093/nar/gkn791
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
MODBASE (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by MODPIPE, an automated modeling pipeline that relies primarily on MODELLER for fold assignment, sequence-structure alignment, model building and model assessment (http:/salilab.org/modeller). MODBASE currently contains 5 152 695 reliable models for domains in 1 593 209 unique protein sequences; only models based on statistically significant alignments and/or models assessed to have the correct fold are included. MODBASE also allows users to calculate comparative models on demand, through an interface to the MODWEB modeling server (http://salilab.org/modweb). Other resources integrated with MODBASE include databases of multiple protein structure alignments (DBAli), structurally defined ligand binding sites (LIGBASE), predicted ligand binding sites (AnnoLyze), structurally defined binary domain interfaces (PIBASE) and annotated single nucleotide polymorphisms and somatic mutations found in human proteins (LS-SNP, LS-Mut). MODBASE models are also available through the Protein Model Portal (http://www.proteinmodelportal.org/).
引用
收藏
页码:D347 / D354
页数:8
相关论文
共 56 条
[1]  
Acland A, 2013, NUCLEIC ACIDS RES, V41, pD8, DOI [10.1093/nar/gkx1095, 10.1093/nar/gks1189, 10.1093/nar/gkq1172]
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   The universal protein resource (UniProt) [J].
Bairoch, Amos ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Puy, Ghislaine Argoud ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
deCastro, Edouard ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
Dobrokhotov, Pavel ;
Dornevil, Dolnide ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Feuermann, Marc ;
Gehant, Sebastian ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
Ioannidis, Vassilios ;
Ivanyi, Ivan ;
James, Janet ;
Jain, Eric ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra ;
Lara, Vicente ;
Lemercier, Philippe ;
Le Saux, Virginie .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D193-D197
[4]   Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[5]  
Benson DA, 2017, NUCLEIC ACIDS RES, V45, pD37, DOI [10.1093/nar/gkl986, 10.1093/nar/gkw1070, 10.1093/nar/gkg057, 10.1093/nar/gks1195, 10.1093/nar/gkp1024, 10.1093/nar/gkq1079, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkn723]
[6]  
Brenner SE, 2000, PROTEIN SCI, V9, P197
[7]   High-throughput computational and experimental techniques in structural genomics [J].
Chance, MR ;
Fiser, A ;
Sali, A ;
Pieper, U ;
Eswar, N ;
Xu, GP ;
Fajardo, JE ;
Radhakannan, T ;
Marinkovic, N .
GENOME RESEARCH, 2004, 14 (10B) :2145-2154
[8]   OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups [J].
Chen, Feng ;
Mackey, Aaron J. ;
Stoeckert, Christian J., Jr. ;
Roos, David S. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D363-D368
[9]   Learning from the genome sequence of Mycobacterium tuberculosis H37Rv [J].
Cole, ST .
FEBS LETTERS, 1999, 452 (1-2) :7-10
[10]   PIBASE: a comprehensive database of structurally defined protein interfaces [J].
Davis, FP ;
Sali, A .
BIOINFORMATICS, 2005, 21 (09) :1901-1907