iProClass:: an integrated, comprehensive and annotated protein classification database

被引:23
作者
Wu, CH [1 ]
Xiao, CL [1 ]
Hou, ZL [1 ]
Huang, HZ [1 ]
Barker, WC [1 ]
机构
[1] Georgetown Univ, Med Ctr, Natl Biomed Res Fdn, Prot Informat Resource, Washington, DC 20007 USA
关键词
D O I
10.1093/nar/29.1.52
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The iProClass database is an integrated resource that provides comprehensive family relationships and structural and functional features of proteins, with rich links to various databases. It is extended from ProClass, a protein family database that integrates PIR superfamilies and PROSITE motifs. The iProClass currently consists of more than 200 000 non-redundant PIR and SWISS-PROT proteins organized with more than 28 000 superfamilies, 2600 domains, 1300 motifs, 280 posttranslational modification sites and links to more than 30 databases of protein families, structures, functions, genes, genomes, literature and taxonomy, Protein and family summary reports provide rich annotations, including membership information with length, taxonomy and keyword statistics, full family relationships, comprehensive enzyme and PDB cross-references and graphical feature display. The database facilitates classification-driven annotation for protein sequence databases and complete genomes, and supports structural and functional genomic research. The rProClass is implemented in Oracle 81 object-relational system and available for sequence search and report retrieval at http:// pir.georgetown.edu/iprocrass/.
引用
收藏
页码:52 / 54
页数:3
相关论文
共 20 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] PRINTS-S: the database formerly known as PRINTS
    Attwood, TK
    Croning, MDR
    Flower, DR
    Lewis, AP
    Mabey, JE
    Scordis, P
    Selley, JN
    Wright, W
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 225 - 227
  • [3] The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000
    Bairoch, A
    Apweiler, R
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 45 - 48
  • [4] Barker WC, 1996, METHOD ENZYMOL, V266, P59
  • [5] Protein Information Resource: a community resource for expert annotation of protein data
    Barker, WC
    Garavelli, JS
    Hou, ZL
    Huang, HZ
    Ledley, RS
    McGarvey, PB
    Mewes, HW
    Orcutt, BC
    Pfeiffer, F
    Tsugita, A
    Vinayaka, CR
    Xiao, CL
    Yeh, LSL
    Wu, C
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 29 - 32
  • [6] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [7] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [8] The PDB data uniformity project
    Bhat, TN
    Bourne, P
    Feng, ZK
    Gilliland, G
    Jain, S
    Ravichandran, V
    Schneider, B
    Schneider, K
    Thanki, N
    Weissig, H
    Westbrook, J
    Berman, HM
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 214 - 218
  • [9] COPET F, 2000, NUCLEIC ACIDS RES, V28, P267
  • [10] The RESID Database of protein structure modifications: 2000 update
    Garavelli, JS
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 209 - 211