The iProClass integrated database for protein functional analysis

被引:59
作者
Wu, CH
Huang, HZ
Nikolskaya, A
Hu, ZZ
Barker, WC
机构
[1] Georgetown Univ, Med Ctr, Washington, DC 20057 USA
[2] Natl Biomed Res Fdn, Washington, DC 20057 USA
关键词
protein functional analysis; human genome; bioinformatics;
D O I
10.1016/j.compbiolchem.2003.10.003
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Increasingly, scientists have begun to tackle gene functions and other complex regulatory processes by studying organisms at the global scales for various levels of biological organization, ranging from genomes to metabolomes and physiomes. Meanwhile, new bioinformatics methods have been developed for inferring protein function using associative analysis of functional properties to complement the traditional sequence homology-based methods. To fully exploit the value of the high-throughput system biology data and to facilitate protein functional studies requires bioinformatics infrastructures that support both data integration and associative analysis. The iProClass database, designed to serve as a framework for data integration in a distributed networking environment, provides comprehensive descriptions of all proteins, with rich links to over 50 databases of protein family, function, pathway, interaction, modification, structure, genome, ontology, literature, and taxonomy. In particular, the database is organized with PIRSF family classification and maps to other family, function, and structure classification schemes. Coupled with the underlying taxonomic information for complete genomes, the iProClass system (http://pir.georgetown.edu/iproclass/) supports associative studies of protein family, domain, function, and structure. A case study of the phosphoglycerate mutases illustrates a systematic approach for protein family and phylogenetic analysis. Such studies may serve as a basis for further analysis of protein functional evolution, and its relationship to the co-evolution of metabolic pathways, cellular networks, and organisms. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:87 / 96
页数:10
相关论文
共 37 条
  • [1] The new Virgil database: a service of rich links
    Achard, F
    Cussat-Blanc, C
    Viara, E
    Barillot, E
    [J]. BIOINFORMATICS, 1998, 14 (04) : 342 - 348
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] APWEILER R, 2004, IN PRESS NUCL ACIDS, V32
  • [4] Ashburner M, 2001, GENOME RES, V11, P1425
  • [5] ProteomeWeb: A web-based interface for the display and interrogation of proteomes
    Babnigg, G
    Giometti, CS
    [J]. PROTEOMICS, 2003, 3 (05) : 584 - 600
  • [6] Barker WC, 1996, METHOD ENZYMOL, V266, P59
  • [7] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
  • [8] The Molecular Biology Database Collection: 2003 update
    Baxevanis, AD
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 1 - 12
  • [9] Mechanistic implications for Escherichia coli cofactor-dependent phosphoglycerate mutase based on the high-resolution crystal structure of a vanadate complex
    Bond, CS
    White, MF
    Hunter, WN
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2002, 316 (05) : 1071 - 1081
  • [10] Comprehensive analysis of the mouse metabolome based on the transcriptome
    Bono, H
    Nikaido, I
    Kasukawa, T
    Hayashizaki, Y
    Okazaki, Y
    [J]. GENOME RESEARCH, 2003, 13 (6B) : 1345 - 1349