PANDORA: keyword-based analysis of protein sets by integration of annotation sources

被引:25
作者
Kaplan, N
Vaaknin, A
Linial, M [1 ]
机构
[1] Hebrew Univ Jerusalem, Inst Life Sci, Dept Biol Chem, IL-91904 Jerusalem, Israel
[2] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
关键词
D O I
10.1093/nar/gkg769
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in high-throughput methods and the application of computational tools for automatic classification of proteins have made it possible to carry out large-scale proteomic analyses. Biological analysis and interpretation of sets of proteins is a time-consuming undertaking carried out manually by experts. We have developed PANDORA (Protein ANnotation Diagram ORiented Analysis), a web-based tool that provides an automatic representation of the biological knowledge associated with any set of proteins. PANDORA uses a unique approach of keyword-based graphical analysis that focuses on detecting subsets of proteins that share unique biological properties and the intersections of such sets. PANDORA currently supports SwissProt keywords, NCBI Taxonomy, InterPro entries and the hierarchical classification terms from ENZYME, SCOP and GO databases. The integrated study of several annotation sources simultaneously allows a representation of biological relations of structure, function, cellular location, taxonomy, domains and motifs. PANDORA is also integrated into the ProtoNet system, thus allowing testing thousands of automatically generated clusters. We illustrate how PANDORA enhances the biological understanding of large, non-uniform sets of proteins originating from experimental and computational sources, without the need for prior biological knowledge on individual proteins.
引用
收藏
页码:5617 / 5626
页数:10
相关论文
共 43 条
[1]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[2]   The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[3]  
Baxter S M, 2001, Curr Opin Drug Discov Devel, V4, P291
[4]   Involvement of superoxide dismutases in the response of Escherichia coli to selenium oxides [J].
Bébien, M ;
Lagniel, G ;
Garin, J ;
Touati, D ;
Verméglio, A ;
Labarre, J .
JOURNAL OF BACTERIOLOGY, 2002, 184 (06) :1556-1564
[5]   The advantage of functional prediction based on clustering of yeast genes and its correlation with non-sequence based classifications [J].
Bilu, Y ;
Linial, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) :193-210
[6]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[7]   Predicting function: From genes to genomes and back [J].
Bork, P ;
Dandekar, T ;
Diaz-Lazcoz, Y ;
Eisenhaber, F ;
Huynen, M ;
Yuan, YP .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 283 (04) :707-725
[8]   Aerobic glycolysis by proliferating cells: A protective strategy against reactive oxygen species [J].
Brand, KA ;
Hermfisse, U .
FASEB JOURNAL, 1997, 11 (05) :388-395
[9]   New technologies for high-throughput screening [J].
Burbaum, JJ ;
Sigal, NH .
CURRENT OPINION IN CHEMICAL BIOLOGY, 1997, 1 (01) :72-78
[10]   The gene ontology annotation (GOA) project: Implementation of GO in SWISS-PROT, TrEMBL, and InterPro [J].
Camon, E ;
Magrane, M ;
Barrell, D ;
Binns, D ;
Fleischmann, W ;
Kersey, P ;
Mulder, N ;
Oinn, T ;
Maslen, J ;
Cox, A ;
Apweiler, R .
GENOME RESEARCH, 2003, 13 (04) :662-672