Computational protein function prediction: Are we making progress?

被引:39
作者
Godzik, A. [1 ]
Jambon, M. [1 ]
Friedberg, I. [1 ]
机构
[1] Burnham Inst Med Res, La Jolla, CA 92037 USA
关键词
protein function prediction; bioinformatics; CASP; AFP; aspartate dehydrogenase; aspartate oxidase; non-orthologous replacement; NAD synthesis;
D O I
10.1007/s00018-007-7211-y
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The computational prediction of gene and protein function is rapidly gaining ground as a central undertaking in computational biology. Making sense of the flood of genomic data requires fast and reliable annotation. Many ingenious algorithms have been devised to infer a protein's function from its amino acid sequence, 3D structure and chromosomal location of the encoding genes. However, there are significant challenges in assessing how well these programs perform. In this article we explore those challenges and review our own attempt at assessing the performance of those programs. We conclude that the task is far from complete and that a critical assessment of the performance of function prediction programs is necessary to make true progress in computational function prediction.
引用
收藏
页码:2505 / 2511
页数:7
相关论文
共 30 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]  
Bartlett Gail J, 2003, Methods Biochem Anal, V44, P387
[4]  
Bateman A, 2002, NUCLEIC ACIDS RES, V30, P276, DOI [10.1093/nar/gkr1065, 10.1093/nar/gkp985, 10.1093/nar/gkh121]
[5]  
Biswas Margaret, 2002, Brief Bioinform, V3, P285, DOI 10.1093/bib/3.3.285
[6]   Phydbac "Gene Function Predictor": a gene annotation tool based on genomic context analysis [J].
Enault, F ;
Suhre, K ;
Claverie, JM .
BMC BIOINFORMATICS, 2005, 6 (1)
[7]   Annotation of bacterial genomes using improved phylogenomic profiles [J].
Enault, F. ;
Suhre, K. ;
Abergel, C. ;
Poirot, O. ;
Claverie, J. -M. .
BIOINFORMATICS, 2003, 19 :i105-i107
[8]   Automated protein function prediction - the genomic challenge [J].
Friedberg, Iddo .
BRIEFINGS IN BIOINFORMATICS, 2006, 7 (03) :225-242
[9]   Enhanced automated function prediction using distantly related sequences and contextual association by PFP [J].
Hawkins, Troy ;
Luban, Stanislav ;
Kihara, Daisuke .
PROTEIN SCIENCE, 2006, 15 (06) :1550-1556
[10]   Automatic rule generation for protein annotation with the C4.5 data mining algorithm applied on SWISS-PROT [J].
Kretschmann, E ;
Fleischmann, W ;
Apweiler, R .
BIOINFORMATICS, 2001, 17 (10) :920-926