Automatic prediction of protein function

被引:169
作者
Rost, B
Liu, J
Nair, R
Wrzeszczynski, KO
Ofran, Y
机构
[1] Columbia Univ, Dept Biochem & Mol Biophys, NE Struct Genom Consortium, New York, NY 10032 USA
[2] Columbia Univ, Ctr Computat Biol & Bioinformat C2B2, New York, NY 10032 USA
[3] Columbia Univ, Dept Pharmacol, New York, NY 10032 USA
[4] Columbia Univ, Dept Phys, New York, NY 10027 USA
[5] Columbia Univ, Dept Biomed Informat, New York, NY 10032 USA
关键词
genome analysis; protein function prediction; ab initio prediction; neural networks; multiple alignments; sequence analysis; subcellular localization; post-translational modifications; protein-protein interactions; bioinformatics;
D O I
10.1007/s00018-003-3114-8
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Most methods annotating protein function utilise sequence homology to proteins of experimentally known function. Such a homology-based annotation transfer is problematic and limited in scope. Therefore, computational biologists have begun to develop ab initio methods that predict aspects of function, including subcellular localization, post-translational modifications, functional type and protein-protein interactions. For the first two cases, the most accurate approaches rely on identifying short signalling motifs, while the most general methods utilise tools of artificial intelligence. An outstanding new method predicts classes of cellular function directly from sequence. Similarly, promising methods have been developed predicting protein-protein interaction partners at acceptable levels of accuracy for some pairs in entire proteomes. No matter how difficult the task, successes over the last few years have clearly paved the way for ab initio prediction of protein function.
引用
收藏
页码:2637 / 2650
页数:14
相关论文
共 180 条
[31]   Conservation of gene order: a fingerprint of proteins that physically interact [J].
Dandekar, T ;
Snel, B ;
Huynen, M ;
Bork, P .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) :324-328
[32]   Unraveling hot spots in binding interfaces: progress and challenges [J].
DeLano, WL .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (01) :14-20
[33]  
Devos D, 2000, PROTEINS, V41, P98, DOI 10.1002/1097-0134(20001001)41:1<98::AID-PROT120>3.0.CO
[34]  
2-S
[35]   Intrinsic errors in genome annotation [J].
Devos, D ;
Valencia, A .
TRENDS IN GENETICS, 2001, 17 (08) :429-431
[36]   A Bayesian system integrating expression data with sequence patterns for localizing proteins: Comprehensive application to the yeast genome [J].
Drawid, A ;
Gerstein, M .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 301 (04) :1059-1075
[37]   Protein function in the post-genomic era [J].
Eisenberg, D ;
Marcotte, EM ;
Xenarios, I ;
Yeates, TO .
NATURE, 2000, 405 (6788) :823-826
[38]   Post-translational GPI lipid anchor modification of proteins in kingdoms of life: analysis of protein sequence data from complete genomes [J].
Eisenhaber, B ;
Bork, P ;
Eisenhaber, F .
PROTEIN ENGINEERING, 2001, 14 (01) :17-25
[39]   Wanted: subcellular localization of proteins based on sequence [J].
Eisenhaber, F ;
Bork, P .
TRENDS IN CELL BIOLOGY, 1998, 8 (04) :169-170
[40]   Predicting subcellular localization of proteins based on their N-terminal amino acid sequence [J].
Emanuelsson, O ;
Nielsen, H ;
Brunak, S ;
von Heijne, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 300 (04) :1005-1016