Improved performance in protein secondary structure prediction by inhomogeneous score combination

被引:185
作者
Guermeur, Y
Geourjon, C
Gallinari, P
Deléage, G
机构
[1] Ecole Normale Super Lyon, LIP, F-69364 Lyon 07, France
[2] Inst Biol & Chim Prot, F-69367 Lyon, France
[3] Univ Paris 06, LIP6, F-75252 Paris 05, France
关键词
D O I
10.1093/bioinformatics/15.5.413
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In many fields of pattern recognition combination has proved efficient to increase the generalization performance of individual prediction methods. Numerous systems have been developed for protein secondary structure prediction, based on different principles. Finding better ensemble methods for this task may thus become crucial. Furthermore, efforts need to be made to help the biologist in the post-processing of the outputs. Results: An ensemble method has been designed to post-process the outputs of discriminant models, in order to obtain an improvement in prediction accuracy while generating class posterior probability estimates. Experimental results establish that it can increase the recognition rate of protein secondary structure prediction methods that provide inhomogeneous scores, even though their individual prediction successes are largely different. This combination thus constitutes a help for the biologist who can use it confidently on top of any set of prediction methods. Moreover the resulting estimates can be used in various ways, for instance to determine which areas in the sequence are predicted with a given level of reliability.
引用
收藏
页码:413 / 421
页数:9
相关论文
共 34 条
[1]   COMBINATION OF FORECASTS [J].
BATES, JM ;
GRANGER, CWJ .
OPERATIONAL RESEARCH QUARTERLY, 1969, 20 (04) :451-&
[2]   SECONDARY STRUCTURE PREDICTION - COMBINATION OF 3 DIFFERENT METHODS [J].
BIOU, V ;
GIBRAT, JF ;
LEVIN, JM ;
ROBSON, B ;
GARNIER, J .
PROTEIN ENGINEERING, 1988, 2 (03) :185-191
[3]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[4]  
Breiman L, 1996, MACH LEARN, V24, P49
[5]   LIMITS FOR THE PRECISION AND VALUE OF INFORMATION FROM DEPENDENT SOURCES [J].
CLEMEN, RT ;
WINKLER, RL .
OPERATIONS RESEARCH, 1985, 33 (02) :427-442
[6]   SOME COMMENTS ON COMBINATION OF FORECASTS [J].
DICKINSON, JP .
OPERATIONAL RESEARCH QUARTERLY, 1975, 26 (01) :205-210
[7]   SOME STATISTICAL RESULTS IN COMBINATION OF FORECASTS [J].
DICKINSON, JP .
OPERATIONAL RESEARCH QUARTERLY, 1973, 24 (02) :253-260
[8]  
EISENBERG D, 1987, PROTEIN-STRUCT FUNCT, P425
[9]   PROTEIN-STRUCTURE PREDICTION - RECOGNITION OF PRIMARY, SECONDARY, AND TERTIARY STRUCTURAL FEATURES FROM AMINO-ACID-SEQUENCE [J].
EISENHABER, F ;
PERSSON, B ;
ARGOS, P .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (01) :1-94
[10]  
FLETCHER R., 1991, PRACTICAL METHODS OP