STATISTICAL SIGNIFICANCE OF SEQUENCE PATTERNS IN PROTEINS

被引:57
作者
KARLIN, S
机构
[1] Department of Mathematics, Stanford University, Standford
关键词
D O I
10.1016/0959-440X(95)80098-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
I discuss three recent developments in sequence analysis by the statistical method of scores. First is the identification of segments of high aggregate score in a single protein sequence. Charge clusters and hyper-charge runs are prime examples. Proteins containing hyper-charge runs are principally associated with DNA and RNA processing, chromatin structure, ion storage and exchange, and protein complex assembly. Second is the protein sequence comparisons identifying common segments having high total similarity scores. These are illustrated by comparisons within the family of prokaryotic heat shock 70 kDa proteins. Third is the scoring protocols applied to the inverse folding problem.
引用
收藏
页码:360 / 371
页数:12
相关论文
共 47 条
[21]   STATISTICAL STUDIES OF BIOMOLECULAR SEQUENCES - SCORE-BASED METHODS [J].
KARLIN, S .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1994, 344 (1310) :391-402
[22]  
KARLIN S, 1992, SCIENCE, V257, P38
[23]  
KARLIN S, 1990, STRUCTURE METHODS, V2, P171
[24]  
KARLIN S, 1990, METHOD ENZYMOL, V183, P388
[25]  
LATHROP R, 1993, ARTIF INTELL, P210
[26]   DETECTING SUBTLE SEQUENCE SIGNALS - A GIBBS SAMPLING STRATEGY FOR MULTIPLE ALIGNMENT [J].
LAWRENCE, CE ;
ALTSCHUL, SF ;
BOGUSKI, MS ;
LIU, JS ;
NEUWALD, AF ;
WOOTTON, JC .
SCIENCE, 1993, 262 (5131) :208-214
[27]  
LUTHY R, 1994, PROTEIN SCI, V3, P139
[28]  
Margolis R. L., 1994, American Journal of Human Genetics, V55, pA230
[29]   A NEW SUBSTITUTION MATRIX FOR PROTEIN-SEQUENCE SEARCHES BASED ON CONTACT FREQUENCIES IN PROTEIN STRUCTURES [J].
MIYAZAWA, S ;
JERNIGAN, RL .
PROTEIN ENGINEERING, 1993, 6 (03) :267-278
[30]  
MORIMOTO RI, 1994, BIOL HEAT SHOCL PROT