Scoring residue conservation

被引:488
作者
Valdar, WSJ [1 ]
机构
[1] UCL, Dept Biochem & Mol Biol, Biomol Struct & Modelling Unit, London, England
关键词
protein sequence analysis; amino acid; variability; evolutionary conservation; multiple sequence alignment;
D O I
10.1002/prot.10146
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The importance of a residue for maintaining the structure and function of a protein can usually be inferred from how conserved it appears in a multiple sequence alignment of that protein and its homologues. A reliable metric for quantifying residue conservation is desirable. Over the last two decades many such scores have been proposed, but none has emerged as a generally accepted standard. This work surveys the range of scores that biologists, biochemists, and, more recently, bioinformatics workers have developed, and reviews the intrinsic problems associated with developing and evaluating such a score. A general formula is proposed that may be used to compare the properties of different particular conservation scores or as a measure of conservation in its own right. (C) 2002 Wiley-Liss, Inc.
引用
收藏
页码:227 / 241
页数:15
相关论文
共 50 条
[31]   Accurate formula for p-values of gapped local sequence and profile alignments [J].
Mott, R .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 300 (03) :649-659
[32]  
Page RDM, 1998, MOL EVOLUTION PHYLOG
[33]   Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods [J].
Park, J ;
Karplus, K ;
Barrett, C ;
Hughey, R ;
Haussler, D ;
Hubbard, T ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 284 (04) :1201-1210
[34]   The variable and conserved interfaces of modeled olfactory receptor proteins [J].
Pilpel, Y ;
Lancet, D .
PROTEIN SCIENCE, 1999, 8 (05) :969-977
[35]   Evolutionary conservation in protein folding kinetics [J].
Plaxco, KW ;
Larson, S ;
Ruczinski, I ;
Riddle, DS ;
Thayer, EC ;
Buchwitz, B ;
Davidson, AR ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 298 (02) :303-312
[36]   DATABASE OF HOMOLOGY-DERIVED PROTEIN STRUCTURES AND THE STRUCTURAL MEANING OF SEQUENCE ALIGNMENT [J].
SANDER, C ;
SCHNEIDER, R .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1991, 9 (01) :56-68
[37]   Information content of individual genetic sequences [J].
Schneider, TD .
JOURNAL OF THEORETICAL BIOLOGY, 1997, 189 (04) :427-441
[38]   A MATHEMATICAL THEORY OF COMMUNICATION [J].
SHANNON, CE .
BELL SYSTEM TECHNICAL JOURNAL, 1948, 27 (03) :379-423
[39]   INFORMATION-THEORETICAL ENTROPY AS A MEASURE OF SEQUENCE VARIABILITY [J].
SHENKIN, PS ;
ERMAN, B ;
MASTRANDREA, LD .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1991, 11 (04) :297-313
[40]   WEIGHTING ALIGNED PROTEIN OR NUCLEIC-ACID SEQUENCES TO CORRECT FOR UNEQUAL REPRESENTATION [J].
SIBBALD, PR ;
ARGOS, P .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (04) :813-818