Human non-synonymous SNPs: server and survey

被引:1886
作者
Ramensky, V
Bork, P
Sunyaev, S
机构
[1] European Mol Biol Lab, D-69117 Heidelberg, Germany
[2] Max Delbruck Ctr Mol Med, D-13122 Berlin, Germany
[3] VA Engelhardt Mol Biol Inst, Moscow 119991, Russia
关键词
D O I
10.1093/nar/gkf493
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Human single nucleotide polymorphisms (SNPs) represent the most frequent type of human population DNA variation. One of the main goals of SNP research is to understand the genetics of the human phenotype variation and especially the genetic basis of human complex diseases. Non-synonymous coding SNPs (nsSNPs) comprise a group of SNPs that, together with SNPs in regulatory regions, are believed to have the highest impact on phenotype. Here we present a World Wide Web server to predict the effect of an nsSNP on protein structure and function. The prediction method enabled analysis of the publicly available SNP database HGVbase, which gave rise to a dataset of nsSNPs with predicted functionality. The dataset was further used to compare the effect of various structural and functional characteristics of amino acid substitutions responsible for phenotypic display of nsSNPs. We also studied the dependence of selective pressure on the structural and functional properties of proteins. We found that in our dataset the selection pressure against deleterious SNPs depends on the molecular function of the protein, although it is insensitive to several other protein features considered. The strongest selective pressure was detected for proteins involved in transcription regulation.
引用
收藏
页码:3894 / 3900
页数:7
相关论文
共 37 条
[1]
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[2]
Protein sequence databases [J].
Apweiler, R .
ADVANCES IN PROTEIN CHEMISTRY, VOL 54, 2000, 54 :31-71
[3]
Ashburner M, 2001, GENOME RES, V11, P1425
[4]
The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]
Sequence diversity in 36 candidate genes for cardiovascular disorders [J].
Cambien, F ;
Poirier, O ;
Nicaud, V ;
Herrmann, SM ;
Mallet, C ;
Ricard, S ;
Behague, I ;
Hallet, V ;
Blanc, H ;
Loukaci, V ;
Thillet, J ;
Evans, A ;
Ruidavets, JB ;
Arveiler, D ;
Luc, G ;
Tiret, L .
AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (01) :183-191
[6]
Characterization of single-nucleotide polymorphisms in coding regions of human genes [J].
Cargill, M ;
Altshuler, D ;
Ireland, J ;
Sklar, P ;
Ardlie, K ;
Patil, N ;
Lane, CR ;
Lim, EP ;
Kalyanaraman, N ;
Nemesh, J ;
Ziaugra, L ;
Friedland, L ;
Rolfe, A ;
Warrington, J ;
Lipshutz, R ;
Daley, GQ ;
Lander, ES .
NATURE GENETICS, 1999, 22 (03) :231-238
[7]
Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: Structure-based assessment of amino acid variation [J].
Chasman, D ;
Adams, RM .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) :683-706
[8]
INFORMATION ENHANCEMENT METHODS FOR LARGE-SCALE SEQUENCE-ANALYSIS [J].
CLAVERIE, JM ;
STATES, DJ .
COMPUTERS & CHEMISTRY, 1993, 17 (02) :191-201
[9]
SNP association studies in Alzheimer's disease highlight problems for complex disease analysis [J].
Emahazion, T ;
Feuk, L ;
Jobs, M ;
Sawyer, SL ;
Fredman, D ;
St Clair, D ;
Prince, JA ;
Brookes, AJ .
TRENDS IN GENETICS, 2001, 17 (07) :407-413
[10]
Fay JC, 2001, GENETICS, V158, P1227