Sequence variation in G-protein-coupled receptors: analysis of single nucleotide polymorphisms

被引:31
作者
Balasubramanian, S
Xia, Y
Freinkman, E
Gerstein, M
机构
[1] Yale Univ, Dept Biochem & Mol Biophys, New Haven, CT 06520 USA
[2] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
关键词
D O I
10.1093/nar/gki311
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We assessed the disease-causing potential of single nucleotide polymorphisms (SNPs) based on a simple set of sequence-based features. We focused on SNPs from the dbSNP database in G-protein-coupled receptors (GPCRs), a large class of important transmembrane (TM) proteins. Apart from the location of the SNP in the protein, we evaluated the predictive power of three major classes of features to differentiate between disease-causing mutations and neutral changes: (i) properties derived from amino-acid scales, such as volume and hydrophobicity; (ii) position-specific phylogenetic features reflecting evolutionary conservation, such as normalized site entropy, residue frequency and SIFT score; and (iii) substitution-matrix scores, such as those derived from the BLOSUM62, GRANTHAM and PHAT matrices. We validated our approach using a control dataset consisting of known disease-causing mutations and neutral variations. Logistic regression analyses indicated that position-specific phylogenetic features that describe the conservation of an amino acid at a specific site are the best discriminators of disease mutations versus neutral variations, and integration of all our features improves discrimination power. Overall, we identify 115 SNPs in GPCRs from dbSNP that are likely to be associated with disease and thus are good candidates for genotyping in association studies.
引用
收藏
页码:1710 / 1721
页数:12
相关论文
共 59 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
ARKIN JT, 1998, BIOCHIM BIOPHYS ACTA, V1429, P113
[3]  
Becker OM, 2003, CURR OPIN DRUG DISC, V6, P353
[4]   High-throughput modeling of human G-protein coupled receptors: Amino acid sequence alignment, three-dimensional model building, and receptor library screening [J].
Bissantz, C ;
Logean, A ;
Rognan, D .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03) :1162-1176
[5]   Protein-based virtual screening of chemical databases. II. Are homology models of G-protein coupled receptors suitable targets? [J].
Bissantz, C ;
Bernard, P ;
Hibert, M ;
Rognan, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 50 (01) :5-25
[6]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[7]   Bayesian approach to discovering pathogenic SNPs in conserved protein domains [J].
Cai, ZH ;
Tsung, EF ;
Marinescu, VD ;
Ramoni, MF ;
Riva, A ;
Kohane, IS .
HUMAN MUTATION, 2004, 24 (02) :178-184
[8]   Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: Structure-based assessment of amino acid variation [J].
Chasman, D ;
Adams, RM .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) :683-706
[9]  
Dorn GW, 2000, MOL PHARMACOL, V57, P278
[10]   Complex promoter and coding region β2-adrenergic receptor haplotypes alter receptor expression and predict in vivo responsiveness [J].
Drysdale, CM ;
McGraw, DW ;
Stack, CB ;
Stephens, JC ;
Judson, RS ;
Nandabalan, K ;
Arnold, K ;
Ruano, G ;
Liggett, SB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (19) :10483-10488