Genome bioinformatic analysis of nonsynonymous SNPs

被引:47
作者
Burke, David F.
Worth, Catherine L.
Priego, Eva-Maria
Cheng, Tammy
Smink, Luc J.
Todd, John A.
Blundell, Tom L.
机构
[1] Univ Cambridge, Dept Biochem, Cambridge CB2 1GA, England
[2] Univ Cambridge, Cambridge Inst Med Res, Wellcome Trust Diab & Inflammat Lab, Juvenile Diab Res Fdn, Cambridge CB2 2XY, England
基金
英国惠康基金;
关键词
D O I
10.1186/1471-2105-8-301
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Genome-wide association studies of common diseases for common, low penetrance causal variants are underway. A proportion of these will alter protein sequences, the most common of which is the non-synonymous single nucleotide polymorphism (nsSNP). It would be an advantage if the functional effects of an nsSNP on protein structure and function could be predicted, both for the final identification process of a causal variant in a disease-associated chromosome region, and in further functional analyses of the nsSNP and its disease-associated protein. Results: In the present report we have compared and contrasted structure-and sequence-based methods of prediction to over 5500 genes carrying nearly 24,000 nsSNPs, by employing an automatic comparative modelling procedure to build models for the genes. The nsSNP information came from two sources, the OMIM database which are rare (minor allele frequency, MAF, < 0.01) and are known to cause penetrant, monogenic diseases. Secondly, nsSNP information came from dbSNPI25, for which the vast majority of nsSNPs, mostly MAF > 0.05, have no known link to a disease. For over 40% of the nsSNPs, structure-based methods predicted which of these sequence changes are likely to either disrupt the structure of the protein or interfere with the function or interactions of the protein. For the remaining 60%, we generated sequence-based predictions. Conclusion: We show that, in general, the prediction tools are able distinguish disease causing mutations from those mutations which are thought to have a neutral affect. We give examples of mutations in genes that are predicted to be deleterious and may have a role in disease. Contrary to previous reports, we also show that rare mutations are consistently predicted to be deleterious as often as commonly occurring nsSNPs.
引用
收藏
页数:15
相关论文
共 44 条
[1]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[2]  
[Anonymous], 1998, CATALOG HUMAN GENES
[3]   TLR4 mutations are associated with endotoxin hyporesponsiveness in humans [J].
Arbour, NC ;
Lorenz, E ;
Schutte, BC ;
Zabner, J ;
Kline, JN ;
Jones, M ;
Frees, K ;
Watt, JL ;
Schwartz, DA .
NATURE GENETICS, 2000, 25 (02) :187-+
[4]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[5]  
BURKE DF, 1999, PROTEIN-STRUCT FUNCT, P1
[6]   Predicting protein stability changes from sequences using support vector machines [J].
Capriotti, E ;
Fariselli, P ;
Calabrese, R ;
Casadio, R .
BIOINFORMATICS, 2005, 21 :54-58
[7]   A large-scale genetic association study confirms IL12B and leads to the identification of IL23R as psoriasis-risk genes [J].
Cargill, Michele ;
Schrodi, Steven J. ;
Chang, Monica ;
Garcia, Veronica E. ;
Brandon, Rhonda ;
Callis, Kristina P. ;
Matsunami, Nori ;
Ardlie, Kristin G. ;
Civello, Daniel ;
Catanese, Joseph J. ;
Leong, Diane U. ;
Panko, Jackie M. ;
McAllister, Linda B. ;
Hansen, Christopher B. ;
Papenfuss, Jason ;
Prescott, Stephen M. ;
White, Thomas J. ;
Leppert, Mark F. ;
Krueger, Gerald G. ;
Begovich, Ann B. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (02) :273-290
[8]   Distinguishing structural and functional restraints in evolution in order to identify interaction sites [J].
Chelliah, V ;
Chen, L ;
Blundell, TL ;
Lovell, SC .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 342 (05) :1487-1504
[9]   Prediction of protein stability changes for single-site mutations using support vector machines [J].
Cheng, JL ;
Randall, A ;
Baldi, P .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (04) :1125-1132
[10]   Population structure, differential bias and genomic control in a large-scale, case-control association study [J].
Clayton, DG ;
Walker, NM ;
Smyth, DJ ;
Pask, R ;
Cooper, JD ;
Maier, LM ;
Smink, LJ ;
Lam, AC ;
Ovington, NR ;
Stevens, HE ;
Nutland, S ;
Howson, JMM ;
Faham, M ;
Moorhead, M ;
Jones, HB ;
Falkowski, M ;
Hardenbol, P ;
Willis, TD ;
Todd, JA .
NATURE GENETICS, 2005, 37 (11) :1243-1246