PolyDoms: a whole genome database for the identification of non-synonymous coding SNPs with the potential to impact disease

被引:44
作者
Jegga, Anil G.
Gowrisankar, Sivakumar
Chen, Jing
Aronow, Bruce J.
机构
[1] Childrens Hosp, Med Ctr, Div Biomed Informat, Cincinnati, OH 45229 USA
[2] Univ Cincinnati, Coll Med, Dept Pediat, Cincinnati, OH 45229 USA
[3] Univ Cincinnati, Dept Biomed Engn, Cincinnati, OH 45229 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/nar/gkl826
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As knowledge of human genetic polymorphisms grows, so does the opportunity and challenge of identifying those polymorphisms that may impact the health or disease risk of an individual person. A critical need is to organize large-scale polymorphism analyses and to prioritize candidate nonsynonymous coding SNPs (nsSNPs) that should be tested in experimental and epidemiological studies to establish their context-specific impacts on protein function. In addition, with emerging high-resolution clinical genetics testing, new polymorphisms must be analyzed in the context of all available protein feature knowledge including other known mutations and polymorphisms. To approach this, we developed PolyDoms (http://polydoms.cchmc.org/) as a database to integrate the results of multiple algorithmic procedures and functional criteria applied to the entire Entrez dbSNP dataset. In addition to predicting structural and functional impacts of all nsSNPs, filtering functions enable group-based identification of potentially harmful nsSNPs among multiple genes associated with specific diseases, anatomies, mammalian phenotypes, gene ontologies, pathways or protein domains. PolyDoms, thus, provides a means to derive a list of candidate SNPs to be evaluated in experimental or epidemiological studies for impact on protein functions and disease risk associations. PolyDoms will continue to be curated to improve its usefulness.
引用
收藏
页码:D700 / D706
页数:7
相关论文
共 37 条
[1]   Accurate prediction of solvent accessibility using neural networks-based regression [J].
Adamczak, R ;
Porollo, A ;
Meller, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 56 (04) :753-767
[2]  
Becker KG, 2004, NAT GENET, V36, P431, DOI 10.1038/ng0504-431
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   It's raining SNPs, hallelujah? [J].
Chakravarti, A .
NATURE GENETICS, 1998, 19 (03) :216-217
[5]   Variations on a theme: Cataloging human DNA sequence variation [J].
Collins, FS ;
Guyer, MS ;
Chakravarti, A .
SCIENCE, 1997, 278 (5343) :1580-1581
[6]   PMUT:: a web-based tool for the annotation of pathological mutations on proteins [J].
Ferrer-Costa, C ;
Gelpí, JL ;
Zamakola, L ;
Parraga, I ;
de la Cruz, X ;
Orozco, M .
BIOINFORMATICS, 2005, 21 (14) :3176-3178
[7]   Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders [J].
Hamosh, A ;
Scott, AF ;
Amberger, J ;
Bocchini, C ;
Valle, D ;
McKusick, VA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :52-55
[8]   AUTOMATED ASSEMBLY OF PROTEIN BLOCKS FOR DATABASE SEARCHING [J].
HENIKOFF, S ;
HENIKOFF, JG .
NUCLEIC ACIDS RESEARCH, 1991, 19 (23) :6565-6572
[9]   A gene network for navigating the literature [J].
Hoffmann, R ;
Valencia, A .
NATURE GENETICS, 2004, 36 (07) :664-664
[10]  
HORII A, 1992, CANCER RES, V52, P3231