A SNP-centric database for the investigation of the human genome

被引:57
作者
Riva, A [1 ]
Kohane, IS [1 ]
机构
[1] Childrens Hosp, Childrens Hosp Informat Program, Boston, MA 02115 USA
关键词
D O I
10.1186/1471-2105-5-33
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Single Nucleotide Polymorphisms ( SNPs) are an increasingly important tool for genetic and biomedical research. Although current genomic databases contain information on several million SNPs and are growing at a very fast rate, the true value of a SNP in this context is a function of the quality of the annotations that characterize it. Retrieving and analyzing such data for a large number of SNPs often represents a major bottleneck in the design of large-scale association studies. Description: SNPper is a web-based application designed to facilitate the retrieval and use of human SNPs for high-throughput research purposes. It provides a rich local database generated by combining SNP data with the Human Genome sequence and with several other data sources, and offers the user a variety of querying, visualization and data export tools. In this paper we describe the structure and organization of the SNPper database, we review the available data export and visualization options, and we describe how the architecture of SNPper and its specialized data structures support high-volume SNP analysis. Conclusions: The rich annotation database and the powerful data manipulation and presentation facilities it offers make SNPper a very useful online resource for SNP research. Its success proves the great need for integrated and interoperable resources in the field of computational biology, and shows how such systems may play a critical role in supporting the large-scale computational analysis of our genome.
引用
收藏
页数:8
相关论文
共 11 条
[1]  
BROOKES A, 1999, ESSENCE SNPS, P177
[2]   ALFRED: an allele frequency database for diverse populations and DNA polymorphisms [J].
Cheung, KH ;
Osier, MV ;
Kidd, JR ;
Pakstis, AJ ;
Miller, PL ;
Kidd, KK .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :361-363
[3]   HGVbase:: a human sequence variation database emphasizing data quality and a broad spectrum of data sources [J].
Fredman, D ;
Siegfried, M ;
Yuan, YP ;
Bork, P ;
Lehväslaiho, H ;
Brookes, AJ .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :387-391
[4]   The UCSC Genome Browser Database [J].
Karolchik, D ;
Baertsch, R ;
Diekhans, M ;
Furey, TS ;
Hinrichs, A ;
Lu, YT ;
Roskin, KM ;
Schwartz, M ;
Sugnet, CW ;
Thomas, DJ ;
Weber, RJ ;
Haussler, D ;
Kent, WJ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :51-54
[5]   Single nucleotide polymorphisms in innate immunity genes: abundant variation and potential role in complex human disease [J].
Lazarus, R ;
Vercelli, D ;
Palmer, LJ ;
Klimecki, WJ ;
Silverman, EK ;
Richter, B ;
Riva, A ;
Ramoni, M ;
Martinez, FD ;
Weiss, ST ;
Kwiatkowski, DJ .
IMMUNOLOGICAL REVIEWS, 2002, 190 (01) :9-25
[6]  
Riva A, 2002, AMIA 2002 SYMPOSIUM, PROCEEDINGS, P662
[7]   SNPper: retrieval and analysis of human SNPs [J].
Riva, A ;
Kohane, IS .
BIOINFORMATICS, 2002, 18 (12) :1681-1685
[8]   Single nucleotide polymorphisms and the future of genetic epidemiology [J].
Schork, NJ ;
Fallin, D ;
Lanchbury, JS .
CLINICAL GENETICS, 2000, 58 (04) :250-264
[9]   dbSNP: the NCBI database of genetic variation [J].
Sherry, ST ;
Ward, MH ;
Kholodov, M ;
Baker, J ;
Phan, L ;
Smigielski, EM ;
Sirotkin, K .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :308-311
[10]   Integrating biological databases [J].
Stein, LD .
NATURE REVIEWS GENETICS, 2003, 4 (05) :337-345