Single nucleotide polymorphisms associated with rat expressed sequences

被引:34
作者
Guryev, V
Berezikov, E
Malik, R
Plasterk, RHA
Cuppen, E
机构
[1] Netherlands Inst Dev Biol, Hubrecht Lab, NL-3584 CT Utrecht, Netherlands
[2] Univ Utrecht, Dept Math & Comp Sci, NL-3584 CH Utrecht, Netherlands
关键词
D O I
10.1101/gr.2154304
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single nucleotide polymorphisms (SNPs) are the most common source of genetic variation in populations and are thus most likely to account for the majority of phenotypic and behavioral differences between individuals or strains. Although the rat is extensively studied for the latter, data on naturally occurring polymorphisms are mostly lacking. We have used publicly available sequences consisting of whole-genome shotgun (WGS), expressed sequence tag (EST), and mRNA data as a source for the in silico identification of SNPs in gene-coding regions and have identified a large collection of 33,305 high-quality candidate SNPs. Experimental verification of 471 candidate SNPs using a limited set of rat isolates revealed a confirmation rate of similar to50%. Although the majority of SNPs were identified between Sprague-Dawley (EST data) and Brown Norway (WGS data) strains, we found that 66% of the verified variations are common among different rat strains. All SNPs were extensively annotated, including chromosomal and genetic map information, and nonsynonymous SNPs were analyzed by SIFT and PolyPhen prediction programs for their potential deleterious effect on protein function. Interestingly, we retrieved three SNPs from the database that result in the introduction of a premature stop codon and that could be confirmed experimentally. Two of these "in silico-identified knockouts" reside in interesting QTL regions. Data are publicly available via a Web interface (http://cascad.niob.knaw.nl), allowing simple and advanced search queries.
引用
收藏
页码:1438 / 1443
页数:6
相关论文
共 21 条
[1]   Genealogies of mouse inbred strains [J].
Beck, JA ;
Lloyd, S ;
Hafezparast, M ;
Lennon-Pierce, M ;
Eppig, JT ;
Festing, MFW ;
Fisher, EMC .
NATURE GENETICS, 2000, 24 (01) :23-+
[2]   BIOCHEMICAL MARKERS IN INBRED STRAINS OF THE RAT (RATTUS-NORVEGICUS) [J].
BENDER, K ;
ADAMS, M ;
BAVERSTOCK, PR ;
DENBIEMAN, M ;
BISSBORT, S ;
BRDICKA, R ;
BUTCHER, GW ;
CRAMER, DV ;
VONDEIMLING, O ;
FESTING, MFW ;
GUNTHER, E ;
GUTTMANN, RD ;
HEDRICH, HJ ;
KENDALL, PB ;
KLUGE, R ;
MOUTIER, R ;
SIMON, B ;
WOMACK, JE ;
YAMADA, J ;
VANZUTPHEN, B .
IMMUNOGENETICS, 1984, 19 (03) :257-266
[3]   GENOTRACE: cDNA-based local GENOme assembly from TRACE archives [J].
Berezikov, E ;
Plasterk, RHA ;
Cuppen, E .
BIOINFORMATICS, 2002, 18 (10) :1396-1397
[4]   EST analysis online: WWW tools for detection of SNPs and alternative splice forms [J].
Brett, D ;
Lehmann, G ;
Hanke, J ;
Gross, S ;
Reich, J ;
Bork, P .
TRENDS IN GENETICS, 2000, 16 (09) :416-418
[5]   Reliable identification of large numbers of candidate SNPs from public EST data [J].
Buetow, KH ;
Edmonson, MN ;
Cassidy, AB .
NATURE GENETICS, 1999, 21 (03) :323-325
[6]  
Deutsch S, 2001, GENOME RES, V11, P300
[7]   Genome-wide analysis of single-nucleotide polymorphisms in human expressed sequences [J].
Irizarry, K ;
Kustanovich, V ;
Li, C ;
Brown, N ;
Nelson, S ;
Wong, W ;
Lee, CJ .
NATURE GENETICS, 2000, 26 (02) :233-236
[8]   MEGA2: molecular evolutionary genetics analysis software [J].
Kumar, S ;
Tamura, K ;
Jakobsen, IB ;
Nei, M .
BIOINFORMATICS, 2001, 17 (12) :1244-1245
[9]   A general approach to single-nucleotide polymorphism discovery [J].
Marth, GT ;
Korf, I ;
Yandell, MD ;
Yeh, RT ;
Gu, ZJ ;
Zakeri, H ;
Stitziel, NO ;
Hillier, L ;
Kwok, PY ;
Gish, WR .
NATURE GENETICS, 1999, 23 (04) :452-456
[10]   SIFT: predicting amino acid changes that affect protein function [J].
Ng, PC ;
Henikoff, S .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3812-3814