High-throughput variation detection and genotyping using microarrays

被引:221
作者
Cutler, DJ [1 ]
Zwick, ME
Carrasquillo, MM
Yohn, CT
Tobin, KP
Kashuk, C
Mathews, DJ
Shah, NA
Eichler, EE
Warrington, JA
Chakravarti, A
机构
[1] Johns Hopkins Univ, Sch Med, McKusick Nathans Inst Genet Med, Baltimore, MD 21287 USA
[2] Affymetrix Inc, Santa Clara, CA 95051 USA
[3] Case Western Reserve Univ, Dept Genet, Cleveland, OH 44106 USA
关键词
D O I
10.1101/gr.197201
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The genetic dissection of complex traits may ultimately require a large number of SNPs to be a genotyped in multiple individuals who exhibit phenotypic variation in a trait of interest. Microarray technology can enable rapid genotyping of variation specific to study samples. To facilitate their use, we have developed an automated statistical method (ABACUS) to analyze microarray hybridization data and applied this method to Affymetrix Variation Detection Arrays (VDAS). ABACUS provides a quality score to individual genotypes, allowing investigators to focus their attention on sites that give accurate information. We have applied ABACUS to an experiment encompassing 32 autosomal and eight X-linked genomic regions, each consisting of similar to 50 kb of unique sequence spanning a 100-kb region, in 40 humans. At sufficiently high-quality scores, we are able to read similar to 80% of all sites. To assess the accuracy of SNP detection, 108 of 108 SNPs have been experimentally confirmed; an additional 371 SNPs have been confirmed electronically. To access the accuracy of diploid genotypes at segregating autosomal sites, we confirmed 1515 of 1515 homozygous calls, and 420 of 423 (99.29%) heterozygotes. In replicate experiments, consisting of independent amplification of identical samples followed by hybridization to distinct microarrays of the same design, genotyping is highly repeatable. In an autosomal replicate experiment, 813,295 of 813,295 genotypes are called identically (including 351 heterozygotes); at an X-linked focus in males (haploid), 841,236 of 841,236 sites are called identically.
引用
收藏
页码:1913 / 1925
页数:13
相关论文
共 41 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   An SNP map of the human genome generated by reduced representation shotgun sequencing [J].
Altshuler, D ;
Pollara, VJ ;
Cowles, CR ;
Van Etten, WJ ;
Baldwin, J ;
Linton, L ;
Lander, ES .
NATURE, 2000, 407 (6803) :513-516
[3]   Characterization of single-nucleotide polymorphisms in coding regions of human genes [J].
Cargill, M ;
Altshuler, D ;
Ireland, J ;
Sklar, P ;
Ardlie, K ;
Patil, N ;
Lane, CR ;
Lim, EP ;
Kalyanaraman, N ;
Nemesh, J ;
Ziaugra, L ;
Friedland, L ;
Rolfe, A ;
Warrington, J ;
Lipshutz, R ;
Daley, GQ ;
Lander, ES .
NATURE GENETICS, 1999, 22 (03) :231-238
[4]  
CHAKRAVARTI A, NAT GENET, V21, P56
[5]   Accessing genetic information with high-density DNA arrays [J].
Chee, M ;
Yang, R ;
Hubbell, E ;
Berno, A ;
Huang, XC ;
Stern, D ;
Winkler, J ;
Lockhart, DJ ;
Morris, MS ;
Fodor, SPA .
SCIENCE, 1996, 274 (5287) :610-614
[6]   A DNA polymorphism discovery resource for research on human genetic variation [J].
Collins, FS ;
Brooks, LD ;
Chakravarti, A .
GENOME RESEARCH, 1998, 8 (12) :1229-1231
[7]   CONTRIBUTING SOFTWARE TO THE INTERNET - THE AMPLIFY PROGRAM [J].
ENGELS, WR .
TRENDS IN BIOCHEMICAL SCIENCES, 1993, 18 (11) :448-450
[8]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[9]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[10]   A computer program for aligning a cDNA sequence with a genomic DNA sequence [J].
Florea, L ;
Hartzell, G ;
Zhang, Z ;
Rubin, GM ;
Miller, W .
GENOME RESEARCH, 1998, 8 (09) :967-974