Identification of disease causing loci using an array-based genotyping approach on pooled DNA

被引:55
作者
Craig, DW
Huentelman, MJ
Hu-Lince, D
Zismann, VL
Kruer, MC
Lee, AM
Puffenberger, EG
Pearson, JM
Stephan, DA [1 ]
机构
[1] Translat Genom Res Inst TGen, Neurogenom Div, Phoenix, AZ 85004 USA
[2] Clin Special Children, Strasburg, PA 17579 USA
关键词
D O I
10.1186/1471-2164-6-138
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Pooling genomic DNA samples within clinical classes of disease followed by genotyping on whole-genome SNP microarrays, allows for rapid and inexpensive genome-wide association studies. Key to the success of these studies is the accuracy of the allelic frequency calculations, the ability to identify false-positives arising from assay variability and the ability to better resolve association signals through analysis of neighbouring SNPs. Results: We report the accuracy of allelic frequency measurements on pooled genomic DNA samples by comparing these measurements to the known allelic frequencies as determined by individual genotyping. We describe modifications to the calculation of k-correction factors from relative allele signal (RAS) values that remove biases and result in more accurate allelic frequency predictions. Our results show that the least accurate SNPs, those most likely to give false-positives in an association study, are identifiable by comparing their frequencies to both those from a known database of individual genotypes and those of the pooled replicates. In a disease with a previously identified genetic mutation, we demonstrate that one can identify the disease locus through the comparison of the predicted allelic frequencies in case and control pools. Furthermore, we demonstrate improved resolution of association signals using the mean of individual test-statistics for consecutive SNPs windowed across the genome. A database of k-correction factors for predicting allelic frequencies for each SNP, derived from several thousand individually genotyped samples, is provided. Lastly, a Perl script for calculating RAS values for the Affymetrix platform is provided. Conclusion: Our results illustrate that pooling of DNA samples is an effective initial strategy to identify a genetic locus. However, it is important to eliminate inaccurate SNPs prior to analysis by comparing them to a database of individually genotyped samples as well as by comparing them to replicates of the pool. Lastly, detection of association signals can be improved by incorporating data from neighbouring SNPs.
引用
收藏
页数:9
相关论文
共 21 条
[1]   Association analysis of mild mental impairment using DNA pooling to screen 432 brain-expressed single-nucleotide polymorphisms [J].
Butcher, LM ;
Meaburn, E ;
Dale, PS ;
Sham, P ;
Schalkwyk, LC ;
Craig, IW ;
Plomin, R .
MOLECULAR PSYCHIATRY, 2005, 10 (04) :384-392
[2]   Genotyping pooled DNA on microarrays: A systematic genome screen of thousands of SNPs in large samples to detect QTLs for complex traits [J].
Butcher, LM ;
Meaburn, E ;
Liu, L ;
Fernandes, C ;
Hill, L ;
Al-Chalabi, A ;
Plomin, R ;
Schalkwyk, L ;
Craig, IW .
BEHAVIOR GENETICS, 2004, 34 (05) :549-555
[3]   SNPs, microarrays and pooled DNA: identification of four loci associated with mild mental impairment in a sample of 6000 children [J].
Butcher, LM ;
Meaburn, E ;
Knight, J ;
Sham, PC ;
Schalkwyk, LC ;
Craig, IW ;
Plomin, R .
HUMAN MOLECULAR GENETICS, 2005, 14 (10) :1315-1325
[4]   Association study designs for complex diseases [J].
Cardon, LR ;
Bell, JI .
NATURE REVIEWS GENETICS, 2001, 2 (02) :91-99
[5]   Mapping complex disease loci in whole-genome association studies [J].
Carlson, CS ;
Eberle, MA ;
Kruglyak, L ;
Nickerson, DA .
NATURE, 2004, 429 (6990) :446-452
[6]   Highly parallel SNP genotyping [J].
Fan, JB ;
Oliphant, A ;
Shen, R ;
Kermani, BG ;
Garcia, F ;
Gunderson, KL ;
Hansen, M ;
Steemers, F ;
Butler, SL ;
Deloukas, P ;
Galver, L ;
Hunt, S ;
McBride, C ;
Bibikova, M ;
Rubano, T ;
Chen, J ;
Wickham, E ;
Doucet, D ;
Chang, W ;
Campbell, D ;
Zhang, B ;
Kruglyak, S ;
Bentley, D ;
Haas, J ;
Rigault, P ;
Zhou, L ;
Stuelpnagel, J ;
Chee, MS .
COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 2003, 68 :69-78
[7]   Determination of SNP allele frequencies in pooled DNAs by primer extension genotyping and denaturing high-performance liquid chromatography [J].
Giordano, M ;
Mellai, M ;
Hoogendoorn, B ;
Momigliano-Richiardi, P .
JOURNAL OF BIOCHEMICAL AND BIOPHYSICAL METHODS, 2001, 47 (1-2) :101-110
[8]  
Hinds David A., 2004, Human Genomics, V1, P421
[9]   Genome-wide association studies for common diseases and complex traits [J].
Hirschhorn, JN ;
Daly, MJ .
NATURE REVIEWS GENETICS, 2005, 6 (02) :95-108
[10]   Cheap, accurate and rapid allele frequency estimation of single nucleotide polymorphisms by primer extension and DHPLC in DNA pools [J].
Hoogendoorn, B ;
Norton, N ;
Kirov, G ;
Williams, N ;
Hamshere, ML ;
Spurlock, G ;
Austin, J ;
Stephens, MK ;
Buckland, PR ;
Owen, MJ ;
O'Donovan, MC .
HUMAN GENETICS, 2000, 107 (05) :488-493