The Next Generation of Molecular Markers From Massively Parallel Sequencing of Pooled DNA Samples

被引:275
作者
Futschik, Andreas [1 ]
Schloetterer, Christian [2 ]
机构
[1] Univ Vienna, Dept Stat, A-1010 Vienna, Austria
[2] Vet Med Univ Wien, Inst Populat Genet, A-1210 Vienna, Austria
关键词
ALLELE FREQUENCIES; DISCOVERY;
D O I
10.1534/genetics.110.114397
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Next generation sequencing (NGS) is about to revolutionize genetic analysis. Currently NGS techniques are mainly used to sequence individual genomes. Due to the high sequence coverage required, the costs for population-scale analyses are still too high to allow an extension to nonmodel organisms. Here, we show that NGS of pools of individuals is often more effective in SNP discovery and provides more accurate allele frequency estimates, even when taking sequencing errors into account. We modify the population genetic estimators Tajima's pi and Watterson's theta to obtain unbiased estimates from NGS pooling data. Given the same sequencing effort, the resulting estimators often show a better performance than those obtained from individual sequencing. Although our analysis also shows that NGS of pools of individuals will not be preferable under all circumstances, it provides a cost-effective approach to estimate allele frequencies on a genome-wide scale.
引用
收藏
页码:207 / 218
页数:12
相关论文
共 15 条
[1]   Testing for neutrality in samples with sequencing errors [J].
Achat, Guillaume .
GENETICS, 2008, 179 (03) :1409-1424
[2]  
Durrett R, 2008, PROBAB APPL SER, P1, DOI 10.1007/978-0-387-78168-6_1
[3]  
Eberle MA, 2000, GENET EPIDEMIOL, V19, pS29, DOI 10.1002/1098-2272(2000)19:1+<::AID-GEPI5>3.0.CO
[4]  
2-P
[5]   DNA Sudoku-harnessing high-throughput sequencing for multiplexed specimen analysis [J].
Erlich, Yaniv ;
Chang, Kenneth ;
Gordon, Assaf ;
Ronen, Roy ;
Navon, Oron ;
Rooks, Michelle ;
Hannon, Gregory J. .
GENOME RESEARCH, 2009, 19 (07) :1243-1253
[6]   On the inadmissibility of Watterson's estimator [J].
Futschik, Andreas ;
Gach, Florian .
THEORETICAL POPULATION BIOLOGY, 2008, 73 (02) :212-221
[7]   Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA [J].
Holt, Kathryn E. ;
Teo, Yik Y. ;
Li, Heng ;
Nair, Satheesh ;
Dougan, Gordon ;
Wain, John ;
Parkhill, Julian .
BIOINFORMATICS, 2009, 25 (16) :2074-2075
[8]   Generating samples under a Wright-Fisher neutral model of genetic variation [J].
Hudson, RR .
BIOINFORMATICS, 2002, 18 (02) :337-338
[9]   Population Genetic Inference From Resequencing Data [J].
Jiang, Rong ;
Tavare, Simon ;
Marjoram, Paul .
GENETICS, 2009, 181 (01) :187-197
[10]   Accurate and fast methods to estimate the population mutation rate from error prone sequences [J].
Knudsen, Bjarne ;
Miyamoto, Michael M. .
BMC BIOINFORMATICS, 2009, 10 :247