High-throughput genotyping by whole-genome resequencing

被引:656
作者
Huang, Xuehui [1 ,2 ]
Feng, Qi [1 ,2 ,3 ]
Qian, Qian [4 ]
Zhao, Qiang [1 ,2 ,3 ]
Wang, Lu [1 ,2 ]
Wang, Ahong [1 ,2 ]
Guan, Jianping [1 ,2 ]
Fan, Danlin [1 ,2 ]
Weng, Qijun [1 ,2 ]
Huang, Tao [1 ,2 ]
Dong, Guojun [4 ]
Sang, Tao [1 ,2 ,5 ]
Han, Bin [1 ,2 ,6 ]
机构
[1] Chinese Acad Sci, Natl Ctr Gene Res, Shanghai 200233, Peoples R China
[2] Chinese Acad Sci, Inst Plant Physiol & Ecol, Shanghai Inst Biol Sci, Shanghai 200233, Peoples R China
[3] Shanghai Jiao Tong Univ, Coll Life Sci & Biotechnol, Shanghai 200240, Peoples R China
[4] Chinese Acad Agr Sci, State Key Lab Rice Biol, China Natl Rice Res Inst, Hangzhou 310006, Zhejiang, Peoples R China
[5] Michigan State Univ, Dept Plant Biol, E Lansing, MI 48824 USA
[6] Chinese Acad Sci, Beijing Inst Genom, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
OPEN SOFTWARE; GENE; MAP;
D O I
10.1101/gr.089516.108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The next-generation sequencing technology coupled with the growing number of genome sequences opens the opportunity to redesign genotyping strategies for more effective genetic mapping and genome analysis. We have developed a high-throughput method for genotyping recombinant populations utilizing whole-genome resequencing data generated by the Illumina Genome Analyzer. A sliding window approach is designed to collectively examine genome-wide single nucleotide polymorphisms for genotype calling and recombination breakpoint determination. Using this method, we constructed a genetic map for 150 rice recombinant inbred lines with an expected genotype calling accuracy of 99.94% and a resolution of recombination breakpoints within an average of 40 kb. In comparison to the genetic map constructed with 287 PCR-based markers for the rice population, the sequencing-based method was; 203 faster in data collection and 353 more precise in recombination breakpoint determination. Using the sequencing-based genetic map, we located a quantitative trait locus of large effect on plant height in a 100-kb region containing the rice "green revolution'' gene. Through computer simulation, we demonstrate that the method is robust for different types of mapping populations derived from organisms with variable quality of genome sequences and is feasible for organisms with large genome sizes and low polymorphisms. With continuous advances in sequencing technologies, this genome-based method may replace the conventional marker-based genotyping approach to provide a powerful tool for large-scale gene discovery and for addressing a wide range of biological questions.
引用
收藏
页码:1068 / 1076
页数:9
相关论文
共 19 条
[1]   Identification of genetic variants using bar-coded multiplexed sequencing [J].
Craig, David W. ;
Pearson, John V. ;
Szelinger, Szabolcs ;
Sekar, Aswin ;
Redman, Margot ;
Corneveaux, Jason J. ;
Pawlowski, Traci L. ;
Laub, Trisha ;
Nunn, Gary ;
Stephan, Dietrich A. ;
Homer, Nils ;
Huentelman, Matthew J. .
NATURE METHODS, 2008, 5 (10) :887-893
[2]   Multiplex sequencing of plant chloroplast genomes using Solexa sequencing-by-synthesis technology [J].
Cronn, Richard ;
Liston, Aaron ;
Parks, Matthew ;
Gernandt, David S. ;
Shen, Rongkun ;
Mockler, Todd .
NUCLEIC ACIDS RESEARCH, 2008, 36 (19)
[3]   Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[4]   Development and evaluation of a high-throughput, low-cost genotyping platform based on oligonucleotide microarrays in rice [J].
Edwards, Jeremy D. ;
Janda, Jaroslav ;
Sweeney, Megan T. ;
Gaikwad, Ambika B. ;
Liu, Bin ;
Leung, Hei ;
Galbraith, David W. .
PLANT METHODS, 2008, 4 (1)
[5]   A sequence-based variation map of 8.27 million SNPs in inbred mouse strains [J].
Frazer, Kelly A. ;
Eskin, Eleazar ;
Kang, Hyun Min ;
Bogue, Molly A. ;
Hinds, David A. ;
Beilharz, Erica J. ;
Gupta, Robert V. ;
Montgomery, Julie ;
Morenzoni, Matt M. ;
Nilsen, Geoffrey B. ;
Pethiyagoda, Charit L. ;
Stuve, Laura L. ;
Johnson, Frank M. ;
Daly, Mark J. ;
Wade, Claire M. ;
Cox, David R. .
NATURE, 2007, 448 (7157) :1050-U8
[6]   Versatile and open software for comparing large genomes [J].
Kurtz, S ;
Phillippy, A ;
Delcher, AL ;
Smoot, M ;
Shumway, M ;
Antonescu, C ;
Salzberg, SL .
GENOME BIOLOGY, 2004, 5 (02)
[7]   Mapping short DNA sequencing reads and calling variants using mapping quality scores [J].
Li, Heng ;
Ruan, Jue ;
Durbin, Richard .
GENOME RESEARCH, 2008, 18 (11) :1851-1858
[8]  
Lincoln S. E., 1993, MAPMAKER EXP 3 0 MAP
[9]   The map-based sequence of the rice genome [J].
Matsumoto, T ;
Wu, JZ ;
Kanamori, H ;
Katayose, Y ;
Fujisawa, M ;
Namiki, N ;
Mizuno, H ;
Yamamoto, K ;
Antonio, BA ;
Baba, T ;
Sakata, K ;
Nagamura, Y ;
Aoki, H ;
Arikawa, K ;
Arita, K ;
Bito, T ;
Chiden, Y ;
Fujitsuka, N ;
Fukunaka, R ;
Hamada, M ;
Harada, C ;
Hayashi, A ;
Hijishita, S ;
Honda, M ;
Hosokawa, S ;
Ichikawa, Y ;
Idonuma, A ;
Iijima, M ;
Ikeda, M ;
Ikeno, M ;
Ito, K ;
Ito, S ;
Ito, T ;
Ito, Y ;
Ito, Y ;
Iwabuchi, A ;
Kamiya, K ;
Karasawa, W ;
Kurita, K ;
Katagiri, S ;
Kikuta, A ;
Kobayashi, H ;
Kobayashi, N ;
Machita, K ;
Maehara, T ;
Masukawa, M ;
Mizubayashi, T ;
Mukai, Y ;
Nagasaki, H ;
Nagata, Y .
NATURE, 2005, 436 (7052) :793-800
[10]   Genotyping pooled DNA using 100K SNP microarrays: a step towards genomewide association scans [J].
Meaburn, E ;
Butcher, LM ;
Schalkwyk, LC ;
Plomin, R .
NUCLEIC ACIDS RESEARCH, 2006, 34 (04)