Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection

被引:60
作者
Ahmad, Riaz [1 ]
Parfitt, Dan E. [1 ]
Fass, Joseph
Ogundiwin, Ebenezer [1 ]
Dhingra, Amit [2 ]
Gradziel, Thomas M. [1 ]
Lin, Dawei
Joshi, Nikhil A.
Martinez-Garcia, Pedro J. [1 ]
Crisosto, Carlos H. [1 ]
机构
[1] Univ Calif Davis, Dept Plant Sci, Davis, CA 95616 USA
[2] Washington State Univ, Dept Hort & Landscape Architecture, Pullman, WA 99164 USA
基金
美国食品与农业研究所;
关键词
VELVET; MAP;
D O I
10.1186/1471-2164-12-569
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 [微生物学]; 090105 [作物生产系统与生态工程];
摘要
Background: The application of next generation sequencing technologies and bioinformatic scripts to identify high frequency SNPs distributed throughout the peach genome is described. Three peach genomes were sequenced using Roche 454 and Illumina/Solexa technologies to obtain long contigs for alignment to the draft 'Lovell' peach sequence as well as sufficient depth of coverage for 'in silico' SNP discovery. Description: The sequences were aligned to the 'Lovell' peach genome released April 01, 2010 by the International Peach Genome Initiative (IPGI). 'Dr. Davis', 'F8, 1-42' and 'Georgia Belle' were sequenced to add SNPs segregating in two breeding populations, Pop DF ('Dr. Davis' x 'F8, 1-42') and Pop DG ('Dr. Davis' x 'Georgia Belle'). Roche 454 sequencing produced 980,000 total reads with 236 Mb sequence for 'Dr. Davis' and 735,000 total reads with 172 Mb sequence for 'F8, 1-42'. 84 bp x 84 bp paired end Illumina/Solexa sequences yielded 25.5, 21.4, 25.5 million sequences for 'Dr. Davis','F8, 1-42' and 'Georgia Belle', respectively. BWA/SAMtools were used for alignment of raw reads and SNP detection, with custom PERL scripts for SNP filtering. Velvet's Columbus module was used for sequence assembly. Comparison of aligned and overlapping sequences from both Roche 454 and Illumina/Solexa resulted in the selection of 6654 high quality SNPs for 'Dr. Davis' vs. 'F8, 1-42' and 'Georgia Belle', distributed on eight major peach genome scaffolds as defined from the 'Lovell' assembly. Conclusion: The eight scaffolds contained about 215-225 Mb of peach genomic sequences with one SNP/similar to 40,000 bases. All sequences from Roche 454 and Illumina/Solexa have been submitted to NCBI for public use in the Short Read Archive database. SNPs have been deposited in the NCBI SNP database.
引用
收藏
页数:7
相关论文
共 23 条
[1]
Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[2]
Quality scores and SNP detection in sequencing-by-synthesis systems [J].
Brockman, William ;
Alvarez, Pablo ;
Young, Sarah ;
Garber, Manuel ;
Giannoukos, Georgia ;
Lee, William L. ;
Russ, Carsten ;
Lander, Eric S. ;
Nusbaum, Chad ;
Jaffe, David B. .
GENOME RESEARCH, 2008, 18 (05) :763-770
[3]
progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement [J].
Darling, Aaron E. ;
Mau, Bob ;
Perna, Nicole T. .
PLOS ONE, 2010, 5 (06)
[4]
Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[5]
EDWARD KJ, 2008, SNP DISCOVERY PLANTS
[6]
Isolation of Arabidopsis nuclei and measurement of gene transcription rates using nuclear run-on assays [J].
Folta, Kevin M. ;
Kaufman, Lon S. .
NATURE PROTOCOLS, 2006, 1 (06) :3094-3100
[7]
Gupta PK, 2008, P NATL A SCI INDIA B, V78, P91
[8]
Evaluation of next generation sequencing platforms for population targeted sequencing studies [J].
Harismendy, Olivier ;
Ng, Pauline C. ;
Strausberg, Robert L. ;
Wang, Xiaoyun ;
Stockwell, Timothy B. ;
Beeson, Karen Y. ;
Schork, Nicholas J. ;
Murray, Sarah S. ;
Topol, Eric J. ;
Levy, Samuel ;
Frazer, Kelly A. .
GENOME BIOLOGY, 2009, 10 (03)
[9]
Candidate gene database and transcript map for peach, a model species for fruit trees [J].
Horn, R ;
Lecouls, AC ;
Callahan, A ;
Dandekar, A ;
Garay, L ;
McCord, P ;
Howad, W ;
Chan, H ;
Verde, I ;
Main, D ;
Jung, S ;
Georgi, L ;
Forrest, S ;
Mook, J ;
Zhebentyayeva, T ;
Yu, YS ;
Kim, HR ;
Jesudurai, C ;
Sosinski, B ;
Arús, P ;
Baird, V ;
Parfitt, D ;
Reighard, G ;
Scorza, R ;
Tomkins, J ;
Wing, R ;
Abbott, AG .
THEORETICAL AND APPLIED GENETICS, 2005, 110 (08) :1419-1428
[10]
Sequencing breakthroughs for genomic ecology and evolutionary biology [J].
Hudson, Matthew E. .
MOLECULAR ECOLOGY RESOURCES, 2008, 8 (01) :3-17