Local De Novo Assembly of RAD Paired-End Contigs Using Short Sequencing Reads

被引:169
作者
Etter, Paul D. [1 ]
Preston, Jessica L. [1 ]
Bassham, Susan [2 ]
Cresko, William A. [2 ]
Johnson, Eric A. [1 ]
机构
[1] Univ Oregon, Inst Mol Biol, Eugene, OR 97403 USA
[2] Univ Oregon, Ctr Ecol & Evolutionary Biol, Eugene, OR 97403 USA
来源
PLOS ONE | 2011年 / 6卷 / 04期
关键词
PARALLEL; GENOME;
D O I
10.1371/journal.pone.0018561
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite the power of massively parallel sequencing platforms, a drawback is the short length of the sequence reads produced. We demonstrate that short reads can be locally assembled into longer contigs using paired-end sequencing of restriction-site associated DNA (RAD-PE) fragments. We use this RAD-PE contig approach to identify single nucleotide polymorphisms (SNPs) and determine haplotype structure in threespine stickleback and to sequence E. coli and stickleback genomic DNA with overlapping contigs of several hundred nucleotides. We also demonstrate that adding a circularization step allows the local assembly of contigs up to 5 kilobases (kb) in length. The ease of assembly and accuracy of the individual contigs produced from each RAD site sequence suggests RAD-PE sequencing is a useful way to convert genome-wide short reads into individually-assembled sequences hundreds or thousands of nucleotides long.
引用
收藏
页数:10
相关论文
共 13 条
[1]   Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers [J].
Baird, Nathan A. ;
Etter, Paul D. ;
Atwood, Tressa S. ;
Currey, Mark C. ;
Shiver, Anthony L. ;
Lewis, Zachary A. ;
Selker, Eric U. ;
Cresko, William A. ;
Johnson, Eric A. .
PLOS ONE, 2008, 3 (10)
[2]   A common coding variant in CASP8 is associated with breast cancer risk [J].
Cox, Angela ;
Dunning, Alison M. ;
Garcia-Closas, Montserrat ;
Balasubramanian, Sabapathy ;
Reed, Malcolm W. R. ;
Pooley, Karen A. ;
Scollen, Serena ;
Baynes, Caroline ;
Ponder, Bruce A. J. ;
Chanock, Stephen ;
Lissowska, Jolanta ;
Brinton, Louise ;
Peplonska, Beata ;
Southey, Melissa C. ;
Hopper, John L. ;
McCredie, Margaret R. E. ;
Giles, Graham G. ;
Fletcher, Olivia ;
Johnson, Nichola ;
dos Santos Silva, Isabel ;
Gibson, Lorna ;
Bojesen, Stig E. ;
Nordestgaard, Borge G. ;
Axelsson, Christen K. ;
Torres, Diana ;
Hamann, Ute ;
Justenhoven, Christina ;
Brauch, Hiltrud ;
Chang-Claude, Jenny ;
Kropp, Silke ;
Risch, Angela ;
Wang-Gohrke, Shan ;
Schuermann, Peter ;
Bogdanova, Natalia ;
Doerk, Thilo ;
Fagerholm, Rainer ;
Aaltonen, Kirsimari ;
Blomqvist, Carl ;
Nevanlinna, Heli ;
Seal, Sheila ;
Renwick, Anthony ;
Stratton, Michael R. ;
Rahman, Nazneen ;
Sangrajrang, Suleeporn ;
Hughes, David ;
Odefrey, Fabrice ;
Brennan, Paul ;
Spurdle, Amanda B. ;
Chenevix-Trench, Georgia ;
Beesley, Jonathan .
NATURE GENETICS, 2007, 39 (03) :352-358
[3]   Resolving postglacial phylogeography using high-throughput sequencing [J].
Emerson, Kevin J. ;
Merz, Clayton R. ;
Catchen, Julian M. ;
Hohenlohe, Paul A. ;
Cresko, William A. ;
Bradshaw, William E. ;
Holzapfel, Christina M. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (37) :16196-16200
[4]   Highly parallel SNP genotyping [J].
Fan, JB ;
Oliphant, A ;
Shen, R ;
Kermani, BG ;
Garcia, F ;
Gunderson, KL ;
Hansen, M ;
Steemers, F ;
Butler, SL ;
Deloukas, P ;
Galver, L ;
Hunt, S ;
McBride, C ;
Bibikova, M ;
Rubano, T ;
Chen, J ;
Wickham, E ;
Doucet, D ;
Chang, W ;
Campbell, D ;
Zhang, B ;
Kruglyak, S ;
Bentley, D ;
Haas, J ;
Rigault, P ;
Zhou, L ;
Stuelpnagel, J ;
Chee, MS .
COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 2003, 68 :69-78
[5]   Highly parallel genomic assays [J].
Fan, Jian-Bing ;
Chee, Mark S. ;
Gunderson, Kevin L. .
NATURE REVIEWS GENETICS, 2006, 7 (08) :632-644
[6]  
Gabriel Stacey, 2009, Curr Protoc Hum Genet, VChapter 2, DOI 10.1002/0471142905.hg0212s60
[7]  
Hiatt JB, 2010, NAT METHODS, V7, P119, DOI [10.1038/NMETH.1416, 10.1038/nmeth.1416]
[8]   Population Genomics of Parallel Adaptation in Threespine Stickleback using Sequenced RAD Tags [J].
Hohenlohe, Paul A. ;
Bassham, Susan ;
Etter, Paul D. ;
Stiffler, Nicholas ;
Johnson, Eric A. ;
Cresko, William A. .
PLOS GENETICS, 2010, 6 (02)
[9]   The sequence and de novo assembly of the giant panda genome [J].
Li, Ruiqiang ;
Fan, Wei ;
Tian, Geng ;
Zhu, Hongmei ;
He, Lin ;
Cai, Jing ;
Huang, Quanfei ;
Cai, Qingle ;
Li, Bo ;
Bai, Yinqi ;
Zhang, Zhihe ;
Zhang, Yaping ;
Wang, Wen ;
Li, Jun ;
Wei, Fuwen ;
Li, Heng ;
Jian, Min ;
Li, Jianwen ;
Zhang, Zhaolei ;
Nielsen, Rasmus ;
Li, Dawei ;
Gu, Wanjun ;
Yang, Zhentao ;
Xuan, Zhaoling ;
Ryder, Oliver A. ;
Leung, Frederick Chi-Ching ;
Zhou, Yan ;
Cao, Jianjun ;
Sun, Xiao ;
Fu, Yonggui ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Wang, Bo ;
Hou, Rong ;
Shen, Fujun ;
Mu, Bo ;
Ni, Peixiang ;
Lin, Runmao ;
Qian, Wubin ;
Wang, Guodong ;
Yu, Chang ;
Nie, Wenhui ;
Wang, Jinhuan ;
Wu, Zhigang ;
Liang, Huiqing ;
Min, Jiumeng ;
Wu, Qi ;
Cheng, Shifeng ;
Ruan, Jue ;
Wang, Mingwei .
NATURE, 2010, 463 (7279) :311-317
[10]   Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers [J].
Miller, Michael R. ;
Dunham, Joseph P. ;
Amores, Angel ;
Cresko, William A. ;
Johnson, Eric A. .
GENOME RESEARCH, 2007, 17 (02) :240-248