Whole-genome sequencing and variant discovery in C-elegans

被引:271
作者
Hillier, LaDeana W. [1 ]
Marth, Gabor T. [2 ]
Quinlan, Aaron R. [2 ]
Dooling, David [1 ]
Fewell, Ginger [1 ]
Barnett, Derek [2 ]
Fox, Paul [1 ]
Glasscock, Jarret I. [1 ]
Hickenbotham, Matthew [1 ]
Huang, Weichun [2 ]
Magrini, Vincent J. [1 ]
Richt, Ryan J. [1 ]
Sander, Sacha N. [1 ]
Stewart, Donald A. [2 ]
Stromberg, Michael [2 ]
Tsung, Eric F. [2 ]
Wylie, Todd [1 ]
Schedl, Tim [1 ]
Wilson, Richard K. [1 ]
Mardis, Elaine R. [1 ]
机构
[1] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63108 USA
[2] Boston Coll, Dept Biol, Chestnut Hill, MA 02467 USA
关键词
D O I
10.1038/NMETH.1179
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and for accurate discovery of genome-wide polymorphisms.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 17 条
  • [1] Automating resequencing-based detection of insertion-deletion polymorphisms
    Bhangale, Tushar R.
    Stephens, Matthew
    Nickerson, Deborah A.
    [J]. NATURE GENETICS, 2006, 38 (12) : 1457 - 1462
  • [2] WormBase:: new content and better access
    Bieri, Tamberlyn
    Blasiar, Darin
    Ozersky, Philip
    Antoshechkin, Igor
    Bastiani, Carol
    Canaran, Payan
    Chan, Juancarlos
    Chen, Nansheng
    Chen, Wen J.
    Davis, Paul
    Fiedler, Tristan J.
    Girard, Lisa
    Han, Michael
    Harris, Todd W.
    Kishore, Ranjana
    Lee, Raymond
    McKay, Sheldon
    Muller, Hans-Michael
    Nakamura, Cecilia
    Petcherski, Andrei
    Rangarajan, Arun
    Rogers, Anthony
    Schindelman, Gary
    Schwarz, Erich M.
    Spooner, Will
    Tuli, Mary Ann
    Van Auken, Kimberly
    Wang, Daniel
    Wang, Xiaodong
    Williams, Gary
    Durbin, Richard
    Stein, Lincoln D.
    Sternberg, Paul W.
    Spieth, John
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 : D506 - D510
  • [3] Genome sequence of the nematode C-elegans:: A platform for investigating biology
    不详
    [J]. SCIENCE, 1998, 282 (5396) : 2012 - 2018
  • [4] Finishing the euchromatic sequence of the human genome
    Collins, FS
    Lander, ES
    Rogers, J
    Waterston, RH
    [J]. NATURE, 2004, 431 (7011) : 931 - 945
  • [5] Phylogenetics in Caenorhabditis elegans:: An analysis of divergence and outcrossing
    Denver, DR
    Morris, K
    Thomas, WK
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (03) : 393 - 400
  • [6] Consed: A graphical tool for sequence finishing
    Gordon, D
    Abajian, C
    Green, P
    [J]. GENOME RESEARCH, 1998, 8 (03) : 195 - 202
  • [7] WormBase: a multi-species resource for nematode biology and genomics
    Harris, TW
    Chen, NS
    Cunningham, F
    Tello-Ruiz, M
    Antoshechkin, I
    Bastiani, C
    Bieri, T
    Blasiar, D
    Bradnam, K
    Chan, J
    Chen, CK
    Chen, WJ
    Davis, P
    Kenny, E
    Kishore, R
    Lawson, D
    Lee, R
    Muller, HM
    Nakamura, C
    Ozersky, P
    Petcherski, A
    Rogers, A
    Sabo, A
    Schwarz, EM
    Van Auken, K
    Wang, QH
    Durbin, R
    Spieth, J
    Sternberg, PW
    Stein, LD
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D411 - D417
  • [8] Hodgkin J, 1997, GENETICS, V146, P149
  • [9] Single nucleotide polymorphisms in wild isolates of Caenorhabditis elegans
    Koch, R
    van Luenen, HGAM
    van der Horst, M
    Thijssen, KL
    Plasterk, RHA
    [J]. GENOME RESEARCH, 2000, 10 (11) : 1690 - 1696
  • [10] Initial sequencing and analysis of the human genome
    Lander, ES
    Int Human Genome Sequencing Consortium
    Linton, LM
    Birren, B
    Nusbaum, C
    Zody, MC
    Baldwin, J
    Devon, K
    Dewar, K
    Doyle, M
    FitzHugh, W
    Funke, R
    Gage, D
    Harris, K
    Heaford, A
    Howland, J
    Kann, L
    Lehoczky, J
    LeVine, R
    McEwan, P
    McKernan, K
    Meldrim, J
    Mesirov, JP
    Miranda, C
    Morris, W
    Naylor, J
    Raymond, C
    Rosetti, M
    Santos, R
    Sheridan, A
    Sougnez, C
    Stange-Thomann, N
    Stojanovic, N
    Subramanian, A
    Wyman, D
    Rogers, J
    Sulston, J
    Ainscough, R
    Beck, S
    Bentley, D
    Burton, J
    Clee, C
    Carter, N
    Coulson, A
    Deadman, R
    Deloukas, P
    Dunham, A
    Dunham, I
    Durbin, R
    French, L
    [J]. NATURE, 2001, 409 (6822) : 860 - 921