Sequencing from compomers:: Using mass spectrometry for DNA de novo sequencing of 200+ nt

被引:21
作者
Böcker, S [1 ]
机构
[1] Univ Bielefeld, Tech Fak, AG Genominformat, D-35501 Bielefeld, Germany
关键词
DNA de novo sequencing; compomers; base-specific cleavage; mass spectrometry;
D O I
10.1089/cmb.2004.11.1110
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
One of the main endeavors in today's life science remains the efficient sequencing of long DNA molecules. Today, most de novo sequencing of DNA is still performed using the electrophoresis-based Sanger concept of 1977, in spite of certain restrictions of this method. Methods using mass spectrometry to acquire the Sanger sequencing data are limited by short sequencing lengths of 15 - 25 nt. We propose a new method for DNA sequencing using base-specific cleavage and mass spectrometry that appears to be a promising alternative to classical DNA sequencing approaches. A single stranded DNA or RNA molecule is cleaved by a base-specific (bio-) chemical reaction using, for example, RNAses. The cleavage reaction is modified such that not all, but only a certain percentage of bases are cleaved. The resulting mixture of fragments is then analyzed using MALDI-TOF mass spectrometry, whereby we acquire the molecular masses of fragments. For every peak in the mass spectrum, we calculate those base compositions that will potentially create a peak of the observed mass and, repeating the cleavage reaction for all four bases, finally try to uniquely reconstruct the underlying sequence from these observed spectra. This leads us to the combinatorial problem of sequencing from compomers and, finally, to the graph-theoretical problem of finding a walk in a subgraph of the de Bruijn graph. Application of this method to simulated data indicates that it might be capable of sequencing DNA molecules with 200+ nt.
引用
收藏
页码:1110 / 1134
页数:25
相关论文
共 32 条
  • [1] An SNP map of the human genome generated by reduced representation shotgun sequencing
    Altshuler, D
    Pollara, VJ
    Cowles, CR
    Van Etten, WJ
    Baldwin, J
    Linton, L
    Lander, ES
    [J]. NATURE, 2000, 407 (6803) : 513 - 516
  • [2] [Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
  • [3] A NOVEL METHOD FOR NUCLEIC-ACID SEQUENCE DETERMINATION
    BAINS, W
    SMITH, GC
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 1988, 135 (03) : 303 - 307
  • [4] Böcker S, 2003, LECT N BIOINFORMAT, V2812, P476
  • [5] BOCKER S, 2004, P FUN ALG 2004, P132
  • [6] SNP and mutation discovery using base-specific cleavage and MALDI-TOF mass spectrometry
    Boecker, Sebastian
    [J]. BIOINFORMATICS, 2003, 19 : i44 - i53
  • [7] De novo peptide sequencing via tandem mass spectrometry
    Dancík, V
    Addona, TA
    Clauser, KR
    Vath, JE
    Pevzner, PA
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) : 327 - 342
  • [8] de Bruijn Nicolaas G., 1946, INDAGATIONES MATH, V8, P461
  • [9] DRMANAC R, 1989, GENOMICS, V4, P114
  • [10] A review of DNA sequencing techniques
    França, LTC
    Carrilho, E
    Kist, TBL
    [J]. QUARTERLY REVIEWS OF BIOPHYSICS, 2002, 35 (02) : 169 - 200