Simple tools for assembling and searching high-density picolitre pyrophosphate sequence data

被引:3
作者
Parker, Nicolas J. [1 ]
Parker, Andrew G. [2 ]
机构
[1] 10 Lockhart Close, Kenilworth CV8 1RB, Warwick, England
[2] IAEA, FAO IAEA Agr & Biotechnol Lab, Agcys Labs Seibersdorf, Entomol Unit, A-1400 Vienna, Austria
来源
SOURCE CODE FOR BIOLOGY AND MEDICINE | 2008年 / 3卷 / 01期
关键词
D O I
10.1186/1751-0473-3-5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The advent of pyrophosphate sequencing makes large volumes of sequencing data available at a lower cost than previously possible. However, the short read lengths are difficult to assemble and the large dataset is difficult to handle. During the sequencing of a virus from the tsetse fly, Glossina pallidipes, we found the need for tools to search quickly a set of reads for near exact text matches. Methods: A set of tools is provided to search a large data set of pyrophosphate sequence reads under a "live" CD version of Linux on a standard PC that can be used by anyone without prior knowledge of Linux and without having to install a Linux setup on the computer. The tools permit short lengths of de novo assembly, checking of existing assembled sequences, selection and display of reads from the data set and gathering counts of sequences in the reads. Results: Demonstrations are given of the use of the tools to help with checking an assembly against the fragment data set; investigating homopolymer lengths, repeat regions and polymorphisms; and resolving inserted bases caused by incomplete chain extension. Conclusion: The additional information contained in a pyrophosphate sequencing data set beyond a basic assembly is difficult to access due to a lack of tools. The set of simple tools presented here would allow anyone with basic computer skills and a standard PC to access this information.
引用
收藏
页数:10
相关论文
共 22 条
[1]   Development of a non-destructive PCR method for detection of the salivary gland hypertrophy virus (SGHV) in tsetse flies [J].
Abd-Alla, Adly ;
Bossin, Herve ;
Cousserans, Francois ;
Parker, Andrew ;
Bergoin, Max ;
Robinson, Alan .
JOURNAL OF VIROLOGICAL METHODS, 2007, 139 (02) :143-149
[2]   Genome analysis of a Glossina pallidipes salivary gland hypertrophy virus reveals a novel, large, double-stranded circular DNA virus [J].
Abd-Alla, Adly M. M. ;
Cousserans, Francois ;
Parker, Andrew G. ;
Jehle, Johannes A. ;
Parker, Nicolas J. ;
Vlak, Just M. ;
Robinson, Alan S. ;
Bergoin, Max .
JOURNAL OF VIROLOGY, 2008, 82 (09) :4595-4611
[3]  
[Anonymous], 2008, PUBLICATIONS SCI J C
[4]   Fragment assembly with short reads [J].
Chaisson, M ;
Pevzner, P ;
Tang, HX .
BIOINFORMATICS, 2004, 20 (13) :2067-2074
[5]   Short read fragment assembly of bacterial genomes [J].
Chaisson, Mark J. ;
Pevzner, Pavel A. .
GENOME RESEARCH, 2008, 18 (02) :324-330
[6]  
Elahi Elahe, 2004, Methods Mol Biol, V255, P211
[7]   Genome sequencing in microfabricated high-density picolitre reactors [J].
Margulies, M ;
Egholm, M ;
Altman, WE ;
Attiya, S ;
Bader, JS ;
Bemben, LA ;
Berka, J ;
Braverman, MS ;
Chen, YJ ;
Chen, ZT ;
Dewell, SB ;
Du, L ;
Fierro, JM ;
Gomes, XV ;
Godwin, BC ;
He, W ;
Helgesen, S ;
Ho, CH ;
Irzyk, GP ;
Jando, SC ;
Alenquer, MLI ;
Jarvie, TP ;
Jirage, KB ;
Kim, JB ;
Knight, JR ;
Lanza, JR ;
Leamon, JH ;
Lefkowitz, SM ;
Lei, M ;
Li, J ;
Lohman, KL ;
Lu, H ;
Makhijani, VB ;
McDade, KE ;
McKenna, MP ;
Myers, EW ;
Nickerson, E ;
Nobile, JR ;
Plant, R ;
Puc, BP ;
Ronan, MT ;
Roth, GT ;
Sarkis, GJ ;
Simons, JF ;
Simpson, JW ;
Srinivasan, M ;
Tartaro, KR ;
Tomasz, A ;
Vogt, KA ;
Volkmer, GA .
NATURE, 2005, 437 (7057) :376-380
[8]   Analysis of read length limiting factors in Pyrosequencing chemistry [J].
Mashayekhi, Foad ;
Ronaghi, Mostafa .
ANALYTICAL BIOCHEMISTRY, 2007, 363 (02) :275-287
[9]   Multiplex sequencing of paired-end ditags (MS-PET): a strategy for the ultra-high-throughput analysis of transcriptomes and genomes [J].
Ng, Patrick ;
Tan, Jack J. S. ;
Ooi, Hong Sain ;
Lee, Yen Ling ;
Chiu, Kuo Ping ;
Fullwood, Melissa J. ;
Srinivasan, Kandhadayar G. ;
Perbost, Clotilde ;
Du, Lei ;
Sung, Wing-Kin ;
Wei, Chia-Lin ;
Ruan, Yijun .
NUCLEIC ACIDS RESEARCH, 2006, 34 (12) :e84
[10]   An Eulerian path approach to DNA fragment assembly [J].
Pevzner, PA ;
Tang, HX ;
Waterman, MS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (17) :9748-9753