Rapid and accurate pyrosequencing of angiosperm plastid genomes

被引:194
作者
Moore, Michael J.
Dhingra, Amit
Soltis, Pamela S.
Shaw, Regina
Farmerie, William G.
Folta, Kevin M.
Soltis, Douglas E.
机构
[1] Univ Florida, Dept Bot, Gainesville, FL 32611 USA
[2] Univ Florida, Florida Museum Nat Hist, Gainesville, FL 32611 USA
[3] Univ Florida, Hort Sci Dept, Gainesville, FL 32611 USA
[4] Univ Florida, ICBR Genome Sequencing Serv Lab, Gainesville, FL 32610 USA
关键词
D O I
10.1186/1471-2229-6-17
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Background: Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 ( GS 20) System (454 Life Sciences Corporation), to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae) and Platanus occidentalis (Platanaceae). Results: More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6 x in Nandina and 17.3 x in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with similar to 60% of all errors associated with homopolymer runs of 5 or more nucleotides and similar to 50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion: Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy observed in the GS 20 plastid genome sequence was generated for a significant reduction in time and cost over traditional shotgun-based genome sequencing techniques, although with approximately half the coverage of previously reported GS 20 de novo genome sequence. The GS 20 should be broadly applicable to angiosperm plastid genome sequencing, and therefore promises to expand the scale of plant genetic and phylogenetic research dramatically.
引用
收藏
页数:13
相关论文
共 40 条
[1]   Dating phylogenetically basal eudicots using rbcL sequences and multiple fossil reference points [J].
Anderson, CL ;
Bremer, K ;
Friis, EM .
AMERICAN JOURNAL OF BOTANY, 2005, 92 (10) :1737-1748
[2]   Photosynthetic evolution in parasitic plants: insight from the chloroplast genome [J].
Bungard, RA .
BIOESSAYS, 2004, 26 (03) :235-247
[3]   PCR amplification introduces errors into mononucleotide and dinucleotide repeat sequences [J].
Clarke, LA ;
Rebelo, CS ;
Gonçalves, J ;
Boavida, MG ;
Jordan, P .
JOURNAL OF CLINICAL PATHOLOGY-MOLECULAR PATHOLOGY, 2001, 54 (05) :351-353
[4]   The highly rearranged chloroplast genome of Trachelium caeruleum (Campanulaceae): Multiple inversions, inverted repeat expansion and contraction, transposition, insertions/deletions, and several repeat families [J].
Cosner, ME ;
Jansen, RK ;
Palmer, JD ;
Downie, SR .
CURRENT GENETICS, 1997, 31 (05) :419-429
[5]   Rapid amplification of plasmid and phage DNA using phi29 DNA polymerase and multiply-primed rolling circle amplification [J].
Dean, FB ;
Nelson, JR ;
Giesler, TL ;
Lasken, RS .
GENOME RESEARCH, 2001, 11 (06) :1095-1099
[6]   ASAP: Amplification, sequencing & annotation of plastomes [J].
Dhingra, A ;
Folta, KM .
BMC GENOMICS, 2005, 6 (1)
[7]  
DHINGRA A, 2005, ARABIDOPSIS PROTOCOL, V323, P525
[8]   Using pyrosequencing to shed light on deep mine microbial ecology [J].
Edwards, Robert A. ;
Rodriguez-Brito, Beltran ;
Wegley, Linda ;
Haynes, Matthew ;
Breitbart, Mya ;
Peterson, Dean M. ;
Saar, Martin O. ;
Alexander, Scott ;
Alexander, E. Calvin, Jr. ;
Rohwer, Forest .
BMC GENOMICS, 2006, 7 (1)
[9]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[10]   A Sanger/pyrosequencing hybrid approach tor the generation of high-quality draft assemblies of marine microbial genomes [J].
Goldberg, Susanne M. D. ;
Johnson, Justin ;
Busam, Dana ;
Feldblyum, Tamara ;
Ferriera, Steve ;
Friedman, Robert ;
Halpern, Aaron ;
Khouri, Hoda ;
Kravitz, Saul A. ;
Lauro, Federico M. ;
Li, Kelvin ;
Rogers, Yu-Hui ;
Strausberg, Robert ;
Sutton, Granger ;
Tallon, Luke ;
Thomas, Torsten ;
Venter, Eli ;
Frazier, Marvin ;
Venter, J. Craig .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (30) :11240-11245