Transcriptome sequencing of the Microarray Quality Control (MAQC) RNA reference samples using next generation sequencing

被引:61
作者
Mane, Shrinivasrao P. [2 ]
Evans, Clive [2 ]
Cooper, Kristal L. [2 ]
Crasta, Oswald R. [2 ]
Folkerts, Otto [2 ]
Hutchison, Stephen K. [3 ]
Harkins, Timothy T. [4 ]
Thierry-Mieg, Danielle [5 ]
Thierry-Mieg, Jean [5 ]
Jensen, Roderick V. [1 ]
机构
[1] Virginia Tech, Dept Biol Sci, Blacksburg, VA 24061 USA
[2] Virginia Tech, Virginia Bioinformat Inst, Blacksburg, VA 24061 USA
[3] 454 Life Sci Inc, Branford, CT 06405 USA
[4] Roche Appl Sci, Indianapolis, IN 46250 USA
[5] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
来源
BMC GENOMICS | 2009年 / 10卷
关键词
GENE-EXPRESSION; CELL TRANSCRIPTOME; DISCOVERY;
D O I
10.1186/1471-2164-10-264
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC) reference RNA samples using Roche's 454 Genome Sequencer FLX. Results: We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values <= 10(-20). We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR) from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion: Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.
引用
收藏
页数:12
相关论文
共 29 条
  • [1] Analysis of the prostate cancer cell line LNCaP transcriptome using a sequencing-by-synthesis approach
    Bainbridge, Matthew N.
    Warren, Rene L.
    Hirst, Martin
    Romanuik, Tammy
    Zeng, Thomas
    Go, Anne
    Delaney, Allen
    Griffith, Malachi
    Hickenbotham, Matthew
    Magrini, Vincent
    Mardis, Elaine R.
    Sadar, Marianne D.
    Siddiqui, Asim S.
    Marra, Marco A.
    Jones, Steven J. M.
    [J]. BMC GENOMICS, 2006, 7 (1)
  • [2] Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays
    Bemmo, Amandine
    Benovoy, David
    Kwan, Tony
    Gaffney, Daniel J.
    Jensen, Roderick V.
    Majewski, Jacek
    [J]. BMC GENOMICS, 2008, 9 (1)
  • [3] Evaluation of DNA microarray results with quantitative gene expression platforms
    Canales, Roger D.
    Luo, Yuling
    Willey, James C.
    Austermiller, Bradley
    Barbacioru, Catalin C.
    Boysen, Cecilie
    Hunkapiller, Kathryn
    Jensen, Roderick V.
    Knight, Charles R.
    Lee, Kathleen Y.
    Ma, Yunqing
    Maqsodi, Botoul
    Papallo, Adam
    Peters, Elizabeth Herness
    Poulter, Karen
    Ruppel, Patricia L.
    Samaha, Raymond R.
    Shi, Leming
    Yang, Wen
    Zhang, Lu
    Goodsaid, Federico M.
    [J]. NATURE BIOTECHNOLOGY, 2006, 24 (09) : 1115 - 1122
  • [4] Stem cell transcriptome profiling via massive-scale mRNA sequencing
    Cloonan, Nicole
    Forrest, Alistair R. R.
    Kolle, Gabriel
    Gardiner, Brooke B. A.
    Faulkner, Geoffrey J.
    Brown, Mellissa K.
    Taylor, Darrin F.
    Steptoe, Anita L.
    Wani, Shivangi
    Bethel, Graeme
    Robertson, Alan J.
    Perkins, Andrew C.
    Bruce, Stephen J.
    Lee, Clarence C.
    Ranade, Swati S.
    Peckham, Heather E.
    Manning, Jonathan M.
    McKernan, Kevin J.
    Grimmond, Sean M.
    [J]. NATURE METHODS, 2008, 5 (07) : 613 - 619
  • [5] Annotating genomes with massive-scale RNA sequencing
    Denoeud, France
    Aury, Jean-Marc
    Da Silva, Corinne
    Noel, Benjamin
    Rogier, Odile
    Delledonne, Massimo
    Morgante, Michele
    Valle, Giorgio
    Wincker, Patrick
    Scarpelli, Claude
    Jaillon, Olivier
    Artiguenave, Francois
    [J]. GENOME BIOLOGY, 2008, 9 (12)
  • [6] Gene discovery and annotation using LCM-454 transcriptome sequencing
    Emrich, Scott J.
    Barbazuk, W. Brad
    Li, Li
    Schnable, Patrick S.
    [J]. GENOME RESEARCH, 2007, 17 (01) : 69 - 73
  • [7] Molecular biology - Power sequencing
    Graveley, Brenton R.
    [J]. NATURE, 2008, 453 (7199) : 1197 - 1198
  • [8] Single-molecule DNA sequencing of a viral genome
    Harris, Timothy D.
    Buzby, Phillip R.
    Babcock, Hazen
    Beer, Eric
    Bowers, Jayson
    Braslavsky, Ido
    Causey, Marie
    Colonell, Jennifer
    DiMeo, James
    Efcavitch, J. William
    Giladi, Eldar
    Gill, Jaime
    Healy, John
    Jarosz, Mirna
    Lapen, Dan
    Moulton, Keith
    Quake, Stephen R.
    Steinmann, Kathleen
    Thayer, Edward
    Tyurina, Anastasia
    Ward, Rebecca
    Weiss, Howard
    Xie, Zheng
    [J]. SCIENCE, 2008, 320 (5872) : 106 - 109
  • [9] Parallel confocal detection of single molecules in real time
    Lundquist, Paul M.
    Zhong, Cheng F.
    Zhao, Peiqian
    Tomaney, Austin B.
    Peluso, Paul S.
    Dixon, John
    Bettman, Brad
    Lacroix, Yves
    Kwo, Deborah P.
    McCullough, Etienne
    Maxham, Mark
    Hester, Kevin
    McNitt, Paul
    Grey, Donald M.
    Henriquez, Carlos
    Foquet, Mathieu
    Turner, Stephen W.
    Zaccarin, Denis
    [J]. OPTICS LETTERS, 2008, 33 (09) : 1026 - 1028
  • [10] Identification of new genes in Sinorhizobium meliloti using the genome sequencer FLX system
    Mao, Chunhong
    Evans, Clive
    Jensen, Roderick V.
    Sobral, Bruno W. S.
    [J]. BMC MICROBIOLOGY, 2008, 8 (1)