Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing

被引:525
作者
Vera, J. Cristobal [1 ]
Wheat, Christopher W. [1 ,2 ]
Fescemyer, Howard W. [1 ]
Frilander, Mikko J. [3 ]
Crawford, Douglas L. [4 ]
Hanski, Ilkka [2 ]
Marden, James H. [1 ]
机构
[1] Penn State Univ, Mueller Lab 208, Dept Biol, University Pk, PA 16802 USA
[2] Univ Helsinki, Dept Biol & Environm Sci, FIN-00014 Helsinki, Finland
[3] Univ Helsinki, Inst Biotechnol, FIN-00014 Helsinki, Finland
[4] Univ Miami, Rosenstiel Sch Marine & Atmospher Sci, Miami, FL 33149 USA
关键词
bioinformatics; biotechnology; functional genomics; metapopulation; polymorphism; transcriptomics;
D O I
10.1111/j.1365-294X.2008.03666.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a de novo assembly of a eukaryote transcriptome using 454 pyrosequencing data. The Glanville fritillary butterfly (Melitaea cinxia; Lepidoptera: Nymphalidae) is a prominent species in population biology but had no previous genomic data. Sequencing runs using two normalized complementary DNA collections from a genetically diverse pool of larvae, pupae, and adults yielded 608 053 expressed sequence tags (mean length = 110 nucleotides), which assembled into 48 354 contigs (sets of overlapping DNA segments) and 59 943 singletons. BLAST comparisons confirmed the accuracy of the sequencing and assembly, and indicated the presence of c. 9000 unique genes, along with > 6000 additional microarray-confirmed unannotated contigs. Average depth of coverage was 6.5-fold for the longest 4800 contigs (348-2849 bp in length), sufficient for detecting large numbers of single nucleotide polymorphisms. Oligonucleotide microarray probes designed from the assembled sequences showed highly repeatable hybridization intensity and revealed biological differences among individuals. We conclude that 454 sequencing, when performed to provide sufficient coverage depth, allows de novo transcriptome assembly and a fast, cost-effective, and reliable method for development of functional genomic tools for nonmodel species. This development narrows the gap between approaches based on model organisms with rich genetic resources vs. species that are most tractable for ecological and evolutionary studies.
引用
收藏
页码:1636 / 1647
页数:12
相关论文
共 56 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]  
[Anonymous], 2004, WINGS CHECKERSPOTS M
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   Coregulators:: transducing signal from transcription to alternative splicing [J].
Auboeuf, Didier ;
Batsche, Eric ;
Dutertre, Martin ;
Muchardt, Christian ;
O'Malley, Bert W. .
TRENDS IN ENDOCRINOLOGY AND METABOLISM, 2007, 18 (03) :122-129
[5]   Analysis of the prostate cancer cell line LNCaP transcriptome using a sequencing-by-synthesis approach [J].
Bainbridge, Matthew N. ;
Warren, Rene L. ;
Hirst, Martin ;
Romanuik, Tammy ;
Zeng, Thomas ;
Go, Anne ;
Delaney, Allen ;
Griffith, Malachi ;
Hickenbotham, Matthew ;
Magrini, Vincent ;
Mardis, Elaine R. ;
Sadar, Marianne D. ;
Siddiqui, Asim S. ;
Marra, Marco A. ;
Jones, Steven J. M. .
BMC GENOMICS, 2006, 7 (1)
[6]   A tutorial on statistical methods for population association studies [J].
Balding, David J. .
NATURE REVIEWS GENETICS, 2006, 7 (10) :781-791
[7]   SNP discovery via 454 transcriptome sequencing [J].
Barbazuk, W. Brad ;
Emrich, Scott J. ;
Chen, Hsin D. ;
Li, Li ;
Schnable, Patrick S. .
PLANT JOURNAL, 2007, 51 (05) :910-918
[8]   A wing expressed sequence tag resource for Bicyclus anynana butterflies, an evo-devo model [J].
Beldade, Patricia ;
Rudd, Stephen ;
Gruber, Jonathan D. ;
Long, Anthony D. .
BMC GENOMICS, 2006, 7 (1)
[9]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[10]   The molecular ecologist's guide to expressed sequence tags [J].
Bouck, Amy ;
Vision, Todd .
MOLECULAR ECOLOGY, 2007, 16 (05) :907-924