Comparative high-throughput transcriptome sequencing and development of SiESTa, the Silene EST annotation database

被引:21
作者
Blavet, Nicolas [1 ]
Charif, Delphine [2 ]
Oger-Desfeux, Christine [3 ]
Marais, Gabriel A. B. [2 ]
Widmer, Alex [1 ]
机构
[1] ETH, Inst Integrat Biol IBZ, CH-8092 Zurich, Switzerland
[2] Univ Lyon 1, CNRS, Lab Biometrie & Biol Evolut, UMR 5558, F-69622 Villeurbanne, France
[3] Univ Lyon 1, DTAMB PRABI, IFR41, F-69622 Villeurbanne, France
来源
BMC GENOMICS | 2011年 / 12卷
关键词
cDNA library; database; EST; SNP; Silene; REPRODUCTIVE ISOLATION; SEX-CHROMOSOMES; GENOME SIZE; ARABIDOPSIS; DNA; LATIFOLIA; GENES; RETROTRANSPOSONS; POLYMORPHISM; EXPRESSION;
D O I
10.1186/1471-2164-12-376
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The genus Silene is widely used as a model system for addressing ecological and evolutionary questions in plants, but advances in using the genus as a model system are impeded by the lack of available resources for studying its genome. Massively parallel sequencing cDNA has recently developed into an efficient method for characterizing the transcriptomes of non-model organisms, generating massive amounts of data that enable the study of multiple species in a comparative framework. The sequences generated provide an excellent resource for identifying expressed genes, characterizing functional variation and developing molecular markers, thereby laying the foundations for future studies on gene sequence and gene expression divergence. Here, we report the results of a comparative transcriptome sequencing study of eight individuals representing four Silene and one Dianthus species as outgroup. All sequences and annotations have been deposited in a newly developed and publicly available database called SiESTa, the Silene EST annotation database. Results: A total of 1,041,122 EST reads were generated in two runs on a Roche GS-FLX 454 pyrosequencing platform. EST reads were analyzed separately for all eight individuals sequenced and were assembled into contigs using TGICL. These were annotated with results from BLASTX searches and Gene Ontology (GO) terms, and thousands of single-nucleotide polymorphisms (SNPs) were characterized. Unassembled reads were kept as singletons and together with the contigs contributed to the unigenes characterized in each individual. The high quality of unigenes is evidenced by the proportion (49%) that have significant hits in similarity searches with the A. thaliana proteome. The SiESTa database is accessible at http://www.siesta.ethz.ch. Conclusion: The sequence collections established in the present study provide an important genomic resource for four Silene and one Dianthus species and will help to further develop Silene as a plant model system. The genes characterized will be useful for future research not only in the species included in the present study, but also in related species for which no genomic resources are yet available. Our results demonstrate the efficiency of massively parallel transcriptome sequencing in a comparative framework as an approach for developing genomic resources in diverse groups of non-model organisms.
引用
收藏
页数:11
相关论文
共 53 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], CODON USAGE DATABASE
[3]   SNP discovery via 454 transcriptome sequencing [J].
Barbazuk, W. Brad ;
Emrich, Scott J. ;
Chen, Hsin D. ;
Li, Li ;
Schnable, Patrick S. .
PLANT JOURNAL, 2007, 51 (05) :910-918
[4]   NUCLEAR-DNA AMOUNTS IN ANGIOSPERMS [J].
BENNETT, MD ;
SMITH, JB .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1991, 334 (1271) :309-345
[5]   Silene as a model system in ecology and evolution [J].
Bernasconi, G. ;
Antonovics, J. ;
Biere, A. ;
Charlesworth, D. ;
Delph, L. F. ;
Filatov, D. ;
Giraud, T. ;
Hood, M. E. ;
Marais, G. A. B. ;
McCauley, D. ;
Pannell, J. R. ;
Shykoff, J. A. ;
Vyskot, B. ;
Wolfe, L. M. ;
Widmer, A. .
HEREDITY, 2009, 103 (01) :5-14
[6]   Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events [J].
Bowers, JE ;
Chapman, BA ;
Rong, JK ;
Paterson, AH .
NATURE, 2003, 422 (6930) :433-438
[7]  
Byng JW, 2016, BOT J LINN SOC, V181, P1, DOI [10.1111/boj.12385, 10.1111/j.1095-8339.2009.00996.x]
[8]   Survey of repetitive sequences in Silene latifolia with respect to their distribution on sex chromosomes [J].
Cermak, Tomas ;
Kubat, Zdenek ;
Hobza, Roman ;
Koblizkova, Andrea ;
Widmer, Alex ;
Macas, Jiri ;
Vyskot, Boris ;
Kejnovsky, Eduard .
CHROMOSOME RESEARCH, 2008, 16 (07) :961-976
[9]   Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology [J].
Cheung, Foo ;
Haas, Brian J. ;
Goldberg, Susanne M. D. ;
May, Gregory D. ;
Xiao, Yongli ;
Town, Christopher D. .
BMC GENOMICS, 2006, 7 (1)
[10]   Blast2GO:: a universal tool for annotation, visualization and analysis in functional genomics research [J].
Conesa, A ;
Götz, S ;
García-Gómez, JM ;
Terol, J ;
Talón, M ;
Robles, M .
BIOINFORMATICS, 2005, 21 (18) :3674-3676