ESTExplorer: an expressed sequence tag (EST) assembly and annotation platform

被引:44
作者
Nagaraj, Shivashankar H. [1 ]
Deshpande, Nandan [1 ]
Gasser, Robin B. [2 ]
Ranganathan, Shoba [1 ,3 ]
机构
[1] Macquarie Univ, Dept Chem & Biomol Sci, Sydney, NSW 2109, Australia
[2] Univ Melbourne, Dept Vet Sci, Werribee, Vic 3030, Australia
[3] Natl Univ Singapore, Yong Loo Lin Sch Med, Dept Biochem, Singapore 119260, Singapore
基金
澳大利亚研究理事会;
关键词
D O I
10.1093/nar/gkm378
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The analysis of expressed sequence tag (EST) datasets offers a rapid and cost-effective approach to elucidate the transcriptome of an organism, but requiring several computational methods for assembly and annotation. ESTExplorer is a comprehensive workflow system for EST data management and analysis. The pipeline uses a 'distributed control approach' in which the most appropriate bioinformatics tools are implemented over different dedicated processors. Species-specific repeat masking and conceptual translation are in-built. ESTExplorer accepts a set of ESTs in FASTA format which can be analysed using programs selected by the user. After pre-processing and assembly, the dataset is annotated at the nucleotide and protein levels, following conceptual translation. Users may optionally provide ESTExplorer with assembled contigs for annotation purposes. Functionally annotated contigs/ESTs can be analysed individually. The overall outputs are gene ontologies, protein functional identifications in terms of mapping to protein domains and metabolic pathways. ESTExplorer has been applied successfully to annotate large EST datasets from parasitic nematodes and to identify novel genes as potential targets for parasite intervention. ESTExplorer runs on a Linux cluster and is freely available for the academic community at http://estexplorer.biolinfo.org.
引用
收藏
页码:W143 / W147
页数:5
相关论文
共 24 条
[1]   SEQUENCE IDENTIFICATION OF 2,375 HUMAN BRAIN GENES [J].
ADAMS, MD ;
DUBNICK, M ;
KERLAVAGE, AR ;
MORENO, R ;
KELLEY, JM ;
UTTERBACK, TR ;
NAGLE, JW ;
FIELDS, C ;
VENTER, JC .
NATURE, 1992, 355 (6361) :632-634
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   WormBase:: new content and better access [J].
Bieri, Tamberlyn ;
Blasiar, Darin ;
Ozersky, Philip ;
Antoshechkin, Igor ;
Bastiani, Carol ;
Canaran, Payan ;
Chan, Juancarlos ;
Chen, Nansheng ;
Chen, Wen J. ;
Davis, Paul ;
Fiedler, Tristan J. ;
Girard, Lisa ;
Han, Michael ;
Harris, Todd W. ;
Kishore, Ranjana ;
Lee, Raymond ;
McKay, Sheldon ;
Muller, Hans-Michael ;
Nakamura, Cecilia ;
Petcherski, Andrei ;
Rangarajan, Arun ;
Rogers, Anthony ;
Schindelman, Gary ;
Schwarz, Erich M. ;
Spooner, Will ;
Tuli, Mary Ann ;
Van Auken, Kimberly ;
Wang, Daniel ;
Wang, Xiaodong ;
Williams, Gary ;
Durbin, Richard ;
Stein, Lincoln D. ;
Sternberg, Paul W. ;
Spieth, John .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D506-D510
[5]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[6]   Blast2GO:: a universal tool for annotation, visualization and analysis in functional genomics research [J].
Conesa, A ;
Götz, S ;
García-Gómez, JM ;
Terol, J ;
Talón, M ;
Robles, M .
BIOINFORMATICS, 2005, 21 (18) :3674-3676
[7]   Construction of gender-enriched cDNA archives for adult Oesophagostomum dentatum by suppressive-subtractive hybridization and a microarray analysis of expressed sequence tags [J].
Cottee, P. A. ;
Nisbet, A. J. ;
El-Osta, Y. G. Abs ;
Webster, T. L. ;
Gasser, R. B. .
PARASITOLOGY, 2006, 132 :691-708
[8]   ParPEST: a pipeline for EST data analysis based on parallel computing [J].
D'Agostino, N ;
Aversano, M ;
Chiusano, ML .
BMC BIOINFORMATICS, 2005, 6 (Suppl 4)
[9]   ESTAnnotator: a tool for high throughput EST annotation [J].
Hotz-Wagenblatt, A ;
Hankeln, T ;
Ernst, P ;
Glatting, KH ;
Schmidt, ER ;
Suhai, S .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3716-3719
[10]   CAP3: A DNA sequence assembly program [J].
Huang, XQ ;
Madan, A .
GENOME RESEARCH, 1999, 9 (09) :868-877