ESTpass: a web-based server for processing and annotating expressed sequence tag (EST) sequences

被引:22
作者
Lee, Byungwook [1 ,2 ]
Hong, Taehui [1 ]
Byun, Sang Jin [3 ]
Woo, Taeha [1 ]
Choi, Yoon Jeong [1 ]
机构
[1] KRIBB, Korean Bioinformat Ctr, Taejon 305817, South Korea
[2] Korea Adv Inst Sci & Technol, Dept BioSyst, Taejon 305701, South Korea
[3] Bioneer, Bioinformat Team, Taejon 306220, South Korea
关键词
D O I
10.1093/nar/gkm369
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a web-based server, called ESTpass, for processing and annotating sequence data from expressed sequence tag (EST) projects. ESTpass accepts a FASTA-formatted EST file and its quality file as inputs, and it then executes a back-end EST analysis pipeline consisting of three consecutive steps. The first is cleansing the input EST sequences. The second is clustering and assembling the cleansed EST sequences using d2_cluster and CAP3 programs and producing putative transcripts. From the CAP3 output, ESTpass detects chimeric EST sequences which are confirmed through comparison with the nr database. The last step is annotating the putative transcript sequences using RefSeq, InterPro, GO and KEGG gene databases according to user-specified options. The major advantages of ESTpass are the integration of cleansing and annotating processes, rigorous chimeric EST detection, exhaustive annotation, and email reporting to inform the user about the progress and to send the analysis results. The ESTpass results include three reports (summary, cleansing and annotation) and download function, as well as graphic statistics. They can be retrieved and downloaded using a standard web browser. The server is available at http://estpass.kobic.re.kr/.
引用
收藏
页码:W159 / W162
页数:4
相关论文
共 26 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]  
ALTSCHUL SF, 1997, NUCLEIC ACIDS RES, V25, P3402
[3]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[4]   d2_cluster: A validated method for clustering EST and full-length cDNA sequences [J].
Burke, J ;
Davison, D ;
Hide, W .
GENOME RESEARCH, 1999, 9 (11) :1135-1142
[5]   ParPEST: a pipeline for EST data analysis based on parallel computing [J].
D'Agostino, N ;
Aversano, M ;
Chiusano, ML .
BMC BIOINFORMATICS, 2005, 6 (Suppl 4)
[6]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[7]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[8]   ESTAnnotator: a tool for high throughput EST annotation [J].
Hotz-Wagenblatt, A ;
Hankeln, T ;
Ernst, P ;
Glatting, KH ;
Schmidt, ER ;
Suhai, S .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3716-3719
[9]   CAP3: A DNA sequence assembly program [J].
Huang, XQ ;
Madan, A .
GENOME RESEARCH, 1999, 9 (09) :868-877
[10]   From genomics to chemical genomics: new developments in KEGG [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Hattori, Masahiro ;
Aoki-Kinoshita, Kiyoko F. ;
Itoh, Masumi ;
Kawashima, Shuichi ;
Katayama, Toshiaki ;
Araki, Michihiro ;
Hirakawa, Mika .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D354-D357