ESTAnnotator: a tool for high throughput EST annotation

被引:33
作者
Hotz-Wagenblatt, A
Hankeln, T
Ernst, P
Glatting, KH
Schmidt, ER
Suhai, S
机构
[1] German Canc Res Ctr DKFZ, Dept Mol Biophys, D-69120 Heidelberg, Germany
[2] GENterprise GmbH, D-55099 Mainz, Germany
关键词
D O I
10.1093/nar/gkg566
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In high throughput sequence analysis, it is often necessary to combine the results of contemporary bioinformatics tools, because no individual tool alone computes all the requested information. ESTAnnotator is a tool for the high throughput annotation of expressed sequence tags (ESTs) by automatically running a collection of bioinformatics applications. In the first step, a quality check is performed and repeats, vector parts and low quality sequences are masked. Then successive steps of database searching and EST clustering are performed. Already known transcripts present within mRNA and genomic DNA reference databases are identified. Subsequently, tools for the clustering of anonymous ESTs, and for further database searches at the protein level, are applied. Finally, the outputs of each individual tool are gathered and the relevant results presented in a descriptive summary. ESTAnnotator was already successfully applied for the systematic identification and characterisation of novel human genes involved in cartilage/bone formation, growth, differentiation and homeostasis. ESTAnnotator is available at http://genome.dkfz-heidelberg.de, contact: genome@dkfz.de.
引用
收藏
页码:3716 / 3719
页数:4
相关论文
共 11 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   PipeOnline 2.0: automated EST processing and functional data sorting [J].
Ayoubi, P ;
Jin, XJ ;
Leite, S ;
Liu, XH ;
Martajaja, J ;
Abduraham, A ;
Wan, QL ;
Yan, W ;
Misawa, E ;
Prade, RA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (21) :4761-4769
[4]   STACK: Sequence Tag Alignment and Consensus Knowledgebase [J].
Christoffels, A ;
van Gelder, A ;
Greyling, G ;
Miller, R ;
Hide, T ;
Hide, W .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :234-238
[5]   A task framework for the web interface W2H [J].
Ernst, P ;
Glatting, KH ;
Suhai, S .
BIOINFORMATICS, 2003, 19 (02) :278-282
[6]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[7]   A CONTIG ASSEMBLY PROGRAM BASED ON SENSITIVE DETECTION OF FRAGMENT OVERLAPS [J].
HUANG, XQ .
GENOMICS, 1992, 14 (01) :18-25
[8]   An optimized protocol for analysis of EST sequences [J].
Liang, F ;
Holt, I ;
Pertea, G ;
Karamycheva, S ;
Salzberg, SL ;
Quackenbush, J .
NUCLEIC ACIDS RESEARCH, 2000, 28 (18) :3657-3665
[9]   W2H: WWW interface to the GCG sequence analysis package [J].
Senger, M ;
Flores, T ;
Glatting, KH ;
Ernst, P ;
Hotz-Wagenblatt, A ;
Suhai, S .
BIOINFORMATICS, 1998, 14 (05) :452-457
[10]  
ZABEL B, 2002, MED GENETIK, V14, P245