An overview of the wcd EST clustering tool

被引:25
作者
Hazelhurst, Scott [1 ]
Hide, Winston [2 ]
Liptak, Zsuzsanna [3 ]
Nogueira, Ramon [1 ]
Starfield, Richard [1 ]
机构
[1] Univ Witwatersrand, Wits Bioinformat, ZA-2050 Wits, South Africa
[2] Univ Western Cape, South African Natl Bioinformat Inst, ZA-7535 Bellville, South Africa
[3] Univ Bielefeld, AG Genominformat, Tech Fak, D-33501 Bielefeld, Germany
基金
新加坡国家研究基金会; 英国医学研究理事会;
关键词
D O I
10.1093/bioinformatics/btn203
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The wcd system is an open source tool for clustering expressed sequence tags (EST) and other DNA and RNA sequences. wcd allows efficient all-versus-all comparison of ESTs using either the d(2) distance function or edit distance, improving existing implementations of d(2). It supports merging, refinement and reclustering of clusters. It is drop in compatible with the StackPack clustering package. wcd supports parallelization under both shared memory and cluster architectures. It is distributed with an EMBOSS wrapper allowing wcd to be installed as part of an EMBOSS installation (and so provided by a web server).
引用
收藏
页码:1542 / 1546
页数:5
相关论文
共 11 条
[1]  
HAZELHURST S, 2008, S AFRICAN COMPUT J, V40
[2]  
HAZELHURST S, 2003, TRWITSCS20031 U WITW
[3]  
Hide W, 1994, J Comput Biol, V1, P199, DOI 10.1089/cmb.1994.1.199
[4]   CAP3: A DNA sequence assembly program [J].
Huang, XQ ;
Madan, A .
GENOME RESEARCH, 1999, 9 (09) :868-877
[5]   Integrative annotation of 21,037 human genes validated by full-length cDNA clones [J].
Imanishi, T ;
Itoh, T ;
Suzuki, Y ;
O'Donovan, C ;
Fukuchi, S ;
Koyanagi, KO ;
Barrero, RA ;
Tamura, T ;
Yamaguchi-Kabata, Y ;
Tanino, M ;
Yura, K ;
Miyazaki, S ;
Ikeo, K ;
Homma, K ;
Kasprzyk, A ;
Nishikawa, T ;
Hirakawa, M ;
Thierry-Mieg, J ;
Thierry-Mieg, D ;
Ashurst, J ;
Jia, LB ;
Nakao, M ;
Thomas, MA ;
Mulder, N ;
Karavidopoulou, Y ;
Jin, LH ;
Kim, S ;
Yasuda, T ;
Lenhard, B ;
Eveno, E ;
Suzuki, Y ;
Yamasaki, C ;
Takeda, J ;
Gough, C ;
Hilton, P ;
Fujii, Y ;
Sakai, H ;
Tanaka, S ;
Amid, C ;
Bellgard, M ;
Bonaldo, MD ;
Bono, H ;
Bromberg, SK ;
Brookes, AJ ;
Bruford, E ;
Carninci, P ;
Chelala, C ;
Couillault, C ;
de Souza, SJ ;
Debily, MA .
PLOS BIOLOGY, 2004, 2 (06) :856-875
[6]   Space and time efficient parallel algorithms and software for EST clustering [J].
Kalyanaraman, A ;
Aluru, S ;
Brendel, V ;
Kothari, S .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2003, 14 (12) :1209-1221
[7]   Fast sequence clustering using a suffix array algorithm [J].
Malde, K ;
Coward, E ;
Jonassen, I .
BIOINFORMATICS, 2003, 19 (10) :1221-1226
[8]   A comprehensive approach to clustering of expressed human gene sequence: The sequence tag alignment and consensus knowledge base [J].
Miller, RT ;
Christoffels, AG ;
Gopalakrishnan, C ;
Burke, J ;
Ptitsyn, AA ;
Broveak, TR ;
Hide, WA .
GENOME RESEARCH, 1999, 9 (11) :1143-1155
[9]   A hitchhiker's guide to expressed sequence tag (EST) analysis [J].
Nagaraj, Shivashankar H. ;
Gasser, Robin B. ;
Ranganathan, Shoba .
BRIEFINGS IN BIOINFORMATICS, 2007, 8 (01) :6-21
[10]  
REED G, 2001, BRIEF BIOINFORM, V2, P388