Parallel creation of non-redundant gene indices from partial mRNA transcripts

被引:14
作者
Trivedi, N
Bischof, J
Davis, S
Pedretti, K
Scheetz, TE
Braun, TA
Roberts, CA
Robinson, NL
Sheffield, VC
Soares, AB
Casavant, TL [1 ]
机构
[1] Univ Iowa, Dept Elect & Comp Engn, Parallel Proc Lab, Iowa City, IA 52242 USA
[2] Univ Iowa, Dept Elect & Comp Engn, Coordinated Lab Computat Genom, Iowa City, IA 52242 USA
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2002年 / 18卷 / 06期
关键词
parallel cluster application; expressed sequence tag; genome project;
D O I
10.1016/S0167-739X(02)00059-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper describes the UIcluster software tool, which partitions expressed sequence tag (EST) sequences and other genetic sequences into "clusters" based on sequence similarity. Ideally, each cluster will contain sequences that all represent the same gene. UIcluster has been developed over the course of 4 years to solve this problem efficiently and accurately for large data sets consisting of tens or hundreds of thousands of EST sequences. The latest version of the application has been parallelized using the MPI standard. Both the computation and memory requirements of the program can be distributed among multiple (possibly distributed) UNIX processes. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:863 / 870
页数:8
相关论文
共 8 条
[1]  
ADAMS MD, 1995, NATURE, V377, P3
[2]   Normalization and subtraction: Two approaches to facilitate gene discovery [J].
Bonaldo, MDF ;
Lennon, G ;
Soares, MB .
GENOME RESEARCH, 1996, 6 (09) :791-806
[3]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[4]   A comprehensive approach to clustering of expressed human gene sequence: The sequence tag alignment and consensus knowledge base [J].
Miller, RT ;
Christoffels, AG ;
Gopalakrishnan, C ;
Burke, J ;
Ptitsyn, AA ;
Broveak, TR ;
Hide, WA .
GENOME RESEARCH, 1999, 9 (11) :1143-1155
[5]  
PARSONS JD, 1992, COMPUT APPL BIOSCI, V8, P461
[6]   Pieces of the puzzle: expressed sequence tags and the catalog of human genes [J].
Schuler, GD .
JOURNAL OF MOLECULAR MEDICINE-JMM, 1997, 75 (10) :694-698
[7]  
*U TN, 1994, CS94230 U TENN
[8]   The sequence of the human genome [J].
Venter, JC ;
Adams, MD ;
Myers, EW ;
Li, PW ;
Mural, RJ ;
Sutton, GG ;
Smith, HO ;
Yandell, M ;
Evans, CA ;
Holt, RA ;
Gocayne, JD ;
Amanatides, P ;
Ballew, RM ;
Huson, DH ;
Wortman, JR ;
Zhang, Q ;
Kodira, CD ;
Zheng, XQH ;
Chen, L ;
Skupski, M ;
Subramanian, G ;
Thomas, PD ;
Zhang, JH ;
Miklos, GLG ;
Nelson, C ;
Broder, S ;
Clark, AG ;
Nadeau, C ;
McKusick, VA ;
Zinder, N ;
Levine, AJ ;
Roberts, RJ ;
Simon, M ;
Slayman, C ;
Hunkapiller, M ;
Bolanos, R ;
Delcher, A ;
Dew, I ;
Fasulo, D ;
Flanigan, M ;
Florea, L ;
Halpern, A ;
Hannenhalli, S ;
Kravitz, S ;
Levy, S ;
Mobarry, C ;
Reinert, K ;
Remington, K ;
Abu-Threideh, J ;
Beasley, E .
SCIENCE, 2001, 291 (5507) :1304-+