Pooled library tissue tags for EST-based gene discovery

被引:15
作者
Gavin, AJ
Scheetz, TE
Roberts, CA
O'Leary, B
Braun, TA
Sheffield, VC
Soares, MB
Robinson, JP
Casavant, TL [1 ]
机构
[1] Univ Iowa, Dept Elect & Comp Engn, Iowa City, IA 52242 USA
[2] Univ Iowa, Dept Pediat, Iowa City, IA 52242 USA
[3] Univ Iowa, Dept Physiol & Biophys, Iowa City, IA 52242 USA
关键词
D O I
10.1093/bioinformatics/18.9.1162
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In gene discovery projects based on EST sequencing, effective post-sequencing identification methods are important in determining tissue sources of ESTs within pooled cDNA libraries. In the past, such identification efforts have been characterized by higher than necessary failure rates due to the presence of errors within the subsequence containing the oligo tag intended to define the tissue source for each EST. Results: A large-scale EST-based gene discovery program at The University of Iowa has led to the creation of a unique software method named UITagCreator usable in the creation of large sets of synthetic tissue identification tags. The identification tags provide error detection and correction capability and, in conjunction with automated annotation software, result in a substantial improvement in the accurate identification of the tissue source in the presence of sequencing and base-calling errors. These identification rates are favorable, relative to past paradigms.
引用
收藏
页码:1162 / 1166
页数:5
相关论文
共 8 条
[1]   Normalization and subtraction: Two approaches to facilitate gene discovery [J].
Bonaldo, MDF ;
Lennon, G ;
Soares, MB .
GENOME RESEARCH, 1996, 6 (09) :791-806
[2]   LEXICOGRAPHIC CODES - ERROR-CORRECTING CODES FROM GAME-THEORY [J].
CONWAY, JH ;
SLOANE, NJA .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1986, 32 (03) :337-348
[3]  
Gusfield D, 1997, ALGORITHMS STRINGS T
[4]  
LEVENSHTEIN V, 1966, SOV PHYS DOKL, V6, P707
[5]   A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].
NEEDLEMAN, SB ;
WUNSCH, CD .
JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+
[6]  
SETUBAL JC, 1997, INTRO COMPUTATIONAL, P296
[7]  
TRIVEDI N, 2001, TRECE20011213
[8]  
WELDON EJ, 1972, ERROR CORRECTING COD, P560