DNA sequence-based "Bar codes" for tracking the origins of expressed sequence tags from a maize cDNA library constructed using multiple mRNA sources

被引:29
作者
Qiu, F
Guo, L
Wen, TJ
Liu, F
Ashlock, DA
Schnable, PS [1 ]
机构
[1] Iowa State Univ, Dept Agron, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Math, Ames, IA 50011 USA
[3] Iowa State Univ, Interdept Grad Program Bioinformat & Computat Bio, Ames, IA 50011 USA
[4] Iowa State Univ, Interdept Genet Grad Programs, Ames, IA 50011 USA
[5] Iowa State Univ, Ctr Plant Genom, Ames, IA 50011 USA
关键词
D O I
10.1104/pp.103.025015
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
To enhance gene discovery, expressed sequence tag (EST) projects often make use of cDNA libraries produced using diverse mixtures of mRNAs. As such, expression data are lost because the origins of the resulting ESTs cannot be determined. Alternatively, multiple libraries can be prepared, each from a more restricted source of mRNAs. Although this approach allows the origins of ESTs to be determined, it requires the production of multiple libraries. A hybrid approach is reported here. A cDNA library was prepared using 21 different pools of maize (Zea mays) mRNAs. DNA sequence "bar codes" were added during first-strand cDNA synthesis to uniquely identify the mRNA source pool from which individual cDNAs were derived. Using a decoding algorithm that included error correction, it was possible to identify the source mRNA pool of more than 97% of the ESTs. The frequency at which a bar code is represented in an EST contig should be proportional to the abundance of the corresponding mRNA in the source pool. Consistent with this, all ESTs derived from several genes (zein and adh1) that are known to be exclusively expressed in kernels or preferentially expressed under anaerobic conditions, respectively, were exclusively tagged with bar codes associated with mRNA pools prepared from kernel and anaerobically treated seedlings, respectively. Hence, by allowing for the retention of expression data, the bar coding of cDNA libraries can enhance the value of EST projects.
引用
收藏
页码:475 / 481
页数:7
相关论文
共 17 条
[1]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[2]  
Ashlock D, 2002, IEEE C EVOL COMPUTAT, P1296, DOI 10.1109/CEC.2002.1004430
[3]   Both 5' and 3' sequences of maize adh1 mRNA are required for enhanced translation under low-oxygen conditions [J].
BaileySerres, J ;
Dawe, RK .
PLANT PHYSIOLOGY, 1996, 112 (02) :685-695
[4]   Normalization and subtraction: Two approaches to facilitate gene discovery [J].
Bonaldo, MDF ;
Lennon, G ;
Soares, MB .
GENOME RESEARCH, 1996, 6 (09) :791-806
[5]   DNA sequence quality trimming and vector removal [J].
Chou, HH ;
Holmes, MH .
BIOINFORMATICS, 2001, 17 (12) :1093-1104
[6]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[7]  
Gusfield D, 1997, ALGORITHMS STRINGS T
[8]   CAP3: A DNA sequence assembly program [J].
Huang, XQ ;
Madan, A .
GENOME RESEARCH, 1999, 9 (09) :868-877
[9]   TRANSCRIPTIONAL REGULATION OF PS-IAA4/5 AND PS-IAA6 EARLY GENE-EXPRESSION BY INDOLEACETIC-ACID AND PROTEIN-SYNTHESIS INHIBITORS IN PEA (PISUM-SATIVUM) [J].
KOSHIBA, T ;
BALLAS, N ;
WONG, LM ;
THEOLOGIS, A .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 253 (03) :396-413
[10]  
Lal A, 1999, CANCER RES, V59, P5403