Analysis of the cDNAs of hypothetical genes on Arabidopsis chromosome 2 reveals numerous transcript variants

被引:43
作者
Xiao, YL [1 ]
Smith, SR [1 ]
Ishmael, N [1 ]
Redman, JC [1 ]
Kumar, N [1 ]
Monaghan, EL [1 ]
Ayele, M [1 ]
Haas, BJ [1 ]
Wu, HC [1 ]
Town, CD [1 ]
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
关键词
D O I
10.1104/pp.105.063479
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
In the fully sequenced Arabidopsis (Arabidopsis thaliana) genome, many gene models are annotated as "hypothetical protein,'' whose gene structures are predicted solely by computer algorithms with no support from either expressed sequence matches from Arabidopsis, or nucleic acid or protein homologs from other species. In order to confirm their existence and predicted gene structures, a high-throughput method of rapid amplification of cDNA ends ( RACE) was used to obtain their cDNA sequences from 11 cDNA populations. Primers from all of the 797 hypothetical genes on chromosome 2 were designed, and, through 5' and 3' RACE, clones from 506 genes were sequenced and cDNA sequences from 399 target genes were recovered. The cDNA sequences were obtained by assembling their 5' and 3' RACE polymerase chain reaction products. These sequences revealed that (1) the structures of 151 hypothetical genes were different from their predictions; (2) 116 hypothetical genes had alternatively spliced transcripts and 187 genes displayed polyadenylation sites; and (3) there were transcripts arising from both strands, from the strand opposite to that of the prediction and possible dicistronic transcripts. Promoters from five randomly chosen hypothetical genes (At2g02540, At2g31270, At2g33640, At2g35550, and At2g36340) were cloned into report constructs, and their expressions are tissue or development stage specific. Our results indicate at least 50% of hypothetical genes on chromosome 2 are expressed in the cDNA populations with about 38% of the gene structures differing from their predictions. Thus, by using this targeted approach, high-throughput RACE, we revealed numerous transcripts including many uncharacterized variants from these hypothetical genes.
引用
收藏
页码:1323 / 1337
页数:15
相关论文
共 72 条
[11]   Natural antisense transcripts of the S locus receptor kinase gene and related sequences in Brassica oleracea [J].
Cock, JM ;
Swarup, R ;
Dumas, C .
MOLECULAR & GENERAL GENETICS, 1997, 255 (05) :514-524
[12]   THE WAR OF THE WHORLS - GENETIC INTERACTIONS CONTROLLING FLOWER DEVELOPMENT [J].
COEN, ES ;
MEYEROWITZ, EM .
NATURE, 1991, 353 (6339) :31-37
[13]   GeBP, the first member of a new gene family in Arabidopsis, encodes a nuclear protein with DNA-binding activity and is regulated by KNAT1 [J].
Curaba, J ;
Herzog, M ;
Vachon, G .
PLANT JOURNAL, 2003, 33 (02) :305-317
[14]   ANTIPARALLEL EXPRESSION OF THE SENSE AND ANTISENSE TRANSCRIPTS OF MAIZE ALPHA-TUBULIN GENES [J].
DOLFINI, S ;
CONSONNI, G ;
MEREGHETTI, M ;
TONELLI, C .
MOLECULAR & GENERAL GENETICS, 1993, 241 (1-2) :161-169
[15]   Polypurine (A)-rich sequences promote cross-kingdom conservation of internal ribosome entry [J].
Dorokhov, YL ;
Skulachev, MV ;
Ivanov, PA ;
Zvereva, SD ;
Tjulkina, LG ;
Merits, A ;
Gleba, YY ;
Hohn, T ;
Atabekov, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (08) :5301-5306
[16]   RAPID PRODUCTION OF FULL-LENGTH CDNAS FROM RARE TRANSCRIPTS - AMPLIFICATION USING A SINGLE GENE-SPECIFIC OLIGONUCLEOTIDE PRIMER [J].
FROHMAN, MA ;
DUSH, MK ;
MARTIN, GR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (23) :8998-9002
[17]   Cloning of a polycistronic cDNA from tomato encoding gamma-glutamyl kinase and gamma-glutamyl phosphate reductase [J].
GarciaRios, M ;
Fujita, T ;
LaRosa, PC ;
Locy, RD ;
Clithero, JM ;
Bressan, RA ;
Csonka, LN .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (15) :8249-8254
[18]   In silico detection of control signals:: mRNA 3′-end-processing sequences in diverse species [J].
Graber, JH ;
Cantor, CR ;
Mohr, SC ;
Smith, TF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (24) :14055-14060
[19]   Molecular characterisation of two paralogous SPO11 homologues in Arabidopsis thaliana [J].
Hartung, F ;
Puchta, H .
NUCLEIC ACIDS RESEARCH, 2000, 28 (07) :1548-1554
[20]   Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence information [J].
Hebsgaard, SM ;
Korning, PG ;
Tolstrup, N ;
Engelbrecht, J ;
Rouze, P ;
Brunak, S .
NUCLEIC ACIDS RESEARCH, 1996, 24 (17) :3439-3452