Statistical analysis of the 5′ untranslated region of human mRNA using "Oligo-Capped" cDNA libraries

被引:124
作者
Suzuki, Y
Ishihara, D
Sasaki, M
Nakagawa, H
Hata, H
Tsunoda, T
Watanabe, M
Komatsu, T
Ota, T
Isogai, T
Suyama, A
Sugano, S
机构
[1] Univ Tokyo, Inst Med Sci, Dept Virol, Minato Ku, Tokyo 1088639, Japan
[2] RIKEN, Inst Phys & Chem Res, Genome Sci Ctr, Wako, Saitama 3510106, Japan
[3] Helix Res Inst, Kisarazu, Chiba 2920812, Japan
[4] Univ Tokyo, Dept Life Sci, Meguro Ku, Tokyo 1530041, Japan
基金
日本科学技术振兴机构;
关键词
D O I
10.1006/geno.2000.6076
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We constructed 34 types of human "full-length enriched" and "5'-end enriched" cDNA libraries based on the "Oligo-Capping" method. We randomly picked and sequenced 10,000 clones from these libraries. BLAST analysis showed that about 50% of the cDNAs were identical to known genes. Among them, we selected 954 species of cDNA that should represent the entire sequence from the mRNA start sites. Compared with previously reported sequences, they were on average 45 bp longer in the 5'-end. Using these cDNA data, we statistically analyzed the sequence features of the 5'UTR. The average length of the 5'UTR was 125 bp, and there was little correlation with the corresponding mRNA length (correlation coefficiency = 0.26). Of the 954 species of 5'UTR, 459 contained no in-frame terminator codon, which is against the common belief. Two hundred seventy-eight species contained at least one ATG codon upstream of the initiator ATC: codon. We identified 569 upstream ATGs, in total, 63% of which adequately satisfied Kozak's criteria. These findings are contrary to the typical translation initiation model, which states that translation is initiated from the "first" ATG codon. (C) 2000 Academic Press.
引用
收藏
页码:286 / 297
页数:12
相关论文
共 41 条
[1]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[2]   ENHANCED TRANSLATIONAL EFFICIENCY OF A NOVEL TRANSFORMING GROWTH FACTOR-BETA-3 MESSENGER-RNA IN HUMAN BREAST-CANCER CELLS [J].
ARRICK, BA ;
GRENDELL, RL ;
GRIFFIN, LA .
MOLECULAR AND CELLULAR BIOLOGY, 1994, 14 (01) :619-628
[3]   SIZING AND MAPPING OF EARLY ADENOVIRUS MESSENGER-RNAS BY GEL-ELECTROPHORESIS OF S1 ENDONUCLEASE-DIGESTED HYBRIDS [J].
BERK, AJ ;
SHARP, PA .
CELL, 1977, 12 (03) :721-732
[4]   EUKARYOTIC START AND STOP TRANSLATION SITES [J].
CAVENER, DR ;
RAY, SC .
NUCLEIC ACIDS RESEARCH, 1991, 19 (12) :3185-3192
[5]   TransTerm, the translational signal database, extended to include full coding sequences and untranslated regions [J].
Dalphin, ME ;
Stockwell, PA ;
Tate, WP ;
Brown, CM .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :293-294
[6]  
FOO NC, 1994, J BIOL CHEM, V269, P659
[7]   POTENTIAL SECONDARY STRUCTURE AT THE TRANSLATIONAL START DOMAIN OF EUKARYOTIC AND PROKARYOTIC MESSENGER-RNAS [J].
GANOZA, MC ;
LOUIS, BG .
BIOCHIMIE, 1994, 76 (05) :428-439
[8]  
HANDLEYGEARHART PM, 1994, J BIOL CHEM, V269, P33171
[9]  
HERSHEY JWB, 1991, ANNU REV BIOCHEM, V60, P717, DOI 10.1146/annurev.bi.60.070191.003441