mRNA 5′ region sequence incompleteness:: a potential source of systematic errors in translation initiation codon assignment in human mRNAs

被引:14
作者
Casadei, R [1 ]
Strippoli, P [1 ]
D'Addabbo, P [1 ]
Canaider, S [1 ]
Lenzi, L [1 ]
Vitale, L [1 ]
Giannone, S [1 ]
Frabetti, F [1 ]
Facchin, F [1 ]
Carinci, P [1 ]
Zannotti, M [1 ]
机构
[1] Univ Bologna, Ctr Res Mol Genet, Fondaz CARISBO, Inst Histol & Gen Embryol, I-40126 Bologna, Italy
关键词
genomics; 5 ' UTR (5 ' untranslated region); full-length mRNA; human genome; human chromosome 21;
D O I
10.1016/S0378-1119(03)00835-7
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The amino acid sequence of gene products is routinely deduced from the nucleotide sequence of the relative cloned cDNA, according to the rules for recognition of start codon (first-AUG rule, optimal sequence context) and the genetic code. From this prediction stem most subsequent types of product analysis, although all standard methods for cDNA cloning are affected by a potential inability to effectively clone the 5' region of mRNA. Revision by bioinformatics and cloning methods of 109 known genes located on human chromosome 21 (HC 21) shows that 60 mRNAs lack any in-frame stop upstream of the first-AUG, and that in five cases (DSCR1, KIAA0184, K1AA0539, SON, and TFF3) the coding region at the 5' end was incompletely characterized in the original descriptions. We describe the respective consequences for genomic annotation, domain and ortholog identification, and functional experiments design. We have also analyzed the sequences of 13,124 human mRNAs (RefSeq databank), discovering that in 6448 cases (49%), an in-frame stop codon is present upstream of the initiation codon, while in the other 6676 mRNAs (51%), identification of additional bases at the mRNA 5' region could well reveal some new upstream in-frame AUG codons in the optimal context. Proportionally to the HC 21 data, about 550 known human genes might thus be affected by this 5' end mRNA artifact., (C) 2003 Elsevier B.V. All rights reserved.
引用
收藏
页码:185 / 193
页数:9
相关论文
共 30 条
[1]   A NEW HUMAN GENE FROM THE DOWN-SYNDROME CRITICAL REGION ENCODES A PROLINE-RICH PROTEIN HIGHLY EXPRESSED IN FETAL BRAIN AND HEART [J].
FUENTES, JJ ;
PRITCHARD, MA ;
PLANAS, AM ;
BOSCH, A ;
FERRER, I ;
ESTIVILL, X .
HUMAN MOLECULAR GENETICS, 1995, 4 (10) :1935-1944
[2]   DSCR1, overexpressed in Down syndrome, is an inhibitor of calcineurin-mediated signaling pathways [J].
Fuentes, JJ ;
Genescà, L ;
Kingsbury, TJ ;
Cunningham, KW ;
Pérez-Riba, M ;
Estivill, X ;
de la Luna, S .
HUMAN MOLECULAR GENETICS, 2000, 9 (11) :1681-1690
[3]   Genomic organization, alternative splicing, and expression patterns of the DSCR1 (Down syndrome candidate region 1) gene [J].
Fuentes, JJ ;
Pritchard, MA ;
Estivill, X .
GENOMICS, 1997, 44 (03) :358-361
[4]   Annotation of human chromosome 21 for relevance to Down syndrome: Gene structure and expression analysis [J].
Gardiner, K ;
Slavov, D ;
Bechtel, L ;
Davisson, M .
GENOMICS, 2002, 79 (06) :833-843
[5]   Identification and characterization of a highly conserved calcineurin binding protein, CBP1/calcipressin, in Cryptococcus neoformans [J].
Görlach, J ;
Fox, DS ;
Cutler, NS ;
Cox, GM ;
Perfect, JR ;
Heitman, J .
EMBO JOURNAL, 2000, 19 (14) :3618-3629
[6]   The DNA sequence of human chromosome 21 [J].
Hattori, M ;
Fujiyama, A ;
Taylor, TD ;
Watanabe, H ;
Yada, T ;
Park, HS ;
Toyoda, A ;
Ishii, K ;
Totoki, Y ;
Choi, DK ;
Soeda, E ;
Ohki, M ;
Takagi, T ;
Sakaki, Y ;
Taudien, S ;
Blechschmidt, K ;
Polley, A ;
Menzel, U ;
Delabar, J ;
Kumpf, K ;
Lehmann, R ;
Patterson, D ;
Reichwald, K ;
Rump, A ;
Schillhabel, M ;
Schudy, A ;
Zimmermann, W ;
Rosenthal, A ;
Kudoh, J ;
Shibuya, K ;
Kawasaki, K ;
Asakawa, S ;
Shintani, A ;
Sasaki, T ;
Nagamine, K ;
Mitsuyama, S ;
Antonarakis, SE ;
Minoshima, S ;
Shimizu, N ;
Nordsiek, G ;
Hornischer, K ;
Brandt, P ;
Scharfe, M ;
Schön, O ;
Desario, A ;
Reichelt, J ;
Kauer, G ;
Blöcker, H ;
Ramser, J ;
Beck, A .
NATURE, 2000, 405 (6784) :311-319
[7]   THE SON GENE ENCODES A CONSERVED DNA-BINDING PROTEIN MAPPING TO HUMAN-CHROMOSOME-21 [J].
KHAN, IM ;
FISHER, RA ;
JOHNSON, KJ ;
BAILEY, MES ;
SICILIANO, MJ ;
KESSLING, AM ;
FARRER, M ;
KAMALATI, T ;
BULUWELA, L ;
CARRITT, B .
ANNALS OF HUMAN GENETICS, 1994, 58 :25-34
[8]  
Kingsbury TJ, 2000, GENE DEV, V14, P1595
[9]   CLONING OF GT BOX-BINDING PROTEINS - A NOVEL SP1 MULTIGENE FAMILY REGULATING T-CELL RECEPTOR GENE-EXPRESSION [J].
KINGSLEY, C ;
WINOTO, A .
MOLECULAR AND CELLULAR BIOLOGY, 1992, 12 (10) :4251-4261
[10]   Molecular cloning and functional characterization of a new Cap'n' collar family transcription factor Nrf3 [J].
Kobayashi, A ;
Ito, E ;
Toki, T ;
Kogame, K ;
Takahashi, S ;
Igarashi, K ;
Hayashi, N ;
Yamamoto, M .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1999, 274 (10) :6443-6452