Multifactorial Determinants of Protein Expression in Prokaryotic Open Reading Frames

被引:80
作者
Allert, Malin [1 ]
Cox, J. Colin [1 ]
Hellinga, Homme W. [1 ]
机构
[1] Duke Univ, Med Ctr, Dept Biochem, Durham, NC 27710 USA
关键词
protein expression; nucleotide composition; mRNA secondary structure; codon usage; synthetic genes; 16S RIBOSOMAL-RNA; MESSENGER-RNA; ESCHERICHIA-COLI; DOWNSTREAM BOX; TRANSLATION INITIATION; CODON USAGE; GENE-EXPRESSION; ENHANCEMENT; EFFICIENCY; SEQUENCES;
D O I
10.1016/j.jmb.2010.08.010
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A quantitative description of the relationship between protein expression levels and open reading frame (ORF) nucleotide sequences is important for understanding natural systems, designing synthetic systems, and optimizing heterologous expression. Codon identity, mRNA secondary structure, and nucleotide composition within ORFs markedly influence expression levels. Bioinformatic analysis of ORF sequences in 816 bacterial genomes revealed that these features show distinct regional trends. To investigate their effects on protein expression, we designed 285 synthetic genes and determined corresponding expression levels in vitro using Escherichia coli extracts. We developed a mathematical function, parameterized using this synthetic gene data set, which enables computation of protein expression levels from ORF nucleotide sequences. In addition to its practical application in the design of heterologous expression systems, this equation provides mechanistic insight into the factors that control translation efficiency. We found that expression is strongly dependent on the presence of high AU content and low secondary structure in the ORF 5' region. Choice of high-frequency codons contributes to a lesser extent. The 3' terminal AU content makes modest, but detectable contributions. We present a model for the effect of these factors on the three phases of ribosomal function: initiation, elongation, and termination. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:905 / 918
页数:14
相关论文
共 53 条
[1]   Global signatures of protein and mRNA expression levels [J].
Abreu, Raquel de Sousa ;
Penalva, Luiz O. ;
Marcotte, Edward M. ;
Vogel, Christine .
MOLECULAR BIOSYSTEMS, 2009, 5 (12) :1512-1526
[2]   Synthetic biology: new engineering rules for an emerging discipline [J].
Andrianantoandro, Ernesto ;
Basu, Subhayu ;
Karig, David K. ;
Weiss, Ron .
MOLECULAR SYSTEMS BIOLOGY, 2006, 2 (1) :2006.0028
[3]   Correlating ribosome function with high-resolution structures [J].
Bashan, Anat ;
Yonath, Ada .
TRENDS IN MICROBIOLOGY, 2008, 16 (07) :326-335
[4]   High-level misincorporation of lysine for arginine at AGA codons in a fusion protein expressed in Escherichia coli [J].
Calderone, TL ;
Stevens, RD ;
Oas, TG .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 262 (04) :407-412
[5]   Chemical synthesis using synthetic biology [J].
Carothers, James M. ;
Goler, Jonathan A. ;
Keasling, Jay D. .
CURRENT OPINION IN BIOTECHNOLOGY, 2009, 20 (04) :498-503
[6]   The RNA degradosome of Escherichia coli:: An mRNA-degrading machine assembled on RNase E [J].
Carpousis, Agamemnon J. .
ANNUAL REVIEW OF MICROBIOLOGY, 2007, 61 :71-87
[7]   High-throughput, fluorescence-based screening for soluble protein expression [J].
Coleman, MA ;
Lao, VH ;
Segelke, BW ;
Beernink, PT .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (05) :1024-1032
[8]   Protein fabrication automation [J].
Cox, J. Colin ;
Lape, Janel ;
Sayed, Mahmood A. ;
Hellinga, Homme W. .
PROTEIN SCIENCE, 2007, 16 (03) :379-390
[9]   Gene synthesis demystified [J].
Czar, Michael J. ;
Anderson, J. Christopher ;
Bader, Joel S. ;
Peccoud, Jean .
TRENDS IN BIOTECHNOLOGY, 2009, 27 (02) :63-72
[10]   Solving the riddle of codon usage preferences: a test for translational selection [J].
dos Reis, M ;
Savva, R ;
Wernisch, L .
NUCLEIC ACIDS RESEARCH, 2004, 32 (17) :5036-5044