Analysis of high throughput protein expression in Escherichia coli

被引:13
作者
Benita, Yair
Wise, Michael J.
Lok, Martin C.
Humphery-Smith, Ian
Oosting, Ronald S.
机构
[1] Univ Utrecht, Dept Psychopharmacol, UIPS, NL-3584 CA Utrecht, Netherlands
[2] Univ Utrecht, Dept Pharmaceut, UIPS, NL-3584 CA Utrecht, Netherlands
[3] Univ Western Australia, Crawley, WA 6009, Australia
[4] Biosyst Informat Inst, Newcastle Upon Tyne NE1 4EP, Tyne & Wear, England
关键词
D O I
10.1074/mcp.M600140-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The ability to efficiently produce hundreds of proteins in parallel is the most basic requirement of many aspects of proteomics. Overcoming the technical and financial barriers associated with high throughput protein production is essential for the development of an experimental platform to query and browse the protein content of a cell ( e. g. protein and antibody arrays). Proteins are inherently different one from another in their physicochemical properties; therefore, no single protocol can be expected to successfully express most of the proteins. Instead of optimizing a protocol to express a specific protein, we used sequence analysis tools to estimate the probability of a specific protein to be expressed successfully using a given protocol, thereby avoiding a priori proteins with a low success probability. A set of 547 proteins, to be used for antibody production and selection, was expressed in Escherichia coli using a high throughput protein production pipeline. Protein properties derived from sequence alone were correlated to successful expression, and general guidelines are given to increase the efficiency of similar pipelines. A second set of 68 proteins was expressed to investigate the link between successful protein expression and inclusion body formation. More proteins were expressed in inclusion bodies; however, the formation of inclusion bodies was not a requirement for successful expression.
引用
收藏
页码:1567 / 1580
页数:14
相关论文
共 42 条
[1]   Affinity proteomics for systematic protein profiling of chromosome 21 gene products in human tissues [J].
Agaton, C ;
Galli, J ;
Guthenberg, IH ;
Janzon, L ;
Hansson, M ;
Asplund, A ;
Brundell, E ;
Lindberg, S ;
Ruthberg, I ;
Wester, K ;
Wurtz, D ;
Höög, C ;
Lundeberg, J ;
Ståhl, S ;
Pontén, F ;
Uhlén, M .
MOLECULAR & CELLULAR PROTEOMICS, 2003, 2 (06) :405-414
[2]  
[Anonymous], 2000, InNovations
[3]  
[Anonymous], 2006, R LANG ENV STAT COMP
[4]   Recombinant protein folding and misfolding in Escherichia coli [J].
Baneyx, F ;
Mujacic, M .
NATURE BIOTECHNOLOGY, 2004, 22 (11) :1399-1408
[5]   Regionalized GC content of template DNA as a predictor of PCR success [J].
Benita, Y ;
Oosting, RS ;
Lok, MC ;
Wise, MJ ;
Humphery-Smith, I .
NUCLEIC ACIDS RESEARCH, 2003, 31 (16)
[6]   SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics [J].
Bertone, P ;
Kluger, Y ;
Lan, N ;
Zheng, DY ;
Christendat, D ;
Yee, A ;
Edwards, AM ;
Arrowsmith, CH ;
Montelione, GT ;
Gerstein, M .
NUCLEIC ACIDS RESEARCH, 2001, 29 (13) :2884-2898
[7]   Proteome-scale purification of human proteins from bacteria [J].
Braun, P ;
Hu, YH ;
Shen, BH ;
Halleck, A ;
Koundinya, M ;
Harlow, E ;
LaBaer, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (05) :2654-2659
[8]  
CHAPMAN B, 2000, ACM SIGBIO NEWSLETTE, V20, P15
[9]   Structural proteomics: prospects for high throughput sample preparation [J].
Christendat, D ;
Yee, A ;
Dharamsi, A ;
Kluger, Y ;
Gerstein, M ;
Arrowsmith, CH ;
Edwards, AM .
PROGRESS IN BIOPHYSICS & MOLECULAR BIOLOGY, 2000, 73 (05) :339-345
[10]   A phylogenetic approach to target selection for structural genomics: solution structure of YciH [J].
Cort, JR ;
Koonin, EV ;
Bash, PA ;
Kennedy, MA .
NUCLEIC ACIDS RESEARCH, 1999, 27 (20) :4018-4027