An integrated approach to optimization of Escherichia coli fermentations using historical data

被引:28
作者
Coleman, MC
Buck, KKS
Block, DE
机构
[1] Univ Calif Davis, Dept Chem Engn & Mat Sci, Davis, CA 95616 USA
[2] Univ Calif Davis, Dept Viticulture & Enol, Davis, CA 95616 USA
关键词
information theoretic subset selection (ITSS); decision tree analysis (RTA); neural networks; hybrid genetic algorithm; historical process data;
D O I
10.1002/bit.10719
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Using a fermentation database for Escherichia coli producing green fluorescent protein (GFP), we have implemented a novel three-step optimization method to identify the process input variables most important in modeling the fermentation, as well as the values of those critical input variables that result in an increase in the desired output. In the first step of this algorithm, we use either decision-tree analysis (DTA) or information theoretic subset selection (ITSS) as a database mining technique to identify which process input variables best classify each of the process outputs (maximum cell concentration, maximum product concentration, and productivity) monitored in the experimental fermentations. The second step of the optimization method is to train an artificial neural network (ANN) model of the process input-output data, using the critical inputs identified in the first step. Finally, a hybrid genetic algorithm (hybrid GA), which includes both gradient and stochastic search methods, is used to identify the maximum output modeled by the ANN and the values of the input conditions that result in that maximum. The results of the database mining techniques are compared, both in terms of the inputs selected and the subsequent ANN performance. For the E. coli process used in this study, we identified 6 inputs from the original 13 that resulted in an ANN that best modeled the GFP fluorescence outputs of an independent test set. Values of the six inputs that resulted in a modeled maximum fluorescence were identified by applying a hybrid GA to the ANN model developed. When these conditions were tested in laboratory fermentors, an actual maximum fluorescence of 2.16E6 AU was obtained. The previous high value of fluorescence that was observed was 1.51E6 AU. Thus, this input condition set that was suggested by implementing the proposed optimization scheme on the available historical database increased the maximum fluorescence by 55%. (C) 2003 Wiley Periodicals, Inc.
引用
收藏
页码:274 / 285
页数:12
相关论文
共 35 条
[1]  
Agresti A., 1990, Analysis of categorical data
[2]   Green fluorescent protein as a real time quantitative reporter of heterologous protein production [J].
Albano, CR ;
Randers-Eichhorn, L ;
Bentley, WE ;
Rao, G .
BIOTECHNOLOGY PROGRESS, 1998, 14 (02) :351-354
[3]  
[Anonymous], INT C NEURAL NETWORK
[4]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[5]  
Blanch H.W., 1996, Biochemical engineering
[6]   Identification of critical batch operating parameters in fed-batch recombinant E-Coli Fermentations using decision tree analysis [J].
Buck, KKS ;
Subramanian, V ;
Block, DE .
BIOTECHNOLOGY PROGRESS, 2002, 18 (06) :1366-1376
[7]  
DeLisa MP, 1999, BIOTECHNOL BIOENG, V65, P54, DOI 10.1002/(SICI)1097-0290(19991005)65:1<54::AID-BIT7>3.3.CO
[8]  
2-I
[9]   TOWARDS IMPROVED PENICILLIN FERMENTATION VIA ARTIFICIAL NEURAL NETWORKS [J].
DIMASSIMO, C ;
MONTAGUE, GA ;
WILLIS, MJ ;
THAM, MT ;
MORRIS, AJ .
COMPUTERS & CHEMICAL ENGINEERING, 1992, 16 (04) :283-291
[10]   BIOPROCESS MODEL-BUILDING USING ARTIFICIAL NEURAL NETWORKS [J].
DIMASSIMO, C ;
WILLIS, MJ ;
MONTAGUE, GA ;
THAM, MT ;
MORRIS, AJ .
BIOPROCESS ENGINEERING, 1991, 7 (1-2) :77-82