Genetic algorithm-based feature selection in high-resolution NMR spectra

被引:38
作者
Cho, Hyun-Woo [2 ]
Kim, Seoung Bum [1 ]
Jeong, Myong K. [2 ]
Park, Youngja
Ziegler, Thomas R.
Jones, Dean P. [3 ]
机构
[1] Univ Texas Arlington, Dept Ind & Mfg Syst Engn, Arlington, TX 76019 USA
[2] Univ Tennessee, Dept Ind & Informat Engn, Knoxville, TN 37996 USA
[3] Emory Univ, Dept Med, Ctr Clin & Mol Nutr, Clin Biomarkers Lab, Atlanta, GA 30322 USA
关键词
metabolomics; nuclear magnetic resonance (NMR); feature selections; discrimination; genetic algorithm (GA); orthogonal signal correction filter;
D O I
10.1016/j.eswa.2007.08.050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-resolution nuclear magnetic resonance (NMR) spectroscopy has provided a new means for detection and recognition of metabolic changes in biological systems in response to pathophysiological stimuli and to the intake of toxins or nutrition. To identify meaningful patterns from NMR spectra, various statistical pattern recognition methods have been applied to reduce their complexity and uncover implicit metabolic patterns. In this paper, we present a genetic algorithm (GA)-based feature selection method to determine major metabolite features to play a significant role in discrimination of samples among different conditions in high-resolution NMR spectra. In addition, ail orthogonal signal filter was employed as a preprocessor of NMR spectra in order to remove any unwanted variation of the data that is unrelated to the discrimination of different conditions. The results of k-nearest neighbors and the partial least squares discriminant analysis of the experimental NMR spectra from human plasma showed the potential advantage of the features obtained from GA-bascd feature selection combined with in orthogonal signal filter. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:967 / 975
页数:9
相关论文
共 30 条
[1]  
[Anonymous], 1988, Journal of chemometrics
[2]   Partial least squares for discrimination [J].
Barker, M ;
Rayens, W .
JOURNAL OF CHEMOMETRICS, 2003, 17 (03) :166-173
[3]   NMR-based metabonomic toxicity classification: hierarchical cluster analysis and k-nearest-neighbour approaches [J].
Beckonert, O ;
Bollard, ME ;
Ebbels, TMD ;
Keun, HC ;
Antti, H ;
Holmes, E ;
Lindon, JC ;
Nicholson, JK .
ANALYTICA CHIMICA ACTA, 2003, 490 (1-2) :3-15
[4]   CONFORMATIONAL-ANALYSIS OF A DINUCLEOTIDE PHOTODIMER WITH THE AID OF THE GENETIC ALGORITHM [J].
BLOMMERS, MJJ ;
LUCASIUS, CB ;
KATEMAN, G ;
KAPTEIN, R .
BIOPOLYMERS, 1992, 32 (01) :45-52
[5]   CURVE-FITTING USING NATURAL COMPUTATION [J].
DEWEIJER, AP ;
LUCASIUS, CB ;
BUYDENS, L ;
KATEMAN, G ;
HEUVEL, HM ;
MANNEE, H .
ANALYTICAL CHEMISTRY, 1994, 66 (01) :23-31
[6]   Multivariate calibration of near infrared spectra by orthogonal WAVElet correction using a genetic algorithm [J].
Esteban-Díez, I ;
González-Sáiz, JM ;
Gómez-Cámara, D ;
Millan, CP .
ANALYTICA CHIMICA ACTA, 2006, 555 (01) :84-95
[7]   On orthogonal signal correction [J].
Fearn, T .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 50 (01) :47-52
[8]   Genetic algorithms (GA) applied to the orthogonal projection approach (OPA) for variable selection [J].
Gourvénec, S ;
Capron, X ;
Massart, DL .
ANALYTICA CHIMICA ACTA, 2004, 519 (01) :11-21
[9]   GENETIC ALGORITHMS IN CHEMISTRY [J].
HIBBERT, DB .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 19 (03) :277-293
[10]   Chemometric contributions to the evolution of metabonomics: mathematical solutions to characterising and interpreting complex biological NMR spectra [J].
Holmes, E ;
Antti, H .
ANALYST, 2002, 127 (12) :1549-1557