Comparative analysis of targeted metabolomics: Dominance-based rough set approach versus orthogonal partial least square-discriminant analysis

被引:84
作者
Blasco, H. [1 ,2 ,3 ]
Blaszczynski, J. [4 ]
Billaut, J. C. [6 ]
Nadal-Desbarats, L. [1 ,2 ,7 ]
Pradat, P. F. [8 ]
Devos, D. [9 ]
Moreau, C. [9 ]
Andres, C. R. [1 ,2 ,3 ]
Emond, P. [1 ,2 ,7 ]
Corcia, P. [1 ,2 ,10 ]
Slowinski, R. [4 ,5 ]
机构
[1] Inserm U930, Tours, France
[2] Univ Tours, Tours, France
[3] CHRU Tours, Lab Biochim & Biol Mol, F-37044 Tours 1, France
[4] Poznan Univ Tech, Inst Comp Sci, PL-60965 Poznan, Poland
[5] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[6] Univ Tours, CNRS, LI EA 6300, OC ERL CNRS 6305, Tours, France
[7] Univ Tours, PPF, Tours, France
[8] Hop La Pitie Salpetriere, Ctr Referent Malad Rare SLA, Federat Malad Syst Nerveux, Paris, France
[9] CHRU Lille, Serv Neurol, F-59037 Lille, France
[10] CHRU Bretonneau, Serv Neurol, Ctr SLA, Tours, France
关键词
Metabolomics; OPLS-DA; Dominance-based rough set approach; Bayesian confirmation; Diagnosis prediction; Amyotrophic lateral sclerosis; MASS-SPECTROMETRY; SYSTEMS BIOLOGY; STRATEGY;
D O I
10.1016/j.jbi.2014.12.001
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Background: Metabolomics is an emerging field that includes ascertaining a metabolic profile from a combination of small molecules, and which has health applications. Metabolomic methods are currently applied to discover diagnostic biomarkers and to identify pathophysiological pathways involved in pathology. However, metabolomic data are complex and are usually analyzed by statistical methods. Although the methods have been widely described, most have not been either standardized or validated. Data analysis is the foundation of a robust methodology, so new mathematical methods need to be developed to assess and complement current methods. We therefore applied, for the first time, the dominance-based rough set approach (DRSA) to metabolomics data; we also assessed the complementarity of this method with standard statistical methods. Some attributes were transformed in a way allowing us to discover global and local monotonic relationships between condition and decision attributes. We used previously published metabolomics data (18 variables) for amyotrophic lateral sclerosis (ALS) and non-ALS patients. Results: Principal Component Analysis (PCA) and Orthogonal Partial Least Square-Discriminant Analysis (OPLS-DA) allowed satisfactory discrimination (72.7%) between ALS and non-ALS patients. Some discriminant metabolites were identified: acetate, acetone, pyruvate and glutamine. The concentrations of acetate and pyruvate were also identified by univariate analysis as significantly different between ALS and non-ALS patients. DRSA correctly classified 68.7% of the cases and established rules involving some of the metabolites highlighted by OPLS-DA (acetate and acetone). Some rules identified potential biomarkers not revealed by OPLS-DA (beta-hydroxybutyrate). We also found a large number of common discriminating metabolites after Bayesian confirmation measures, particularly acetate, pyruvate, acetone and ascorbate, consistent with the pathophysiological pathways involved in ALS. Conclusion: DRSA provides a complementary method for improving the predictive performance of the multivariate data analysis usually used in metabolomics. This method could help in the identification of metabolites involved in disease pathogenesis. Interestingly, these different strategies mostly identified the same metabolites as being discriminant. The selection of strong decision rules with high value of Bayesian confirmation provides useful information about relevant condition-decision relationships not otherwise revealed in metabolomics data. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:291 / 299
页数:9
相关论文
共 33 条
[1]
Blasco H, 2013, J PROTEOME RES
[2]
1H-NMR-Based Metabolomic Profiling of CSF in Early Amyotrophic Lateral Sclerosis [J].
Blasco, Helene ;
Corcia, Philippe ;
Moreau, Caroline ;
Veau, Segolene ;
Fournier, Clementine ;
Vourc'h, Patrick ;
Emond, Patrick ;
Gordon, Paul ;
Pradat, Pierre-Francois ;
Praline, Julien ;
Devos, David ;
Nadal-Desbarats, Lydie ;
Andres, Christian R. .
PLOS ONE, 2010, 5 (10)
[3]
Blaszczynski J, 2009, EUR C MACH LEARN PRI, P19
[4]
Inductive discovery of laws using monotonic rules [J].
Blaszczynski, Jerzy ;
Greco, Salvatore ;
Slowinski, Roman .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2012, 25 (02) :284-294
[5]
Blaszczynski J, 2011, LECT NOTES ARTIF INT, V6954, P36, DOI 10.1007/978-3-642-24425-4_7
[6]
Sequential covering rule induction algorithm for variable consistency rough set approaches [J].
Blaszczynski, Jerzy ;
Slowinski, Roman ;
Szelag, Marcin .
INFORMATION SCIENCES, 2011, 181 (05) :987-1002
[7]
Monotonic Variable Consistency Rough Set Approaches [J].
Blaszczynski, Jerzy ;
Greco, Salvatore ;
Slowinski, Roman ;
Szelag, Marcin .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2009, 50 (07) :979-999
[8]
Btaszczynski J, 2010, LNCS, V5946, P40
[9]
Evaluation of the orthogonal projection on latent structure model limitations caused by chemical shift variability and improved visualization of biomarker changes in 1H NMR spectroscopic metabonomic studies [J].
Cloarec, O ;
Dumas, ME ;
Trygg, J ;
Craig, A ;
Barton, RH ;
Lindon, JC ;
Nicholson, JK ;
Holmes, E .
ANALYTICAL CHEMISTRY, 2005, 77 (02) :517-526
[10]
Implementation of a semi-automated strategy for the annotation of metabolomic fingerprints generated by liquid chromatography-high resolution mass spectrometry from biological samples [J].
Courant, Frederique ;
Royer, Anne-Lise ;
Chereau, Sylvain ;
Morvan, Marie-Line ;
Monteau, Fabrice ;
Antignac, Jean-Philippe ;
Le Bizec, Bruno .
ANALYST, 2012, 137 (21) :4958-4967