Improved classification accuracy in 1-and 2-dimensional NMR metabolomics data using the variance stabilising generalised logarithm transformation

被引:172
作者
Parsons, Helen M.
Ludwig, Christian
Guenther, Ulrich L.
Viant, Mark R. [1 ]
机构
[1] Univ Birmingham, Ctr Syst Biol, Birmingham B15 2TT, W Midlands, England
[2] Univ Birmingham, Biomol NMR Spect, Birmingham B15 2TT, W Midlands, England
[3] Univ Birmingham, Sch Biosci, Birmingham B15 2TT, W Midlands, England
基金
英国自然环境研究理事会;
关键词
D O I
10.1186/1471-2105-8-234
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Classifying nuclear magnetic resonance (NMR) spectra is a crucial step in many metabolomics experiments. Since several multivariate classification techniques depend upon the variance of the data, it is important to first minimise any contribution from unwanted technical variance arising from sample preparation and analytical measurements, and thereby maximise any contribution from wanted biological variance between different classes. The generalised logarithm (glog) transform was developed to stabilise the variance in DNA microarray datasets, but has rarely been applied to metabolomics data. In particular, it has not been rigorously evaluated against other scaling techniques used in metabolomics, nor tested on all forms of NMR spectra including 1-dimensional (1D) H-1, projections of 2D H-1, H-1 J-resolved (pJRES), and intact 2D J-resolved (JRES). Results: Here, the effects of the glog transform are compared against two commonly used variance stabilising techniques, autoscaling and Pareto scaling, as well as unscaled data. The four methods are evaluated in terms of the effects on the variance of NMR metabolomics data and on the classification accuracy following multivariate analysis, the latter achieved using principal component analysis followed by linear discriminant analysis. For two of three datasets analysed, classification accuracies were highest following glog transformation: 100% accuracy for discriminating 1D NMR spectra of hypoxic and normoxic invertebrate muscle, and 100% accuracy for discriminating 2D JRES spectra of fish livers sampled from two rivers. For the third dataset, pJRES spectra of urine from two breeds of dog, the glog transform and autoscaling achieved equal highest accuracies. Additionally we extended the glog algorithm to effectively suppress noise, which proved critical for the analysis of 2D JRES spectra. Conclusion: We have demonstrated that the glog and extended glog transforms stabilise the technical variance in NMR metabolomics datasets. This significantly improves the discrimination between sample classes and has resulted in higher classification accuracies compared to unscaled, autoscaled or Pareto scaled data. Additionally we have confirmed the broad applicability of the glog approach using three disparate datasets from different biological samples using 1D NMR spectra, 1D projections of 2D JRES spectra, and intact 2D JRES spectra.
引用
收藏
页数:16
相关论文
共 27 条
[11]   Metabolomics by numbers: acquiring and understanding global metabolite data [J].
Goodacre, R ;
Vaidyanathan, S ;
Dunn, WB ;
Harrigan, GG ;
Kell, DB .
TRENDS IN BIOTECHNOLOGY, 2004, 22 (05) :245-252
[12]   Direct sampling of organisms from the field and knowledge of their phenotype: Key recommendations for environmental metabolomics [J].
Hines, Adam ;
Oladiran, Gbolahan Samuel ;
Bignell, John P. ;
Stentiford, Grant D. ;
Viant, Mark R. .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2007, 41 (09) :3375-3381
[13]   A functional analysis of mouse models of cardiac disease through metabolic profiling [J].
Jones, GLAH ;
Sang, E ;
Goddard, C ;
Mortishire-Smith, RJ ;
Sweatman, BC ;
Haselden, JN ;
Davies, K ;
Grace, AA ;
Clarke, K ;
Griffin, JL .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (09) :7530-7539
[14]   Improved analysis of multivariate data by variable stability scaling: application to NMR-based metabolic profiling [J].
Keun, HC ;
Ebbels, TMD ;
Antti, H ;
Bollard, ME ;
Beckonert, O ;
Holmes, E ;
Lindon, JC ;
Nicholson, JK .
ANALYTICA CHIMICA ACTA, 2003, 490 (1-2) :265-276
[15]  
KIEFTE M, DISCRIMINANT ANAL TO
[16]   Metabolomic analysis of methyl jasmonate treated Brassica rapa leaves by 2-dimensional NMR spectroscopy [J].
Liang, Yun-Sa ;
Choi, Young Hae ;
Kim, Hye Kyong ;
Linthorst, Huub J. M. ;
Verpoorte, Robert .
PHYTOCHEMISTRY, 2006, 67 (22) :2503-2511
[17]  
LIN CY, 2007, EVALUATION METABOLIT, V3, P55, DOI [10.1007/s11306-006-0043-1, DOI 10.1007/S11306-006-0043-1]
[18]   Pattern recognition methods and applications in biomedical magnetic resonance [J].
Lindon, JC ;
Holmes, E ;
Nicholson, JK .
PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY, 2001, 39 (01) :1-40
[19]   Discrimination models using variance-stabilizing transformation of metabolomic NMR data [J].
Purohit, PV ;
Rocke, DM ;
Viant, MR ;
Woodruff, DL .
OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2004, 8 (02) :118-130
[20]  
Ripley B.D., 1996, PATTERN RECOGN