No-alignment-strategies for exploring a set of two-way data tables obtained from capillary electrophores is-mass spectrometry

被引:16
作者
Daszykowski, M. [1 ]
Danielsson, R. [2 ]
Walczak, B. [1 ]
机构
[1] Silesian Univ, Inst Chem, Dept Chemometr, PL-40006 Katowice, Poland
[2] Uppsala Univ, Dept Phys & Analyt Chem Analyt Chem, SE-75124 Uppsala, Sweden
关键词
comparing data tables; hyphenated techniques; two-dimensional fingerprints; alignment; warping; chemometrics;
D O I
10.1016/j.chroma.2008.03.027
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hyphenated techniques such as capillary electrophoresis-mass spectrometry (CE-MS) or high-performance liquid chromatography with diode array detection (HPLC-DAD), etc., are known to produce a huge amount of data since each sample is characterized by a two-way data table. In this paper different ways of obtaining sample-related information from a set of such tables are discussed. Working with original data requires alignment techniques due to time shifts caused by unavoidable variations in separation conditions. Other pre-processing techniques have been suggested to facilitate comparison among samples without prior peak alignment, for example, 'binning' and/or 'blurring' the data along the time dimension. All these techniques, however, require optimization of some parameters, and in this paper an alternative parameter-free method is proposed. The individual data tables (X) are represented as Gram matrices (XXT), where the summation is taken over the time dimension. Hence the possible variations in time scale are eliminated, while the time information is at least partly preserved by the correlation structure between the detection channels. For comparison among samples, a similarity matrix is constructed and explored by principal component analysis and hierarchical clustering. The Gram matrix approach was tested and compared to some other methods using 'binned' and 'blurred' data for a data set with CE-MS runs on urine samples. In addition to data exploration by principal component analysis and hierarchical clustering, a discriminant partial least squares model was constructed to discriminate between the samples that were taken with and without the prior intake of a drug. The result showed that the proposed method is at least as good as the others with respect to cluster identification and class prediction. A distinct advantage is that there is no need for parameter optimization, while a potential drawback is the large size of the Gram matrices for data with high mass resolution. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:157 / 165
页数:9
相关论文
共 17 条
[1]   Multivariate comparison between peptide mass fingerprints obtained by liquid chromatography-electrospray ionization-mass spectrometry with different trypsin digestion procedures [J].
Backstrom, Daniel ;
Moberg, My ;
Sjoberg, Per J. R. ;
Bergquist, Jonas ;
Danielsson, Rolf .
JOURNAL OF CHROMATOGRAPHY A, 2007, 1171 (1-2) :69-79
[2]  
Bro R, 1999, J CHEMOMETR, V13, P295, DOI 10.1002/(SICI)1099-128X(199905/08)13:3/4<295::AID-CEM547>3.0.CO
[3]  
2-Y
[4]   SAMPLE-DISTANCE PARTIAL LEAST-SQUARES - PLS OPTIMIZED FOR MANY VARIABLES, WITH APPLICATION TO COMFA [J].
BUSH, BL ;
NACHBAR, RB .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1993, 7 (05) :587-619
[5]  
CUESTASANCHEZ F, 1996, CHEMOM INTELL LAB SY, V34, P139
[6]   Rapid multivariate analysis of LC/GC/CE data (single or multiple channel detection) without prior peak alignment [J].
Danielsson, Rolf ;
Backstrom, Daniel ;
Ullsten, Sara .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 84 (1-2) :33-39
[7]   Identifying potential biomarkers in LC-MS data [J].
Daszykowski, M. ;
Wu, W. ;
Nicholls, A. W. ;
Ball, R. J. ;
Czekaj, T. ;
Walczak, B. .
JOURNAL OF CHEMOMETRICS, 2007, 21 (7-9) :292-302
[8]   Use and abuse of chemometrics in chromatography [J].
Daszykowski, Michal ;
Walczak, Beata .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2006, 25 (11) :1081-1096
[9]   Statistical and computational methods for comparative proteomic profiling using liquid chromatography-tandem mass spectrometry [J].
Listgarten, J ;
Emili, A .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (04) :419-434
[10]  
Malinowski E.R., 1991, FACTOR ANAL CHEM