Toxicity classification from metabonomic data using a density superposition approach: 'CLOUDS'

被引:55
作者
Ebbels, T [1 ]
Keun, H [1 ]
Beckonert, O [1 ]
Antti, H [1 ]
Bollard, M [1 ]
Holmes, E [1 ]
Lindon, J [1 ]
Nicholson, J [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Fac Med, Biomed Sci Div, London SW7 2AZ, England
关键词
probabilistic classification; CLOUDS; probabilistic neural networks; metabonomics; toxicity prediction;
D O I
10.1016/S0003-2670(03)00121-1
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Predicting and avoiding the potential toxicity of candidate drugs is of fundamental importance to the pharmaceutical industry. The consortium for metabonomic toxicology (COMET) project aims to construct databases and metabolic models of drug toxicity using ca. 100,000 600 MHz H-1 NMR spectra of biofluids from laboratory rats and mice treated with model toxic compounds. Chemometric methods are being used to characterise the time-related and dose-specific effects of toxins on the endogenous metabolite profiles. Here we present a probabilistic approach to the classification of a large data set of COMET samples using Classification Of Unknowns by Density Superposition (CLOUDS), a novel non-neural implementation of a classification technique developed from probabilistic neural networks. NMR spectra of urine from rats from 19 different treatment groups, collected over 8 days, were processed to produce a data matrix with 2844 samples and 205 spectral variables. The spectra were normalised to account for gross concentration differences in the urine and regions corresponding to non-endogenous metabolites (0.4% of the data) were treated as missing values. Modeling the data according to organ of effect (control, liver, kidney or other organ), with a 50/50 train/test set split, over 90% of the test samples were classified as belonging to the correct group. In particular, samples from liver and kidney treatments were classified with 77 and 90% success, respectively, with only a 2% misclassification rate between these classes. Further analysis of the data, counting each of the 19 treatment groups as separate classes, resulted in a mean success rate across groups of 74%. Finally, as a severe test, the data were split into 88 classes, each representing a particular toxin at a particular time point. Fifty-four percent of the spectra from non-control samples were classified correctly, particularly successful when compared to the null success rate of similar to1% expected from random class assignment. The CLOUDS technique has advantages when modelling complex multi-dimensional distributions, giving a probabilistic rather than absolute class description of the data and is particularly amenable to inclusion of prior knowledge such as uncertainties in the data descriptors. This work shows that it is possible to construct viable and informative models of metabonomic data using the CLOUDS methodology, delineating the whole time course of toxicity. These models will be useful in building hybrid expert systems for predicting toxicology, which are the ultimate goal of the COMET project. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:109 / 122
页数:14
相关论文
共 29 条
  • [1] CLASSIFICATION OF TOXIN-INDUCED CHANGES IN H-1-NMR SPECTRA OF URINE USING AN ARTIFICIAL NEURAL-NETWORK
    ANTHONY, ML
    ROSE, VS
    NICHOLSON, JK
    LINDON, JC
    [J]. JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 1995, 13 (03) : 205 - 211
  • [2] Batch statistical processing of 1H NMR-derived urinary spectral data
    Antti, H
    Bollard, ME
    Ebbels, T
    Keun, H
    Lindon, JC
    Nicholson, JK
    Holmes, E
    [J]. JOURNAL OF CHEMOMETRICS, 2002, 16 (8-10) : 461 - 468
  • [3] Seismic vulnerability assessment of chemical plants through probabilistic neural networks
    Aoki, T
    Ceravolo, R
    De Stefano, A
    Genovese, C
    Sabia, D
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2002, 77 (03) : 263 - 268
  • [4] Azmi JK, 2001, TOXICOLOGY, V164, P226
  • [5] BECKONERT O, 2002, P ANAL CHIM ACTA
  • [6] Nuclear magnetic resonance spectroscopic and principal components analysis investigations into biochemical effects of three model hepatotoxins
    Beckwith-Hall, BM
    Nicholson, JK
    Nicholls, AW
    Foxall, PJD
    Lindon, JC
    Connor, SC
    Abdi, M
    Connelly, J
    Holmes, E
    [J]. CHEMICAL RESEARCH IN TOXICOLOGY, 1998, 11 (04) : 260 - 272
  • [7] Investigations into biochemical changes due to diurnal variation and estrus cycle in female rats using high-resolution 1H NMR spectroscopy of urine and pattern recognition
    Bollard, ME
    Holmes, E
    Lindon, JC
    Mitchell, SC
    Branstetter, D
    Zhang, W
    Nicholson, JK
    [J]. ANALYTICAL BIOCHEMISTRY, 2001, 295 (02) : 194 - 202
  • [8] Brindle JT, 2002, NAT MED, V8, P1439, DOI 10.1038/nm802
  • [9] Earthworm species of the genus Eisenia can be phenotypically differentiated by metabolic profiling
    Bundy, JG
    Spurgeon, DJ
    Svendsen, C
    Hankard, PK
    Osborn, D
    Lindon, JC
    Nicholson, JK
    [J]. FEBS LETTERS, 2002, 521 (1-3) : 115 - 120
  • [10] EBBELS TM, 2002, US200100293802001122