X-hitting: An algorithm for novelty detection and dereplication by UV spectra of complex mixtures of natural products

被引:19
作者
Hansen, ME [1 ]
Smedsgaard, J [1 ]
Larsen, TO [1 ]
机构
[1] Tech Univ Denmark, CMB, BioCentrum, DK-2800 Lyngby, Denmark
关键词
D O I
10.1021/ac040191e
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A major challenge in lead discovery is to detect well-known and trivial compounds rapidly, a process known as dereplication, so that isolation, structure elucidation, and pharmacological investigations can be focused on novel compounds. In this paper, we present a new algorithm, X-hitting, based on cross sample comparison of full UV spectra from HPLC analysis of highly complex natural product extracts/samples. X-Hitting allows automatic identification of known compounds but more important also allows finding of potentially new or similar compounds. We demonstrate this new algorithm by automatic identification of known structures, a task we call cross-hitting, and tentative identification of potentially new bioactive compounds, a task we call new-hitting, in HPLC data from analysis of fungal extracts. Both tasks are illustrated using 18 important reference compounds and complex fungal extracts obtained from isolates in the IBT Culture Collection held at BioCentrum-DTU, Technical University of Denmark. The receiver operating characteristics statistic is used to evaluate the performance of the compound predictor, and it was found that compounds could be identified with high confidence (AUC approximate to 0.98). Based on high confidence in retrieving identical spectra, the method is extended to include similar but still different spectra.
引用
收藏
页码:6805 / 6817
页数:13
相关论文
共 37 条
[1]  
[Anonymous], 1989, MONOGRAPHS STAT APPL
[2]   A novel alkaloid serantrypinone and the spiro azaphilone daldinin D from Penicillium thymicola [J].
Ariza, MR ;
Larsen, TO ;
Petersen, BO ;
Duus, JO ;
Christophersen, C ;
Barrero, AF .
JOURNAL OF NATURAL PRODUCTS, 2001, 64 (12) :1590-1592
[3]   Penicillium digitatum metabolites on synthetic media and citrus fruits [J].
Ariza, MR ;
Larsen, TO ;
Petersen, BO ;
Duus, JO ;
Barrero, AF .
JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2002, 50 (22) :6361-6365
[4]   SPIROQUINAZOLINE, A NOVEL SUBSTANCE-P INHIBITOR WITH A NEW CARBON SKELETON, ISOLATED FROM ASPERGILLUS FLAVIPES [J].
BARROW, CJ ;
SUN, HH .
JOURNAL OF NATURAL PRODUCTS, 1994, 57 (04) :471-476
[5]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[6]   REVIEW OF CLASSIFICATION [J].
CORMACK, RM .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-GENERAL, 1971, 134 :321-+
[7]   Natural products in drug discovery and development [J].
Cragg, GM ;
Newman, DJ ;
Snader, KM .
JOURNAL OF NATURAL PRODUCTS, 1997, 60 (01) :52-60
[8]  
Frisvad JC, 2004, STUD MYCOL, P201
[9]   TERVERTICILLATE PENICILLIA - CHEMOTAXONOMY AND MYCOTOXIN PRODUCTION [J].
FRISVAD, JC ;
FILTENBORG, O .
MYCOLOGIA, 1989, 81 (06) :837-861
[10]   STANDARDIZED HIGH-PERFORMANCE LIQUID-CHROMATOGRAPHY OF 182 MYCOTOXINS AND OTHER FUNGAL METABOLITES BASED ON ALKYLPHENONE RETENTION INDEXES AND UV-VIS SPECTRA (DIODE-ARRAY DETECTION) [J].
FRISVAD, JC ;
THRANE, U .
JOURNAL OF CHROMATOGRAPHY, 1987, 404 (01) :195-214