Visualization of large-scale aqueous solubility data using a novel hierarchical data visualization technique

被引:14
作者
Yamashita, Fumiyoshi
Itoh, Takayuki
Hara, Hideto
Hashida, Mitsuru
机构
[1] Kyoto Univ, Grad Sch Pharmaceut Sci, Dept Drug Delivery Res, Sakyo Ku, Kyoto 6068501, Japan
[2] Ochanomizu Univ, Fac Sci, Dept Informat Sci, Bunkyo Ku, Tokyo 1128610, Japan
关键词
D O I
10.1021/ci0504770
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
It is a difficult task to recognize the trends in molecular physical properties relevant to a specific chemical class and find a way to optimize potential compounds. We present here a novel hierarchical data visualization technique, named "HeiankyoView", to visualize large-scale multidimensional chemical information. HeiankyoView represents hierarchically organized data objects by mapping leaf nodes as colored square icons and nonleaf nodes as rectangular borders. In this way, data objects can be expressed as equishaped icons without overlapping one another in the two-dimensional display space. HeiankyoView has been applied to visualize aqueous solubility data for 908 compounds collected from the published literature. When the results of a recursive partitioning analysis and hierarchical clustering analysis were visualized, the trends hidden in the solubility data could be effectively displayed as intuitively understandable visual images. Most interestingly, the data visualization technique, without any statistical computations, was able to assist us in extracting from such large-scale data meaningful information establishing that ClogP and the molecular weight are critical factors in determining aqueous solubility. Thus, HeiankyoView is a powerful tool to help us understand structure- activity relationships intuitively from a large-scale data set.
引用
收藏
页码:1054 / 1059
页数:6
相关论文
共 28 条
[1]   The correlation and prediction of the solubility of compounds in water using an amended solvation energy relationship [J].
Abraham, MH ;
Le, J .
JOURNAL OF PHARMACEUTICAL SCIENCES, 1999, 88 (09) :868-880
[2]  
[Anonymous], 1977, MULTIDIMENSIONAL SCA
[3]  
Bahler D, 1993, Proc Int Conf Intell Syst Mol Biol, V1, P29
[4]   Use of classification regression tree in predicting oral absorption in humans [J].
Bai, JPF ;
Utis, A ;
Crippen, G ;
He, HD ;
Fischer, V ;
Tullman, R ;
Yin, HQ ;
Hsu, CP ;
Jiang, L ;
Hwang, KK .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06) :2061-2069
[5]   Ordered and quantum treemaps: Making effective use of 2D space to display hierarchies [J].
Bederson, BB ;
Shneiderman, B ;
Wattenberg, M .
ACM TRANSACTIONS ON GRAPHICS, 2002, 21 (04) :833-854
[6]  
Butina D, 2003, J CHEM INF COMP SCI, V43, P837, DOI [10.1021/ci020279y, 10.1021/6020279y]
[7]   Classification of drugs in absorption classes using the classification and regression trees (CART) methodology [J].
Deconinck, E ;
Hancock, T ;
Coomans, D ;
Massart, DL ;
Vander Heyden, Y .
JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2005, 39 (1-2) :91-103
[8]   Generation and validation of rapid computational filters for CYP2D6 and CYP3A4 [J].
Ekins, S ;
Berbaum, J ;
Harrison, RK .
DRUG METABOLISM AND DISPOSITION, 2003, 31 (09) :1077-1080
[9]   In silico rationalization of the structural and physicochemical requirements for photobiological activity in angelicine derivatives and their heteroanalogues [J].
Giordanetto, F ;
Fossa, P ;
Menozzi, G ;
Mosti, L .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2003, 17 (01) :53-64
[10]   Estimation of aqueous solubility for a diverse set of organic compounds based on molecular topology [J].
Huuskonen, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (03) :773-777