Anticancer drug sensitivity prediction in cell lines from baseline gene expression through recursive feature selection

被引:122
作者
Dong, Zuoli [1 ]
Zhang, Naiqian [1 ]
Li, Chun [2 ]
Wang, Haiyun [3 ]
Fang, Yun [1 ]
Wang, Jun [1 ]
Zheng, Xiaoqi [1 ]
机构
[1] Shanghai Normal Univ, Dept Math, Shanghai, Peoples R China
[2] Bohai Univ, Dept Math, Jinzhou, Peoples R China
[3] Tongji Univ, Sch Life Sci & Technol, Dept Bioinformat, Shanghai 200092, Peoples R China
基金
中国国家自然科学基金;
关键词
Drug sensitivity prediction; Feature selection; Recursive feature elimination; CANCER; CHEMOSENSITIVITY; IDENTIFICATION;
D O I
10.1186/s12885-015-1492-6
中图分类号
R73 [肿瘤学];
学科分类号
100214 [肿瘤学];
摘要
Background: An enduring challenge in personalized medicine is to select right drug for individual patients. Testing drugs on patients in large clinical trials is one way to assess their efficacy and toxicity, but it is impractical to test hundreds of drugs currently under development. Therefore the preclinical prediction model is highly expected as it enables prediction of drug response to hundreds of cell lines in parallel. Methods: Recently, two large-scale pharmacogenomic studies screened multiple anticancer drugs on over 1000 cell lines in an effort to elucidate the response mechanism of anticancer drugs. To this aim, we here used gene expression features and drug sensitivity data in Cancer Cell Line Encyclopedia (CCLE) to build a predictor based on Support Vector Machine (SVM) and a recursive feature selection tool. Robustness of our model was validated by cross-validation and an independent dataset, the Cancer Genome Project (CGP). Results: Our model achieved good cross validation performance for most drugs in the Cancer Cell Line Encyclopedia (>= 80 % accuracy for 10 drugs, >= 75 % accuracy for 19 drugs). Independent tests on eleven common drugs between CCLE and CGP achieved satisfactory performance for three of them, i.e., AZD6244, Erlotinib and PD-0325901, using expression levels of only twelve, six and seven genes, respectively. Conclusions: These results suggest that drug response could be effectively predicted from genomic features. Our model could be applied to predict drug response for some certain drugs and potentially play a complementary role in personalized medicine.
引用
收藏
页数:12
相关论文
共 28 条
[1]
A balanced iterative random forest for gene selection from microarray data [J].
Anaissi, Ali ;
Kennedy, Paul J. ;
Goyal, Madhu ;
Catchpoole, Daniel R. .
BMC BIOINFORMATICS, 2013, 14
[2]
The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity (vol 483, pg 603, 2012) [J].
Barretina, Jordi ;
Caponigro, Giordano ;
Stransky, Nicolas ;
Venkatesan, Kavitha ;
Margolin, Adam A. ;
Kim, Sungjoon ;
Wilson, Christopher J. ;
Lehar, Joseph ;
Kryukov, Gregory V. ;
Sonkin, Dmitriy ;
Reddy, Anupama ;
Liu, Manway ;
Murray, Lauren ;
Berger, Michael F. ;
Monahan, John E. ;
Morais, Paula ;
Meltzer, Jodi ;
Korejwa, Adam ;
Jane-Valbuena, Judit ;
Mapa, Felipa A. ;
Thibault, Joseph ;
Bric-Furlong, Eva ;
Raman, Pichai ;
Shipway, Aaron ;
Engels, Ingo H. ;
Cheng, Jill ;
Yu, Guoying K. ;
Yu, Jianjun ;
Aspesi, Peter, Jr. ;
de Silva, Melanie ;
Jagtap, Kalpana ;
Jones, Michael D. ;
Wang, Li ;
Hatton, Charles ;
Palescandolo, Emanuele ;
Gupta, Supriya ;
Mahan, Scott ;
Sougnez, Carrie ;
Onofrio, Robert C. ;
Liefeld, Ted ;
MacConaill, Laura ;
Winckler, Wendy ;
Reich, Michael ;
Li, Nanxin ;
Mesirov, Jill P. ;
Gabriel, Stacey B. ;
Getz, Gad ;
Ardlie, Kristin ;
Chan, Vivien ;
Myer, Vic E. .
NATURE, 2012, 492 (7428) :290-290
[3]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]
Three Epigenetic Biomarkers, GDF15, TMEFF2, and VIM, Accurately Predict Bladder Cancer from DNA-Based Analyses of Urine Samples [J].
Costa, Vera L. ;
Henrique, Rui ;
Danielsen, Stine A. ;
Duarte-Pereira, Sara ;
Eknaes, Mette ;
Skotheim, Rolf I. ;
Rodrigues, Angelo ;
Magalhaes, Jose S. ;
Oliveira, Jorge ;
Lothe, Ragnhild A. ;
Teixeira, Manuel R. ;
Jeronimo, Carmen ;
Lind, Guro E. .
CLINICAL CANCER RESEARCH, 2010, 16 (23) :5842-5851
[5]
Costello JC, 2014, ESANN, V32, P1202
[6]
Designing personalised cancer treatments [J].
Cree, Ian A. .
JOURNAL OF CONTROLLED RELEASE, 2013, 172 (02) :405-409
[7]
Feature Selection based F-score and ACO Algorithm in Support Vector Machine [J].
Ding, Sheng .
2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 1, 2009, :19-23
[8]
Sprouty 2, an inhibitor of mitogen-activated protein kinase signaling, is down-regulated in hepatocellular carcinoma [J].
Fong, CW ;
Chua, MS ;
McKie, AB ;
Ling, SHM ;
Mason, L ;
Li, R ;
Yusoff, P ;
Lo, TL ;
Leung, HY ;
So, SKS ;
Guy, GR .
CANCER RESEARCH, 2006, 66 (04) :2048-2058
[9]
Systematic identification of genomic markers of drug sensitivity in cancer cells [J].
Garnett, Mathew J. ;
Edelman, Elena J. ;
Heidorn, Sonja J. ;
Greenman, Chris D. ;
Dastur, Anahita ;
Lau, King Wai ;
Greninger, Patricia ;
Thompson, I. Richard ;
Luo, Xi ;
Soares, Jorge ;
Liu, Qingsong ;
Iorio, Francesco ;
Surdez, Didier ;
Chen, Li ;
Milano, Randy J. ;
Bignell, Graham R. ;
Tam, Ah T. ;
Davies, Helen ;
Stevenson, Jesse A. ;
Barthorpe, Syd ;
Lutz, Stephen R. ;
Kogera, Fiona ;
Lawrence, Karl ;
McLaren-Douglas, Anne ;
Mitropoulos, Xeni ;
Mironenko, Tatiana ;
Thi, Helen ;
Richardson, Laura ;
Zhou, Wenjun ;
Jewitt, Frances ;
Zhang, Tinghu ;
O'Brien, Patrick ;
Boisvert, Jessica L. ;
Price, Stacey ;
Hur, Wooyoung ;
Yang, Wanjuan ;
Deng, Xianming ;
Butler, Adam ;
Choi, Hwan Geun ;
Chang, JaeWon ;
Baselga, Jose ;
Stamenkovic, Ivan ;
Engelman, Jeffrey A. ;
Sharma, Sreenath V. ;
Delattre, Olivier ;
Saez-Rodriguez, Julio ;
Gray, Nathanael S. ;
Settleman, Jeffrey ;
Futreal, P. Andrew ;
Haber, Daniel A. .
NATURE, 2012, 483 (7391) :570-U87
[10]
Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines [J].
Geeleher, Paul ;
Cox, Nancy J. ;
Huang, R. Stephanie .
GENOME BIOLOGY, 2014, 15 (03)