Multi-class cancer classification using multinomial probit regression with Bayesian gene selection

被引：18

作者：

Zhou, X

Wang, X ^{[1
]}

Dougherty, ER

机构：

[1] Texas A&M Univ, Dept Elect Engn, College Stn, TX 77843 USA

[2] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA

[3] Univ Texas, MD Anderson Canc Ctr, Dept Pathol, Houston, TX 77030 USA

来源：

IEE PROCEEDINGS SYSTEMS BIOLOGY | 2006年 / 153卷 / 02期

关键词：

D O I：

10.1049/ip-syb:20050015

中图分类号：

Q2 [细胞生物学];

学科分类号：

071009 [细胞生物学]; 090102 [作物遗传育种];

摘要：

We consider the problems of multi-class cancer classification from gene expression data. After discussing the multinomial probit regression model with Bayesian gene selection, we propose two Bayesian gene selection schemes: one employs different strongest genes for different probit regressions; the other employs the same strongest genes for all regressions. Some fast implementation issues for Bayesian gene selection are discussed, including preselection of the strongest genes and recursive computation of the estimation errors using QR decomposition. The proposed gene selection techniques are applied to analyse real breast cancer data, small round blue-cell tumours, the national cancer institute's anti-cancer drug-screen data and acute leukaemia data. Compared with existing multi-class cancer classifications, our proposed methods can find which genes are the most important genes affecting which kind of cancer. Also, the strongest genes selected using our methods are consistent with the biological significance. The recognition accuracies are very high using our proposed methods.

引用

页码：70 / 78

页数：9

共 21 条

[1]

BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA [J].