Multi-class cancer classification using multinomial probit regression with Bayesian gene selection
被引:18
作者:
Zhou, X
论文数: 0引用数: 0
h-index: 0
机构:Texas A&M Univ, Dept Elect Engn, College Stn, TX 77843 USA
Zhou, X
Wang, X
论文数: 0引用数: 0
h-index: 0
机构:
Texas A&M Univ, Dept Elect Engn, College Stn, TX 77843 USATexas A&M Univ, Dept Elect Engn, College Stn, TX 77843 USA
Wang, X
[1
]
Dougherty, ER
论文数: 0引用数: 0
h-index: 0
机构:Texas A&M Univ, Dept Elect Engn, College Stn, TX 77843 USA
Dougherty, ER
机构:
[1] Texas A&M Univ, Dept Elect Engn, College Stn, TX 77843 USA
[2] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
[3] Univ Texas, MD Anderson Canc Ctr, Dept Pathol, Houston, TX 77030 USA
来源:
IEE PROCEEDINGS SYSTEMS BIOLOGY
|
2006年
/
153卷
/
02期
关键词:
D O I:
10.1049/ip-syb:20050015
中图分类号:
Q2 [细胞生物学];
学科分类号:
071009 [细胞生物学];
090102 [作物遗传育种];
摘要:
We consider the problems of multi-class cancer classification from gene expression data. After discussing the multinomial probit regression model with Bayesian gene selection, we propose two Bayesian gene selection schemes: one employs different strongest genes for different probit regressions; the other employs the same strongest genes for all regressions. Some fast implementation issues for Bayesian gene selection are discussed, including preselection of the strongest genes and recursive computation of the estimation errors using QR decomposition. The proposed gene selection techniques are applied to analyse real breast cancer data, small round blue-cell tumours, the national cancer institute's anti-cancer drug-screen data and acute leukaemia data. Compared with existing multi-class cancer classifications, our proposed methods can find which genes are the most important genes affecting which kind of cancer. Also, the strongest genes selected using our methods are consistent with the biological significance. The recognition accuracies are very high using our proposed methods.