Optimal approach for classification of acute leukemia subtypes based on gene expression data

被引:20
作者
Cho, JH
Lee, D
Park, JH
Kim, K
Lee, IB
机构
[1] Pohang Univ Sci & Technol, Dept Chem Engn, Pohang 790784, South Korea
[2] P&I Consulting Co Ltd, Pohang 790784, South Korea
[3] Jae I1 Hosp, Youngduk 766845, Kyungbook, South Korea
关键词
D O I
10.1021/bp025517o
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The classification of cancer subtypes, which is critical for successful treatment, has been studied extensively with the use of gene expression profiles from oligonucleotide chips or cDNA microarrays. Various pattern recognition methods have been successfully applied to gene expression data. However, these methods are not optimal, rather they are high-performance classifiers that emphasize only classification accuracy. In this paper, we propose an approach for the construction of the optimal linear classifier using gene expression data. Two linear classification methods, linear discriminant analysis (LDA) and discriminant partial least-squares (DPLS), are applied to distinguish acute leukemia subtypes. These methods are shown to give satisfactory accuracy. Moreover, we determined optimally the number of genes participating in the classification (a remarkably small number compared to previous results) on the basis of the statistical significance test. Thus, the proposed method constructs the optimal classifier that is composed of a small size predictor and provides high accuracy.
引用
收藏
页码:847 / 854
页数:8
相关论文
共 32 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]  
ARTHUR DC, 1983, BLOOD, V61, P994
[4]  
BENDOR A, 2000, P 4 ANN INT C COMP M, P54
[5]   Gene expression data analysis [J].
Brazma, A ;
Vilo, J .
MICROBES AND INFECTION, 2001, 3 (10) :823-829
[6]   Fault diagnosis in chemical processes using Fisher discriminant analysis, discriminant partial least squares, and principal component analysis [J].
Chiang, LH ;
Russell, EL ;
Braatz, RD .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 50 (02) :243-252
[7]  
CHOI S, 2001, AICHE ANN M
[8]   The use and misuse of chemometrics for treating classification problems [J].
Defernez, M ;
Kemsley, EK .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 1997, 16 (04) :216-221
[9]  
Duda R.O., 2001, Pattern Classification, V2nd
[10]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868