Finding genes in the C2C12 osteogenic pathway by k-nearest-neighbor classification of expression data

被引:55
作者
Theilhaber, J [1 ]
Connolly, T
Roman-Roman, S
Bushnell, S
Jackson, A
Call, K
Garcia, T
Baron, R
机构
[1] Aventis Pharmaceut, Cambridge Genom Ctr, Cambridge, MA 02139 USA
[2] Aventis Pharmaceut, Bone Dis Grp, F-93235 Romainville, France
[3] CuraGen Corp, New Haven, CT 06511 USA
关键词
D O I
10.1101/gr.182601
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A supervised classification scheme for analyzing microarray expression data, based on the k-nearest-neighbor method Coupled to noise-reduction filters, has been used to find genes involved in the osteogenic pathway of the mouse C2Cl2 cell line studied here as a model for in vivo osteogenesis. The scheme uses as input a training set embodying expert biological knowledge, and provides internal estimates of its own misclassification errors, which furthermore enables systematic optimization of the classifier parameters. On the basis of the C2Cl2-generated expression data set with 34,130 expression profiles across 2 time courses, each comprised of 6 points, and a training set containing known members of the osteogenic, myoblastic, and adipocytic pathways, 176 new genes in addition to 28 originally in the training set are selected as relevant to osteogenesis. For this selection, the estimated sensitivity is 42% and the posterior false-positive rate (fraction of candidates that are Spurious) is 12%. The corresponding sensitivity and false-positive rate for detection of myoblastic genes are 9% and 31%, respectively, and only 4% and similar to100%, respectively, for adipocytic genes, in accordance with an experimental design that predominantly stimulated the osteogenic pathway. Validation of this selection is provided by examining expression of the genes in an independent biological assay involving mouse calvaria (skull bone) primary cell cultures, in which a large fraction of the 176 genes are seen to be strongly regulated, as well as by case-by-case analysis of the genes on the basis of expert domain knowledge. The methodology Should be generalizable to any situation in which,enough a prior! biological knowledge exists to define a training set.
引用
收藏
页码:165 / 176
页数:12
相关论文
共 51 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]  
[Anonymous], PATTERN CLASSIFICATI
[4]   Clustering gene expression patterns [J].
Ben-Dor, A ;
Shamir, R ;
Yakhini, Z .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :281-297
[5]  
BIRCH MA, BONE, V24, P571
[6]   CYTOPLASMIC ACTIVATION OF HUMAN NUCLEAR GENES IN STABLE HETEROCARYONS [J].
BLAU, HM ;
CHIU, CP ;
WEBSTER, C .
CELL, 1983, 32 (04) :1171-1180
[7]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[8]  
Califano A, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P75
[9]   cDNA fingerprinting of osteoprogenitor cells to isolate differentiation stage-specific genes [J].
Candeliere, GA ;
Rao, Y ;
Floh, A ;
Sandler, SD ;
Aubin, JE .
NUCLEIC ACIDS RESEARCH, 1999, 27 (04) :1079-1083
[10]   TRANSFORMING GROWTH-FACTOR-BETA GENE FAMILY MEMBERS AND BONE [J].
CENTRELLA, M ;
HOROWITZ, MC ;
WOZNEY, JM ;
MCCARTHY, TL .
ENDOCRINE REVIEWS, 1994, 15 (01) :27-39