Nonnegative Matrix Factorization: An Analytical and Interpretive Tool in Computational Biology

被引:291
作者
Devarajan, Karthik [1 ]
机构
[1] Fox Chase Canc Ctr, Div Populat Sci, Philadelphia, PA 19111 USA
关键词
D O I
10.1371/journal.pcbi.1000029
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In the last decade, advances in high-throughput technologies such as DNA microarrays have made it possible to simultaneously measure the expression levels of tens of thousands of genes and proteins. This has resulted in large amounts of biological data requiring analysis and interpretation. Nonnegative matrix factorization (NMF) was introduced as an unsupervised, parts-based learning paradigm involving the decomposition of a nonnegative matrix V into two nonnegative matrices, W and H, via a multiplicative updates algorithm. In the context of a pxn gene expression matrix V consisting of observations on p genes from n samples, each column of W defines a metagene, and each column of H represents the metagene expression pattern of the corresponding sample. NMF has been primarily applied in an unsupervised setting in image and natural language processing. More recently, it has been successfully utilized in a variety of applications in computational biology. Examples include molecular pattern discovery, class comparison and prediction, cross-platform and cross-species analysis, functional characterization of genes and biomedical informatics. In this paper, we review this method as a data analytical and interpretive tool in computational biology with an emphasis on these applications.
引用
收藏
页数:12
相关论文
共 77 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Independent component representations for face recognition [J].
Bartlett, MS ;
Lades, HM ;
Sejnowski, TJ .
HUMAN VISION AND ELECTRONIC IMAGING III, 1998, 3299 :528-539
[3]  
Behnke S, 2003, IEEE IJCNN, P2758
[4]   Metagenes and molecular pattern discovery using matrix factorization [J].
Brunet, JP ;
Tamayo, P ;
Golub, TR ;
Mesirov, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) :4164-4169
[5]   Color categories revealed by non-negative matrix factorization of Munsell color spectra [J].
Buchsbaum, G ;
Bloch, O .
VISION RESEARCH, 2002, 42 (05) :559-563
[6]   Application of non-negative and local non negative matrix factorization to facial expression recognition [J].
Buciu, I ;
Pitas, I .
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, 2004, :288-291
[7]   Biclustering of gene expression data by non-smooth non-negative matrix factorization [J].
Carmona-Saez, P ;
Pascual-Marqui, RD ;
Tirado, F ;
Carazo, JM ;
Pascual-Montano, A .
BMC BIOINFORMATICS, 2006, 7 (1)
[8]   High-resolution genomic profiles define distinct clinico-pathogenetic subgroups of multiple myeloma patients [J].
Carrasco, DR ;
Tonon, G ;
Huang, YS ;
Zhang, YY ;
Sinha, R ;
Bin, F ;
Stewart, JP ;
Zhan, FG ;
Khatry, D ;
Protopopova, M ;
Protopopov, A ;
Sukhdeo, K ;
Hanamura, I ;
Stephens, O ;
Barlogie, B ;
Anderson, KC ;
Chin, L ;
Shaughnessy, JD ;
Brennan, C ;
DePinho, RA .
CANCER CELL, 2006, 9 (04) :313-325
[9]   Discovering semantic features in the literature: a foundation for building functional associations [J].
Chagoyen, M ;
Carmona-Saez, P ;
Shatkay, H ;
Carazo, JM ;
Pascual-Montano, A .
BMC BIOINFORMATICS, 2006, 7 (1)
[10]  
CHEN X, 2000, P IEEE COMP SOC C CO, V1, P1126