Hierarchical Generative Biclustering for MicroRNA Expression Analysis

被引:10
作者
Caldas, Jose [1 ]
Kaski, Samuel [1 ]
机构
[1] Aalto Univ Sch Sci & Technol, Dept Informat & Comp Sci, Helsinki Inst Informat Technol, FI-00076 Aalto, Finland
关键词
gene expression; machine learning; stochastic processes; trees; INTERACTS;
D O I
10.1089/cmb.2010.0256
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Clustering methods are a useful and common first step in gene expression studies, but the results may be hard to interpret. We bring in explicitly an indicator of which genes tie each cluster, changing the setup to biclustering. Furthermore, we make the indicators hierarchical, resulting in a hierarchy of progressively more specific biclusters. A non-parametric Bayesian formulation makes the model rigorous yet flexible and computations feasible. The model can additionally be used in information retrieval for relating relevant samples. We show that the model outperforms four other biclustering procedures on a large miRNA data set. We also demonstrate the model's added interpretability and information retrieval capability in a case study. Software is publicly available at http://research.ics.tkk.fi/mi/software/treebic/.
引用
收藏
页码:251 / 261
页数:11
相关论文
共 35 条
[1]
[Anonymous], 1999, Learning in Graphical Models
[2]
MIXTURES OF DIRICHLET PROCESSES WITH APPLICATIONS TO BAYESIAN NONPARAMETRIC PROBLEMS [J].
ANTONIAK, CE .
ANNALS OF STATISTICS, 1974, 2 (06) :1152-1174
[3]
Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]
BenDor A., 2002, Proceedings of the sixth annual international conference on computational biology, P49, DOI [10.1145/565196.565203, DOI 10.1145/565196.565203]
[5]
The Nested Chinese Restaurant Process and Bayesian Nonparametric Inference of Topic Hierarchies [J].
Blei, David M. ;
Griffiths, Thomas L. ;
Jordan, Michael I. .
JOURNAL OF THE ACM, 2010, 57 (02)
[6]
A scalable topic-based open souirce search engine [J].
Buntine, W ;
Löfström, J ;
Perkiö, J ;
Perttu, S ;
Poroshin, V ;
Silander, T ;
Tirri, H ;
Tuominen, A ;
Tuulos, V .
IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, :228-234
[7]
Probabilistic retrieval and visualization of biologically relevant microarray experiments [J].
Caldas, Jose ;
Gehlenborg, Nils ;
Faisal, Ali ;
Brazma, Alvis ;
Kaski, Samuel .
BIOINFORMATICS, 2009, 25 (12) :I145-I153
[8]
Cheng Y., 2000, Proceedings International Conference on Intelligent System,s for Molecular Biology
[9]
ISMB. International Conference on Intelligent System, V8, P93
[10]
Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868