Improved recognition of figures containing fluorescence microscope images in online journal articles using graphical models

被引:18
作者
Qian, Yuntao [1 ,2 ,3 ]
Murphy, Robert F. [1 ,2 ,4 ,5 ]
机构
[1] Carnegie Mellon Univ, Ctr Bioimage Informat, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA 15213 USA
[3] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China
[4] Carnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
[5] Carnegie Mellon Univ, Dept Biomed Engn, Pittsburgh, PA 15213 USA
关键词
D O I
10.1093/bioinformatics/btm561
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: There is extensive interest in automating the collection, organization and analysis of biological data. Data in the form of images in online literature present special challenges for such efforts. The first steps in understanding the contents of a figure are decomposing it into panels and determining the type of each panel. In biological literature, panel types include many kinds of images collected by different techniques, such as photographs of gels or images from microscopes. We have previously described the SLIF system (http://slif.cbi.cmu.edu) that identifies panels containing fluorescence microscope images among figures in online journal articles as a prelude to further analysis of the subcellular patterns in such images. This system contains a pretrained classifier that uses image features to assign a type (class) to each separate panel. However, the types of panels in a figure are often correlated, so that we can consider the class of a panel to be dependent not only on its own features but also on the types of the other panels in a figure. Results: In this article, we introduce the use of a type of probabilistic graphical model, a factor graph, to represent the structured information about the images in a figure, and permit more robust and accurate inference about their types. We obtain significant improvement over results for considering panels separately.
引用
收藏
页码:569 / 576
页数:8
相关论文
共 22 条
[11]  
Kou Z, 2003, P 3 ACM SIGKDD WORKS, P2
[12]   Factor graphs and the sum-product algorithm [J].
Kschischang, FR ;
Frey, BJ ;
Loeliger, HA .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2001, 47 (02) :498-519
[13]  
Minka T.P., 2001, P 17 C UNC ART INT, V17, P362, DOI [10.48550/arXiv.1301.2294, DOI 10.48550/ARXIV.1301.2294]
[14]  
Murphy R. F., 2004, P IASTED INT C KNOWL, P109
[15]   Release of phosphorus from sediments in Lake Biwa [J].
Murphy T. ;
Lawson A. ;
Kumagai M. ;
Nalewajko C. .
Limnology, 2001, 2 (2) :119-128
[16]  
Platt JC, 2000, ADV NEUR IN, P61
[17]   SOME GENERALIZED ORDER-DISORDER TRANSFORMATIONS [J].
POTTS, RB .
PROCEEDINGS OF THE CAMBRIDGE PHILOSOPHICAL SOCIETY, 1952, 48 (01) :106-109
[18]  
Rafkind B., 2006, P HLT NAACL BIONLP W, P73
[19]   Integrating image data into biomedical text categorization [J].
Shatkay, Hagit ;
Chen, Nawei ;
Blostein, Dorothea .
BIOINFORMATICS, 2006, 22 (14) :E446-E453
[20]  
Yedidia J.S., 2000, P 13 INT C NEURAL IN, P689