A graphical model approach to automated classification of protein subcellular location patterns in multi-cell images

被引:18
作者
Chen, Shann-Ching
Murphy, Robert F. [1 ]
机构
[1] Carnegie Mellon Univ, Dept Biomed Engn, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Ctr Bioimage Informat, Pittsburgh, PA 15213 USA
[3] Carnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
[4] Carnegie Mellon Univ, Ctr Automated Learning & Discovery, Pittsburgh, PA 15213 USA
关键词
D O I
10.1186/1471-2105-7-90
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Knowledge of the subcellular location of a protein is critical to understanding how that protein works in a cell. This location is frequently determined by the interpretation of fluorescence microscope images. In recent years, automated systems have been developed for consistent and objective interpretation of such images so that the protein pattern in a single cell can be assigned to a known location category. While these systems perform with nearly perfect accuracy for single cell images of all major subcellular structures, their ability to distinguish subpatterns of an organelle ( such as two Golgi proteins) is not perfect. Our goal in the work described here was to improve the ability of an automated system to decide which of two similar patterns is present in a field of cells by considering more than one cell at a time. Since cells displaying the same location pattern are often clustered together, considering multiple cells may be expected to improve discrimination between similar patterns. Results: We describe how to take advantage of information on experimental conditions to construct a graphical representation for multiple cells in a field. Assuming that a field is composed of a small number of classes, the classification accuracy can be improved by allowing the computed probability of each pattern for each cell to be influenced by the probabilities of its neighboring cells in the model. We describe a novel way to allow this influence to occur, in which we adjust the prior probabilities of each class to reflect the patterns that are present. When this graphical model approach is used on synthetic multi-cell images in which the true class of each cell is known, we observe that the ability to distinguish similar classes is improved without suffering any degradation in ability to distinguish dissimilar classes. The computational complexity of the method is sufficiently low that improved assignments of classes can be obtained for fields of twelve cells in under 0.04 second on a 1600 megahertz processor. Conclusion: We demonstrate that graphical models can be used to improve the accuracy of classification of subcellular patterns in multi-cell fluorescence microscope images. We also describe a novel algorithm for inferring classes from a graphical model. The performance and speed suggest that the method will be particularly valuable for analysis of images from high-throughput microscopy. We also anticipate that it will be useful for analyzing the mixtures of cell types typically present in images of tissues. Lastly, we anticipate that the method can be generalized to other problems.
引用
收藏
页数:13
相关论文
共 28 条
  • [1] Boland MV, 1998, CYTOMETRY, V33, P366
  • [2] A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells
    Boland, MV
    Murphy, RF
    [J]. BIOINFORMATICS, 2001, 17 (12) : 1213 - 1223
  • [3] Chen X, 2004, P ANN INT IEEE EMBS, V26, P1632
  • [4] Objective clustering of proteins based on subcellular location patterns
    Chen, X
    Murphy, RF
    [J]. JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2005, (02): : 87 - 95
  • [5] Prediction and classification of protein subcellular location - Sequence-order effect and pseudo amino acid composition
    Chou, KC
    Cai, YD
    [J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 2003, 90 (06) : 1250 - 1260
  • [6] Automatic identification of subcellular phenotypes on human cell arrays
    Conrad, C
    Erfle, H
    Warnat, P
    Daigle, N
    Lörch, T
    Ellenberg, J
    Pepperkok, R
    Eils, R
    [J]. GENOME RESEARCH, 2004, 14 (06) : 1130 - 1136
  • [7] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [8] Duda R., 1973, PATTERN CLASSIFICATI, V1st ed., P482
  • [9] Felzenszwalb PR, 2004, PROC CVPR IEEE, P261
  • [10] Global analysis of protein expression in yeast
    Ghaemmaghami, S
    Huh, W
    Bower, K
    Howson, RW
    Belle, A
    Dephoure, N
    O'Shea, EK
    Weissman, JS
    [J]. NATURE, 2003, 425 (6959) : 737 - 741