A decomposition model to track gene expression signatures: preview on observer-independent classification of ovarian cancer

被引:44
作者
Martoglio, AM
Miskin, JW
Smith, SK
MacKay, DJC
机构
[1] Univ Cambridge, Dept Pathol, Reprod Mol Res Grp, Cambridge CB2 1QP, England
[2] Univ Cambridge, Dept Obstet & Gynaecol, Cambridge CB2 1QP, England
[3] Univ Cambridge, Cavendish Lab, Cavendish Astrophys Grp, Cambridge CB3 0HE, England
关键词
D O I
10.1093/bioinformatics/18.12.1617
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A number of algorithms and analytical models have been employed to reduce the multidimensional complexity of DNA array data and attempt to extract some meaningful interpretation of the results. These include clustering, principal components analysis, self-organizing maps, and support vector machine analysis. Each method assumes an implicit model for the data, many of which separate genes into distinct clusters defined by similar expression profiles in the samples tested. A point of concern is that many genes may be involved in a number of distinct behaviours, and should therefore be modelled to fit into as many separate clusters as detected in the multidimensional gene expression space. The analysis of gene expression data using a decomposition model that is independent of the observer involved would be highly beneficial to improve standard and reproducible classification of clinical and research samples. Results: We present a variational independent component analysis (ICA) method for reducing high dimensional DNA array data to a smaller set of latent variables, each associated with a gene signature. We present the results of applying the method to data from an ovarian cancer study, revealing a number of tissue type-specific and tissue type-independent gene signatures present in varying amounts among the samples surveyed. The observer independent results of such molecular analysis of biological samples could help identify patients who would benefit from different treatment strategies. We further explore the application of the model to similar high-throughput studies.
引用
收藏
页码:1617 / 1624
页数:8
相关论文
共 16 条
[1]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[2]   Tissue inhibitors of matrix metalloproteinases in cancer [J].
Blavier, L ;
Henriet, P ;
Imren, S ;
DeClerck, YA .
INHIBITION OF MATRIX METALLOPROTEINASES: THERAPEUTIC APPLICATIONS, 1999, 878 :108-119
[3]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[4]   Exploring expression data: Identification and analysis of coexpressed genes [J].
Heyer, LJ ;
Kruglyak, S ;
Yooseph, S .
GENOME RESEARCH, 1999, 9 (11) :1106-1115
[5]   Generation of expression plasmids for angiostatin, endostatin and TIMP-2 for cancer gene therapy [J].
Indraccolo, S ;
Minuzzo, S ;
Gola, E ;
Habeler, W ;
Carrozzino, F ;
Noonan, D ;
Albini, A ;
Santi, L ;
Amadori, A ;
Chieco-Bianchi, L .
INTERNATIONAL JOURNAL OF BIOLOGICAL MARKERS, 1999, 14 (04) :251-256
[6]   Collagen and elastin degradation by matrix metalloproteinases and tissue inhibitors of matrix metalloproteinase in aortic dissection [J].
Ishii, T ;
Asuwa, N .
HUMAN PATHOLOGY, 2000, 31 (06) :640-646
[7]   Linear modes of gene expression determined by independent component analysis [J].
Liebermeister, W .
BIOINFORMATICS, 2002, 18 (01) :51-60
[8]   Changes in tumorigenesis- and angiogenesis-related gene transcript abundance profiles in ovarian cancer detected by tailored high density cDNA arrays [J].
Martoglio, AM ;
Tom, BDM ;
Starkey, M ;
Corps, AN ;
Charnock-Jones, DS ;
Smith, SK .
MOLECULAR MEDICINE, 2000, 6 (09) :750-765
[9]  
MARTOGLIO AM, 2000, THESIS U CAMBRIDGE
[10]  
MISKIN JW, 2001, THESIS U CAMBRIDGE