Cross-platform comparison and visualisation of gene expression data using co-inertia analysis -: art. no. 59

被引:102
作者
Culhane, AC [1 ]
Perrière, G
Higgins, DG
机构
[1] Natl Univ Ireland Univ Coll Cork, Dept Biochem, Biosci Inst, Cork, Ireland
[2] Univ Lyon 1, CNRS, UMR 5558, Lab Biometrie & Biol Evolut, F-69622 Villeurbanne, France
关键词
D O I
10.1186/1471-2105-4-59
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Rapid development of DNA microarray technology has resulted in different laboratories adopting numerous different protocols and technological platforms, which has severely impacted on the comparability of array data. Current cross-platform comparison of microarray gene expression data are usually based on cross-referencing the annotation of each gene transcript represented on the arrays, extracting a list of genes common to all arrays and comparing expression data of this gene subset. Unfortunately, filtering of genes to a subset represented across all arrays often excludes many thousands of genes, because different subsets of genes from the genome are represented on different arrays. We wish to describe the application of a powerful yet simple method for cross-platform comparison of gene expression data. Co-inertia analysis (CIA) is a multivariate method that identifies trends or co-relationships in multiple datasets which contain the same samples. CIA simultaneously finds ordinations (dimension reduction diagrams) from the datasets that are most similar. It does this by finding successive axes from the two datasets with maximum covariance. CIA can be applied to datasets where the number of variables (genes) far exceeds the number of samples (arrays) such is the case with microarray analyses. Results: We illustrate the power of CIA for cross-platform analysis of gene expression data by using it to identify the main common relationships in expression profiles on a panel of 60 tumour cell lines from the National Cancer Institute (NCI) which have been subjected to microarray studies using both Affymetrix and spotted cDNA array technology. The co-ordinates of the CIA projections of the cell lines from each dataset are graphed in a bi-plot and are connected by a line, the length of which indicates the divergence between the two datasets. Thus, CIA provides graphical representation of consensus and divergence between the gene expression profiles from different microarray platforms. Secondly, the genes that define the main trends in the analysis can be easily identified. Conclusions: CIA is a robust, efficient approach to coupling of gene expression datasets. CIA provides simple graphical representations of the results making it a particularly attractive method for the identification of relationships between large datasets.
引用
收藏
页数:15
相关论文
共 41 条
  • [31] Systematic variation in gene expression patterns in human cancer cell lines
    Ross, DT
    Scherf, U
    Eisen, MB
    Perou, CM
    Rees, C
    Spellman, P
    Iyer, V
    Jeffrey, SS
    Van de Rijn, M
    Waltham, M
    Pergamenschikov, A
    Lee, JCE
    Lashkari, D
    Shalon, D
    Myers, TG
    Weinstein, JN
    Botstein, D
    Brown, PO
    [J]. NATURE GENETICS, 2000, 24 (03) : 227 - 235
  • [32] A gene expression database for the molecular pharmacology of cancer
    Scherf, U
    Ross, DT
    Waltham, M
    Smith, LH
    Lee, JK
    Tanabe, L
    Kohn, KW
    Reinhold, WC
    Myers, TG
    Andrews, DT
    Scudiero, DA
    Eisen, MB
    Sausville, EA
    Pommier, Y
    Botstein, D
    Brown, PO
    Weinstein, JN
    [J]. NATURE GENETICS, 2000, 24 (03) : 236 - 244
  • [33] Comparison of five antibodies as markers in the diagnosis of melanoma in cytologic preparations
    Sheffield, MV
    Yee, H
    Dorvault, CC
    Weilbaecher, KN
    Eltoum, IA
    Siegal, GP
    Fisher, DE
    Chhieng, DC
    [J]. AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2002, 118 (06) : 930 - 936
  • [34] Chemosensitivity prediction by transcriptional profiling
    Staunton, JE
    Slonim, DK
    Coller, HA
    Tamayo, P
    Angelo, MJ
    Park, J
    Scherf, U
    Lee, JK
    Reinhold, WO
    Weinstein, JN
    Mesirov, JP
    Lander, ES
    Golub, TR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (19) : 10787 - 10792
  • [35] TERBRAAK CJF, 1986, ECOLOGY, V67, P1167
  • [36] Epithelial-mesenchymal transitions in tumour progression
    Thiery, JP
    [J]. NATURE REVIEWS CANCER, 2002, 2 (06) : 442 - 454
  • [37] ADE-4: A multivariate analysis and graphical display software
    Thioulouse, J
    Chessel, D
    Doledec, S
    Olivier, JM
    [J]. STATISTICS AND COMPUTING, 1997, 7 (01) : 75 - 83
  • [38] THIOULOUSE J, 1995, COMPUT APPL BIOSCI, V11, P321
  • [39] THIOULOUSE J, 1996, BIOMETRIC B, V13, P10
  • [40] Missing value estimation methods for DNA microarrays
    Troyanskaya, O
    Cantor, M
    Sherlock, G
    Brown, P
    Hastie, T
    Tibshirani, R
    Botstein, D
    Altman, RB
    [J]. BIOINFORMATICS, 2001, 17 (06) : 520 - 525