Cross-platform comparison and visualisation of gene expression data using co-inertia analysis -: art. no. 59

被引:102
作者
Culhane, AC [1 ]
Perrière, G
Higgins, DG
机构
[1] Natl Univ Ireland Univ Coll Cork, Dept Biochem, Biosci Inst, Cork, Ireland
[2] Univ Lyon 1, CNRS, UMR 5558, Lab Biometrie & Biol Evolut, F-69622 Villeurbanne, France
关键词
D O I
10.1186/1471-2105-4-59
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Rapid development of DNA microarray technology has resulted in different laboratories adopting numerous different protocols and technological platforms, which has severely impacted on the comparability of array data. Current cross-platform comparison of microarray gene expression data are usually based on cross-referencing the annotation of each gene transcript represented on the arrays, extracting a list of genes common to all arrays and comparing expression data of this gene subset. Unfortunately, filtering of genes to a subset represented across all arrays often excludes many thousands of genes, because different subsets of genes from the genome are represented on different arrays. We wish to describe the application of a powerful yet simple method for cross-platform comparison of gene expression data. Co-inertia analysis (CIA) is a multivariate method that identifies trends or co-relationships in multiple datasets which contain the same samples. CIA simultaneously finds ordinations (dimension reduction diagrams) from the datasets that are most similar. It does this by finding successive axes from the two datasets with maximum covariance. CIA can be applied to datasets where the number of variables (genes) far exceeds the number of samples (arrays) such is the case with microarray analyses. Results: We illustrate the power of CIA for cross-platform analysis of gene expression data by using it to identify the main common relationships in expression profiles on a panel of 60 tumour cell lines from the National Cancer Institute (NCI) which have been subjected to microarray studies using both Affymetrix and spotted cDNA array technology. The co-ordinates of the CIA projections of the cell lines from each dataset are graphed in a bi-plot and are connected by a line, the length of which indicates the divergence between the two datasets. Thus, CIA provides graphical representation of consensus and divergence between the gene expression profiles from different microarray platforms. Secondly, the genes that define the main trends in the analysis can be easily identified. Conclusions: CIA is a robust, efficient approach to coupling of gene expression datasets. CIA provides simple graphical representations of the results making it a particularly attractive method for the identification of relationships between large datasets.
引用
收藏
页数:15
相关论文
共 41 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Ball CA, 2002, SCIENCE, V298, P539
  • [3] Pharmacogenomic analysis: Correlating molecular substructure classes with microarray gene expression data
    Blower P.E.
    Yang C.
    Fligner M.A.
    Verducci J.S.
    Yu L.
    Richman S.
    Weinstein J.N.
    [J]. The Pharmacogenomics Journal, 2002, 2 (4) : 259 - 271
  • [4] MatchMiner: a tool for batch navigation among gene and gene product identifiers
    Bussey, KJ
    Kane, D
    Sunshine, M
    Narasimhan, S
    Nishizuka, S
    Reinhold, WC
    Zeeberg, B
    Ajay
    Weinstein, JN
    [J]. GENOME BIOLOGY, 2003, 4 (04)
  • [5] Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks
    Butte, AJ
    Tamayo, P
    Slonim, D
    Golub, TR
    Kohane, IS
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) : 12182 - 12186
  • [6] Between-group analysis of microarray data
    Culhane, AC
    Perrière, G
    Considine, EC
    Cotter, TG
    Higgins, DG
    [J]. BIOINFORMATICS, 2002, 18 (12) : 1600 - 1608
  • [7] Degen WGJ, 1996, INT J CANCER, V65, P460, DOI 10.1002/(SICI)1097-0215(19960208)65:4<460::AID-IJC12>3.0.CO
  • [8] 2-E
  • [9] SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data
    Diehn, M
    Sherlock, G
    Binkley, G
    Jin, H
    Matese, JC
    Hernandez-Boussard, T
    Rees, CA
    Cherry, JM
    Botstein, D
    Brown, PO
    Alizadeh, AA
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 219 - 223
  • [10] CO-INERTIA ANALYSIS - AN ALTERNATIVE METHOD FOR STUDYING SPECIES ENVIRONMENT RELATIONSHIPS
    DOLEDEC, S
    CHESSEL, D
    [J]. FRESHWATER BIOLOGY, 1994, 31 (03) : 277 - 294