Measuring gene expression divergence: the distance to keep

被引:20
作者
Glazko, Galina [1 ]
Mushegian, Arcady [2 ,3 ]
机构
[1] Univ Rochester, Med Ctr, Dept Biostat & Computat Biol, Rochester, NY 14642 USA
[2] Stowers Inst Med Res, Kansas City, MO 64110 USA
[3] Univ Kansas, Dept Microbiol Mol Genet & Immunol, Med Ctr, Kansas City, KS 66160 USA
关键词
EVOLUTION; NORMALIZATION; PROFILES;
D O I
10.1186/1745-6150-5-51
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Gene expression divergence is a phenotypic trait reflecting evolution of gene regulation and characterizing dissimilarity between species and between cells and tissues within the same species. Several distance measures, such as Euclidean and correlation-based distances have been proposed for measuring expression divergence. Results: We show that different distance measures identify different trends in gene expression patterns. When comparing orthologous genes in eight rat and human tissues, the Euclidean distance identified genes uniformly expressed in all tissues near the expression background as genes with the most conserved expression pattern. In contrast, correlation-based distance and generalized average distance identified genes with concerted changes among homologous tissues as those most conserved. On the other hand, correlation-based distance, Euclidean distance and generalized-average distance highlight quite well the relatively high similarity of gene expression patterns in homologous tissues between species, compared to non-homologous tissues within species. Conclusions: Different trends exist in the high-dimensional numeric data, and to highlight a particular trend an appropriate distance measure needs to be chosen. The choice of the distance measure for measuring expression divergence can be dictated by the expression patterns that are of interest in a particular study. Reviewers: This article was reviewed by Mikhail Gelfand, Eugene Koonin and Subhajyoti De (nominated by Sarah Teichmann).
引用
收藏
页数:10
相关论文
共 22 条
  • [1] GOstat: find statistically overrepresented Gene Ontologies within a group of genes
    Beissbarth, T
    Speed, TP
    [J]. BIOINFORMATICS, 2004, 20 (09) : 1464 - 1465
  • [2] A Phylogenetic Mixture Model for the Evolution of Gene Expression
    Eng, Kevin H.
    Bravo, Hector Corrada
    Keles, Suenduez
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2009, 26 (10) : 2363 - 2372
  • [3] Heterogeneous Patterns of Gene-Expression Diversification in Mammalian Gene Duplicates
    Farre, Domenec
    Mar Alba, M.
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2010, 27 (02) : 325 - 335
  • [4] Homology - a personal view on some of the problems
    Fitch, WM
    [J]. TRENDS IN GENETICS, 2000, 16 (05) : 227 - 231
  • [5] DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS
    FITCH, WM
    [J]. SYSTEMATIC ZOOLOGY, 1970, 19 (02): : 99 - &
  • [6] Interpreting expression profiles of cancers by genome-wide survey of breadth of expression in normal tissues
    Ge, XJ
    Yamamoto, S
    Tsutsumi, S
    Midorikawa, Y
    Ihara, S
    Wang, SM
    Aburatani, H
    [J]. GENOMICS, 2005, 86 (02) : 127 - 141
  • [7] The choice of optimal distance measure in genome-wide datasets
    Glazko, G
    Gordon, A
    Mushegian, A
    [J]. BIOINFORMATICS, 2005, 21 : 2 - 10
  • [8] Gusfield D., 1999, ALGORITHMS STRINGS T
  • [9] Exploration, normalization, and summaries of high density oligonucleotide array probe level data
    Irizarry, RA
    Hobbs, B
    Collin, F
    Beazer-Barclay, YD
    Antonellis, KJ
    Scherf, U
    Speed, TP
    [J]. BIOSTATISTICS, 2003, 4 (02) : 249 - 264
  • [10] Evolutionary significance of gene expression divergence
    Jordan, IK
    Mariño-Ramírez, L
    Koonin, EV
    [J]. GENE, 2005, 345 (01) : 119 - 126