Correlating Information Contents of Gene Ontology Terms to Infer Semantic Similarity of Gene Products

被引:8
作者
Gan, Mingxin [1 ]
机构
[1] Univ Sci & Technol Beijing, Dongling Sch Econ & Management, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
DATABASE; NETWORK; TOOL;
D O I
10.1155/2014/891842
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Successful applications of the gene ontology to the inference of functional relationships between gene products in recent years have raised the need for computational methods to automatically calculate semantic similarity between gene products based on semantic similarity of gene ontology terms. Nevertheless, existing methods, though having been widely used in a variety of applications, may significantly overestimate semantic similarity between genes that are actually not functionally related, thereby yielding misleading results in applications. To overcome this limitation, we propose to represent a gene product as a vector that is composed of information contents of gene ontology terms annotated for the gene product, and we suggest calculating similarity between two gene products as the relatedness of their corresponding vectors using three measures: Pearson's correlation coefficient, cosine similarity, and the Jaccard index. We focus on the biological process domain of the gene ontology and annotations of yeast proteins to study the effectiveness of the proposed measures. Results show that semantic similarity scores calculated using the proposed measures are more consistent with known biological knowledge than those derived using a list of existing methods, suggesting the effectiveness of our method in characterizing functional relationships between gene products.
引用
收藏
页数:9
相关论文
共 25 条
[1]  
[Anonymous], 1997, P 10 RES COMPUTATION
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[4]   Integrating human omics data to prioritize candidate genes [J].
Chen, Yong ;
Wu, Xuebing ;
Jiang, Rui .
BMC MEDICAL GENOMICS, 2013, 6
[5]   Identifying potential cancer driver genes by genomic data integration [J].
Chen, Yong ;
Hao, Jingjing ;
Jiang, Wei ;
He, Tong ;
Zhang, Xuegong ;
Jiang, Tao ;
Jiang, Rui .
SCIENTIFIC REPORTS, 2013, 3
[6]   Saccharomyces Genome Database: the genomics resource of budding yeast [J].
Cherry, J. Michael ;
Hong, Eurie L. ;
Amundsen, Craig ;
Balakrishnan, Rama ;
Binkley, Gail ;
Chan, Esther T. ;
Christie, Karen R. ;
Costanzo, Maria C. ;
Dwight, Selina S. ;
Engel, Stacia R. ;
Fisk, Dianna G. ;
Hirschman, Jodi E. ;
Hitz, Benjamin C. ;
Karra, Kalpana ;
Krieger, Cynthia J. ;
Miyasato, Stuart R. ;
Nash, Rob S. ;
Park, Julie ;
Skrzypek, Marek S. ;
Simison, Matt ;
Weng, Shuai ;
Wong, Edith D. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D700-D705
[7]   Controlled vocabularies and semantics in systems biology [J].
Courtot, Melanie ;
Juty, Nick ;
Knuepfer, Christian ;
Waltemath, Dagmar ;
Zhukova, Anna ;
Draeger, Andreas ;
Dumontier, Michel ;
Finney, Andrew ;
Golebiewski, Martin ;
Hastings, Janna ;
Hoops, Stefan ;
Keating, Sarah ;
Kell, Douglas B. ;
Kerrien, Samuel ;
Lawson, James ;
Lister, Allyson ;
Lu, James ;
Machne, Rainer ;
Mendes, Pedro ;
Pocock, Matthew ;
Rodriguez, Nicolas ;
Villeger, Alice ;
Wilkinson, Darren J. ;
Wimalaratne, Sarala ;
Laibe, Camille ;
Hucka, Michael ;
Le Novere, Nicolas .
MOLECULAR SYSTEMS BIOLOGY, 2011, 7
[8]   Measuring semantic similarity between Gene Ontology terms [J].
Couto, Francisco M. ;
Silva, Mario J. ;
Coutinho, Pedro M. .
DATA & KNOWLEDGE ENGINEERING, 2007, 61 (01) :137-152
[9]   From Ontology to Semantic Similarity: Calculation of Ontology-Based Semantic Similarity [J].
Gan, Mingxin ;
Dou, Xue ;
Jiang, Rui .
SCIENTIFIC WORLD JOURNAL, 2013,
[10]   Network motif identification in stochastic networks [J].
Jiang, Rui ;
Tu, Zhidong ;
Chen, Ting ;
Sun, Fengzhu .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (25) :9404-9409