Integrative mining of traditional Chinese medicine literature and MEDLINE for functional gene networks

被引:64
作者
Zhou, Xuezhong [1 ]
Liu, Baoyan
Wu, Zhaohui
Feng, Yi
机构
[1] Chinese Med Sci, China Acad, Beijing 100700, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China
[3] Chinese Med Sci, China Acad, Guanganmen Hosp, Beijing 100053, Peoples R China
基金
中国博士后科学基金;
关键词
integrative data mining; functional gene network; traditional Chinese medicine literature; MEDLINE; text mining;
D O I
10.1016/j.artmed.2007.07.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: The amount of biomedical data in different disciplines is growing at an exponential rate. Integrating these significant knowledge sources to generate novel hypotheses for systems biology research is difficult. Traditional Chinese medicine (TCM) is a completely different discipline, and is a complementary knowledge system to modern biomedical science. This paper uses a significant TCM bibliographic literature database in China, together with MEDLINE, to help discover novel gene functional knowledge. Materials and methods: We present an integrative mining approach to uncover the functional gene relationships from MEDLINE and TCM bibliographic literature. This paper introduces TCM Literature (about 50,000 records) as one knowledge source for constructing Literature-based gene networks. We use the TCM diagnosis, TCM syndrome, to automatically congregate the related genes. The syndrome-gene relationships are discovered based on the syndrome-disease relationships extracted from TCM literature and the disease-gene relationships in MEDLINE. Based on the bubble-bootstrapping and relation weight computing methods, we have developed a prototype system called MeDisco/3S, which has name entity and relation extraction, and online analytical processing (OLAP) capabilities, to perform the integrative mining process. Results: We have got about 200,000 syndrome-gene relations, which could help generate syndrome-based gene networks, and help analyze the functional knowledge of genes from syndrome perspective. We take the gene network of Kidney-Yang Deficiency syndrome (KYD syndrome) and the functional analysis of some genes, such as CRH (corticotropin releasing hormone), PTH (parathyroid hormone), PRL (protactin), BRCA1 (breast cancer 1, early onset) and BRCA2 (breast cancer 2, early onset), to demonstrate the preliminary results. The underlying hypothesis is that the related genes of the same syndrome will have some biological functional relationships, and will constitute a functional network. Conclusion: This paper presents an approach to integrate TCM literature and modern biomedical data to discover novel gene networks and functional knowledge of genes. The preliminary results show that the novel gene functional knowledge and gene networks, which are worthy of further investigation, could be generated by integrating the two complementary biomedical data sources. It wilt be a promising research field through integrative mining of TCM and modern life science literature. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:87 / 104
页数:18
相关论文
共 69 条
[1]   A literature based method for identifying gene-disease connections [J].
Adamic, LA ;
Wilkinson, D ;
Huberman, BA ;
Adar, E .
CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, :109-117
[2]   Using literature and data to learn Bayesian networks as clinical models of ovarian tumors [J].
Antal, P ;
Fannes, G ;
Timmerman, D ;
Moreau, Y ;
De Moor, B .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 30 (03) :257-281
[3]  
*BEIJ U TRAD CHIN, 2002, BAS THEOR TRAD CHIN
[4]  
Blaschke C, 1999, Proc Int Conf Intell Syst Mol Biol, P60
[5]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[6]  
Brin S, 1999, LECT NOTES COMPUT SC, V1590, P172
[7]   Comparative experiments on learning information extractors for proteins and their interactions [J].
Bunescu, R ;
Ge, RF ;
Kate, RJ ;
Marcotte, EM ;
Mooney, RJ ;
Ramani, AK ;
Wong, YW .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2005, 33 (02) :139-155
[8]  
Bunescu RR, 2006, P HLT NAACL BIONLP W, P49, DOI DOI 10.3115/1654415.1654424
[9]  
Chen JX, 2006, PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON VEHICULAR ELECTRONICS AND SAFETY, P367
[10]  
CHEN K, 2006, TRADIT CHIN MED MAT, V8, P1