T-HOD: a literature-based candidate gene database for hypertension, obesity and diabetes

被引:35
作者
Dai, Hong-Jie [1 ]
Wu, Johnny Chi-Yang [1 ]
Tsai, Richard Tzong-Han [2 ]
Pan, Wen-Harn [3 ]
Hsu, Wen-Lian [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Intelligent Agent Syst Lab, Taipei, Taiwan
[2] Yuan Ze Univ, Dept Comp Sci & Engn, Tao Yuan, Taiwan
[3] Natl Hlth Res Inst, Inst Populat Hlth Sci, Div Prevent Med & Hlth Serv Res, Taipei, Taiwan
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2013年
关键词
BIOMEDICAL LITERATURE; ASSOCIATION; NORMALIZATION; METHYLATION; INTEGRATION; EXTRACTION; DISEASE;
D O I
10.1093/database/bas061
中图分类号
Q [生物科学];
学科分类号
090105 [作物生产系统与生态工程];
摘要
Researchers are finding it more and more difficult to follow the changing status of disease candidate genes due to the exponential increase in gene mapping studies. The Text-mined Hypertension, Obesity and Diabetes candidate gene database (T-HOD) is developed to help trace existing research on three kinds of cardiovascular diseases: hypertension, obesity and diabetes, with the last disease categorized into Type 1 and Type 2, by regularly and semiautomatically extracting HOD-related genes from newly published literature. Currently, there are 837, 835 and 821 candidate genes recorded in T-HOD for hypertension, obesity and diabetes, respectively. T-HOD employed the state-of-art text-mining technologies, including a gene/disease identification system and a disease-gene relation extraction system, which can be used to affirm the association of genes with three diseases and provide more evidence for further studies. The primary inputs of T-HOD are the three kinds of diseases, and the output is a list of disease-related genes that can be ranked based on their number of appearance, protein-protein interactions and single-nucleotide polymorphisms. Unlike manually constructed disease gene databases, the content of T-HOD is regularly updated by our text-mining system and verified by domain experts. The interface of T-HOD facilitates easy browsing for users and allows T-HOD curators to verify data efficiently. We believe that T-HOD can help life scientists in search for more disease candidate genes in a less time-and effort-consuming manner.
引用
收藏
页数:8
相关论文
共 19 条
[1]
T2D-Db: An integrated platform to study the molecular basis of Type 2 diabetes [J].
Agrawal, Shipra ;
Dimitrova, Nevenka ;
Nathan, Prasanthi ;
Udayakumar, K. ;
Lakshmi, S. Sai ;
Sriram, S. ;
Manjusha, N. ;
Sengupta, Urmi .
BMC GENOMICS, 2008, 9 (1)
[2]
Arighi C.N., 2012, P 2012 BIOCREATIVE W
[3]
Becker KG, 2004, NAT GENET, V36, P431, DOI 10.1038/ng0504-431
[4]
Dai H.-J., 2011, P 5 INT JOINT C NAT, P846
[5]
Integration of gene normalization stages and co-reference resolution using a Markov logic network [J].
Dai, Hong-Jie ;
Chang, Yen-Ching ;
Tsai, Richard Tzong-Han ;
Hsu, Wen-Lian .
BIOINFORMATICS, 2011, 27 (18) :2586-2594
[6]
Multistage Gene Normalization and SVM-Based Ranking for Protein Interactor Extraction in Full-Text Articles [J].
Dai, Hong-Jie ;
Lai, Po-Ting ;
Tsai, Richard Tzong-Han .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (03) :412-420
[7]
Hypertension genetics, single nucleotide polymorphisms, and the common disease: Common variant hypothesis [J].
Doris, PA .
HYPERTENSION, 2002, 39 (02) :323-331
[8]
MeInfoText 2.0: gene methylation and cancer relation extraction from biomedical literature [J].
Fang, Yu-Ching ;
Lai, Po-Ting ;
Dai, Hong-Jie ;
Hsu, Wen-Lian .
BMC BIOINFORMATICS, 2011, 12
[9]
TCMGeneDIT: a database for associated traditional Chinese medicine, gene and disease information using text mining [J].
Fang, Yu-Ching ;
Huang, Hsuan-Cheng ;
Chen, Hsin-Hsi ;
Juan, Hsueh-Fen .
BMC COMPLEMENTARY AND ALTERNATIVE MEDICINE, 2008, 8 (1)
[10]
Microarray technology and applications in the arena of genome-wide association [J].
Grant, Struan F. A. ;
Hakonarson, Hakon .
CLINICAL CHEMISTRY, 2008, 54 (07) :1116-1124