MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database

被引:108
作者
Davis, Allan Peter [1 ]
Wiegers, Thomas C. [1 ]
Rosenstein, Michael C. [1 ]
Mattingly, Carolyn J. [1 ]
机构
[1] Mt Desert Isl Biol Lab, Dept Bioinformat, Salsbury Cove, ME 04672 USA
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2012年
关键词
TOOL; ONTOLOGY; GENOMICS;
D O I
10.1093/database/bar065
中图分类号
Q [生物科学];
学科分类号
090105 [作物生产系统与生态工程];
摘要
The Comparative Toxicogenomics Database (CTD) is a public resource that promotes understanding about the effects of environmental chemicals on human health. CTD biocurators manually curate a triad of chemical-gene, chemical-disease and gene-disease relationships from the scientific literature. The CTD curation paradigm uses controlled vocabularies for chemicals, genes and diseases. To curate disease information, CTD first had to identify a source of controlled terms. Two resources seemed to be good candidates: the Online Mendelian Inheritance in Man (OMIM) and the 'Diseases' branch of the National Library of Medicine's Medical Subject Headers (MeSH). To maximize the advantages of both, CTD biocurators undertook a novel initiative to map the flat list of OMIM disease terms into the hierarchical nature of the MeSH vocabulary. The result is CTD's 'merged disease vocabulary' (MEDIC), a unique resource that integrates OMIM terms, synonyms and identifiers with MeSH terms, synonyms, definitions, identifiers and hierarchical relationships. MEDIC is both a deep and broad vocabulary, composed of 9700 unique diseases described by more than 67 000 terms (including synonyms). It is freely available to download in various formats from CTD. While neither a true ontology nor a perfect solution, this vocabulary has nonetheless proved to be extremely successful and practical for our biocurators in generating over 2.5 million disease-associated toxicogenomic relationships in CTD. Other external databases have also begun to adopt MEDIC for their disease vocabulary. Here, we describe the construction, implementation, maintenance and use of MEDIC to raise awareness of this resource and to offer it as a putative scaffold in the formal construction of an official disease ontology.
引用
收藏
页数:9
相关论文
共 17 条
[1]
A New Face and New Challenges for Online Mendelian Inheritance in Man (OMIM®) [J].
Amberger, Joanna ;
Bocchini, Carol ;
Hamosh, Ada .
HUMAN MUTATION, 2011, 32 (05) :564-567
[2]
Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]
The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics [J].
Blake, Judith A. ;
Bult, Carol J. ;
Kadin, James A. ;
Richardson, Joel E. ;
Eppig, Janan T. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D842-D848
[4]
The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[5]
Technical milestone - Medical subject headings used to search the biomedical literature [J].
Coletti, MH ;
Bleich, HL .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, 8 (04) :317-323
[6]
The Ontology Lookup Service, a lightweight cross-platform tool for controlled vocabulary queries [J].
Côté, RG ;
Jones, P ;
Apweiler, R ;
Hermjakob, H .
BMC BIOINFORMATICS, 2006, 7 (1)
[7]
The Comparative Toxicogenomics Database facilitates identification and understanding of chemical-gene-disease associations: arsenic as a case study [J].
Davis, Allan P. ;
Murphy, Cynthia G. ;
Rosenstein, Michael C. ;
Wiegers, Thomas C. ;
Mattingly, Carolyn J. .
BMC MEDICAL GENOMICS, 2008, 1 (1)
[8]
The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database [J].
Davis, Allan Peter ;
Wiegers, Thomas C. ;
Murphy, Cynthia G. ;
Mattingly, Carolyn J. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
[9]
Davis AP, 2011, BIOINFORMATION, V7, P154
[10]
The Comparative Toxicogenomics Database: update 2011 [J].
Davis, Allan Peter ;
King, Benjamin L. ;
Mockus, Susan ;
Murphy, Cynthia G. ;
Saraceni-Richards, Cynthia ;
Rosenstein, Michael ;
Wiegers, Thomas ;
Mattingly, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D1067-D1072