Chemical Entity Semantic Specification: Knowledge representation for efficient semantic cheminformatics and facile data integration

被引:16
作者
Chepelev, Leonid L. [1 ]
Dumontier, Michel [1 ,2 ,3 ]
机构
[1] Carleton Univ, Dept Biol, Ottawa, ON K1S 5B6, Canada
[2] Carleton Univ, Inst Biochem, Ottawa, ON K1S 5B6, Canada
[3] Carleton Univ, Sch Comp Sci, Ottawa, ON K1S 5B6, Canada
来源
JOURNAL OF CHEMINFORMATICS | 2011年 / 3卷
关键词
MARKUP LANGUAGE; XML; ONTOLOGY; SYSTEMS;
D O I
10.1186/1758-2946-3-20
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Over the past several centuries, chemistry has permeated virtually every facet of human lifestyle, enriching fields as diverse as medicine, agriculture, manufacturing, warfare, and electronics, among numerous others. Unfortunately, application-specific, incompatible chemical information formats and representation strategies have emerged as a result of such diverse adoption of chemistry. Although a number of efforts have been dedicated to unifying the computational representation of chemical information, disparities between the various chemical databases still persist and stand in the way of cross-domain, interdisciplinary investigations. Through a common syntax and formal semantics, Semantic Web technology offers the ability to accurately represent, integrate, reason about and query across diverse chemical information. Results: Here we specify and implement the Chemical Entity Semantic Specification (CHESS) for the representation of polyatomic chemical entities, their substructures, bonds, atoms, and reactions using Semantic Web technologies. CHESS provides means to capture aspects of their corresponding chemical descriptors, connectivity, functional composition, and geometric structure while specifying mechanisms for data provenance. We demonstrate that using our readily extensible specification, it is possible to efficiently integrate multiple disparate chemical data sources, while retaining appropriate correspondence of chemical descriptors, with very little additional effort. We demonstrate the impact of some of our representational decisions on the performance of chemically-aware knowledgebase searching and rudimentary reaction candidate selection. Finally, we provide access to the tools necessary to carry out chemical entity encoding in CHESS, along with a sample knowledgebase. Conclusions: By harnessing the power of Semantic Web technologies with CHESS, it is possible to provide a means of facile cross-domain chemical knowledge integration with full preservation of data correspondence and provenance. Our representation builds on existing cheminformatics technologies and, by the virtue of RDF specification, remains flexible and amenable to application-and domain-specific annotations without compromising chemical data integration. We conclude that the adoption of a consistent and semantically-enabled chemical specification is imperative for surviving the coming chemical data deluge and supporting systems science research.
引用
收藏
页数:19
相关论文
共 40 条
[1]   Chemical Markup, XML and the World-Wide Web. 8. Polymer Markup Language [J].
Adams, Nico ;
Winter, Jerry ;
Murray-Rust, Peter ;
Rzepa, Henry S. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (11) :2118-2128
[2]  
Alan M., 2006, CHEM INT, V1, P12, DOI DOI 10.1515/CI.2006.28.6.12
[3]  
[Anonymous], CHEMINF Ontology
[4]  
[Anonymous], Pellet: Owl 2 reasoner for java
[5]  
[Anonymous], Chemical Entity Semantic Specification
[6]  
[Anonymous], Linking Open Data Initiative
[7]  
[Anonymous], Connectivity Table File Formats
[8]  
[Anonymous], OP BAB CHEM TOOLB
[9]  
[Anonymous], WEB ONTOLOGY LANGUAG
[10]  
[Anonymous], Linking Open Drug Data Project