PubChemRDF: towards the semantic annotation of PubChem compound and substance databases

被引:77
作者
Fu, Gang [1 ]
Batchelor, Colin [2 ]
Dumontier, Michel [3 ]
Hastings, Janna [4 ]
Willighagen, Egon [5 ]
Bolton, Evan [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
[2] Royal Soc Chem, Cambridge, England
[3] Stanford Univ, Stanford Ctr Biomed Informat Res, Stanford, CA 94305 USA
[4] EMBL EBI, Hinxton, Cambs, England
[5] Maastricht Univ, Dept Bioinformat BiGCaT, NUTRIM, Maastricht, Netherlands
来源
JOURNAL OF CHEMINFORMATICS | 2015年 / 7卷
关键词
SYSTEMS CHEMICAL BIOLOGY; WEB; ONTOLOGY; CHEMINFORMATICS; INTEGRATION; DISCOVERY; ENTITIES; PLATFORM; CHEBI;
D O I
10.1186/s13321-015-0084-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: PubChem is an open repository for chemical structures, biological activities and biomedical annotations. Semantic Web technologies are emerging as an increasingly important approach to distribute and integrate scientific data. Exposing PubChem data to Semantic Web services may help enable automated data integration and management, as well as facilitate interoperable web applications. Description: This work, one of a series covering the PubChemRDF project, describes an approach to translate PubChem Substance and Compound information into Resource Description Framework (RDF) format. Basic examples are provided to demonstrate its use. The aim of this effort is to provide two new primary benefits to researchers in a cost-effective manner. Firstly, we aim to remove the inherent limitations of using the web-based resource PubChem by allowing a researcher to use readily available semantic technologies (namely, RDF triple stores and their corresponding SPARQL query engines) to query and analyze PubChem data on local computing resources. Secondly, this work intends to help improve data sharing, analysis, and integration of PubChem data to resources external to NCBI and across scientific domains, by means of the association of PubChem data to existing ontological frameworks, including CHEMical INFormation ontology, Semanticscience Integrated Ontology, and others. Conclusions: With the goal of semantically describing information available in the PubChem archive, pre-existing ontological frameworks were used, rather than creating new ones. Semantic relationships between compounds and substances, chemical descriptors associated with compounds and substances, interrelationships between chemicals, as well as provenance and attribute metadata of substances are described.
引用
收藏
页数:15
相关论文
共 42 条
[1]  
[Anonymous], P 2 INT WORKSH LINK
[2]  
Beckett D, 2011, W3C TEAM SUBM
[3]   Bio2RDF: Towards a mashup to build bioinformatics knowledge systems [J].
Belleau, Francois ;
Nolin, Marc-Alexandre ;
Tourigny, Nicole ;
Rigault, Philippe ;
Morissette, Jean .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (05) :706-716
[4]  
Berners-Lee T, REQ COMM 3986
[5]  
Biron PV, 2004, XAL SCHEMA 2
[6]  
Bolton EE, PUBCHEM SYNONY UNPUB
[7]  
Bolton EE, 2011, J CHEMINFORMATICS, V3, P32
[8]   PubChem3D: Similar conformers [J].
Bolton, Evan E. ;
Kim, Sunghwan ;
Bryant, Stephen H. .
JOURNAL OF CHEMINFORMATICS, 2011, 3
[9]   PubChem3D: Conformer generation [J].
Bolton, Evan E. ;
Kim, Sunghwan ;
Bryant, Stephen H. .
JOURNAL OF CHEMINFORMATICS, 2011, 3
[10]  
Bolton EE, 2010, ANN REP COMP CHEM, V4, P217, DOI 10.1016/S1574-1400(08)00012-1