OSCAR4: a flexible architecture for chemical text-mining

被引:115
作者
Jessop, David M. [1 ]
Adams, Sam E. [1 ]
Willighagen, Egon L. [1 ]
Hawizy, Lezan [1 ]
Murray-Rust, Peter [1 ]
机构
[1] Univ Cambridge, Dept Chem, Unilever Ctr Mol Sci Informat, Cambridge CB2 1EW, England
来源
JOURNAL OF CHEMINFORMATICS | 2011年 / 3卷
基金
英国工程与自然科学研究理事会;
关键词
SOURCE [!text type='JAVA']JAVA[!/text] LIBRARY; DEVELOPMENT KIT CDK; INFORMATION; DICTIONARY; SMILES;
D O I
10.1186/1758-2946-3-41
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The Open-Source Chemistry Analysis Routines (OSCAR) software, a toolkit for the recognition of named entities and data in chemistry publications, has been developed since 2002. Recent work has resulted in the separation of the core OSCAR functionality and its release as the OSCAR4 library. This library features a modular API (based on reduction of surface coupling) that permits client programmers to easily incorporate it into external applications. OSCAR4 offers a domain-independent architecture upon which chemistry specific text-mining tools can be built, and its development and usage are discussed.
引用
收藏
页数:12
相关论文
共 41 条
[1]   Experimental data checker: better information for organic chemists [J].
Adams, SE ;
Goodman, JM ;
Kidd, RJ ;
McNaught, AD ;
Murray-Rust, P ;
Norton, FR ;
Townsend, JA ;
Waudby, CA .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (21) :3067-3070
[2]  
[Anonymous], IUPAC INT CHEM INDEN
[3]  
[Anonymous], OPSIN OPEN PARSER SY
[4]  
[Anonymous], OSCAR4 LAUNCH
[5]  
[Anonymous], CHEMICALTAGGER
[6]  
[Anonymous], FIX ONTOLOGY
[7]  
[Anonymous], OSCAR3 ALPHA 5
[8]  
[Anonymous], 2018, ChEBI
[9]  
[Anonymous], YOURKIT JAVA PROFILE
[10]  
[Anonymous], RSC EXPT DATA CHECKE