共 41 条
OSCAR4: a flexible architecture for chemical text-mining
被引:115
作者:
Jessop, David M.
[1
]
Adams, Sam E.
[1
]
Willighagen, Egon L.
[1
]
Hawizy, Lezan
[1
]
Murray-Rust, Peter
[1
]
机构:
[1] Univ Cambridge, Dept Chem, Unilever Ctr Mol Sci Informat, Cambridge CB2 1EW, England
来源:
JOURNAL OF CHEMINFORMATICS
|
2011年
/
3卷
基金:
英国工程与自然科学研究理事会;
关键词:
SOURCE [!text type='JAVA']JAVA[!/text] LIBRARY;
DEVELOPMENT KIT CDK;
INFORMATION;
DICTIONARY;
SMILES;
D O I:
10.1186/1758-2946-3-41
中图分类号:
O6 [化学];
学科分类号:
0703 ;
摘要:
The Open-Source Chemistry Analysis Routines (OSCAR) software, a toolkit for the recognition of named entities and data in chemistry publications, has been developed since 2002. Recent work has resulted in the separation of the core OSCAR functionality and its release as the OSCAR4 library. This library features a modular API (based on reduction of surface coupling) that permits client programmers to easily incorporate it into external applications. OSCAR4 offers a domain-independent architecture upon which chemistry specific text-mining tools can be built, and its development and usage are discussed.
引用
收藏
页数:12
相关论文