A robot-based resource discovery tool for adding chemical meta-information and value to web-based documents

被引:4
作者
Gkoutos, GV [1 ]
Kenway, PR
Rzepa, HS
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Chem, London SW7 2AY, England
[2] Merck Sharp & Dohme Res Labs, Neurosci Res Ctr, Harlow CM20 2QR, Essex, England
关键词
D O I
10.1039/b009040i
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We report a set of tools to be used in conjunction with a robot-based Internet indexing engine which can be used to convert non-conforming HTML collections to well-formed and valid XHTML documents. The tools, inter alia, can correct invalid syntax which can occur in embedded RasMol scripts and extract chemical mete-information from normally inaccessible document components, including transcluded chemical files. The index that can be built from the transformed documents can be used to improve the quality of searches carried out in a chemical context.
引用
收藏
页码:635 / 638
页数:4
相关论文
共 16 条
[1]  
Berners-Lee T, 2000, RECHERCHE, P62
[2]   HYPERACTIVE MOLECULES AND THE WORLD-WIDE-WEB INFORMATION-SYSTEM [J].
CASHER, O ;
CHANDRAMOHAN, GK ;
HARGREAVES, MJ ;
LEACH, C ;
MURRAYRUST, P ;
RZEPA, HS ;
SAYLE, R ;
WHITAKER, BJ .
JOURNAL OF THE CHEMICAL SOCIETY-PERKIN TRANSACTIONS 2, 1995, (01) :7-11
[3]   The semantic Web:: The roles of XML and RDF [J].
Decker, S ;
Melnik, S ;
Van Harmelen, F ;
Fensel, D ;
Klein, M ;
Broekstra, J ;
Erdmann, M ;
Horrocks, I .
IEEE INTERNET COMPUTING, 2000, 4 (05) :63-74
[4]  
DEITEL H, 2000, XML HOW TO PROGRAM
[5]  
DEITEL H, 2000, COMPLETE XML TRAININ
[6]  
GKOUTOS GV, 1999, EL C SYNTH ORG CHEM
[7]  
GKOUTOS GV, 2001, IN PRESS J CHEM INF
[8]  
GKOUTOS GV, 2000, INTERNET J CHEM, V3
[9]   A model for enhancing Internet medical document retrieval with "medical core metadata" [J].
Malet, G ;
Munoz, F ;
Appleyard, R ;
Hersh, W .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, 6 (02) :163-172
[10]   Chemical markup, XML, and the Worldwide Web. 1. Basic principles [J].
Murray-Rust, P ;
Rzepa, HS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (06) :928-942