JChemTidy: A tool for converting chemical Web document collections to an XHTML']HTML representation

被引:6
作者
Gkoutos, GV
Kenway, PR
Rzepa, HS [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Chem, London SW7 2AY, England
[2] Merck Sharp & Dohme Res Labs, Neurosci Res Ctr, Harlow CM20 2QR, Essex, England
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2001年 / 41卷 / 02期
关键词
D O I
10.1021/ci000396y
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A robot-based procedure is described for traversing a collection of hyperlinked documents written in HTML and converting these to the XML-compliant and well-formed XHTML representation. Transcluded chemical content invoked using <embed> or <applet> HTML calls are converted to the XHTML recommended <object> form. Additional attributes such as title or derived chemical attributes such as a SMILES descriptor are added to improve the indexing of the resulting document collection. Conformance tests for the popular Web browsers are reported.
引用
收藏
页码:253 / 258
页数:6
相关论文
共 13 条
[1]  
Brecher JS, 1998, CHIMIA, V52, P658
[2]  
GKOUTOS GV, 2000, INT J CHEM, P3
[3]  
GKOUTOS GV, 2001, IN PRESS NEW J CHEM, V25
[4]   CS Chem3D Pro 3.5 and CS MOPAC Pro (Mac and Windows) UK [J].
Hinchliffe, A .
ELECTRONIC JOURNAL OF THEORETICAL CHEMISTRY, 1997, 2 :215-217
[5]  
Krassavine A, 1998, CHIMIA, V52, P668
[6]  
Martz E, 1997, FASEB J, V11, pA850
[7]  
MILLER MA, 1997, ABSTR PAP AM CHEM S, V214
[8]   A universal approach to web-based chemistry using XML and CML [J].
Murray-Rust, P ;
Rzepa, HS ;
Wright, M ;
Zara, S .
CHEMICAL COMMUNICATIONS, 2000, (16) :1471-1472
[9]   Chemical markup, XML, and the Worldwide Web. 1. Basic principles [J].
Murray-Rust, P ;
Rzepa, HS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (06) :928-942
[10]   VChemLab: A virtual chemistry laboratory. The storage, retrieval, and display of chemical information using standard Internet tools [J].
Rzepa, HS ;
Tonge, AP .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (06) :1048-1053