共 9 条
[1]
M icrosoft Research Asia at The W eb Track of TREC 2003. WEN JR,SONG RH,CAI D,et al. The Twelfth Text Retrieval Conference(TREC 12 ) . 2003
[2]
Record-Boundary D iscovery in W eb Documents. EMBLEY DW,JIANG YS,NG YK. SIGMOD 99 Proceedings . 1999
[3]
Record Location and Reconfiguration in Unstructured Multiple-Record W eb Documents. EMBLEY DW,LI X. W ebDB 00 Proceedings . 2000
[4]
D iscovering Informative Content B locks from W eb Documents. LIN SH,HO JM. KDD . 2002
[5]
ImprovingPseudoRelevanceFeed backinWebInformationRetrievalUsingWebPageSegmentation. YUSP,CAID,WENJR,etal. http://research.microsoft.com/research/pubs/view.as px? type=Technical%20Report & id=632 . 2002
[6]
ExtractingStructuresofHTMLDocumentsUsingaHighLevelStackMachine. LIMSJ,NGYK. InformationNetworkinginAsia . 2001
[7]
The W3C Protocol Library. http://www.w3.org/Library/ . 2004
[8]
A Heuristic Approach for Converting HTML Documents to XML Documents. LIM SJ,NG YK. Proceedings of the Sixth International Conference on Rules and Objects in Databases(DOOD 2000 )[C] . 2000
[9]
IntegratingHTML TablesUsing Semantic H ierarchies And Meta-Data Sets. LIM SJ,NG YK,YANG XC. International Database Engineering and Applications Symposium ( IDEAS 02 )[C] . 2002