共 9 条
[2]
Mirror, mirror on the Web: A study of host pairs with replicated content
[J].
Computer Networks,
1999, 31 (11)
:1579-1590
[3]
Finding related pages in the World Wide Web.[J].Jeffrey Dean;Monika R Henzinger.Computer Networks.1999, 11
[4]
Syntactic clustering of the Web.[J].Andrei Z. Broder;Steven C. Glassman;Mark S. Manasse;Geoffrey Zweig.Computer Networks and ISDN Systems.1997, 8
[5]
News article extraction with template-independent wrapper..Wang;J;He;X;Wang;C;Pei;J;Bu;J;Chen;C;Guan;Z;Lu;G;.Proceedings of the 18th international conference on World wide web.2009,
[6]
Detecting Near- Duplicates for Web Crawlng..Gurmeet Singh Manku;Arvind Jain;Anish Das Sarma;.International World Wide Web Conference.2007,
[7]
Spotsigs:Robust and Efficient Near Duplicate Detection in Large Web Collections..Theobald; M;Siddharth; J;Paepcke; A;.Proceedings of the 31 st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.2008,
[8]
[9]

