Soft computing techniques for rank aggregation on the World Wide Web

被引:33
作者
Beg, MMS [1 ]
Ahmad, N [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, New Delhi 110016, India
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2003年 / 6卷 / 01期
关键词
World Wide Web; rank aggregation; genetic algorithm; fuzzy ordering; meta-searching;
D O I
10.1023/A:1022344031752
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Rank aggregation is the problem of generating a near-"consensus" ranking for a given set of rankings. When applied to the web, this finds applications in meta-searching, search engine comparison, spam fighting and word association techniques. The rank aggregation obtained by optimizing the Spearman footrule distance is called footrule optimal aggregation (FOA), and it also satisfies the Condorcet property. We find in literature a polynomial time algorithm to compute FOA for full lists. However, when collating the results of the search engines, the lists are almost invariably always the partial ones, as different search engines usually return non-overlapping lists of documents. The FOA for partial lists, however, is NP-hard. This NP-hard nature of partial footrule optimal aggregation problem (PFOA) motivates us to apply genetic algorithm (GA) for the PFOA problem. The GA based technique may take long to compute, but we propose to decide upon the number of generations of GA based on the time limit allowed by the user. We have also considered some "positional" methods, as they are linear in complexity. A classical positional method is the Borda's methods Since, fuzzy logic has been extensively studied in literature for arriving at consensus in group decision making, the adoption of some fuzzy techniques is also being investigated here for getting an improvement over the Borda's method. We have not only adopted and compared the classical fuzzy rank ordering techniques for web applications, but also proposed three novel techniques that outshine the existing techniques.
引用
收藏
页码:5 / 22
页数:18
相关论文
共 12 条
[1]  
AHMAD N, 2002, P INT C ART INT ENG, P363
[2]  
[Anonymous], FUZZY LOGIC ENG APPL
[3]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[4]  
[Anonymous], P 10 WORLD WID WEB C
[5]  
Beg MMS, 2002, PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, P329
[6]  
BHARAT K, 1988, ACM SIGIR, P104
[7]  
Borda J.-C., 1781, HIST ACAD ROYAL DES
[8]  
CHAKRABARTI S, 1998, P ACM SIGIR WORKSH H
[9]   SPEARMANS FOOTRULE AS A MEASURE OF DISARRAY [J].
DIACONIS, P ;
GRAHAM, RL .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (02) :262-268
[10]   GENETIC ALGORITHMS - PRINCIPLES OF NATURAL-SELECTION APPLIED TO COMPUTATION [J].
FORREST, S .
SCIENCE, 1993, 261 (5123) :872-878