Genetic-based approaches in ranking function discovery and optimization in information retrieval - A framework

被引:21
作者
Fan, Weiguo [2 ]
Pathak, Praveen [1 ]
Zhou, Mi [3 ]
机构
[1] Univ Florida, Worrington Coll Business Adm, Gainesville, FL 32611 USA
[2] Virginia Tech, Blacksburg, VA 24061 USA
[3] Xi An Jiao Tong Univ, Sch Management, Xian 710049, Shaanxi, Peoples R China
基金
美国国家科学基金会;
关键词
Information retrieval; Artificial intelligence; Evolutionary computations; Data fusion; Genetic algorithms; RELEVANCE FEEDBACK; WEB SEARCH; ALGORITHMS; QUERY; MODEL;
D O I
10.1016/j.dss.2009.04.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An Information Retrieval (IR) system consists of document collection, queries issued by users. and the matching/ranking functions used to rank documents in the predicted order of relevance for a given query. A variety of ranking functions have been used in the literature. But studies show that these functions do not perform consistently well across different contexts. In this paper we propose a two-stage integrated framework for discovering and optimizing ranking functions used in IR. The first stage, discovery process, is accomplished by intelligently leveraging the structural and statistical information available in HTML documents by using Genetic Programming techniques to yield novel ranking functions. in the second stage, the optimization process, document retrieval scores of various well-known ranking functions are combined using Genetic Algorithms. The overall discovery and optimization framework is tested oil the well-known TREC collection of web documents for both the ad-hoc retrieval task and the routing task. Utilizing our framework we observe a significant increase in retrieval performance compared to some of the well-known stand alone ranking functions. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:398 / 407
页数:10
相关论文
共 50 条
  • [1] [Anonymous], NIST SPECIAL PUBLICA
  • [2] [Anonymous], 2003, Genetic programming IV: routine human-competitive machine intelligence
  • [3] Bartell BT, 1998, J AM SOC INFORM SCI, V49, P742, DOI 10.1002/(SICI)1097-4571(199806)49:8<742::AID-ASI8>3.0.CO
  • [4] 2-H
  • [5] BARTELL BT, 1994, P 17 ANN INT ACM SIG, P173, DOI DOI 10.1007/978-1-4471-2099-5_18
  • [6] COMBINING THE EVIDENCE OF MULTIPLE QUERY REPRESENTATIONS FOR INFORMATION-RETRIEVAL
    BELKIN, NJ
    KANTOR, P
    FOX, EA
    SHAW, JA
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (03) : 431 - 448
  • [7] Learning retrieval expert combinations with genetic algorithms
    Billhardt, H
    Borrajo, D
    Maojo, V
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2003, 11 (01) : 87 - 113
  • [8] BOOKSTEIN A, 1985, ANNU REV INFORM SCI, V20, P117
  • [9] BORDOGNA G, 1993, J AM SOC INFORM SCI, V44, P70, DOI 10.1002/(SICI)1097-4571(199303)44:2<70::AID-ASI2>3.0.CO
  • [10] 2-I