Search log analysis: What it is, what's been done, how to do it

被引:124
作者
Jansen, Bemard J. [1 ]
机构
[1] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
关键词
D O I
10.1016/j.lisr.2006.06.005
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
The use of data stored in transaction logs of Web search engines, Intranets, and Web sites can provide valuable insight into understanding the information-searching process of online searchers. This understanding can enlighten information system design, interface development, and devising the information architecture for content collections. This article presents a review and foundation for conducting Web search transaction log analysis. A methodology is outlined consisting of three stages, which are collection, preparation, and analysis. The three stages of the methodology are presented in detail with discussions of goals, metrics, and processes at each stage. Critical terms in transaction log analysis for Web searching are defined. The strengths and limitations of transaction log analysis as a research method are presented. An application to log client-side interactions that supplements transaction logs is reported on, and the application is made available for use by the research community. Suggestions are provided on ways to leverage the strengths of, while addressing the limitations of, transaction log analysis for Web-searching research. Finally, a complete flat text transaction log from a commercial search engine is available as supplementary material with this manuscript. (c) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:407 / 432
页数:26
相关论文
共 70 条
[31]  
Jansen BJ, 2001, J AM SOC INF SCI TEC, V52, P235, DOI 10.1002/1097-4571(2000)9999:9999<::AID-ASI1607>3.0.CO
[32]  
2-F
[33]  
JANSEN BJ, 2006, J AM SOC INFORM SCI, V56, P1480
[34]  
JANSEN BJ, 2003, ACM CHI 2003 C HUM F
[35]  
JANSEN BJ, 2003, 2003 IEEE INT C SYST
[36]  
JONES S, 1998, 3 ACM C DIG LIBR PIT
[37]  
Kelly D., 2003, SIGIR Forum, V37, P18, DOI 10.1145/959258.959260
[38]  
Kelly D, 2004, UNDERSTANDING IMPLIC
[39]  
KINSELLA J, 1987, LIBR TRENDS, V35, P619
[40]  
KORFHAGE RR, 1997, INFORMATION STORAGE