A day in the life of PubMed: Analysis of a typical day's query log

被引:91
作者
Herskovic, Jorge R.
Tantaka, Len Y.
Hersh, William
Bernstam, Elmer V.
机构
[1] Univ Texas, Sch Hlth Informat Sci, Houston, TX 77030 USA
[2] Univ Texas, Sch Med, Dept Pediat, Div Pediat Crit Care, Houston, TX 77030 USA
[3] Oregon Hlth & Sci Univ, Dept Med Informat & Clin Epidemiol, Portland, OR USA
[4] Univ Texas, Sch Med, Dept Internal Med, Div Gen Internal Med, Houston, TX 77030 USA
关键词
WEB; MEDLINE;
D O I
10.1197/jamia.M2191
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To characterize PubMed usage over a typical day and compare it to previous studies of user behavior on Web search engines. Design: We performed a lexical and semantic analysis of 2,689,166 queries issued on PubMed over 24 consecutive hours on a typical day. Measurements: We measured the number of queries, number of distinct users, queries per user, terms per query, common terms, Boolean operator use, common phrases, result set size, MeSH categories, used semantic measurements to group queries into sessions, and studied the addition and removal of terms from consecutive queries to gauge search strategies. Results: The size of the result sets from a sample of queries showed a bimodal distribution, with peaks at approximately 3 and 100 results, suggesting that a large group of queries was tightly focused and another was broad. Like Web search engine sessions, most PubMed sessions consisted of a single query. However, PubMed queries contained more terms. Conclusion: PubMed's usage profile should be considered when educating users, building user interfaces, and developing future biomedical information retrieval systems.
引用
收藏
页码:212 / 220
页数:9
相关论文
共 18 条
[1]  
[Anonymous], P SIGIR WORKSH MATH
[2]  
[Anonymous], 1998, 1998014 SRC
[3]  
Bernstam E, 2001, P AN M AM SOC CLIN, p244a
[4]   Using citation data to improve retrieval from MEDLINE [J].
Bernstam, EV ;
Herskovic, JR ;
Aphinyanaphongs, Y ;
Aliferis, CF ;
Sriram, MG ;
Hersh, WR .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2006, 13 (01) :96-105
[5]  
Broder A., 2002, SIGIR Forum, V36, P3, DOI 10.1145/792550.792552
[6]  
BUDANITSKY A, 2001, 2 M N AM CHAPT ASS C
[7]   Analysis of the query logs of a web site search engine [J].
Chau, M ;
Fang, X ;
Sheng, ORL .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2005, 56 (13) :1363-1376
[8]  
EIRON N, 2003, C HYP HYP 2003 AUG 2, P85
[9]   ONLINE ACCESS TO MEDLINE IN CLINICAL SETTINGS - A STUDY OF USE AND USEFULNESS [J].
HAYNES, RB ;
MCKIBBON, KA ;
WALKER, CJ ;
RYAN, N ;
FITZGERALD, D ;
RAMSDEN, MF .
ANNALS OF INTERNAL MEDICINE, 1990, 112 (01) :78-84
[10]   Factors associated with success in searching MEDLINE and applying evidence to answer clinical questions [J].
Hersh, WR ;
Crabtree, MK ;
Hickam, DH ;
Sacherek, L ;
Friedman, CP ;
Tidmarsh, P ;
Mosbaek, C ;
Kraemer, D .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2002, 9 (03) :283-293