Geographic intention and modification in web search

被引:37
作者
Jones, Rosie [1 ]
Zhang, Wei V. [1 ]
Rey, Benjamin [1 ]
Jhala, Pradhuman [1 ]
Stipp, Eugene [1 ]
机构
[1] Yahoo, Burbank, CA 91504 USA
关键词
web search query log analysis; query reformulation; geo-correction; query geo-modification; geographic intent; geographical information retrieval (GIR);
D O I
10.1080/13658810701626186
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web searchers signal their geographic intent by using place-names in search queries. They also indicate their flexibility about geographic specificity by reformulating their queries. By examining this data we can learn to understand web searcher flexibility with respect to geographic intent. We examine aggregated data of queries with locations, and locations identified from IP addresses, to identify overall distance preferences, as well as distance preferences by search topic. We also examine query rewriting: both deliberate query rewriting, conducted in web search sessions, and automated query rewriting, with manual relevance judgments of geo-modified queries. We find geo-specification in 12.7% of user query rewrites in search sessions, and show the breakdown into sub-classes such as same-city, same-state, same-country and different-country. We also measure the dependence between US-state-name and distance-of-modified-location-from-original-location, finding that Vermont web searchers modify their locations greater distances than California web searchers. We find that automatically-modified queries are perceived as much more relevant when the geographic component is unchanged. We look at the relationship between the non-location part of a query and the distance from the user. We see that people search for child day-care near their locations and maps far from where they are located. We also give distance profiles for the top topics which cooccur with place-names in queries, which could be used to set document priors based on document proximity and query topic.
引用
收藏
页码:229 / 246
页数:18
相关论文
共 14 条
[1]  
Amitay E., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P273, DOI 10.1145/1008992.1009040
[2]   Web-based delineation of imprecise regions [J].
Arampatzis, Avi ;
van Kreveld, Marc ;
Reinbacher, Iris ;
Jones, Christopher B. ;
Vaid, Subodh ;
Clough, Paul ;
Joho, Hideo ;
Sanderson, Mark .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2006, 30 (04) :436-459
[3]   Contextualization of geospatial database semantics for human-GIS interaction [J].
Cai, Guoray .
GEOINFORMATICA, 2007, 11 (02) :217-237
[4]  
Dunning T., 1993, Computational Linguistics, V19, P61
[5]  
FU G, 2005, P LECT NOTES COMPUTE, P1466
[6]  
Gravano L., 2003, P 12 INT C INF KNOWL, P325
[7]  
Jones R., 2003, P 26 ANN INT ACM SIG, P435
[8]  
Jones Rosie, 2006, P 15 INT C WORLD WID, P387, DOI [DOI 10.1145/1135777.1135835, 10.1145/1135777.1135835]
[9]  
Larson RR, 2004, LECT NOTES COMPUT SC, V3232, P45
[10]  
LEIDNER J, 2006, LECT NOTES COMPUTER, V4002, P987