A comparison of geometric approaches to assessing spatial similarity for GIR

被引:46
作者
Frontiera, Patricia [2 ]
Larson, Ray [1 ]
Radke, John [2 ]
机构
[1] Univ Calif Berkeley, Sch Informat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Geog Informat Sci Ctr, Berkeley, CA 94720 USA
关键词
geographic information retrieval; GIR; spatial similarity; spatial ranking; spatial search; geographic relevance; geometric approximations;
D O I
10.1080/13658810701626293
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This research compares the geographic information retrieval (GIR) performance of a set of logistic regression models with those of five non-probabilistic methods that compute a spatial similarity score for a query-document pair. All methods are applied to a test collection of queries and documents indexed spatially by two convex conservative geometric approximations: the minimum bounding box (MBB) and the convex hull. In the comparison, the tested logistic regression models outperform, in terms of standard information retrieval recall and precision measures, all of the non-probabilistic methods. The retrieval performance achieved by the logistic regression models on MBB approximations is similar to that achieved by the use of the non-probabilistic methods on convex hulls. Although these results are valid only for the test collection used in this study, they suggest that a logistic regression approach to GIR provides an alternative to the use of higher-quality geometric representations that are more difficult to obtain, implement, and process. Additionally, this research demonstrates the ability of a probabilistic approach to effectively incorporate information about geographic context in the spatial ranking process.
引用
收藏
页码:337 / 360
页数:24
相关论文
共 60 条
[1]   Voronoi-based region approximation for geographical information retrieval with gazetteers [J].
Alani, H ;
Jones, CB ;
Tudhope, D .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2001, 15 (04) :287-306
[2]  
AMITAY E, 2004, SIGIR04 25 29 JUL SH
[3]  
[Anonymous], P 16 ANN INT ACM SIG
[4]   A LINEAR TIME ALGORITHM FOR THE HAUSDORFF DISTANCE BETWEEN CONVEX POLYGONS [J].
ATALLAH, MJ .
INFORMATION PROCESSING LETTERS, 1983, 17 (04) :207-209
[5]  
Baeza-Yates R.A., 1999, Modern Information Retrieval
[6]   Multidimensional ranking for data in digital spatial libraries [J].
Beard K. ;
Sharma V. .
International Journal on Digital Libraries, 1997, 1 (2) :153-160
[7]  
BRINKHOFF T, 1993, SPATIAL DATABASE SYS, P40
[8]  
BUCHER B, 2005, D317301 SPIRIT
[9]  
CAI G, 2002, LECT NOTES COMPUTER, V2478, P65
[10]   A model for representing topological relationships between complex geometric features in spatial databases [J].
Clementini, E ;
DiFelice, P .
INFORMATION SCIENCES, 1996, 90 (1-4) :121-136