Data quality assurance for volunteered geographic information

被引:46
作者
Ali, Ahmed Loai [1 ,3 ]
Schmid, Falko [1 ,2 ]
机构
[1] Cognitive Systems Group, University of Bremen
[2] SFB/TR 8 Spatial Cognition, University of Bremen
来源
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2014年 / 8728卷
关键词
D O I
10.1007/978-3-319-11593-1_9
中图分类号
学科分类号
摘要
The availability of technology and tools enables the public to participate in the collection, contribution, editing, and usage of geographic information, a domain previously reserved for mapping agencies or companies. The data of Volunteered Geographic Information (VGI) systems, such as OpenStreetMap (OSM), is based on the availability of technology and participation of individuals. However, this combination also implies quality issues related to the data: some of the contributed entities can be assigned to wrong or implausible classes, due to individual interpretation of the submitted data, or due to misunderstanding about available classes. In this paper we propose two methods to check the integrity of VGI data with respect to hierarchical consistency and classification plausibility. These methods are based on constraint checking and machine learning methods. They can be used to check the validity of data during contribution or at a later stage for collaborative manual or automatic data correction. © Springer International Publishing Switzerland 2014.
引用
收藏
页码:126 / 141
页数:15
相关论文
共 28 条
[21]  
Tobler W.R., A computer movie simulating urban growth in the detroit region, Economic Geography, 46, pp. 234-240, (1970)
[22]  
Devogele T., Parent C., Spaccapietra S., On spatial database integration, International Journal of Geographical Information Science, 12, 4, pp. 335-352, (1998)
[23]  
Bishop C.M., Pattern Recognition and Machine Learning (Information Science and Statistics), (2006)
[24]  
Han J., Kamber M., Pei J., Data Mining: Concepts and Techniques, (2011)
[25]  
Cover T., Hart P., Nearest Neighbor pattern classification, IEEE Transactions on Information Theory, 13, 1, pp. 21-27, (1967)
[26]  
Witten I.H., Frank E., Data Mining: PracticalMachine Learning Tools and Techniques, (2005)
[27]  
Kohavi R., Et al., A study of Cross-Validation and Bootstrap for accuracy estimation and model selection, Proc. International Joint Conference on Artificial Intelligence (IJCAI), pp. 1137-1145, (1995)
[28]  
Fawcett T., An introduction to ROC analysis, Pattern Recognition Letters, 27, 8, pp. 861-874, (2006)