A Metrics-Driven Approach for Quality Assessment of Linked Open Data

被引:44
作者
Behkamal, Behshid [1 ]
Kahani, Mohsen [1 ]
Bagheri, Ebrahim [2 ]
Jeremic, Zoran [2 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Comp Engn, Mashhad, Iran
[2] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON, Canada
来源
JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH | 2014年 / 9卷 / 02期
关键词
Metrics; Linked open data; Correctness; Consistency; Quality assessment;
D O I
10.4067/S0718-18762014000200006
中图分类号
F [经济];
学科分类号
02 ;
摘要
The main objective of the Web of Data paradigm is to crystallize knowledge through the interlinking of already existing but dispersed data. The usefulness of the developed knowledge depends strongly on the quality of the published data. Researchers have observed many deficiencies with regard to the quality of Linked Open Data. The first step towards improving the quality of data released as a part of the Linked Open Data Cloud is to develop tools for measuring the quality of such data. To this end, the main objective of this paper is to propose and validate a set of metrics for evaluating the inherent quality characteristics of a dataset before it is released to the Linked Open Data Cloud. These inherent characteristics are semantic accuracy, syntactic accuracy, uniqueness, completeness and consistency. We follow the Goal-Question-Metric approach to propose various metrics for each of these five quality characteristics. We provide both theoretical validation and empirical observation of the behavior of the proposed metrics in this paper. The proposed set of metrics establishes a starting point for a systematic inherent quality analysis of open datasets.
引用
收藏
页码:64 / 79
页数:16
相关论文
共 34 条
[1]  
[Anonymous], 2013, P 9 INT C SEMANTIC S, DOI DOI 10.1145/2506182.2506195
[2]  
[Anonymous], 2014, SOFTWARE METRICS RIG
[3]  
[Anonymous], 2011, Linked Open Data: The Essentials
[4]  
[Anonymous], 2007, EON
[5]  
[Anonymous], 2008, P STI BERL CSW PHD W
[6]  
[Anonymous], 2006, Data Quality: Concepts, Methodologies and Techniques, DOI [DOI 10.1007/3-540-33173-5_1, DOI 10.1007/3-540-33173-5]
[7]   Assessing the maintainability of software product line feature models using structural metrics [J].
Bagheri, Ebrahim ;
Gasevic, Dragan .
SOFTWARE QUALITY JOURNAL, 2011, 19 (03) :579-612
[8]   Methodologies for Data Quality Assessment and Improvement [J].
Batini, Carlo ;
Cappiello, Cinzia ;
Francalanci, Chiara ;
Maurino, Andrea .
ACM COMPUTING SURVEYS, 2009, 41 (03)
[9]   Publishing Persian linked data; challenges and lessons learned [J].
Behkamal B. ;
Kahani M. ;
Paydar S. ;
Dadkhah M. ;
Sekhavaty E. .
2010 5th International Symposium on Telecommunications, IST 2010, 2010, :732-737
[10]  
Bizer C, 2007, QUALITY DRIVEN INFOR