Knowle: A semantic link network based system for organizing large scale online news events

被引:79
作者
Xu, Zheng [1 ,3 ]
Wei, Xiao [4 ]
Luo, Xiangfeng [2 ]
Liu, Yunhuai [1 ]
Mei, Lin [1 ]
Hu, Chuanping [1 ]
Chen, Lan [1 ]
机构
[1] Minist Publ Secur, Res Inst 3, Shanghai, Peoples R China
[2] Shanghai Univ, Sch Comp, Shanghai, Peoples R China
[3] Tsinghua Univ, Beijing 100084, Peoples R China
[4] Shanghai Inst Technol, Shanghai, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2015年 / 43-44卷
基金
国家高技术研究发展计划(863计划); 美国国家科学基金会;
关键词
News events; Health domain; Big data; Semantic link network; WEB; SIMULATION; SCIENCE; TIME;
D O I
10.1016/j.future.2014.04.002
中图分类号
TP301 [理论、方法];
学科分类号
080201 [机械制造及其自动化];
摘要
An explosive growth in the volume, velocity, and variety of the data available on the Internet has been witnessed recently. The data originated from multiple types of sources including mobile devices, sensors, individual archives, social networks, Internet of Things, enterprises, cameras, software logs, health data has led to one of the most challenging research issues of the big data era. In this paper, Knowle-an online news management system upon semantic link network model is introduced. Knowle is a news event centrality data management system. The core elements of Knowle are news events on the Web, which are linked by their semantic relations. Knowle is a hierarchical data system, which has three different layers including the bottom layer (concepts), the middle layer (resources), and the top layer (events). The basic blocks of the Knowle system news collection, resources representation, semantic relations mining, semantic linking news events are given. Knowle does not require data providers to follow semantic standards such as RDF or OWL, which is a semantics-rich self-organized network. It reflects various semantic relations of concepts, news, and events. Moreover, in the case study, Knowle is used for organizing and mining health news, which shows the potential on forming the basis of designing and developing big data analytics based innovation framework in the health domain. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:40 / 50
页数:11
相关论文
共 46 条
[1]
Modified Gath-Geva clustering for fuzzy segmentation of multivariate time-series [J].
Abonyi, J ;
Feil, B ;
Nemeth, S ;
Arva, P .
FUZZY SETS AND SYSTEMS, 2005, 149 (01) :39-56
[2]
Agrawal R., 1994, P 20 INT C VER LARG
[3]
[Anonymous], 2006, Linked data-design issues'
[4]
[Anonymous], P 15 INT C WORLD WID
[5]
The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities [J].
Berners-Lee, T ;
Hendler, J ;
Lassila, O .
SCIENTIFIC AMERICAN, 2001, 284 (05) :34-+
[6]
Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[7]
The Emerging Web of Linked Data [J].
Bizer, Christian .
IEEE INTELLIGENT SYSTEMS, 2009, 24 (05) :87-92
[8]
The anatomy of a large-scale hypertextual Web search engine [J].
Brin, S ;
Page, L .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7) :107-117
[9]
The small world of human language [J].
Cancho, RFI ;
Solé, RV .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2001, 268 (1482) :2261-2265
[10]
Hybrid modelling and simulation of huge crowd over a hierarchical Grid architecture [J].
Chen, Dan ;
Wang, Lizhe ;
Wu, Xiaomin ;
Chen, Jingying ;
Khan, Samee U. ;
Kolodziej, Joanna ;
Tian, Mingwei ;
Huang, Fang ;
Liu, Wangyang .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (05) :1309-1317