Extracting Cybersecurity Related Linked Data from Text

被引:71
作者
Joshi, Arnav [1 ]
Lal, Ravendar [1 ]
Finin, Tim [1 ]
Joshi, Anupam [1 ]
机构
[1] Univ Maryland Baltimore Cty, Baltimore, MD 21250 USA
来源
2013 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2013) | 2013年
关键词
cybersecurity; linked data; information extraction; ontology;
D O I
10.1109/ICSC.2013.50
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Web is typically our first source of information about new software vulnerabilities, exploits and cyber-attacks. Information is found in semi-structured vulnerability databases as well as in text from security bulletins, news reports, cybersecurity blogs and Internet chat rooms. It can be useful to cybersecurity systems if there is a way to recognize and extract relevant information and represent it as easily shared and integrated semantic data. We describe such an automatic framework that generates and publishes a RDF linked data representation of cybersecurity concepts and vulnerability descriptions extracted from the National Vulnerability Database and from text sources. A CRF-based system is used to identify cybersecurity-related entities, concepts and relations in text, which are then represented using custom ontologies for the cybersecurity domain and also mapped to objects in the DBpedia knowledge base. The resulting cybersecurity linked data collection can be used for many purposes, including automating early vulnerability identification, mitigation and prevention efforts.
引用
收藏
页码:252 / 259
页数:8
相关论文
共 21 条
[1]  
[Anonymous], 2008, W3C RECOMMENDATION
[2]  
[Anonymous], 2011, P 7 INT C SEM SYST, DOI [10.1145/2063518.2063519, DOI 10.1145/2063518.2063519]
[3]  
[Anonymous], HPL2003146
[4]  
[Anonymous], 2007, WWW
[5]  
[Anonymous], 2004, W3C RECOMMENDATION
[6]  
[Anonymous], 2001, PROC 18 INT C MACH L
[7]   Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[8]  
Bollacker K., 2008, P 2008 ACM SIGMOD IN, P1247, DOI DOI 10.1145/1376616.1376746
[9]  
Finkel J.R., 2005, P 43 ANN M ASS COMP
[10]  
Joshi A., 2013, THESIS U MARYLAND BA