A semantic frame-based intelligent agent for topic detection

被引:10
作者
Chang, Yung-Chun [1 ,2 ]
Hsieh, Yu-Lun [1 ,3 ]
Chen, Cen-Chieh [1 ,3 ]
Hsu, Wen-Lian [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Informat Management, Taipei, Taiwan
[3] Natl Chengchi Univ, Dept Comp Sci, Taipei, Taiwan
关键词
Topic detection; Semantic frame; Semantic class; Partial matching; FUZZY ONTOLOGY;
D O I
10.1007/s00500-015-1695-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting the topic of documents can help readers construct the background of the topic and facilitate document comprehension. In this paper, we propose a semantic frame-based topic detection (SFTD) that simulates such process in human perception. We take advantage of multiple knowledge sources and extracted discriminative patterns from documents through a highly automated, knowledge-supported frame generation and matching mechanisms. Using a Chinese news corpus containing over 111,000 news articles, we provide a comprehensive performance evaluation which demonstrates that our novel approach can effectively detect the topic of a document by exploiting the syntactic structures, semantic association, and the context within the text. Experimental results show that SFTD is comparable to other well-known topic detection methods.
引用
收藏
页码:391 / 401
页数:11
相关论文
共 22 条
[1]   Automatic ontology-based knowledge extraction from web documents [J].
Alani, H ;
Kim, S ;
Millard, DE ;
Weal, MJ ;
Hall, W ;
Lewis, PH ;
Shadbolt, NR .
IEEE INTELLIGENT SYSTEMS, 2003, 18 (01) :14-21
[2]  
[Anonymous], 1999, FDN STAT NATURAL LAN
[3]  
[Anonymous], 3 IEEE EMRITE
[4]  
[Anonymous], P 3 INT C INF THEOR
[5]  
[Anonymous], TECH REP
[6]  
[Anonymous], 2011, Modern Information Retrieval: The Concepts and Technology behind Search
[7]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[8]  
Bollacker K., 2008, P 2008 ACM SIGMOD IN, P1247, DOI DOI 10.1145/1376616.1376746
[9]  
Bun KK, 2002, WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, P73, DOI 10.1109/WISE.2002.1181645
[10]  
Dong Z., 2010, P 23 INT C COMP LING, P53