基于LDA模型的Ad hoc信息检索方法研究

被引:8
作者
卜质琼 [1 ]
郑波尽 [2 ]
机构
[1] 广东技术师范学院计算机学院
[2] 中南民族大学计算机学院
关键词
信息检索; 语言模型; 文档模型; 话题模型;
D O I
暂无
中图分类号
TP391.1 [文字信息处理];
学科分类号
摘要
传统的话题模型假设每个文档只属于一个话题,而实际情况下一个文档往往与多个话题相关。应用LDA模型将文档表示为多个话题的组合,并基于语言模型框架,提出了一种基于LDA的混合模型用于文本信息的Ad hoc检索。该方法将LDA模型与文档模型相结合,与聚类模型相比,在保持较低的计算复杂度外,具有很高的检索性能,因此更适用于大规模文档集的信息检索。
引用
收藏
页码:1369 / 1372
页数:4
相关论文
共 15 条
[1]  
Probabilistic latent semantic indexing. Thomas Hofmann. Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval . 1999
[2]  
A nonlocal Bayesian image denoising algorithm. Lebru M,Buades A,Morel J. SIAM Journal on Imaging Sciences . 2013
[3]  
A New Bag of Words LBP (BoWL) Descriptor for Scene Image Classification. Banerji S,Sinha A,Liu C. Computer Analysis of Images and Patterns . 2013
[4]  
Iterative expectation for multi period information retrieval. SLOAN M,WANG Jun. Proc of WSDM Workshop on Web Search Click Data . 2013
[5]  
Blog topic analysis using TF smoothing and LDA. LEE S,LEE J,PARK C Y,et al. Proc of the 7th International Conference on Ubiquitous Information Management and Communication . 2013
[6]  
Lexical and hierarchical topic regression. NGUYEN V A,BOYD-GRABER J,RESNIK P. Advances in Neural Information Processing Systems . 2013
[7]   Content-based information retrieval and digital libraries [J].
Wan, Gary ;
Liu, Zao .
INFORMATION TECHNOLOGY AND LIBRARIES, 2008, 27 (01) :41-47
[8]  
A mixture clustering model for pseudo feedback in information retrieval. Tao T,Zhai C X. Classification,Clustering,and Data Mining Applications . 2004
[9]   A mixed iteration for nonnegative matrix factorizations [J].
Soltuz, Stefan M. ;
Rhoades, B. E. .
APPLIED MATHEMATICS AND COMPUTATION, 2013, 219 (18) :9847-9855
[10]  
基于数据间内在关联性的自适应模糊聚类模型[J]. 唐成龙,王石刚.  自动化学报. 2010(11)