Temporal profiles of queries

被引:107
作者
Jones, Rosie
Diaz, Fernando
机构
[1] Yahoo Res, Burbank, CA 91504 USA
[2] Univ Massachusetts, Dept Comp Sci, Ctr Intelligent Informat Retrieval, Amherst, MA 01003 USA
关键词
algorithms; experimentation; theory; time; temporal profiles; ambiguity; precision prediction; query classification; event detection; language models;
D O I
10.1145/1247715.1247720
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Documents with timestamps, such as email and news, can be placed along a timeline. The timeline for a set of documents returned in response to a query gives an indication of how documents relevant to that query are distributed in time. Examining the timeline of a query result set allows us to characterize both how temporally dependent the topic is, as well as how relevant the results are likely to be. We outline characteristic patterns in query result set timelines, and show experimentally that we can automatically classify documents into these classes. We also show that properties of the query result set timeline can help predict the mean average precision of a query. These results show that meta-features associated with a query can be combined with text retrieval techniques to improve our understanding and treatment of text search on documents with timestamps.
引用
收藏
页数:31
相关论文
共 19 条
  • [1] Allan J., 2003, LEMUR TOOLKIT LANGUA
  • [2] Anick P., 2003, P 26 ANN INT ACM SIG, P88, DOI DOI 10.1145/860435.860453
  • [3] [Anonymous], P 16 ANN INT ACM SIG
  • [4] Croft B, 2003, LANGUAGE MODELING IN, V13
  • [5] Cronen-Townsend S., 2002, Proceedings of SIGIR 2002. Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P299
  • [6] DIAZ F, 1924, P 27 ANN INT C RES D, P18
  • [7] Hai Leong Chieu, 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P425
  • [8] HE B, 2004, LECT NOTES COMPUTER
  • [9] Kleinberg J., 2002, P 8 ACM SIGKDD INT C, P91, DOI [DOI 10.1145/775047.775061, 10.1145/775047.775061]
  • [10] Lavrenko Victor, 2001, P 24 ANN INT ACM SIG, P120, DOI DOI 10.1145/383952.383972