New Information Distance Measure and Its Application in Question Answering System

被引:3
作者
张显 [1 ]
郝宇 [1 ]
朱小燕 [1 ]
李明 [2 ]
机构
[1] Department of Computer Science and Technology,Tsinghua University
[2] David R.Cheriton School of Computer Science,University of Waterloo
基金
中国国家自然科学基金;
关键词
information distance; normalized information distance; question answering system;
D O I
暂无
中图分类号
TP391.1 [文字信息处理];
学科分类号
081203 ; 0835 ;
摘要
In a question answering (QA) system,the fundamental problem is how to measure tile distance between a question and an answer,hence ranking different answers.We demonstrate that such a distance can he precisely and mathematically defined.Not only such a definition is possible,it is actually provably better than any other feasible definitions. Not only such an ultimate definition is possible,but also it can be conveniently and fruitfully applied to construct a QA system.We have built such a system——QUANTA.Extensive experiments are conducted to justify the new theory.
引用
收藏
页码:557 / 572
页数:16
相关论文
共 9 条
  • [1] The context-tree kernel for strings
    Cuturi, M
    Vert, JP
    [J]. NEURAL NETWORKS, 2005, 18 (08) : 1111 - 1123
  • [2] Philipp Cimiano,Steffen Staab.Learning by googling[J].ACM SIGKDD Explorations Newsletter,2004
  • [3] Algorithmic clustering of music based on string compression
    Cilibrasi, R
    Vitányi, P
    de Wolf, R
    [J]. COMPUTER MUSIC JOURNAL, 2004, 28 (04) : 49 - 67
  • [4] Andrej A. Muchnik.Conditional complexity and codes[J].Theoretical Computer Science,2002(1)
  • [5] Nikolai K. Vereshchagin,Michael V. Vyugin.Independent minimum length programs to translate between given strings[J].Theoretical Computer Science,2002(1)
  • [6] Alexei Chernov,Andrej Muchnik,Andrei Romashchenko,Alexander Shen,Nikolai Vereshchagin.Upper semi-lattice of binary strings with the relation “ x is simple conditional to y ”[J].Theoretical Computer Science,2002(1)
  • [7] Ming Li,Jonathan H. Badger,Xin Chen,Sam Kwong,Paul Kearney.An information-based sequence distance and its application to whole mitochondrial genome phylogeny[J].Bioinformatics,2001
  • [8] Relaxing the Triangle Inequality in Pattern Matching
    Ronald Fagin
    Larry Stockmeyer
    [J]. International Journal of Computer Vision, 1998, 30 : 219 - 231
  • [9] Lin J.The web as a resource for question answering:Per- spectives and challenges[K].Proc.3rd Int.Conf.Language Resources and Evolution,2002