Constructing an associative concept space for literature-based discovery

被引:46
作者
van der Eijk, CC
van Mulligen, EM
Kors, JA
Mons, B
van den Berg, J
机构
[1] Univ Med Ctr Rotterdam, Erasmus MC, Dept Med Informat, NL-3000 DR Rotterdam, Netherlands
[2] Erasmus Univ, Fac Econ, Dept Comp Sci, NL-3000 DR Rotterdam, Netherlands
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2004年 / 55卷 / 05期
关键词
D O I
10.1002/asi.10392
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific literature is often fragmented, which implies that certain scientific questions can only be answered by combining information from various articles. In this paper, a new algorithm is proposed for finding associations between related concepts present in literature. To this end, concepts are mapped to a multidimensional space by a Hebbian type of learning algorithm using co-occurrence data as input. The resulting concept space allows exploration of the neighborhood of a concept and finding potentially novel relationships between concepts. The obtained information retrieval system is useful for finding literature supporting hypotheses and for discovering previously unknown relationships between concepts. Tests on artificial data show the potential of the proposed methodology. In addition, preliminary tests on a set of Medline abstracts yield promising results.
引用
收藏
页码:436 / 444
页数:9
相关论文
共 36 条
[1]   Data discretization for novel relationship discovery in information retrieval [J].
Benoît, G .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (09) :736-746
[2]  
Blaschke C, 2001, Genome Inform, V12, P123
[3]  
Chen C., 1999, Proceedings Visualization '99 (Cat. No.99CB37067), P449, DOI 10.1109/VISUAL.1999.809927
[4]  
Chen HC, 1997, J AM SOC INFORM SCI, V48, P17, DOI 10.1002/(SICI)1097-4571(199701)48:1<17::AID-ASI4>3.0.CO
[5]  
2-4
[6]  
Craven M, 1999, Proc Int Conf Intell Syst Mol Biol, P77
[7]  
Duda R. O., 1973, PATTERN CLASSIFICATI
[8]  
Gordon MD, 1998, J AM SOC INFORM SCI, V49, P674, DOI 10.1002/(SICI)1097-4571(199806)49:8<674::AID-ASI2>3.0.CO
[9]  
2-T
[10]  
Hearst M.A., 1999, P 37 ANN M ASS COMP, P3, DOI 10.3115/1034678.1034679