Extraction and representation of contextual information for knowledge discovery in texts

被引:25
作者
Perrin, P [1 ]
Petry, FE
机构
[1] Merck Res Labs, Med Chem Mol Syst, Rahway, NJ 07065 USA
[2] Tulane Univ, Dept Elect Engn & Comp Sci, New Orleans, LA 70118 USA
关键词
text mining; text feature construction; extraction and selection; collocational expressions; text representation; first-order logic;
D O I
10.1016/S0020-0255(02)00400-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the role of lexical contextual relations for the problem of unsupervised knowledge discovery in full texts. Narrative texts have inherent structure dictated by language usage in generating them. We suggest that the relative distance of terms within a text gives sufficient information about its structure and its relevant content. Furthermore, this structure can be used to discover implicit knowledge embedded in the text, therefore serving as a good candidate to represent effectively the text content for knowledge elicitation tasks. We qualitatively demonstrate that a useful text structure and content can be systematically extracted by collocational lexical analysis without the need to encode any supplemental sources of knowledge. We present an algorithm that systematically extracts the most relevant facts in the texts and labels them by their overall theme, dictated by local contextual information. It exploits domain independent lexical frequencies and mutual information measures to find the relevant Contextual units in the texts. We report results from experiments in a real-world textual database of psychiatric evaluation reports. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:125 / 152
页数:28
相关论文
共 47 条
[1]  
Akman V, 1996, AI MAG, V17, P55
[2]   SOME COMPUTATIONAL EXPERIMENTS IN SUMMARY [J].
ALTERMAN, R ;
BOOKMAN, LA .
DISCOURSE PROCESSES, 1990, 13 (02) :143-174
[3]  
Austin John., 1955, How to do Things with Words
[4]  
BARWISE J, 1992, PHILOS PERSPECTIVES, V8
[5]  
Berry-Rogghe GLM, 1973, COMPUTER LIT STUDIES, P103
[6]  
BOWER B, 1996, SCI NEWS, V150, P126
[7]   SENTENCE MEMORY - CONSTRUCTIVE VERSUS INTERPRETIVE APPROACH [J].
BRANSFORD, JD ;
BARCLAY, JR ;
FRANKS, JJ .
COGNITIVE PSYCHOLOGY, 1972, 3 (02) :193-209
[8]  
Brill E, 1997, AI MAG, V18, P13
[9]  
Brown G., 1983, DISCOURSE ANAL
[10]  
Brown P. F., 1990, Computational Linguistics, V16, P79