Towards new information resources for public health - From WordNet to MedicalWordNet

被引:8
作者
Fellbaum, Christiane
Hahn, Udo
Smith, Barry
机构
[1] Princeton Univ, Dept Psychol, Princeton, NJ 08544 USA
[2] Berlin Brandenburg Acad Sci, Berlin, Germany
[3] Univ Jena, JULIE, D-6900 Jena, Germany
[4] Univ Saarbrucken, IFOMIS, Saarbrucken, Germany
[5] SUNY Buffalo, Dept Philosophy, Buffalo, NY 14260 USA
关键词
WordNet; MedicalWordNet; MedicalFactNet; medical terminology; lexical database; expert terminology; layperson's terminology;
D O I
10.1016/j.jbi.2005.09.004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In the last two decades, WORDNET has evolved as the most comprehensive computational lexicon of general English. In this article, we discuss its potential for supporting the creation of an entirely new kind of information resource for public health, viz. MEDICALWORDNET. This resource is not to be conceived merely as a lexical extension of the original WORDNET to medical terminology; indeed, there is already a considerable degree of overlap between WORDNET and the vocabulary of medicine. Instead, we propose a new type of repository, consisting of three large collections of (1) medically relevant word forms, structured along the lines of the existing Princeton WORDNET; (2) medically validated propositions, referred to here as medical facts, which will constitute what we shall call MEDICALFACTNET; and (3) propositions reflecting laypersons' medical beliefs, which will constitute what we shall call the MEDICALBELIEFNET. We introduce a methodology for setting up the MEDICALWORDNET. We then turn to the discussion of research challenges that have to be met to build this new type of information resource. We build a database of sentences relevant to the medical domain. The sentences are generated from WordNet via its relations as well as from medical statements broken down into elementary propositions. Two subcorpora of sentences are distinguished, MedicalBeliefNet and MedicalFactNet. The former is rated for assent by laypersons; the latter for correctness by medical experts. The sentence corpora will be valuable for a variety of applications in information retrieval as well as in research in linguistics and psychology with respect to the study of expert and non-expert beliefs and their linguistic expressions. Our work has to meet several considerable challenges. These include accounting for the distinction between medical experts and laypersons, the social issues of expert-layperson communication in different media, the linguistic aspects of encoding medical knowledge, and the reliability, volume, and emergence of medical knowledge. The work described here has been tested in a small pilot experiment [39] and awaits large-scale implementation. (c) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:321 / 332
页数:12
相关论文
共 49 条
[1]   The tools and techniques of evidence-based medicine [J].
Abalos, E .
BEST PRACTICE & RESEARCH CLINICAL OBSTETRICS & GYNAECOLOGY, 2005, 19 (01) :15-26
[2]  
[Anonymous], NORMS WORD ASS
[3]  
Baader F, 2003, DESCRIPTION LOGIC HANDBOOK: THEORY, IMPLEMENTATION AND APPLICATIONS, P43
[4]   Themes of holism, empowerment, access, and legitimacy define complementary, alternative, and integrative medicine in relation to conventional biomedicine [J].
Barrett, B ;
Marchand, L ;
Scheder, J ;
Plane, MB ;
Maberry, R ;
Appelbaum, D ;
Rakel, D ;
Rabago, D .
JOURNAL OF ALTERNATIVE AND COMPLEMENTARY MEDICINE, 2003, 9 (06) :937-947
[5]  
BODENREIDER O, 2002, P 1 GLOB WORDNET C, P223
[6]  
BODENREIDER O, 2004, KR MED 2004
[7]  
BODENREIDER O, 2003, MED INFORM EUROPE, P379
[8]  
Brachman Ronald J., 2004, KNOWLEDGE REPRESENTA
[9]  
BUITELAAR P, 2002, P 1 INT C GLOB WORDN
[10]  
Burgun Anita, 2001, P NAACL 2001 WORKSH, P77