Using literature and data to learn Bayesian networks as clinical models of ovarian tumors

被引:45
作者
Antal, P
Fannes, G
Timmerman, D
Moreau, Y
De Moor, B
机构
[1] Katholieke Univ Leuven, ESAT SCD, Dept Elect Engn, B-3001 Louvain, Belgium
[2] Katholieke Univ Leuven Hosp, Dept Obstet & Gynecol, B-3000 Louvain, Belgium
关键词
text mining; literature networks; Bayesian networks; prior incorporation; structure learning;
D O I
10.1016/j.artmed.2003.11.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thanks to its increasing availability, electronic literature has become a potential source of information for the development of complex Bayesian networks (BN), when human expertise is missing or data is scarce or contains much noise. This opportunity raises the question of how to integrate information from free-text resources with statistical data in learning Bayesian networks. Firstly, we report on the collection of prior information resources in the ovarian cancer domain, which includes "kernel" annotations of the domain variables. We introduce methods based on the annotations and literature to derive informative pairwise dependency measures, which are derived from the statistical cooccurrence of the names of the variables, from the similarity of the "kernel" descriptions of the variables and from a combined method. We perform wide-scale evaluation of these text-based dependency scores against an expert reference and against data scores (the mutual information (MI) and a Bayesian score). Next, we transform the text-based dependency measures into informative text-based priors for Bayesian network structures. Finally, we report the benefit of such informative text-based priors on the performance of a Bayesian network for the classification of ovarian tumors from clinical data. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:257 / 281
页数:25
相关论文
共 50 条
[1]   Web-based data collection for uterine adnexal tumors: A case study [J].
Aerts, S ;
Antal, P ;
Timmerman, D ;
De Moor, B ;
Moreau, Y .
PROCEEDINGS OF THE 15TH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, 2002, :282-287
[2]   Challenges for biomedical informatics and pharmacogenomics [J].
Altman, RB ;
Klein, TE .
ANNUAL REVIEW OF PHARMACOLOGY AND TOXICOLOGY, 2002, 42 :113-133
[3]   Bayesian applications of belief networks and multilayer perceptrons for ovarian tumor classification with rejection [J].
Antal, P ;
Fannes, G ;
Timmerman, D ;
Moreau, Y ;
De Moor, B .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2003, 29 (1-2) :39-60
[4]   Domain knowledge based information retrieval language:: An application of Annotated Bayesian Networks in ovarian cancer domain [J].
Antal, P ;
De Moor, B ;
Timmerman, D ;
Mészáros, T ;
Dobrowiecki, T .
PROCEEDINGS OF THE 15TH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, 2002, :213-218
[5]   Annotated Bayesian networks:: a tool to integrate textual and probabilistic medical knowledge [J].
Antal, P ;
Mészáros, T ;
De Moor, B ;
Dobrowiecki, T .
FOURTEENTH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2001, :177-182
[6]  
Antal P., 2002, P 8 ACM SIGKDD INT C, P405
[7]  
BAEZAYATES RA, 1999, MODERN INFORMATION R
[8]  
Blaschke C, 2002, IEEE INTELL SYST, V17, P14, DOI 10.1109/MIS.2002.999215
[9]  
BROOKS T, 1998, P ASIS, P33
[10]  
Buntine W., 1991, P 7 C UNC ART INT, P52