Degree centrality for semantic abstraction summarization of therapeutic studies

被引:48
作者
Zhang, Han [1 ,2 ]
Fiszman, Marcelo [2 ]
Shin, Dongwook [2 ]
Miller, Christopher M. [2 ]
Rosemblat, Graciela [2 ]
Rindflesch, Thomas C. [2 ]
机构
[1] China Med Univ, Dept Med Informat, Shenyang, Peoples R China
[2] NIH, Natl Lib Med, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
Automatic summarization; Natural language processing; Graph theory; Degree centrality; Semantic processing; Disease treatment; MEDICAL LANGUAGE SYSTEM;
D O I
10.1016/j.jbi.2011.05.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatic summarization has been proposed to help manage the results of biomedical information retrieval systems. Semantic MEDLINE, for example, summarizes semantic predications representing assertions in MEDLINE citations. Results are presented as a graph which maintains links to the original citations. Graphs summarizing more than 500 citations are hard to read and navigate, however. We exploit graph theory for focusing these large graphs. The method is based on degree centrality, which measures connectedness in a graph. Four categories of clinical concepts related to treatment of disease were identified and presented as a summary of input text. A baseline was created using term frequency of occurrence. The system was evaluated on summaries for treatment of five diseases compared to a reference standard produced manually by two physicians. The results showed that recall for system results was 72%, precision was 73%, and F-score was 0.72. The system F-score was considerably higher than that for the baseline (0.47). Published by Elsevier Inc.
引用
收藏
页码:830 / 838
页数:9
相关论文
共 48 条