CoPub update: CoPub 5.0 a text mining system to answer biological questions

被引:29
作者
Fleuren, Wilco W. M. [1 ,2 ]
Verhoeven, Stefan [3 ]
Frijters, Raoul [1 ]
Heupers, Bart [4 ]
Polman, Jan [3 ]
van Schaik, Rene [3 ]
de Vlieg, Jacob [1 ,3 ]
Alkema, Wynand [3 ]
机构
[1] Radboud Univ Nijmegen, Med Ctr, NCMLS, CMBI, NL-6500 HB Nijmegen, Netherlands
[2] Netherlands Bioinformat Ctr NBIC, NL-6500 HB Nijmegen, Netherlands
[3] MSD, NL-5340 BH Oss, Netherlands
[4] SARA Comp & Network Serv, Amsterdam, Netherlands
关键词
D O I
10.1093/nar/gkr310
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this article, we present CoPub 5.0, a publicly available text mining system, which uses Medline abstracts to calculate robust statistics for keyword co-occurrences. CoPub was initially developed for the analysis of microarray data, but we broadened the scope by implementing new technology and new thesauri. In CoPub 5.0, we integrated existing CoPub technology with new features, and provided a new advanced interface, which can be used to answer a variety of biological questions. CoPub 5.0 allows searching for keywords of interest and its relations to curated thesauri and provides highlighting and sorting mechanisms, using its statistics, to retrieve the most important abstracts in which the terms co-occur. It also provides a way to search for indirect relations between genes, drugs, pathways and diseases, following an ABC principle, in which A and C have no direct connection but are connected via shared B intermediates. With CoPub 5.0, it is possible to create, annotate and analyze networks using the layout and highlight options of Cytoscape web, allowing for literature based systems biology. Finally, operations of the CoPub 5.0 Web service enable to implement the CoPub technology in bioinformatics workflows. CoPub 5.0 can be accessed through the CoPub portal http://www.copub.org.
引用
收藏
页码:W450 / W454
页数:5
相关论文
共 16 条
[1]   CoPub Mapper: mining MEDLINE based on search term co-publication [J].
Alako, BTF ;
Veldhoven, A ;
van Baal, S ;
Jelier, R ;
Verhoeven, S ;
Rullmann, T ;
Polman, J ;
Jenster, G .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]   Content-rich biological network constructed by mining PubMed abstracts [J].
Chen, H ;
Sharp, BM .
BMC BIOINFORMATICS, 2004, 5 (1)
[3]  
FRIBERG PA, MOL CELL ENDOCRINOL, V315, P121
[4]  
FRIJTERS R, PLOS COMPUT BIOL, V6
[5]  
FRIJTERS R, BMC GENOMICS, V11, P359
[6]   CoPub: a literature-based keyword enrichment tool for microarray data analysis [J].
Frijters, Raoul ;
Heupers, Bart ;
van Beek, Pieter ;
Bouwhuis, Maurice ;
van Schaik, Rene ;
de Vlieg, Jacob ;
Polman, Jan ;
Alkema, Wynand .
NUCLEIC ACIDS RESEARCH, 2008, 36 :W406-W410
[7]   Literature-based compound profiling: application to toxicogenomics [J].
Frijters, Raoul ;
Verhoeven, Stefan ;
Alkema, Wynand ;
van Schaik, Rene ;
Polman, Jan .
PHARMACOGENOMICS, 2007, 8 (11) :1521-1534
[8]   Building with a scaffold: emerging strategies for high- to low-level cellular modeling [J].
Ideker, T ;
Lauffenburger, D .
TRENDS IN BIOTECHNOLOGY, 2003, 21 (06) :255-262
[9]  
MERKL M, BIOL REPROD, V83, P874
[10]  
MITTERHUEMER S, BMC GENOMICS, V11, P138