A mouse protein interactome through combined literature mining with multiple sources of interaction evidence

被引:20
作者
Li, Xiao [1 ]
Cai, Haoyang [1 ]
Xu, Jiabao [1 ]
Ying, Sancheng [2 ]
Zhang, Yizheng [1 ]
机构
[1] Sichuan Univ, Coll Life Sci, Key Lab Bioresource & Ecoenvironm, Minist Educ,Sichuan Key Lab Mol Biol & Biotechnol, Chengdu 610065, Peoples R China
[2] Sichuan Anim Sci Acad, Chengdu 610066, Peoples R China
关键词
Interactome; Mouse; Protein interaction network; Protein-protein interaction; BIOLOGICAL NETWORKS; SACCHAROMYCES-CEREVISIAE; INTERACTION MAP; DATABASE; PREDICTION; MODELS; YEAST; IDENTIFICATION; CYTOSCAPE; EVOLUTION;
D O I
10.1007/s00726-009-0335-7
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interactions (PPIs) play crucial roles in a number of biological processes. Recently, protein interaction networks (PINs) for several model organisms and humans have been generated, but few large-scale researches for mice have ever been made neither experimentally nor computationally. In the work, we undertook an effort to map a mouse PIN, in which protein interactions are hidden in enormous amount of biomedical literatures. Following a co-occurrence-based text-mining approach, a probabilistic model-na < ve Bayesian was used to filter false-positive interactions by integrating heterogeneous kinds of evidence from genomic and proteomic datasets. A support vector machine algorithm was further used to choose protein pairs with physical interactions. By comparing with the currently available PPI datasets from several model organisms and humans, it showed that the derived mouse PINs have similar topological properties at the global level, but a high local divergence. The mouse protein interaction dataset is stored in the Mouse protein-protein interaction DataBase (MppDB) that is useful source of information for system-level understanding of gene function and biological processes in mammals. Access to the MppDB database is public available at http://bio.scu.edu.cn/mppi.
引用
收藏
页码:1237 / 1252
页数:16
相关论文
共 79 条
[1]   The Biomolecular Interaction Network Database and related tools 2005 update [J].
Alfarano, C ;
Andrade, CE ;
Anthony, K ;
Bahroos, N ;
Bajec, M ;
Bantoft, K ;
Betel, D ;
Bobechko, B ;
Boutilier, K ;
Burgess, E ;
Buzadzija, K ;
Cavero, R ;
D'Abreo, C ;
Donaldson, I ;
Dorairajoo, D ;
Dumontier, MJ ;
Dumontier, MR ;
Earles, V ;
Farrall, R ;
Feldman, H ;
Garderman, E ;
Gong, Y ;
Gonzaga, R ;
Grytsan, V ;
Gryz, E ;
Gu, V ;
Haldorsen, E ;
Halupa, A ;
Haw, R ;
Hrvojic, A ;
Hurrell, L ;
Isserlin, R ;
Jack, F ;
Juma, F ;
Khan, A ;
Kon, T ;
Konopinsky, S ;
Le, V ;
Lee, E ;
Ling, S ;
Magidin, M ;
Moniakis, J ;
Montojo, J ;
Moore, S ;
Muskat, B ;
Ng, I ;
Paraiso, JP ;
Parker, B ;
Pintilie, G ;
Pirone, R .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D418-D424
[2]  
[Anonymous], NATURE STAT LEARNING
[3]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[4]   Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[5]   NCBI GEO: mining tens of millions of expression profiles - database and tools update [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Rudnev, Dmitry ;
Evangelista, Carlos ;
Kim, Irene F. ;
Soboleva, Alexandra ;
Tomashevsky, Maxim ;
Edgar, Ron .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D760-D765
[6]   Cerebral: a Cytoscape plugin for layout of and interaction with biological networks using subcellular localization annotation [J].
Barsky, Aaron ;
Gardy, Jennifer L. ;
Hancock, Robert E. W. ;
Munzner, Tamara .
BIOINFORMATICS, 2007, 23 (08) :1040-1042
[7]   Specificity and evolvability in eukaryotic protein interaction networks [J].
Beltrao, Pedro ;
Serrano, Luis .
PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (02) :258-267
[8]   Choosing negative examples for the prediction of protein-protein interactions [J].
Ben-Hur, A ;
Noble, WS .
BMC BIOINFORMATICS, 2006, 7 (Suppl 1)
[9]   Cross-species analysis of biological networks by Bayesian alignment [J].
Berg, Johannes ;
Lassig, Michael .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (29) :10967-10972
[10]   Prolinks: a database of protein functional linkages derived from coevolution [J].
Bowers, PM ;
Pellegrini, M ;
Thompson, MJ ;
Fierro, J ;
Yeates, TO ;
Eisenberg, D .
GENOME BIOLOGY, 2004, 5 (05)