Systematic comparison of the protein-protein interaction databases from a user's perspective

被引:63
作者
Bajpai, Akhilesh Kumar [1 ,2 ]
Davuluri, Sravanthi [1 ,2 ]
Tiwary, Kriti [2 ]
Narayanan, Sithalechumi [2 ]
Oguru, Sailaja [3 ]
Basavaraju, Kavyashree [3 ]
Dayalan, Deena [3 ]
Thirumurugan, Kavitha [1 ]
Acharya, Kshitish K. [2 ,3 ,4 ]
机构
[1] Vellore Inst Technol VIT Univ, Ctr Biomed Res, SBST, Struct Biol Lab, Vellore 632014, Tamil Nadu, India
[2] Shodhaka Life Sci Pvt Ltd, Phase 1, Bangalore 560100, Karnataka, India
[3] Biol Data Analyzers Assoc BdataA, Phase 1, Bangalore 560100, Karnataka, India
[4] IBAB, Phase 1, Bangalore 560100, Karnataka, India
关键词
Protein interaction databases; Database comparisons; Protein interactions; Molecular networks; Systems biology; Database and software selection; INTERACTION NETWORKS; CURATION; VIEW;
D O I
10.1016/j.jbi.2020.103380
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
In absence of periodic systematic comparisons, biologists/bioinformaticians may be forced to make a subjective selection among the many protein-protein interaction (PPI) databases and tools. We conducted a comprehensive compilation and comparison of such resources. We compiled 375 PPI resources, short-listed 125 important ones (both lists are available at startbioinfo.com), and compared the features and coverage of 16 carefully-selected databases related to human PPIs. We quantitatively compared the coverage of 'experimentally verified' as well as `total' (experimentally verified and predicted) PPIs for these 16 databases. Coverage was compared in two ways: (a) PPIs obtained in response to gene queries using the web interfaces were compared. As a query set, 108 genes expressed differently across tissues (specific to kidney, testis, and uterus, and ubiquitous - i.e., expressed in 43 human normal tissues) or associated with certain diseases (breast cancer, lung cancer, Alzheimer's, cystic fibrosis, diabetes, and cardiomyopathy) were chosen. The coverage was also compared for the well-studied genes versus the less-studied ones. The coverage of the databases for high-quality interactions was separately assessed using a set of literature curated experimentally-proven PPIs (gold standard PPI-set); (b) the back-end-data from 15 PPI databases was downloaded and compared. Combined results from STRING and UniHI covered around 84% of 'experimentally verified' PPIs. Approximately 94% of the 'total' PPIs available across the databases were retrieved by the combined use of hPRINT, STRING, and IID. Among the experimentally verified PPIs found exclusively in each database, STRING contributed around 71% of the hits. The coverage of certain databases was skewed for some gene-types. Analysis with the gold-standard PPI-set revealed that GPS-Prot, STRING, APID, and HIPPIE, each covered 70% of the curated interactions. The database usage frequencies did not always correlate with their respective advantages, thereby justifying the need for more frequent studies of this nature.
引用
收藏
页数:15
相关论文
共 37 条
[1]
A novel tissue-specific meta-analysis approach for gene expression predictions, initiated with a mammalian gene expression testis database [J].
Acharya, Kshitish K. ;
Chandrashekar, Darshan S. ;
Chitturi, Neelima ;
Shah, Hardik ;
Malhotra, Varun ;
Sreelakshmi, K. S. ;
Deepti, H. ;
Bajpai, Akhilesh ;
Davuluri, Sravanthi ;
Bora, Pranami ;
Rao, Leena .
BMC GENOMICS, 2010, 11
[2]
HIPPIE v2.0: enhancing meaningfulness and reliability of protein-protein interaction networks [J].
Alanis-Lobato, Gregorio ;
Andrade-Navarro, Miguel A. ;
Schaefer, Martin H. .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D408-D414
[3]
APID interactomes: providing proteome-based interactomes with controlled quality for multiple species and derived networks [J].
Alonso-Lopez, Diego ;
Gutierrez, Miguel A. ;
Lopes, Katia P. ;
Prieto, Carlos ;
Santamaria, Rodrigo ;
De Las Rivas, Javier .
NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) :W529-W535
[4]
OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders [J].
Amberger, Joanna S. ;
Bocchini, Carol A. ;
Schiettecatte, Francois ;
Scott, Alan F. ;
Hamosh, Ada .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D789-D798
[5]
Bajpai A.K., 2011, NATURE PRECEDINGS, V2101, DOI [10.1038/npre.2011.2101.3, DOI 10.1038/NPRE.2011.2101.3.]
[6]
The TissueNet v.2 database: A quantitative view of protein-protein interactions across human tissues [J].
Basha, Omer ;
Barshir, Ruth ;
Sharon, Moran ;
Lerman, Eugene ;
Kirson, Binyamin F. ;
Hekselman, Idan ;
Yeger-Lotem, Esti .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D427-D431
[7]
mentha: a resource for browsing integrated protein-interaction networks [J].
Calderone, Alberto ;
Castagnoli, Luisa ;
Cesareni, Gianni .
NATURE METHODS, 2013, 10 (08) :690-691
[8]
Identification of Human Housekeeping Genes and Tissue-Selective Genes by Microarray Meta-Analysis [J].
Chang, Cheng-Wei ;
Cheng, Wei-Chung ;
Chen, Chaang-Ray ;
Shu, Wun-Yi ;
Tsai, Min-Lung ;
Huang, Ching-Lung ;
Hsu, Ian C. .
PLOS ONE, 2011, 6 (07)
[9]
The BioGRID interaction database: 2017 update [J].
Chatr-aryamontri, Andrew ;
Oughtred, Rose ;
Boucher, Lorrie ;
Rust, Jennifer ;
Chang, Christie ;
Kolas, Nadine K. ;
O'Donnell, Lara ;
Oster, Sara ;
Theesfeld, Chandra ;
Sellam, Adnane ;
Stark, Chris ;
Breitkreutz, Bobby-Joe ;
Dolinski, Kara ;
Tyers, Mike .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D369-D379
[10]
HINT: High-quality protein interactomes and their applications in understanding human disease [J].
Das, Jishnu ;
Yu, Haiyuan .
BMC SYSTEMS BIOLOGY, 2012, 6