Recommending research collaborations using link prediction and random forest classifiers

被引:94
作者
Guns, Raf [1 ]
Rousseau, Ronald [1 ,2 ]
机构
[1] Univ Antwerp, IBW, Inst Educ & Informat Sci, B-2000 Antwerp, Belgium
[2] Katholieke Univ Leuven, B-3000 Leuven, Belgium
关键词
Collaboration; Networks; Link prediction; Machine learning; Random forest classifiers; Recommendation; Facilitator cities; NETWORKS;
D O I
10.1007/s11192-013-1228-9
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We introduce a method to predict or recommend high-potential future (i.e., not yet realized) collaborations. The proposed method is based on a combination of link prediction and machine learning techniques. First, a weighted co-authorship network is constructed. We calculate scores for each node pair according to different measures called predictors. The resulting scores can be interpreted as indicative of the likelihood of future linkage for the given node pair. To determine the relative merit of each predictor, we train a random forest classifier on older data. The same classifier can then generate predictions for newer data. The top predictions are treated as recommendations for future collaboration. We apply the technique to research collaborations between cities in Africa, the Middle East and South-Asia, focusing on the topics of malaria and tuberculosis. Results show that the method yields accurate recommendations. Moreover, the method can be used to determine the relative strengths of each predictor.
引用
收藏
页码:1461 / 1473
页数:13
相关论文
共 27 条
  • [1] Friends and neighbors on the Web
    Adamic, LA
    Adar, E
    [J]. SOCIAL NETWORKS, 2003, 25 (03) : 211 - 230
  • [2] [Anonymous], 2002, P 8 ACM SIGKDD INT C
  • [3] [Anonymous], 1 MONDAY
  • [4] [Anonymous], ISSI NEWSLETTER
  • [5] [Anonymous], THESIS ANTWERP U
  • [6] Simrank++: Query Rewriting through Link Analysis of the Click Graph
    Antonellis, Ioannis
    Molina, Hector Garcia
    Chang, Chi Chao
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 408 - 421
  • [7] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [8] South-South research collaboration of countries in the Southern African Development Community (SADC)
    Boshoff, Nelius
    [J]. SCIENTOMETRICS, 2010, 84 (02) : 481 - 503
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] A measure for the cohesion of weighted networks
    Egghe, L
    Rousseau, R
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (03): : 193 - 202