Analysis of user keyword similarity in online social networks

被引:81
作者
Bhattacharyya, Prantik [1 ]
Garg, Ankush [1 ]
Wu, Shyhtsun Felix [1 ]
机构
[1] Univ Calif Davis, Dept Comp Sci, One Shields Ave, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
Online social network; User keywords; User similarity; Homophily measurement; Semantic analysis;
D O I
10.1007/s13278-010-0006-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How do two people become friends? What role does homophily play in bringing two people closer to help them forge friendship? Is the similarity between two friends different from the similarity between any two people? How does the similarity between a friend of a friend compare to similarity between direct friends? In this work, our goal is to answer these questions. We study the relationship between semantic similarity of user profile entries and the social network topology. A user profile in an on-line social network is characterized by its profile entries. The entries are termed as user keywords. We develop a model to relate keywords based on their semantic relationship and define similarity functions to quantify the similarity between a pair of users. First, we present a 'forest model' to categorize keywords across multiple categorization trees and define the notion of distance between keywords. Second, we use the keyword distance to define similarity functions between a pair of users. Third, we analyze a set of Facebook data according to the model to determine the effect of homophily in on-line social networks. Based on our evaluations, we conclude that direct friends are more similar than any other user pair. However, the more striking observation is that except for direct friends, similarities between users are approximately equal, irrespective of the topological distance between them.
引用
收藏
页码:143 / 158
页数:16
相关论文
共 23 条
[1]  
Adamic L. A., 2003, First Monday, V8, DOI 10.5210/fm.v8i6.1057
[2]   Friends and neighbors on the Web [J].
Adamic, LA ;
Adar, E .
SOCIAL NETWORKS, 2003, 25 (03) :211-230
[3]  
Banks L, 2007, LSAD 07, P121
[4]   Davis Social Links: Leveraging Social Networks for Future Internet Communication [J].
Banks, Lerone ;
Bhattacharyya, Prantik ;
Spear, Matthew ;
Wu, Shyhtsun Felix .
2009 9TH ANNUAL INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET, 2009, :165-168
[5]   Social Network Model based on Keyword Categorization [J].
Bhattacharyya, Prantik ;
Garg, Ankush ;
Wu, S. Felix .
2009 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, 2009, :170-175
[6]  
Crandall D.J., 2008, PROC 14 ACM SIGKDD I, P160, DOI DOI 10.1145/1401890.1401914
[7]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[8]  
2-9
[9]  
Fellbaum C., 1998, WORDNET ELECT LEXICA
[10]  
Howe Daniel C., 2009, RITA WORDNET JAVA BA