Mining communities and their relationships in blogs: A study of online hate groups

被引:138
作者
Chau, Michael [1 ]
Xu, Jennifer
机构
[1] Univ Hong Kong, Sch Business, Pokfulam, Hong Kong, Peoples R China
[2] Bentley Coll, Dept Comp Informat Syst, Waltham, MA 02452 USA
关键词
blogs; social network analysis; hate groups; Web mining; NETWORKS; CENTRALITY; FRAMEWORK; ERROR; WEB;
D O I
10.1016/j.ijhcs.2006.08.009
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Blogs, often treated as the equivalence of online personal diaries, have become one of the fastest growing types of Web-based media. Everyone is free to express their opinions and emotions very easily through blogs. In the blogosphere, many communities have emerged, which include hate groups and racists that are trying to share their ideology, express their views, or recruit new group members. It is important to analyze these virtual communities, defined based on membership and subscription linkages, in order to monitor for activities that are potentially harmful to society. While many Web mining and network analysis techniques have been used to analyze the content and structure of the Web sites of hate groups on the Internet, these techniques have not been applied to the study of hate groups in blogs. To address this issue, we have proposed a semi-automated approach in this research. The proposed approach consists of four modules, namely blog spider, information extraction, network analysis, and visualization. We applied this approach to identify and analyze a selected set of 28 anti-Blacks hate groups (820 bloggers) on Xanga, one of the most popular blog hosting sites. Our analysis results revealed some interesting demographical and topological characteristics in these groups, and identified at least two large communities on top of the smaller ones. The study also demonstrated the feasibility in applying the proposed approach in the study of hate groups and other related communities in blogs. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:57 / 70
页数:14
相关论文
共 74 条
[1]   Statistical mechanics of complex networks [J].
Albert, R ;
Barabási, AL .
REVIEWS OF MODERN PHYSICS, 2002, 74 (01) :47-97
[2]   Internet -: Diameter of the World-Wide Web [J].
Albert, R ;
Jeong, H ;
Barabási, AL .
NATURE, 1999, 401 (6749) :130-131
[3]   Error and attack tolerance of complex networks [J].
Albert, R ;
Jeong, H ;
Barabási, AL .
NATURE, 2000, 406 (6794) :378-382
[4]  
ALEXA, 2005, TOP ENGLISH LANGUAGE
[5]  
[Anonymous], P 9 INT WORLD WID WE
[6]  
Armstrong R., 1995, P AAAI SPRING S INF
[7]   Normative versus social constructivist processes in the allocation of citations: A network-analytic model [J].
Baldi, S .
AMERICAN SOCIOLOGICAL REVIEW, 1998, 63 (06) :829-846
[8]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[9]   White boys to terrorist men - Target recruitment of Nazi skinheads [J].
Blazak, R .
AMERICAN BEHAVIORAL SCIENTIST, 2001, 44 (06) :982-1000
[10]   How blogging software reshapes the online community [J].
Blood, R .
COMMUNICATIONS OF THE ACM, 2004, 47 (12) :53-55