Mining communities and their relationships in blogs: A study of online hate groups

被引:138
作者
Chau, Michael [1 ]
Xu, Jennifer
机构
[1] Univ Hong Kong, Sch Business, Pokfulam, Hong Kong, Peoples R China
[2] Bentley Coll, Dept Comp Informat Syst, Waltham, MA 02452 USA
关键词
blogs; social network analysis; hate groups; Web mining; NETWORKS; CENTRALITY; FRAMEWORK; ERROR; WEB;
D O I
10.1016/j.ijhcs.2006.08.009
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Blogs, often treated as the equivalence of online personal diaries, have become one of the fastest growing types of Web-based media. Everyone is free to express their opinions and emotions very easily through blogs. In the blogosphere, many communities have emerged, which include hate groups and racists that are trying to share their ideology, express their views, or recruit new group members. It is important to analyze these virtual communities, defined based on membership and subscription linkages, in order to monitor for activities that are potentially harmful to society. While many Web mining and network analysis techniques have been used to analyze the content and structure of the Web sites of hate groups on the Internet, these techniques have not been applied to the study of hate groups in blogs. To address this issue, we have proposed a semi-automated approach in this research. The proposed approach consists of four modules, namely blog spider, information extraction, network analysis, and visualization. We applied this approach to identify and analyze a selected set of 28 anti-Blacks hate groups (820 bloggers) on Xanga, one of the most popular blog hosting sites. Our analysis results revealed some interesting demographical and topological characteristics in these groups, and identified at least two large communities on top of the smaller ones. The study also demonstrated the feasibility in applying the proposed approach in the study of hate groups and other related communities in blogs. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:57 / 70
页数:14
相关论文
共 74 条
[11]  
Bollobas, 2001, CAMBRIDGE STUDIES AD, V73
[12]   The network paradigm in organizational research: A review and typology [J].
Borgatti, SP ;
Foster, PC .
JOURNAL OF MANAGEMENT, 2003, 29 (06) :991-1013
[13]  
BRIN S, 1998, P 7 WWW C BRISB AUST
[14]  
Burris V., 2000, SOCIOL FOCUS, V33, DOI https://doi.org/10.1080/00380237.2000.10571166
[15]  
CHAKRABARTI S, 1999, KP 8 INT WORLD WID W
[16]   SpidersRUs: Automated development of vertical search engines in different domains and languages [J].
Chau, M ;
Qin, JL ;
Zhou, YL ;
Tseng, CJ ;
Chen, H .
PROCEEDINGS OF THE 5TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, PROCEEDINGS, 2005, :110-111
[17]  
CHAU M, 2001, P 1 ACM IEEE CS JOIN, P79
[18]  
Chau M., 2002, Proc. Nat'l Conf. Digital Government Research, P271
[19]  
CHAU M, 2005, P 4 WORKSH E BUS WEB
[20]   Crime data mining: A general framework and some examples [J].
Chen, HC ;
Chung, WY ;
Xu, JJ ;
Wang, G ;
Qin, Y ;
Chau, M .
COMPUTER, 2004, 37 (04) :50-+