Inference of demographic attributes based on mobile phone usage patterns and social network topology

被引:8
作者
Sarraute C. [1 ]
Brea J. [1 ]
Burroni J. [1 ]
Blanc P. [2 ]
机构
[1] Grandata Labs, Bartolomé Cruz 1818, Vicente Lopez, Buenos Aires
[2] IMAS, UBA-CONICET, FCEN, Ciudad Universitaria, Int Guiraldes 2160, CABA, Buenos Aires
关键词
Call detail records; Demographics; Graph mining; Homophily; Mobile phone social network; Social network analysis;
D O I
10.1007/s13278-015-0277-x
中图分类号
学科分类号
摘要
Mobile phone usage provides a wealth of information, which can be used to better understand the demographic structure of a population. In this paper, we focus on the population of Mexican mobile phone users. We first present an observational study of mobile phone usage according to gender and age groups. We are able to detect significant differences in phone usage among different subgroups of the population. We then study the performance of different machine learning (ML) methods to predict demographic features (namely, age and gender) of unlabeled users by leveraging individual calling patterns, as well as the structure of the communication graph. We show how a specific implementation of a diffusion model, harnessing the graph structure, has significantly better performance over other node-based standard ML methods. We provide details of the methodology together with an analysis of the robustness of our results to changes in the model parameters. Furthermore, by carefully examining the topological relations of the training nodes (seed nodes) to the rest of the nodes in the network, we find topological metrics which have a direct influence on the performance of the algorithm. © 2015, Springer-Verlag Wien.
引用
收藏
页码:1 / 18
页数:17
相关论文
共 41 条
[1]  
Adali S., Golbeck J., Predicting personality with social behavior: a comparative study, Soc Netw Anal Min, 4, 1, pp. 1-20, (2014)
[2]  
Barrat A., Arth B., Elemy M., Vespignani A., Dynamical process on complex networks, (2008)
[3]  
Blumenstock J., Eagle N (2010) Mobile divides: gender, socioeconomic status, and mobile phone use in Rwanda, Proceedings of the 4th ACM/IEEE international conference on information and communication technologies and development. ACM
[4]  
Blumenstock J.E., Gillick D., Eagle N., Who’s calling? Demographics of mobile phone use in Rwanda, Transportation, 32, pp. 2-5, (2010)
[5]  
Dong Y., Tang J., Lou T., Wu B., Blockeel H., Kersting K., Nijssen S., Zelezny F., Chawla NV (2013) How long will she call me? Distribution, social theory and duration prediction, Machine learning and knowledge discovery in databases, pp. 16-31, (2013)
[6]  
Dong Y., Yang Y., Tang J., Yang Y., Chawla N.V., Inferring user demographics and social strategies in mobile social networks, (2014)
[7]  
Dyagilev K., Mannor S., Yom-Tov E., On information propagation in mobile call networks, Soc Netw Anal Min, 3, 3, pp. 521-541, (2013)
[8]  
Fan R.-E., Chang K.-W., Hsieh C.-J., Wang X.-R., Lin C.-J., LIBLINEAR: a library for large linear classification, J Mach Learn Res, 9, pp. 1871-1874, (2008)
[9]  
Feld S.L., Social structural determinants of similarity among associates, Am Sociol Rev, 47, pp. 797-801, (1982)
[10]  
Fischer C.S., Stueve C., Jones L.M., Jackson R.M., Gerson K., Baldassare M., Networks and places: social relations in the urban setting, (1977)