A brief survey of automatic methods for author name disambiguation

被引:169
作者
机构
[1] Departamento de Computação, Universidade Federal de Ouro Preto
[2] Departamento de Ciência da Computação, Universidade Federal de Minas Gerais
来源
Ferreira, A.A. (ferreira@dcc.ufmg.br) | 1600年 / Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States卷 / 41期
关键词
51;
D O I
10.1145/2350036.2350040
中图分类号
学科分类号
摘要
Name ambiguity in the context of bibliographic citation records is a hard problem that affects the quality of services and content in digital libraries and similar systems. The challenges of dealing with author name ambiguity have led to a myriad of disambiguation methods. Generally speaking, the proposed methods usually attempt to group citation records of a same author by finding some similarity among them or try to directly assign them to their respective authors. Both approaches may either exploit supervised or unsupervised techniques. In this article, we propose a taxonomy for characterizing the current author name disambiguation methods described in the literature, present a brief survey of the most representative ones and discuss several open challenges.
引用
收藏
页码:15 / 26
页数:11
相关论文
共 51 条
  • [1] Bagga A., Baldwin B., Algorithms for scoring coreference chains, LREC, pp. 563-566, (1998)
  • [2] Bekkerman R., McCallum A., Disambiguating web appearances of people in a social network, WWW, pp. 463-470, (2005)
  • [3] Bhattacharya I., Getoor L., A latent dirichlet model for unsupervised entity resolution, SDM, (2006)
  • [4] Bhattacharya I., Getoor L., Collective entity resolution in relational data, ACM TKDD, 1, 1
  • [5] Blei D.M., Ng A.Y., Jordan M.I., Latent dirichlet allocation, JMLR, 3, pp. 993-1022, (2003)
  • [6] Cohen W.W., Ravikumar P.D., Fienberg S.E., A comparison of string distance metrics for name-matching tasks, IIWeb, pp. 73-78, (2003)
  • [7] Cota R.G., Ferreira A.A., Goncalves M.A., Laender A.H.F., Nascimento C., An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations, JASIST, 61, 9, pp. 1853-1870, (2010)
  • [8] Crammer K., Singer Y., Ultraconservative online algorithms for multiclass problems, JMLR, 3, pp. 951-991, (2003)
  • [9] Culotta A., Kanani P., Hall R., Wick M., McCallum A., Author disambiguation using error-driven machine learning with a ranking loss function, IIWeb, (2007)
  • [10] De Carvalho A.P., Ferreira A.A., Laender A.H.F., Goncalves M.A., Incremental unsupervised name disambiguation in cleaned digital libraries, JIDM, 2, 3, pp. 289-304, (2011)