Adaptive name matching in information integration

被引:202
作者
Bilenko, M
Mooney, R
Cohen, W
Ravikumar, P
Fienberg, S
机构
[1] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA
[2] Carnegie Mellon Univ, Ctr Automated Learning & Discovery, Pittsburgh, PA 15213 USA
[3] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[4] Carnegie Mellon Univ, Dept Stat, Pittsburgh, PA 15213 USA
[5] Carnegie Mellon Univ, Ctr Comp & Commun Secur, Pittsburgh, PA 15213 USA
基金
美国安德鲁·梅隆基金会; 美国国家科学基金会;
关键词
D O I
10.1109/MIS.2003.1234765
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Different approaches to the name-matching problem in information integration are discussed. Those methods are used that adapt to a specific domain by combining multiple string similarity methods that capture different notions of similarity. Edit distance metrics are widely used and many variations are possible. An adaptive version of edit distance with affine gaps is proposed.
引用
收藏
页码:16 / 23
页数:8
相关论文
共 27 条