Selectively acquiring customer information: A new data acquisition problem and an active learning-based solution

被引:34
作者
Zheng, Zhiqiang
Padmanabhan, Balaji
机构
[1] Univ Calif Riverside, A Gary Anderson Grad Sch Management, Riverside, CA 92521 USA
[2] Univ Penn, Wharton Sch, Philadelphia, PA 19104 USA
关键词
selective information acquisition; active learning; data mining;
D O I
10.1287/mnsc.1050.0488
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper presents a new information acquisition problem motivated by business applications where customer data has to be acquired with a specific modeling objective in mind. In the last two decades, there has been substantial work in two different fields-optimal experimental design and machine learning-that has addressed the issue of acquiring data in a selective manner with a specific objective in mind. We show that the problem presented here is different from the classic model-based data acquisition problems considered thus far in the literature in both fields. Building on work in optimal experimental design and in machine learning, we develop a new active learning technique for the information acquisition problem presented in this paper. We demonstrate that the proposed method performs well based on results from applying this method across 20 Web usage and machine learning data sets.
引用
收藏
页码:697 / 712
页数:16
相关论文
共 60 条